postgresql/src/include/replication
Masahiko Sawada 2f3304ce13 Fix possibility of logical decoding partial transaction changes.
When creating and initializing a logical slot, the restart_lsn is set
to the latest WAL insertion point (or the latest replay point on
standbys). Subsequently, WAL records are decoded from that point to
find the start point for extracting changes in the
DecodingContextFindStartpoint() function. Since the initial
restart_lsn could be in the middle of a transaction, the start point
must be a consistent point where we won't see the data for partial
transactions.

Previously, when not building a full snapshot, serialized snapshots
were restored, and the SnapBuild jumps to the consistent state even
while finding the start point. Consequently, the slot's restart_lsn
and confirmed_flush could be set to the middle of a transaction. This
could lead to various unexpected consequences. Specifically, there
were reports of logical decoding decoding partial transactions, and
assertion failures occurred because only subtransactions were decoded
without decoding their top-level transaction until decoding the commit
record.

To resolve this issue, the changes prevent restoring the serialized
snapshot and jumping to the consistent state while finding the start
point.

On v17 and HEAD, a flag indicating whether snapshot restores should be
skipped has been added to the SnapBuild struct, and SNAPBUILD_VERSION
has been bumpded.

On backbranches, the flag is stored in the LogicalDecodingContext
instead, preserving on-disk compatibility.

Backpatch to all supported versions.

Reported-by: Drew Callahan
Reviewed-by: Amit Kapila, Hayato Kuroda
Discussion: https://postgr.es/m/2444AA15-D21B-4CCE-8052-52C7C2DAFE5C%40amazon.com
Backpatch-through: 12
2024-07-11 22:48:18 +09:00
..
decode.h Update copyright for 2023 2023-01-02 15:00:37 -05:00
logical.h Fix possibility of logical decoding partial transaction changes. 2024-07-11 22:48:18 +09:00
logicallauncher.h Track logrep apply workers' last start times to avoid useless waits. 2023-01-22 14:08:46 -05:00
logicalproto.h Fix the display of UNKNOWN message type in apply worker. 2023-07-25 09:01:29 +05:30
logicalrelation.h Remove unnecessary checks for indexes for REPLICA IDENTITY FULL tables. 2023-07-25 15:09:31 +09:00
logicalworker.h Perform apply of large transactions by parallel workers. 2023-01-09 07:52:45 +05:30
message.h Update copyright for 2023 2023-01-02 15:00:37 -05:00
origin.h Perform apply of large transactions by parallel workers. 2023-01-09 07:52:45 +05:30
output_plugin.h Fix some typos and some incorrectly duplicated words 2023-04-18 14:03:49 +12:00
pgoutput.h Perform apply of large transactions by parallel workers. 2023-01-09 07:52:45 +05:30
reorderbuffer.h Rename logical_replication_mode to debug_logical_replication_streaming 2023-08-29 15:24:09 +02:00
slot.h Support invalidating replication slots due to horizon and wal_level 2023-04-07 22:40:27 -07:00
snapbuild.h Update copyright for 2023 2023-01-02 15:00:37 -05:00
syncrep.h Update copyright for 2023 2023-01-02 15:00:37 -05:00
walreceiver.h Add new predefined role pg_create_subscription. 2023-03-30 11:37:19 -04:00
walsender.h For cascading replication, wake physical and logical walsenders separately 2023-04-08 01:06:00 -07:00
walsender_private.h Optimize walsender wake up logic using condition variables 2023-05-21 09:44:55 -07:00
worker_internal.h Fix invalid memory access during the shutdown of the parallel apply worker. 2023-05-09 09:28:06 +05:30