postgresql/src/include/optimizer
Richard Guo 383eb21ebf Convert NOT IN sublinks to anti-joins when safe
The planner has historically been unable to convert "x NOT IN (SELECT
y ...)" sublinks into anti-joins.  This is because standard SQL
semantics for NOT IN require that if the comparison "x = y" returns
NULL, the "NOT IN" expression evaluates to NULL (effectively false),
causing the row to be discarded.  In contrast, an anti-join preserves
the row if no match is found.  Due to this semantic mismatch regarding
NULL handling, the conversion was previously considered unsafe.

However, if we can prove that neither side of the comparison can yield
NULL values, and further that the operator itself cannot return NULL
for non-null inputs, the behavior of NOT IN and anti-join becomes
identical.  Enabling this conversion allows the planner to treat the
sublink as a first-class relation rather than an opaque SubPlan
filter.  This unlocks global join ordering optimization and permits
the selection of the most efficient join algorithm based on cost,
often yielding significant performance improvements for large
datasets.

This patch verifies that neither side of the comparison can be NULL
and that the operator is safe regarding NULL results before performing
the conversion.

To verify operator safety, we require that the operator be a member of
a B-tree or Hash operator family.  This serves as a proxy for standard
boolean behavior, ensuring the operator does not return NULL on valid
non-null inputs, as doing so would break index integrity.

For operand non-nullability, this patch makes use of several existing
mechanisms.  It leverages the outer-join-aware-Var infrastructure to
verify that a Var does not come from the nullable side of an outer
join, and consults the NOT-NULL-attnums hash table to efficiently
verify schema-level NOT NULL constraints.  Additionally, it employs
find_nonnullable_vars to identify Vars forced non-nullable by qual
clauses, and expr_is_nonnullable to deduce non-nullability for other
expression types.

The logic for verifying the non-nullability of the subquery outputs
was adapted from prior work by David Rowley and Tom Lane.

Author: Richard Guo <guofenglinux@gmail.com>
Reviewed-by: wenhui qiu <qiuwenhuifx@gmail.com>
Reviewed-by: Zhang Mingli <zmlpostgres@gmail.com>
Reviewed-by: Japin Li <japinli@hotmail.com>
Discussion: https://postgr.es/m/CAMbWs495eF=-fSa5CwJS6B-BaEi3ARp0UNb4Lt3EkgUGZJwkAQ@mail.gmail.com
2026-03-12 09:45:18 +09:00
..
appendinfo.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
clauses.h Convert NOT IN sublinks to anti-joins when safe 2026-03-12 09:45:18 +09:00
cost.h Allow for plugin control over path generation strategies. 2026-01-28 11:28:34 -05:00
extendplan.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
geqo.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
geqo_copy.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
geqo_gene.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
geqo_misc.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
geqo_mutation.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
geqo_pool.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
geqo_random.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
geqo_recombination.h Fix typos and inconsistencies in code and comments 2026-01-05 09:19:15 +09:00
geqo_selection.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
inherit.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
joininfo.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
optimizer.h Convert NOT IN sublinks to anti-joins when safe 2026-03-12 09:45:18 +09:00
orclauses.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
paramassign.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
pathnode.h Replace get_relation_info_hook with build_simple_rel_hook. 2026-03-09 09:48:26 -04:00
paths.h Allow for plugin control over path generation strategies. 2026-01-28 11:28:34 -05:00
placeholder.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
plancat.h Replace get_relation_info_hook with build_simple_rel_hook. 2026-03-09 09:48:26 -04:00
planmain.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
planner.h Pass cursorOptions to planner_setup_hook. 2026-02-10 11:50:28 -05:00
prep.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
restrictinfo.h Update copyright for 2026 2026-01-01 13:24:10 -05:00
subselect.h Convert NOT IN sublinks to anti-joins when safe 2026-03-12 09:45:18 +09:00
tlist.h Update copyright for 2026 2026-01-01 13:24:10 -05:00