mirror of
https://github.com/postgres/postgres.git
synced 2026-02-10 14:23:26 -05:00
This patch makes two closely related sets of changes: 1. For UPDATE, the subplan of the ModifyTable node now only delivers the new values of the changed columns (i.e., the expressions computed in the query's SET clause) plus row identity information such as CTID. ModifyTable must re-fetch the original tuple to merge in the old values of any unchanged columns. The core advantage of this is that the changed columns are uniform across all tables of an inherited or partitioned target relation, whereas the other columns might not be. A secondary advantage, when the UPDATE involves joins, is that less data needs to pass through the plan tree. The disadvantage of course is an extra fetch of each tuple to be updated. However, that seems to be very nearly free in context; even worst-case tests don't show it to add more than a couple percent to the total query cost. At some point it might be interesting to combine the re-fetch with the tuple access that ModifyTable must do anyway to mark the old tuple dead; but that would require a good deal of refactoring and it seems it wouldn't buy all that much, so this patch doesn't attempt it. 2. For inherited UPDATE/DELETE, instead of generating a separate subplan for each target relation, we now generate a single subplan that is just exactly like a SELECT's plan, then stick ModifyTable on top of that. To let ModifyTable know which target relation a given incoming row refers to, a tableoid junk column is added to the row identity information. This gets rid of the horrid hack that was inheritance_planner(), eliminating O(N^2) planning cost and memory consumption in cases where there were many unprunable target relations. Point 2 of course requires point 1, so that there is a uniform definition of the non-junk columns to be returned by the subplan. We can't insist on uniform definition of the row identity junk columns however, if we want to keep the ability to have both plain and foreign tables in a partitioning hierarchy. Since it wouldn't scale very far to have every child table have its own row identity column, this patch includes provisions to merge similar row identity columns into one column of the subplan result. In particular, we can merge the whole-row Vars typically used as row identity by FDWs into one column by pretending they are type RECORD. (It's still okay for the actual composite Datums to be labeled with the table's rowtype OID, though.) There is more that can be done to file down residual inefficiencies in this patch, but it seems to be committable now. FDW authors should note several API changes: * The argument list for AddForeignUpdateTargets() has changed, and so has the method it must use for adding junk columns to the query. Call add_row_identity_var() instead of manipulating the parse tree directly. You might want to reconsider exactly what you're adding, too. * PlanDirectModify() must now work a little harder to find the ForeignScan plan node; if the foreign table is part of a partitioning hierarchy then the ForeignScan might not be the direct child of ModifyTable. See postgres_fdw for sample code. * To check whether a relation is a target relation, it's no longer sufficient to compare its relid to root->parse->resultRelation. Instead, check it against all_result_relids or leaf_result_relids, as appropriate. Amit Langote and Tom Lane Discussion: https://postgr.es/m/CA+HiwqHpHdqdDn48yCEhynnniahH78rwcrv1rEX65-fsZGBOLQ@mail.gmail.com
304 lines
7.7 KiB
C
304 lines
7.7 KiB
C
/*-------------------------------------------------------------------------
|
|
*
|
|
* execJunk.c
|
|
* Junk attribute support stuff....
|
|
*
|
|
* Portions Copyright (c) 1996-2021, PostgreSQL Global Development Group
|
|
* Portions Copyright (c) 1994, Regents of the University of California
|
|
*
|
|
*
|
|
* IDENTIFICATION
|
|
* src/backend/executor/execJunk.c
|
|
*
|
|
*-------------------------------------------------------------------------
|
|
*/
|
|
#include "postgres.h"
|
|
|
|
#include "executor/executor.h"
|
|
|
|
/*-------------------------------------------------------------------------
|
|
* XXX this stuff should be rewritten to take advantage
|
|
* of ExecProject() and the ProjectionInfo node.
|
|
* -cim 6/3/91
|
|
*
|
|
* An attribute of a tuple living inside the executor, can be
|
|
* either a normal attribute or a "junk" attribute. "junk" attributes
|
|
* never make it out of the executor, i.e. they are never printed,
|
|
* returned or stored on disk. Their only purpose in life is to
|
|
* store some information useful only to the executor, mainly the values
|
|
* of system attributes like "ctid", or sort key columns that are not to
|
|
* be output.
|
|
*
|
|
* The general idea is the following: A target list consists of a list of
|
|
* TargetEntry nodes containing expressions. Each TargetEntry has a field
|
|
* called 'resjunk'. If the value of this field is true then the
|
|
* corresponding attribute is a "junk" attribute.
|
|
*
|
|
* When we initialize a plan we call ExecInitJunkFilter to create a filter.
|
|
*
|
|
* We then execute the plan, treating the resjunk attributes like any others.
|
|
*
|
|
* Finally, when at the top level we get back a tuple, we can call
|
|
* ExecFindJunkAttribute/ExecGetJunkAttribute to retrieve the values of the
|
|
* junk attributes we are interested in, and ExecFilterJunk to remove all the
|
|
* junk attributes from a tuple. This new "clean" tuple is then printed,
|
|
* inserted, or updated.
|
|
*
|
|
*-------------------------------------------------------------------------
|
|
*/
|
|
|
|
/*
|
|
* ExecInitJunkFilter
|
|
*
|
|
* Initialize the Junk filter.
|
|
*
|
|
* The source targetlist is passed in. The output tuple descriptor is
|
|
* built from the non-junk tlist entries.
|
|
* An optional resultSlot can be passed as well; otherwise, we create one.
|
|
*/
|
|
JunkFilter *
|
|
ExecInitJunkFilter(List *targetList, TupleTableSlot *slot)
|
|
{
|
|
JunkFilter *junkfilter;
|
|
TupleDesc cleanTupType;
|
|
int cleanLength;
|
|
AttrNumber *cleanMap;
|
|
|
|
/*
|
|
* Compute the tuple descriptor for the cleaned tuple.
|
|
*/
|
|
cleanTupType = ExecCleanTypeFromTL(targetList);
|
|
|
|
/*
|
|
* Use the given slot, or make a new slot if we weren't given one.
|
|
*/
|
|
if (slot)
|
|
ExecSetSlotDescriptor(slot, cleanTupType);
|
|
else
|
|
slot = MakeSingleTupleTableSlot(cleanTupType, &TTSOpsVirtual);
|
|
|
|
/*
|
|
* Now calculate the mapping between the original tuple's attributes and
|
|
* the "clean" tuple's attributes.
|
|
*
|
|
* The "map" is an array of "cleanLength" attribute numbers, i.e. one
|
|
* entry for every attribute of the "clean" tuple. The value of this entry
|
|
* is the attribute number of the corresponding attribute of the
|
|
* "original" tuple. (Zero indicates a NULL output attribute, but we do
|
|
* not use that feature in this routine.)
|
|
*/
|
|
cleanLength = cleanTupType->natts;
|
|
if (cleanLength > 0)
|
|
{
|
|
AttrNumber cleanResno;
|
|
ListCell *t;
|
|
|
|
cleanMap = (AttrNumber *) palloc(cleanLength * sizeof(AttrNumber));
|
|
cleanResno = 0;
|
|
foreach(t, targetList)
|
|
{
|
|
TargetEntry *tle = lfirst(t);
|
|
|
|
if (!tle->resjunk)
|
|
{
|
|
cleanMap[cleanResno] = tle->resno;
|
|
cleanResno++;
|
|
}
|
|
}
|
|
Assert(cleanResno == cleanLength);
|
|
}
|
|
else
|
|
cleanMap = NULL;
|
|
|
|
/*
|
|
* Finally create and initialize the JunkFilter struct.
|
|
*/
|
|
junkfilter = makeNode(JunkFilter);
|
|
|
|
junkfilter->jf_targetList = targetList;
|
|
junkfilter->jf_cleanTupType = cleanTupType;
|
|
junkfilter->jf_cleanMap = cleanMap;
|
|
junkfilter->jf_resultSlot = slot;
|
|
|
|
return junkfilter;
|
|
}
|
|
|
|
/*
|
|
* ExecInitJunkFilterConversion
|
|
*
|
|
* Initialize a JunkFilter for rowtype conversions.
|
|
*
|
|
* Here, we are given the target "clean" tuple descriptor rather than
|
|
* inferring it from the targetlist. The target descriptor can contain
|
|
* deleted columns. It is assumed that the caller has checked that the
|
|
* non-deleted columns match up with the non-junk columns of the targetlist.
|
|
*/
|
|
JunkFilter *
|
|
ExecInitJunkFilterConversion(List *targetList,
|
|
TupleDesc cleanTupType,
|
|
TupleTableSlot *slot)
|
|
{
|
|
JunkFilter *junkfilter;
|
|
int cleanLength;
|
|
AttrNumber *cleanMap;
|
|
ListCell *t;
|
|
int i;
|
|
|
|
/*
|
|
* Use the given slot, or make a new slot if we weren't given one.
|
|
*/
|
|
if (slot)
|
|
ExecSetSlotDescriptor(slot, cleanTupType);
|
|
else
|
|
slot = MakeSingleTupleTableSlot(cleanTupType, &TTSOpsVirtual);
|
|
|
|
/*
|
|
* Calculate the mapping between the original tuple's attributes and the
|
|
* "clean" tuple's attributes.
|
|
*
|
|
* The "map" is an array of "cleanLength" attribute numbers, i.e. one
|
|
* entry for every attribute of the "clean" tuple. The value of this entry
|
|
* is the attribute number of the corresponding attribute of the
|
|
* "original" tuple. We store zero for any deleted attributes, marking
|
|
* that a NULL is needed in the output tuple.
|
|
*/
|
|
cleanLength = cleanTupType->natts;
|
|
if (cleanLength > 0)
|
|
{
|
|
cleanMap = (AttrNumber *) palloc0(cleanLength * sizeof(AttrNumber));
|
|
t = list_head(targetList);
|
|
for (i = 0; i < cleanLength; i++)
|
|
{
|
|
if (TupleDescAttr(cleanTupType, i)->attisdropped)
|
|
continue; /* map entry is already zero */
|
|
for (;;)
|
|
{
|
|
TargetEntry *tle = lfirst(t);
|
|
|
|
t = lnext(targetList, t);
|
|
if (!tle->resjunk)
|
|
{
|
|
cleanMap[i] = tle->resno;
|
|
break;
|
|
}
|
|
}
|
|
}
|
|
}
|
|
else
|
|
cleanMap = NULL;
|
|
|
|
/*
|
|
* Finally create and initialize the JunkFilter struct.
|
|
*/
|
|
junkfilter = makeNode(JunkFilter);
|
|
|
|
junkfilter->jf_targetList = targetList;
|
|
junkfilter->jf_cleanTupType = cleanTupType;
|
|
junkfilter->jf_cleanMap = cleanMap;
|
|
junkfilter->jf_resultSlot = slot;
|
|
|
|
return junkfilter;
|
|
}
|
|
|
|
/*
|
|
* ExecFindJunkAttribute
|
|
*
|
|
* Locate the specified junk attribute in the junk filter's targetlist,
|
|
* and return its resno. Returns InvalidAttrNumber if not found.
|
|
*/
|
|
AttrNumber
|
|
ExecFindJunkAttribute(JunkFilter *junkfilter, const char *attrName)
|
|
{
|
|
return ExecFindJunkAttributeInTlist(junkfilter->jf_targetList, attrName);
|
|
}
|
|
|
|
/*
|
|
* ExecFindJunkAttributeInTlist
|
|
*
|
|
* Find a junk attribute given a subplan's targetlist (not necessarily
|
|
* part of a JunkFilter).
|
|
*/
|
|
AttrNumber
|
|
ExecFindJunkAttributeInTlist(List *targetlist, const char *attrName)
|
|
{
|
|
ListCell *t;
|
|
|
|
foreach(t, targetlist)
|
|
{
|
|
TargetEntry *tle = lfirst(t);
|
|
|
|
if (tle->resjunk && tle->resname &&
|
|
(strcmp(tle->resname, attrName) == 0))
|
|
{
|
|
/* We found it ! */
|
|
return tle->resno;
|
|
}
|
|
}
|
|
|
|
return InvalidAttrNumber;
|
|
}
|
|
|
|
/*
|
|
* ExecFilterJunk
|
|
*
|
|
* Construct and return a slot with all the junk attributes removed.
|
|
*/
|
|
TupleTableSlot *
|
|
ExecFilterJunk(JunkFilter *junkfilter, TupleTableSlot *slot)
|
|
{
|
|
TupleTableSlot *resultSlot;
|
|
AttrNumber *cleanMap;
|
|
TupleDesc cleanTupType;
|
|
int cleanLength;
|
|
int i;
|
|
Datum *values;
|
|
bool *isnull;
|
|
Datum *old_values;
|
|
bool *old_isnull;
|
|
|
|
/*
|
|
* Extract all the values of the old tuple.
|
|
*/
|
|
slot_getallattrs(slot);
|
|
old_values = slot->tts_values;
|
|
old_isnull = slot->tts_isnull;
|
|
|
|
/*
|
|
* get info from the junk filter
|
|
*/
|
|
cleanTupType = junkfilter->jf_cleanTupType;
|
|
cleanLength = cleanTupType->natts;
|
|
cleanMap = junkfilter->jf_cleanMap;
|
|
resultSlot = junkfilter->jf_resultSlot;
|
|
|
|
/*
|
|
* Prepare to build a virtual result tuple.
|
|
*/
|
|
ExecClearTuple(resultSlot);
|
|
values = resultSlot->tts_values;
|
|
isnull = resultSlot->tts_isnull;
|
|
|
|
/*
|
|
* Transpose data into proper fields of the new tuple.
|
|
*/
|
|
for (i = 0; i < cleanLength; i++)
|
|
{
|
|
int j = cleanMap[i];
|
|
|
|
if (j == 0)
|
|
{
|
|
values[i] = (Datum) 0;
|
|
isnull[i] = true;
|
|
}
|
|
else
|
|
{
|
|
values[i] = old_values[j - 1];
|
|
isnull[i] = old_isnull[j - 1];
|
|
}
|
|
}
|
|
|
|
/*
|
|
* And return the virtual tuple.
|
|
*/
|
|
return ExecStoreVirtualTuple(resultSlot);
|
|
}
|