postgresql

Commit Graph

Author	SHA1	Message	Date
Bruce Momjian	6b603e67dc	Add DOMAIN check constraints. Rod Taylor	2002-11-15 02:50:21 +00:00
Tom Lane	3779f7fd9f	Push qual clauses containing subplans to the back of the qual list at each plan node. Per gripe from Ross Reedstrom.	2002-11-15 02:36:53 +00:00
Tom Lane	89caf56b86	Fix planning bug introduced in recent code reorganization for hashed aggregates: tuple_fraction has to be adjusted before passing it to compare_fractional_path_costs().	2002-11-14 19:00:36 +00:00
Bruce Momjian	9b12ab6d5d	Add new palloc0 call as merge of palloc and MemSet(0).	2002-11-13 00:39:48 +00:00
Bruce Momjian	75fee4535d	Back out use of palloc0 in place if palloc/MemSet. Seems constant len to MemSet is a performance boost.	2002-11-11 03:02:20 +00:00
Bruce Momjian	8fee9615cc	Merge palloc()/MemSet(0) calls into a single palloc0() call.	2002-11-10 07:25:14 +00:00
Tom Lane	2103b7baa2	Phase 2 of hashed-aggregation project. nodeAgg.c now knows how to do hashed aggregation, but there's not yet planner support for it.	2002-11-06 22:31:24 +00:00
Tom Lane	f6dba10e62	First phase of implementing hash-based grouping/aggregation. An AGG plan node now does its own grouping of the input rows, and has no need for a preceding GROUP node in the plan pipeline. This allows elimination of the misnamed tuplePerGroup option for GROUP, and actually saves more code in nodeGroup.c than it costs in nodeAgg.c, as well as being presumably faster. Restructure the API of query_planner so that we do not commit to using a sorted or unsorted plan in query_planner; instead grouping_planner makes the decision. (Right now it isn't any smarter than query_planner was, but that will change as soon as it has the option to select a hash- based aggregation step.) Despite all the hackery, no initdb needed since only in-memory node types changed.	2002-11-06 00:00:45 +00:00
Tom Lane	884cd4b6be	Reduce a couple of debugging messages from LOG to DEBUG1 category.	2002-11-01 19:33:09 +00:00
Tom Lane	c0f7dcdac1	Fix range-query estimation to not double-exclude NULLs, per gripe from Ray Ontko 28-June-02. Also, fix prefix_selectivity for NAME lefthand variables (it was bogusly assuming binary compatibility), and adjust make_greater_string() to not call pg_mbcliplen() with invalid multibyte data (this last per bug report that I can't find at the moment, but it was in July '02).	2002-10-19 02:56:16 +00:00
Tom Lane	5bb46e7cd0	Fix for bug #795 : two clauses that seem redundant are not really, if one is pushed down into an outer join and the other is not.	2002-10-12 22:24:49 +00:00
Tom Lane	83fd58dff0	Add missing correction of sublevelsup when pulling up a subquery. Fixes problem with cases like SELECT * FROM foo t WHERE NOT EXISTS (SELECT remoteid FROM (SELECT f1 as remoteid FROM foo WHERE f1 = t.f1) AS t1)	2002-09-24 18:38:23 +00:00
Tom Lane	b26dfb9522	Extend pg_cast castimplicit column to a three-way value; this allows us to be flexible about assignment casts without introducing ambiguity in operator/function resolution. Introduce a well-defined promotion hierarchy for numeric datatypes (int2->int4->int8->numeric->float4->float8). Change make_const to initially label numeric literals as int4, int8, or numeric (never float8 anymore). Explicitly mark Func and RelabelType nodes to indicate whether they came from a function call, explicit cast, or implicit cast; use this to do reverse-listing more accurately and without so many heuristics. Explicit casts to char, varchar, bit, varbit will truncate or pad without raising an error (the pre-7.2 behavior), while assigning to a column without any explicit cast will still raise an error for wrong-length data like 7.3. This more nearly follows the SQL spec than 7.2 behavior (we should be reporting a 'completion condition' in the explicit-cast cases, but we have no mechanism for that, so just do silent truncation). Fix some problems with enforcement of typmod for array elements; it didn't work at all in 'UPDATE ... SET array[n] = foo', for example. Provide a generalized array_length_coerce() function to replace the specialized per-array-type functions that used to be needed (and were missing for NUMERIC as well as all the datetime types). Add missing conversions int8<->float4, text<->numeric, oid<->int8. initdb forced.	2002-09-18 21:35:25 +00:00
Tom Lane	6fdc44be71	Tweak querytree-dependency-extraction code so that columns of tables that are explicitly JOINed are not considered dependencies unless they are actually used in the query: mere presence in the joinaliasvars list of a JOIN RTE doesn't count as being used. The patch touches a number of files because I needed to generalize the API of query_tree_walker to support an additional flag bit, but the changes are otherwise quite small.	2002-09-11 14:48:55 +00:00
Tom Lane	52c9d25933	Be careful to include postgres.h before any system headers, to ensure that the right flavors of largefile-related definitions are seen. Most of these changes are probably unnecessary, but better safe than sorry.	2002-09-05 00:43:07 +00:00
Bruce Momjian	e50f52a074	pgindent run.	2002-09-04 20:31:48 +00:00
Bruce Momjian	595a5a78e0	> Okay. When you get back to the original issue, the gold is hidden in > src/backend/optimizer/path/indxpath.c; see the "special indexable > operators" stuff near the bottom of that file. (It's a bit of a crock > that this code is hardwired there, and not somehow accessed through a > system catalog, but it's what we've got at the moment.) The attached patch re-enables a bytea right hand argument (as compared to a text right hand argument), and enables index usage, for bytea LIKE Joe Conway	2002-09-02 06:22:20 +00:00
Bruce Momjian	97ac103289	Remove sys/types.h in files that include postgres.h, and hence c.h, because c.h has sys/types.h.	2002-09-02 02:47:07 +00:00
Tom Lane	845a6c3acc	Code review for domain-constraints patch. Use a new ConstraintTest node type for runtime constraint checks, instead of misusing the parse-time Constraint node for the purpose. Fix some damage introduced into type coercion logic; in particular ensure that a coerced expression tree will read out the correct result type when inspected (patch had broken some RelabelType cases). Enforce domain NOT NULL constraints against columns that are omitted from an INSERT.	2002-08-31 22:10:48 +00:00
Tom Lane	0201dac1c3	Push down outer qualification clauses into UNION and INTERSECT subqueries. Per pghackers discussion from back around 1-August.	2002-08-29 16:03:49 +00:00
Bruce Momjian	81dfa2ce43	backend where a statically sized buffer is written to. Most of these should be pretty safe in practice, but it's probably better to be safe than sorry. I was actually looking for cases where NAMEDATALEN is assumed to be 32, but only found one. That's fixed too, as well as a few bits of code cleanup. Neil Conway	2002-08-28 20:46:24 +00:00
Bruce Momjian	39e331be72	Add Bob Devine's name to the optimizer README.	2002-08-25 22:39:37 +00:00
Peter Eisentraut	f1d820494c	Fix failure to relink postmaster executable in the first make run if only a single source file a few directories deep in the backend tree has changed.	2002-08-10 17:59:28 +00:00
Tom Lane	38bb77a5d1	ALTER TABLE DROP COLUMN works. Patch by Christopher Kings-Lynne, code review by Tom Lane. Remaining issues: functions that take or return tuple types are likely to break if one drops (or adds!) a column in the table defining the type. Need to think about what to do here. Along the way: some code review for recent COPY changes; mark system columns attnotnull = true where appropriate, per discussion a month ago.	2002-08-02 18:15:10 +00:00
Tom Lane	76099408f6	If we're cleaning out _deadcode, might as well zap this one too.	2002-07-30 18:54:59 +00:00
Peter Eisentraut	43515ba3f8	Remove _deadcode.	2002-07-24 19:16:43 +00:00
Bruce Momjian	b0f5086e41	oid is needed, it is added at the end of the struct (after the null bitmap, if present). Per Tom Lane's suggestion the information whether a tuple has an oid or not is carried in the tuple descriptor. For debugging reasons tdhasoid is of type char, not bool. There are predefined values for WITHOID, WITHOUTOID and UNDEFOID. This patch has been generated against a cvs snapshot from last week and I don't expect it to apply cleanly to current sources. While I post it here for public review, I'm working on a new version against a current snapshot. (There's been heavy activity recently; hope to catch up some day ...) This is a long patch; if it is too hard to swallow, I can provide it in smaller pieces: Part 1: Accessor macros Part 2: tdhasoid in TupDesc Part 3: Regression test Part 4: Parameter withoid to heap_addheader Part 5: Eliminate t_oid from HeapTupleHeader Part 2 is the most hairy part because of changes in the executor and even in the parser; the other parts are straightforward. Up to part 4 the patched postmaster stays binary compatible to databases created with an unpatched version. Part 5 is small (100 lines) and finally breaks compatibility. Manfred Koizar	2002-07-20 05:16:59 +00:00
Bruce Momjian	38dd3ae7d0	The attached patch fixes a build problem with GEQO when using the PX recombination operator, changes some elog() messages from LOG to DEBUG1, puts some debugging functions inside the appropriate #ifdef (not enabled by default), and makes a few other minor cleanups. BTW, the elog() change is motivated by at least one user who has sent a concerned email to -general asking exactly what the "ERX recombination operator" is, and what it is doing to their DBMS. Neil Conway	2002-07-20 04:59:10 +00:00
Bruce Momjian	7d78bac108	Back out BETWEEN node patch, was causing initdb failure.	2002-07-18 17:14:20 +00:00
Bruce Momjian	3e22406ec6	Finished the Between patch Christopher started. Implements between (symmetric / asymmetric) as a node. Executes the left or right expression once, makes a Const out of the resulting Datum and executes the >=, <= portions out of the Const sets. Of course, the parser does a fair amount of preparatory work for this to happen. Rod Taylor	2002-07-18 04:41:46 +00:00
Tom Lane	942a2e94fa	Fix testing of partial-index predicates to work correctly in cases where varno of index's relation is not 1. This embarrassing oversight pointed out by Dmitry Tkach 12-Jul-02.	2002-07-13 19:20:34 +00:00
Bruce Momjian	1666970275	I've fixed up the way domain constraints (not null and type length) are managed as per request. Moved from merging with table attributes to applying themselves during coerce_type() and coerce_type_typmod. Regression tests altered to test the cast() scenarios. Rod Taylor	2002-07-06 20:16:36 +00:00
Thomas G. Lockhart	68d9fbeb55	Implement the IS DISTINCT FROM operator per SQL99. Reused the Expr node to hold DISTINCT which strongly resembles the existing OP info. Define DISTINCT_EXPR which strongly resembles the existing OPER_EXPR opType, but with handling for NULLs required by SQL99. We have explicit support for single-element DISTINCT comparisons all the way through to the executor. But, multi-element DISTINCTs are handled by expanding into a comparison tree in gram.y as is done for other row comparisons. Per discussions, it might be desirable to move this into one or more purpose-built nodes to be handled in the backend. Define the optional ROW keyword and token per SQL99. This allows single-element row constructs, which were formerly disallowed due to shift/reduce conflicts with parenthesized a_expr clauses. Define the SQL99 TREAT() function. Currently, use as a synonym for CAST().	2002-07-04 15:24:11 +00:00
Bruce Momjian	73ad6ca96c	The attached patch fixes some spelling mistakes, makes the comments on one of the optimizer functions a lot more clear, adds a summary of the recent KSQO discussion to the comments in the code, adds regression tests for the bug with sequence state Tom fixed recently and another reg. test, and removes some PostQuel legacy stuff: ExecAppend -> ExecInsert, ExecRetrieve -> ExecSelect, etc. Error messages remain unchanged until a vote. Neil Conway	2002-06-26 21:58:56 +00:00
Bruce Momjian	e2c007046f	Back out cleanup patch. Got old version and needs work. Neil Conway	2002-06-25 17:58:10 +00:00
Bruce Momjian	ed275aea42	The attached patch fixes some spelling mistakes, makes the comments on one of the optimizer functions a lot more clear, adds a summary of the recent KSQO discussion to the comments in the code, adds regression tests for the bug with sequence state Tom fixed recently and another reg. test, and removes some PostQuel legacy stuff: ExecAppend -> ExecInsert, ExecRetrieve -> ExecSelect, etc. This was changed because the elog() messages from this routine are user-visible, so we should be using the SQL terms. Neil Conway	2002-06-25 17:27:20 +00:00
Bruce Momjian	d84fe82230	Update copyright to 2002.	2002-06-20 20:29:54 +00:00
Bruce Momjian	0dbfea39f3	Remove KSQO from GUC and move file to _deadcode.	2002-06-16 00:09:12 +00:00
Tom Lane	f67a931aa4	Make WHERE conditions pulled up from subqueries be executed before outer WHERE conditions, if there is no reason to do it differently.	2002-06-13 15:10:25 +00:00
Tom Lane	44fbe20d62	Restructure indexscan API (index_beginscan, index_getnext) per yesterday's proposal to pghackers. Also remove unnecessary parameters to heap_beginscan, heap_rescan. I modified pg_proc.h to reflect the new numbers of parameters for the AM interface routines, but did not force an initdb because nothing actually looks at those fields.	2002-05-20 23:51:44 +00:00
Tom Lane	a5b370943e	Teach query_tree_walker, query_tree_mutator, and SS_finalize_plan to process function RTE expressions, which they were previously missing. This allows outer-Var references and subselects to work correctly in the arguments of a function RTE. Install check to prevent function RTEs from cross-referencing Vars of sibling FROM-items, which doesn't make any sense (if you want to join, write a JOIN or WHERE clause).	2002-05-18 18:49:41 +00:00
Tom Lane	51fd22abdd	Change set_plan_references and join_references to take an rtable List rather than a Query node; this allows set_plan_references to recurse into subplans correctly. Fixes core dump on full outer joins in subplans. Also, invoke preprocess_expression on function RTEs' function expressions. This seems to fix the planner's problems with outer-level Vars in function RTEs.	2002-05-18 02:25:50 +00:00
Tom Lane	0a757154bd	Add missing fix_expr_references() step for the funcexpr of a FunctionScan plan node.	2002-05-18 00:42:55 +00:00
Tom Lane	22d641a7d4	Get rid of the last few uses of typeidTypeName() rather than format_type_be() in error messages.	2002-05-17 22:35:13 +00:00
Tom Lane	3389a110d4	Get rid of long-since-vestigial Iter node type, in favor of adding a returns-set boolean field in Func and Oper nodes. This allows cleaner, more reliable tests for expressions returning sets in the planner and parser. For example, a WHERE clause returning a set is now detected and complained of in the parser, not only at runtime.	2002-05-12 23:43:04 +00:00
Tom Lane	f9e4f611a1	First pass at set-returning-functions in FROM, by Joe Conway with some kibitzing from Tom Lane. Not everything works yet, and there's no documentation or regression test, but let's commit this so Joe doesn't need to cope with tracking changes in so many files ...	2002-05-12 20:10:05 +00:00
Tom Lane	6c59886942	Second try at fixing join alias variables. Instead of attaching miscellaneous lists to join RTEs, attach a list of Vars and COALESCE expressions that will replace the join's alias variables during planning. This simplifies flatten_join_alias_vars while still making it easy to fix up varno references when transforming the query tree. Add regression test cases for interactions of subqueries with outer joins.	2002-04-28 19:54:29 +00:00
Tom Lane	6cef5d2549	Operators live in namespaces. CREATE/DROP/COMMENT ON OPERATOR take qualified operator names directly, for example CREATE OPERATOR myschema.+ ( ... ). To qualify an operator name in an expression you need to write OPERATOR(myschema.+) (thanks to Peter for suggesting an escape hatch). I also took advantage of having to reformat pg_operator to fix something that'd been bugging me for a while: mergejoinable operators should have explicit links to the associated cross-data-type comparison operators, rather than hardwiring an assumption that they are named < and >.	2002-04-16 23:08:12 +00:00
Tom Lane	9999f5a10e	Checking to decide whether relations are system relations now depends on the namespace not the name; pg_ is not a reserved prefix for table names anymore. From Fernando Nasser.	2002-04-12 20:38:31 +00:00
Tom Lane	902a6a0a4b	Restructure representation of aggregate functions so that they have pg_proc entries, per pghackers discussion. This fixes aggregates to live in namespaces, and also simplifies/speeds up lookup in parse_func.c. Also, add a 'proimplicit' flag to pg_proc that controls whether a type coercion function may be invoked implicitly, or only explicitly. The current settings of these flags are more permissive than I would like, but we will need to debate and refine the behavior; for now, I avoided breaking regression tests as much as I could.	2002-04-11 20:00:18 +00:00
Tom Lane	b9ae55f2aa	Undo not-so-hot decision to postpone insertion of default values into INSERT statements to the planner. Taking it out of the parser was right (so that defaults don't get into stored rules), but it has to happen before rewrite rule expansion, else references to NEW.field behave incorrectly. Accordingly, add a step to the rewriter to insert defaults just before rewrite-rule expansion.	2002-04-05 05:47:05 +00:00
Tom Lane	4bdb4be62e	Divide functions into three volatility classes (immutable, stable, and volatile), rather than the old cachable/noncachable distinction. This allows indexscan optimizations in many places where we formerly didn't. Also, add a pronamespace column to pg_proc (it doesn't do anything yet, however).	2002-04-05 00:31:36 +00:00
Hiroshi Inoue	c26a44db08	Removed obsolete DROP_COLUMN_HACK stuff.	2002-04-02 08:51:52 +00:00
Tom Lane	108a0ec87d	A little further progress on schemas: push down RangeVars into addRangeTableEntry calls. Remove relname field from RTEs, since it will no longer be a useful unique identifier of relations; we want to encourage people to rely on the relation OID instead. Further work on dumping qual expressions in EXPLAIN, too.	2002-03-22 02:56:37 +00:00
Tom Lane	95ef6a3448	First phase of SCHEMA changes, concentrating on fixing the grammar and the parsetree representation. As yet we don't do anything with schema names, just drop 'em on the floor; but you can enter schema-compatible command syntax, and there's even a primitive CREATE SCHEMA command. No doc updates yet, except to note that you can now extract a field from a function-returning-row's result with (foo(...)).fieldname.	2002-03-21 16:02:16 +00:00
Tom Lane	337b22cb47	Code review for DOMAIN patch.	2002-03-20 19:45:13 +00:00
Bruce Momjian	d3788c3305	Add DOMAIN support. Includes manual pages and regression tests, from Rod Taylor.	2002-03-19 02:18:25 +00:00
Tom Lane	6eeb95f0f5	Restructure representation of join alias variables. An explicit JOIN now has an RTE of its own, and references to its outputs now are Vars referencing the JOIN RTE, rather than CASE-expressions. This allows reverse-listing in ruleutils.c to use the correct alias easily, rather than painfully reverse-engineering the alias namespace as it used to do. Also, nested FULL JOINs work correctly, because the result of the inner joins are simple Vars that the planner can cope with. This fixes a bug reported a couple times now, notably by Tatsuo on 18-Nov-01. The alias Vars are expanded into COALESCE expressions where needed at the very end of planning, rather than during parsing. Also, beginnings of support for showing plan qualifier expressions in EXPLAIN. There are probably still cases that need work. initdb forced due to change of stored-rule representation.	2002-03-12 00:52:10 +00:00
Bruce Momjian	b976b8af80	Back out domain patch until it works properly.	2002-03-07 16:35:41 +00:00
Bruce Momjian	01c76f7411	Ok. Updated patch attached. - domain.patch -> source patch against pgsql in cvs - drop_domain.sgml and create_domain.sgml -> New doc/src/sgml/ref docs - dominfo.txt -> basic domain related queries I used for testing [ ADDED TO /doc] Enables domains of array elements -> CREATE DOMAIN dom int4[3][2]; Uses a typbasetype column to describe the origin of the domain. Copies data to attnotnull rather than processing in execMain(). Some documentation differences from earlier. If this is approved, I'll start working on pg_dump, and a \dD <domain> option in psql, and regression tests. I don't really feel like doing those until the system table structure settles for pg_type. CHECKS when added, will also be copied to to the table attributes. FK Constraints (if I ever figure out how) will be done similarly. Both will lbe handled by MergeDomainAttributes() which is called shortly before MergeAttributes(). Rod Taylor	2002-03-06 20:35:02 +00:00
Bruce Momjian	92288a1cf9	Change made to elog: o Change all current CVS messages of NOTICE to WARNING. We were going to do this just before 7.3 beta but it has to be done now, as you will see below. o Change current INFO messages that should be controlled by client_min_messages to NOTICE. o Force remaining INFO messages, like from EXPLAIN, VACUUM VERBOSE, etc. to always go to the client. o Remove INFO from the client_min_messages options and add NOTICE. Seems we do need three non-ERROR elog levels to handle the various behaviors we need for these messages. Regression passed.	2002-03-06 06:10:59 +00:00
Tom Lane	944671820f	Previous patch to mark UNION outputs with common typmod (if any) breaks three-or-more-way UNIONs, as per example from Josh Berkus. Cause is a fragile assumption that one tlist's entries will exactly match another. Restructure code to make that assumption a little less fragile.	2002-03-05 05:10:24 +00:00
Bruce Momjian	a033daf566	Commit to match discussed elog() changes. Only update is that LOG is now just below FATAL in server_min_messages. Added more text to highlight ordering difference between it and client_min_messages. --------------------------------------------------------------------------- REALLYFATAL => PANIC STOP => PANIC New INFO level the prints to client by default New LOG level the prints to server log by default Cause VACUUM information to print only to the client NOTICE => INFO where purely information messages are sent DEBUG => LOG for purely server status messages DEBUG removed, kept as backward compatible DEBUG5, DEBUG4, DEBUG3, DEBUG2, DEBUG1 added DebugLvl removed in favor of new DEBUG[1-5] symbols New server_min_messages GUC parameter with values: DEBUG[5-1], INFO, NOTICE, ERROR, LOG, FATAL, PANIC New client_min_messages GUC parameter with values: DEBUG[5-1], LOG, INFO, NOTICE, ERROR, FATAL, PANIC Server startup now logged with LOG instead of DEBUG Remove debug_level GUC parameter elog() numbers now start at 10 Add test to print error message if older elog() values are passed to elog() Bootstrap mode now has a -d that requires an argument, like postmaster	2002-03-02 21:39:36 +00:00
Tom Lane	54f7f62d4a	Fix thinko: cost_mergejoin must pay attention to which side of the mergeclause is which when extracting selectivity info.	2002-03-01 20:50:20 +00:00
Tom Lane	8f0a9e85b3	Second thoughts dept: arrange to cache mergejoin scan selectivity in RestrictInfo nodes, instead of recomputing on every use.	2002-03-01 06:01:20 +00:00
Tom Lane	f8c109528c	Teach planner about the idea that a mergejoin won't necessarily read both input streams to the end. If one variable's range is much less than the other, an indexscan-based merge can win by not scanning all of the other table. Per example from Reinhard Max.	2002-03-01 04:09:28 +00:00
Tom Lane	7863404417	A bunch of changes aimed at reducing backend startup time... Improve 'pg_internal.init' relcache entry preload mechanism so that it is safe to use for all system catalogs, and arrange to preload a realistic set of system-catalog entries instead of only the three nailed-in-cache indexes that were formerly loaded this way. Fix mechanism for deleting out-of-date pg_internal.init files: this must be synchronized with transaction commit, not just done at random times within transactions. Drive it off relcache invalidation mechanism so that no special-case tests are needed. Cache additional information in relcache entries for indexes (their pg_index tuples and index-operator OIDs) to eliminate repeated lookups. Also cache index opclass info at the per-opclass level to avoid repeated lookups during relcache load. Generalize 'systable scan' utilities originally developed by Hiroshi, move them into genam.c, use in a number of places where there was formerly ugly code for choosing either heap or index scan. In particular this allows simplification of the logic that prevents infinite recursion between syscache and relcache during startup: we can easily switch to heapscans in relcache.c when and where needed to avoid recursion, so IndexScanOK becomes simpler and does not need any expensive initialization. Eliminate useless opening of a heapscan data structure while doing an indexscan (this saves an mdnblocks call and thus at least one kernel call).	2002-02-19 20:11:20 +00:00
Tom Lane	f7fb29dec3	Shouldn't try to copy null datums with datumCopy.	2002-01-03 18:01:59 +00:00
Tom Lane	63cc56de54	Suppress subquery pullup and pushdown when the subquery has any set-returning functions in its target list. This ensures that we won't rewrite the query in a way that places set-returning functions into quals (WHERE clauses). Cf. bug reports from Joe Conway.	2001-12-10 22:54:12 +00:00
Tom Lane	c31bcbc8d6	Repair failure to mark an inserted Materialize node with the appropriate extParam/locParam lists. Per bug #526.	2001-11-30 19:24:15 +00:00
Tom Lane	e433bf5a5e	If the inputs of a UNION/INTERSECT/EXCEPT construct all agree on the typmod of a particular column, mark the output with that same typmod, not -1 as formerly. -1 is still used if there is any disagreement. Part of response to bug#513.	2001-11-12 20:04:20 +00:00
Tom Lane	c5c97318f9	In find_mergeclauses_for_pathkeys, it's okay to return multiple merge clauses per path key. Indeed, we must do so or we will be unable to form a valid plan for FULL JOIN with overlapping join conditions, eg select * from a full join b on a.v1 = b.v1 and a.v2 = b.v2 and a.v1 = b.v2.	2001-11-11 20:33:53 +00:00
Tom Lane	ad511a3ff3	sort_inner_and_outer needs a check to ensure that it's consumed all the mergeclauses in RIGHT/FULL join cases, just like the other routines have. I'm not quite sure why I thought it didn't need one --- but Nick Fankhauser's recent bug report proves that it does.	2001-11-11 19:18:54 +00:00
Bruce Momjian	ea08e6cd55	New pgindent run with fixes suggested by Tom. Patch manually reviewed, initdb/regression tests pass.	2001-11-05 17:46:40 +00:00
Tom Lane	9685afb0b2	Add default expressions to INSERTs during planning, not during parse analysis. This keeps stored rules from prematurely absorbing default information, which is necessary for ALTER TABLE SET DEFAULT to work unsurprisingly with rules. See pgsql-bugs discussion 24-Oct-01.	2001-11-02 20:23:02 +00:00
Tom Lane	96ca8ffebc	Fix problems with subselects used in GROUP BY expressions, per gripe from Philip Warner. Side effect of change is that GROUP BY expressions will not be re-evaluated at multiple plan levels anymore, whereas this sometimes happened with old code.	2001-10-30 19:58:58 +00:00
Bruce Momjian	c41b6b1b9c	Fix small problem Tom Lane found with pgindent run.	2001-10-30 05:38:56 +00:00
Bruce Momjian	6783b2372e	Another pgindent run. Fixes enum indenting, and improves #endif spacing. Also adds space for one-line comments.	2001-10-28 06:26:15 +00:00
Bruce Momjian	b81844b173	pgindent run on all C files. Java run to follow. initdb/regression tests pass.	2001-10-25 05:50:21 +00:00
Tom Lane	6254465d06	Extend code that deduces implied equality clauses to detect whether a clause being added to a particular restriction-clause list is redundant with those already in the list. This avoids useless work at runtime, and (perhaps more importantly) keeps the selectivity estimation routines from generating too-small estimates of numbers of output rows. Also some minor improvements in OPTIMIZER_DEBUG displays.	2001-10-18 16:11:42 +00:00
Tom Lane	6f33c179b9	Produce slightly saner-looking EXPLAIN output for a Result node.	2001-09-21 04:06:04 +00:00
Tom Lane	6c91eef7b7	Fix handling of pg_type.typdefault per bug report from Dave Blasby. If there's anyone out there who's actually using datatype-defined default values, this will be an incompatible change in behavior ... but the old behavior was so broken that I doubt anyone was using it.	2001-09-06 02:07:42 +00:00
Tom Lane	f933766ba7	Restructure pg_opclass, pg_amop, and pg_amproc per previous discussions in pgsql-hackers. pg_opclass now has a row for each opclass supported by each index AM, not a row for each opclass name. This allows pg_opclass to show directly whether an AM supports an opclass, and furthermore makes it possible to store additional information about an opclass that might be AM-dependent. pg_opclass and pg_amop now store "lossy" and "haskeytype" information that we previously expected the user to remember to provide in CREATE INDEX commands. Lossiness is no longer an index-level property, but is associated with the use of a particular operator in a particular index opclass. Along the way, IndexSupportInitialize now uses the syscaches to retrieve pg_amop and pg_amproc entries. I find this reduces backend launch time by about ten percent, at the cost of a couple more special cases in catcache.c's IndexScanOK. Initial work by Oleg Bartunov and Teodor Sigaev, further hacking by Tom Lane. initdb forced.	2001-08-21 16:36:06 +00:00
Tom Lane	4bc9f5e9ba	Fix brokenness of nested EXCEPT/INTERSECT queries. prepunion was being a tad sloppy about generating the targetlist for some nodes, by generating a tlist entry that claimed to be a constant when the value wasn't actually constant. This caused setrefs.c to do the wrong thing later on.	2001-08-14 17:12:57 +00:00
Tom Lane	246793469e	Modify partial-index-predicate applicability tester to test whether clauses are equal(), before trying to match them up using btree opclass inference rules. This allows it to recognize many simple cases involving non-btree operations, for example 'x IS NULL'. Clean up code a little.	2001-08-06 18:09:45 +00:00
Tom Lane	0889bd00bd	Further thought shows that has_distinct_on_clause() needs to take much more care with resjunk tlist entries than it was doing. The original coding ignored resjunk entries entirely, but a resjunk entry that is in either the distinctClause or sortClause lists indicates that DISTINCT ON was used. It's important for ruleutils.c to get this right, else we may dump views using DISTINCT ON incorrectly.	2001-07-31 20:16:33 +00:00
Tom Lane	421467cdc8	Fix optimizer to not try to push WHERE clauses down into a sub-SELECT that has a DISTINCT ON clause, per bug report from Anthony Wood. While at it, improve the DISTINCT-ON-clause recognizer routine to not be fooled by out- of-order DISTINCT lists.	2001-07-31 17:56:31 +00:00
Tom Lane	40db52af34	Do not push down quals into subqueries that have LIMIT/OFFSET clauses, since the added qual could change the set of rows that get past the LIMIT. Per discussion on pgsql-sql 7/15/01.	2001-07-16 17:57:02 +00:00
Tom Lane	f31dc0ada7	Partial indexes work again, courtesy of Martijn van Oosterhout. Note: I didn't force an initdb, figuring that one today was enough. However, there is a new function in pg_proc.h, and pg_dump won't be able to dump partial indexes until you add that function.	2001-07-16 05:07:00 +00:00
Tom Lane	c8076f09d2	Restructure index AM interface for index building and index tuple deletion, per previous discussion on pghackers. Most of the duplicate code in different AMs' ambuild routines has been moved out to a common routine in index.c; this means that all index types now do the right things about inserting recently-dead tuples, etc. (I also removed support for EXTEND INDEX in the ambuild routines, since that's about to go away anyway, and it cluttered the code a lot.) The retail indextuple deletion routines have been replaced by a "bulk delete" routine in which the indexscan is inside the access method. I haven't pushed this change as far as it should go yet, but it should allow considerable simplification of the internal bookkeeping for deletions. Also, add flag columns to pg_am to eliminate various hardcoded tests on AM OIDs, and remove unused pg_am columns. Fix rtree and gist index types to not attempt to store NULLs; before this, gist usually crashed, while rtree managed not to crash but computed wacko bounding boxes for NULL entries (which might have had something to do with the performance problems we've heard about occasionally). Add AtEOXact routines to hash, rtree, and gist, all of which have static state that needs to be reset after an error. We discovered this need long ago for btree, but missed the other guys. Oh, one more thing: concurrent VACUUM is now the default.	2001-07-15 22:48:19 +00:00
Tom Lane	4d58a7ca87	Optimizer can now estimate selectivity of IS NULL, IS NOT NULL, IS TRUE, etc, with some degree of verisimilitude. Split out selectivity support functions from builtins.h into a new header file selfuncs.h, so as to reduce the number of header files builtins.h must depend on. Fix a few missing inclusions exposed thereby. From Joe Conway, with some kibitzing from Tom Lane.	2001-06-25 21:11:45 +00:00
Tom Lane	116d2bba7e	Add IS UNKNOWN, IS NOT UNKNOWN boolean tests, fix the existing boolean tests to return the correct results per SQL9x when given NULL inputs. Reimplement these tests as well as IS [NOT] NULL to have their own expression node types, instead of depending on special functions. From Joe Conway, with a little help from Tom Lane.	2001-06-19 22:39:12 +00:00
Tom Lane	1f1ca182be	Make inet/cidr << and <<= operators indexable. From Alex Pilosov <alex@pilosoft.com>.	2001-06-17 02:05:20 +00:00
Tom Lane	01a819abe3	Make planner compute the number of hash buckets the same way that nodeHash.c will compute it (by sharing code).	2001-06-11 00:17:08 +00:00
Tom Lane	a8fe109ac1	Fix thinko in hash cost estimation: average frequency should be computed from total number of distinct values in whole relation, not # distinct values we expect to have after restriction clauses are applied.	2001-06-10 02:59:35 +00:00
Tom Lane	cdd230d628	Improve planning of OR indexscan plans: for quals like WHERE (a = 1 or a = 2) and b = 42 and an index on (a,b), include the clause b = 42 in the indexquals generated for each arm of the OR clause. Essentially this is an index- driven conversion from CNF to DNF. Implementation is a bit klugy, but better than not exploiting the extra quals at all ...	2001-06-05 17:13:52 +00:00
Tom Lane	7c579fa12d	Further work on making use of new statistics in planner. Adjust APIs of costsize.c routines to pass Query root, so that costsize can figure more things out by itself and not be so dependent on its callers to tell it everything it needs to know. Use selectivity of hash or merge clause to estimate number of tuples processed internally in these joins (this is more useful than it would've been before, since eqjoinsel is somewhat more accurate than before).	2001-06-05 05:26:05 +00:00
Peter Eisentraut	12c1552066	Mark many strings in backend not covered by elog for translation. Also, make strings in xlog.c look more like English and less like binary noise.	2001-06-03 14:53:56 +00:00
Tom Lane	be03eb25f3	Modify optimizer data structures so that IndexOptInfo lists built for create_index_paths are not immediately discarded, but are available for subsequent planner work. This allows avoiding redundant syscache lookups in several places. Change interface to operator selectivity estimation procedures to allow faster and more flexible estimation. Initdb forced due to change of pg_proc entries for selectivity functions!	2001-05-20 20:28:20 +00:00
Tom Lane	248182560c	Current implementation of FOR UPDATE has no hope of working correctly for relations on the nullable side of an OUTER JOIN. For now I think we'd better refuse such queries.	2001-05-14 20:25:00 +00:00
Tom Lane	c23bc6fbb0	First cut at making indexscan cost estimates depend on correlation between index order and table order.	2001-05-09 23:13:37 +00:00
Tom Lane	6cda3ad8fe	Cause planner to make use of average-column-width statistic that is now collected by ANALYZE. Also, add some modest amount of intelligence to guesses that are used for varlena columns in the absence of any ANALYZE statistics. The 'width' reported by EXPLAIN is finally something less than totally bogus for varlena columns ... and, in consequence, hashjoin estimating should be a little better ...	2001-05-09 00:35:09 +00:00
Bruce Momjian	857abb0e57	Add newlines around debug output in optimizer showing total costs.	2001-05-08 17:25:28 +00:00
Tom Lane	f905d65ee3	Rewrite of planner statistics-gathering code. ANALYZE is now available as a separate statement (though it can still be invoked as part of VACUUM, too). pg_statistic redesigned to be more flexible about what statistics are stored. ANALYZE now collects a list of several of the most common values, not just one, plus a histogram (not just the min and max values). Random sampling is used to make the process reasonably fast even on very large tables. The number of values and histogram bins collected is now user-settable via an ALTER TABLE command. There is more still to do; the new stats are not being used everywhere they could be in the planner. But the remaining changes for this project should be localized, and the behavior is already better than before. A not-very-related change is that sorting now makes use of btree comparison routines if it can find one, rather than invoking '<' twice.	2001-05-07 00:43:27 +00:00
Tom Lane	e2004dfc69	Suppress pull-up of subqueries that are in the nullable side of an outer join. This is needed to avoid improper evaluation of expressions that should be nulled out, as in Victor Wagner's bug report of 4/27/01. Pretty ugly solution, but no time to do anything better for 7.1.1.	2001-04-30 19:24:47 +00:00
Tom Lane	a43f20cb0a	Tweak nestloop costing to weight restart cost of inner path more heavily. Without this, it was making some pretty silly decisions about whether an expensive sub-SELECT should be the inner or outer side of a join...	2001-04-25 22:04:37 +00:00
Tom Lane	d5096af2c4	Make the world safe for passing whole rows of views to functions. This already worked fine for whole rows of tables, but not so well for views...	2001-04-18 20:42:56 +00:00
Tom Lane	cdcaec5c53	Avoid reversing user-given order of WHERE clauses while attaching clauses to specific base or join RelOptInfo nodes during planning. This preserves the more-intuitive behavior of 7.0.* --- if you write an expensive clause (such as a sub-select) last, it should get evaluated last. Someday we ought to try to have some intelligence about the order of evaluation of WHERE clauses, but for now we should not override what the user wrote.	2001-04-16 19:44:10 +00:00
Tom Lane	f9094c44c0	Prevent generation of invalid plans for RIGHT or FULL joins with multiple join clauses. The mergejoin executor wants all the join clauses to appear as merge quals, not as extra joinquals, for these kinds of joins. But the planner would consider plans in which partially-sorted input paths were used, leading to only some of the join clauses becoming merge quals. This is fine for inner/left joins, not fine for right/full joins.	2001-04-15 00:48:17 +00:00
Tom Lane	2ef99ee708	Planner wasn't correctly handling adjustment of tuple_fraction for the case of LIMIT in a sub-select.	2001-04-01 22:37:19 +00:00
Tom Lane	f155cc82ec	Quick hack to fix Oliver Elphick's problem with subselects in an inheritance query: make duplicate copies of subplans in adjust_inherited_attrs. When we redesign querytrees we really gotta do something about this issue of whether querytrees are read-only and can share substructure or not.	2001-03-27 18:02:19 +00:00
Tom Lane	fa0f2c6577	Repair pgindent damage to comments.	2001-03-27 17:12:34 +00:00
Bruce Momjian	7cf952e7b4	Fix comments that were mis-wrapped, for Tom Lane.	2001-03-23 04:49:58 +00:00
Bruce Momjian	0686d49da0	Remove dashes in comments that don't need them, rewrap with pgindent.	2001-03-22 06:16:21 +00:00
Bruce Momjian	9e1552607a	pgindent run. Make it all clean.	2001-03-22 04:01:46 +00:00
Tom Lane	d73e9df087	A subplan invoked within an aggregate function's argument should be allowed to receive ungrouped variables of the current query level. Curious that no one reported this bug before...	2001-03-08 01:49:01 +00:00
Tom Lane	13cc7eb3e2	Clean up two rather nasty bugs in operator selection code. 1. If there is exactly one pg_operator entry of the right name and oprkind, oper() and related routines would return that entry whether its input type had anything to do with the request or not. This is just premature optimization: we shouldn't return the single candidate until after we verify that it really is a valid candidate, ie, is at least coercion-compatible with the given types. 2. oper() and related routines only promise a coercion-compatible result. Unfortunately, there were quite a few callers that assumed the returned operator is binary-compatible with the given datatype; they would proceed to call it without making any datatype coercions. These callers include sorting, grouping, aggregation, and VACUUM ANALYZE. In general I think it is appropriate for these callers to require an exact or binary-compatible match, so I've added a new routine compatible_oper() that only succeeds if it can find an operator that doesn't require any run-time conversions. Callers now call oper() or compatible_oper() depending on whether they are prepared to deal with type conversion or not. The upshot of these bugs is revealed by the following silliness in PL/Tcl's selftest: it creates an operator @< on int4, and then tries to use it to sort a char(N) column. The system would let it do that :-( (and evidently has done so since 6.3 :-( :-(). The result in this case was just a silly sort order, but the reverse combination would've provoked coredump from trying to dereference integers. With this fix you get more reasonable behavior: pltcl_test=# select * from T_pkey1 order by key1, key2 using @<; ERROR: Unable to identify an operator '@<' for types 'bpchar' and 'bpchar' You will have to retype this query using an explicit cast	2001-02-16 03:16:58 +00:00
Tom Lane	b29f68f611	Take OUTER JOIN semantics into account when estimating the size of join relations. It's not very bright, but at least it now knows that A LEFT JOIN B must produce at least as many rows as are in A ...	2001-02-16 00:03:08 +00:00
Tom Lane	83b4ab53ad	Update a couple of obsolete comments.	2001-02-15 17:46:40 +00:00
Bruce Momjian	d8c4cb740c	Cleanup	2001-02-12 18:46:40 +00:00
Bruce Momjian	281b7d84fc	Add // -> /* */ mapping to pgindent.	2001-02-12 18:30:53 +00:00
Tom Lane	503f042cd7	Fix inappropriate attempt to push down qual clauses into a view that has UNION/INTERSECT/EXCEPT operations. Per bug report from Ferrier.	2001-02-03 21:17:52 +00:00
Tom Lane	f44639e1bf	Don't crash if subquery appears multiple times in jointree. This should not happen anyway, but let's try not to get completely confused if it does (due to rewriter bugs or whatever).	2001-01-27 04:42:32 +00:00
Bruce Momjian	623bf843d2	Change Copyright from PostgreSQL, Inc to PostgreSQL Global Development Group.	2001-01-24 19:43:33 +00:00
Tom Lane	b06fbc7ad2	Fix performance issue with qualifications on VIEWs: outer query should try to push restrictions on the view down into the view subquery, so that they can become indexscan quals or what-have-you rather than being applied at the top level of the subquery. 7.0 and before were able to do this, though in a much klugier way, and I'd hate to have anyone complaining that 7.1 is stupider than 7.0 ...	2001-01-18 07:12:37 +00:00
Bruce Momjian	5088f0748a	Change lcons(x, NIL) to makeList(x) where appropriate.	2001-01-17 17:26:45 +00:00
Bruce Momjian	26e0321191	Move structure comments from the top block down to the line entries for this file to match all the other files, and to be clearer.	2001-01-17 06:41:31 +00:00
Tom Lane	07c741e61c	Fix oversight in planning of GROUP queries: when an expression is used as both a GROUP BY item and an output expression, the top-level Group node should just copy up the evaluated expression value from its input, rather than re-evaluating the expression. Aside from any performance benefit this might offer, this avoids a crash when there is a sub-SELECT in said expression.	2001-01-09 03:48:51 +00:00
Tom Lane	7df721af0e	Compute reasonable cost and output-row-count estimates for LIMIT plan nodes.	2000-12-23 18:49:41 +00:00
Tom Lane	97cfb9d606	Make sure make_rels_by_clause_joins doesn't return multiple references to same joinrel. Although make_rels_by_joins doesn't mind, GEQO has an Assert that doesn't like this.	2000-12-18 06:50:51 +00:00
Tom Lane	ea166f1146	Planner speedup hacking. Avoid saving useless pathkeys, so that path comparison does not consider paths different when they differ only in uninteresting aspects of sort order. (We had a special case of this consideration for indexscans already, but generalize it to apply to ordered join paths too.) Be stricter about what is a canonical pathkey to allow faster pathkey comparison. Cache canonical pathkeys and dispersion stats for left and right sides of a RestrictInfo's clause, to avoid repeated computation. Total speedup will depend on number of tables in a query, but I see about 4x speedup of planning phase for a sample seven-table query.	2000-12-14 22:30:45 +00:00
Tom Lane	17b843d677	Cache eval cost of qualification expressions in RestrictInfo nodes to avoid repeated evaluations in cost_qual_eval(). This turns out to save a useful fraction of planning time. No change to external representation of RestrictInfo --- although that node type doesn't appear in stored rules anyway.	2000-12-12 23:33:34 +00:00
Tom Lane	73d2a3595a	Clean up handling of FOR UPDATE inside views and subselects ... make it work where we can (given that the executor only handles it at top level) and generate an error where we can't. Note that while the parser has been allowing views to say SELECT FOR UPDATE for a few weeks now, that hasn't actually worked until just now.	2000-12-06 23:55:19 +00:00
Tom Lane	bbea3643a3	Store current LC_COLLATE and LC_CTYPE settings in pg_control during initdb; re-adopt these settings at every postmaster or standalone-backend startup. This should fix problems with indexes becoming corrupt due to failure to provide consistent locale environment for postmaster at all times. Also, refuse to start up a non-locale-enabled compilation in a database originally initdb'd with a non-C locale. Suppress LIKE index optimization if locale is not "C" or "POSIX" (are there any other locales where it's safe?). Issue NOTICE during initdb if selected locale disables LIKE optimization.	2000-11-25 20:33:54 +00:00
Tom Lane	48437f5c3a	Ensure that mergejoin plan will be considered for FULL OUTER JOIN even if enable_mergejoin = OFF. Must do this, because we have no other implementation method for full joins.	2000-11-23 03:57:31 +00:00
Peter Eisentraut	a70e74b060	Put external declarations into header files.	2000-11-21 21:16:06 +00:00
Tom Lane	3030189b69	Fix erroneous handling of parameters at SubqueryScan plan nodes, per bug report from Don Baccus.	2000-11-21 00:17:59 +00:00
Tom Lane	a933ee38bb	Change SearchSysCache coding conventions so that a reference count is maintained for each cache entry. A cache entry will not be freed until the matching ReleaseSysCache call has been executed. This eliminates worries about cache entries getting dropped while still in use. See my posting to pg-hackers of even date for more info.	2000-11-16 22:30:52 +00:00
Tom Lane	6543d81d65	Restructure handling of inheritance queries so that they work with outer joins, and clean things up a good deal at the same time. Append plan node no longer hacks on rangetable at runtime --- instead, all child tables are given their own RT entries during planning. Concept of multiple target tables pushed up into execMain, replacing bug-prone implementation within nodeAppend. Planner now supports generating Append plans for inheritance sets either at the top of the plan (the old way) or at the bottom. Expanding at the bottom is appropriate for tables used as sources, since they may appear inside an outer join; but we must still expand at the top when the target of an UPDATE or DELETE is an inheritance set, because we actually need a different targetlist and junkfilter for each target table in that case. Fortunately a target table can't be inside an outer join... Bizarre mutual recursion between union_planner and prepunion.c is gone --- in fact, union_planner doesn't really have much to do with union queries anymore, so I renamed it grouping_planner.	2000-11-12 00:37:02 +00:00
Tom Lane	a1d133990f	Repair some bugs in new union/intersect/except code. Thanks to Kevin O'Gorman for finding these...	2000-11-09 02:46:17 +00:00
Tom Lane	11f7b29054	Allow ORDER BY, LIMIT in sub-selects. Fix most (not all) cases where the grammar did not allow redundant parentheses around sub-selects. Distinguish LIMIT ALL from LIMIT 0; make the latter behave as one would expect.	2000-11-05 00:15:54 +00:00
Tom Lane	2f35b4efdb	Re-implement LIMIT/OFFSET as a plan node type, instead of a hack in ExecutorRun. This allows LIMIT to work in a view. Also, LIMIT in a cursor declaration will behave in a reasonable fashion, whereas before it was overridden by the FETCH count.	2000-10-26 21:38:24 +00:00
Tom Lane	09a8912f73	Ensure clause_selectivity() behaves sanely when examining an uplevel Var or a Var that references a subquery output.	2000-10-25 21:48:12 +00:00
Bruce Momjian	b32685a999	Add proofreader's changes to docs. Fix misspelling of disbursion to dispersion.	2000-10-05 19:48:34 +00:00
Tom Lane	05e3d0ee86	Reimplementation of UNION/INTERSECT/EXCEPT. INTERSECT/EXCEPT now meet the SQL92 semantics, including support for ALL option. All three can be used in subqueries and views. DISTINCT and ORDER BY work now in views, too. This rewrite fixes many problems with cross-datatype UNIONs and INSERT/SELECT where the SELECT yields different datatypes than the INSERT needs. I did that by making UNION subqueries and SELECT in INSERT be treated like subselects-in-FROM, thereby allowing an extra level of targetlist where the datatype conversions can be inserted safely. INITDB NEEDED!	2000-10-05 19:11:39 +00:00
Tom Lane	3a94e789f5	Subselects in FROM clause, per ISO syntax: FROM (SELECT ...) [AS] alias. (Don't forget that an alias is required.) Views reimplemented as expanding to subselect-in-FROM. Grouping, aggregates, DISTINCT in views actually work now (he says optimistically). No UNION support in subselects/views yet, but I have some ideas about that. Rule-related permissions checking moved out of rewriter and into executor. INITDB REQUIRED!	2000-09-29 18:21:41 +00:00
Tom Lane	8bdc2bf030	Use variable aliases, if supplied, rather than real column names in complaints about ungrouped variables. This is for consistency with behavior elsewhere, notably the fact that the relname is reported as an alias in these same complaints. Also, it'll work with subselect- in-FROM where old code didn't.	2000-09-25 18:14:55 +00:00
Tom Lane	164caa3951	System neglected to complain about ungrouped variables passed to sublinks when outer query contained aggregates but no GROUP clause.	2000-09-25 18:09:28 +00:00
Tom Lane	ba2ea6e0f5	Fix GEQO optimizer to work correctly with new outer-join-capable query representation. Note that GEQO_RELS setting is now interpreted as the number of top-level items in the FROM list, not necessarily the number of relations in the query. This seems appropriate since we are only doing join-path searching over the top-level items.	2000-09-19 18:42:34 +00:00
Tom Lane	8ae9ad1cb8	Reimplement LIKE/ESCAPE as operators so that indexscan optimization can still work, per recent discussion on pghackers. Correct some bugs in ILIKE implementation.	2000-09-15 18:45:31 +00:00
Tom Lane	ed5003c584	First cut at full support for OUTER JOINs. There are still a few loose ends to clean up (see my message of same date to pghackers), but mostly it works. INITDB REQUIRED!	2000-09-12 21:07:18 +00:00
Peter Eisentraut	424f0edcb8	Fix relative path references so that make knowns which dependencies refer to one another. Sort out builddir vs srcdir variable namings. Remove some now obsoleted make variables.	2000-08-31 16:12:35 +00:00
Tom Lane	782c16c6a1	SQL-language functions are now callable in ordinary fmgr contexts ... for example, an SQL function can be used in a functional index. (I make no promises about speed, but it'll work ;-).) Clean up and simplify handling of functions returning sets.	2000-08-24 03:29:15 +00:00
Tom Lane	7893462e44	Move pg_checkretval out of the planner (where it never belonged) into pg_proc.c (where it's actually used). Fix it to correctly handle tlists that contain resjunk target items, and improve error messages. This addresses bug reported by Krupnikov 6-July-00.	2000-08-21 20:55:31 +00:00
Tom Lane	e67ff6b670	fmgr interface mopup work. Use new DatumGetBool and BoolGetDatum macros where appropriate (the code used to have several different ways of doing that, including Int32, Int8, UInt8, ...). Remove last few references to float32 and float64 typedefs --- it's all float4/float8 now. The typedefs themselves should probably stay in c.h for a release or two, though, to avoid breaking user-written C functions.	2000-08-21 17:22:36 +00:00
Tom Lane	37168b8da4	Clean up handling of variable-free qual clauses. System now does the right thing with variable-free clauses that contain noncachable functions, such as 'WHERE random() < 0.5' --- these are evaluated once per potential output tuple. Expressions that contain only Params are now candidates to be indexscan quals --- for example, 'var = ($1 + 1)' can now be indexed. Cope with RelabelType nodes atop potential indexscan variables --- this oversight prevents 7.0.* from recognizing some potentially indexscanable situations.	2000-08-13 02:50:35 +00:00
Tom Lane	62e29fe2e7	Remove 'func_tlist' from Func expression nodes, likewise 'param_tlist' from Param nodes, per discussion a few days ago on pghackers. Add new expression node type FieldSelect that implements the functionality where it's actually needed. Clean up some other unused fields in Func nodes as well. NOTE: initdb forced due to change in stored expression trees for rules.	2000-08-08 15:43:12 +00:00
Tom Lane	9426047021	Clean up bogosities in use of random(3) and srandom(3) --- do not assume that RAND_MAX applies to them, since it doesn't. Instead add a config.h parameter MAX_RANDOM_VALUE. This is currently set at 2^31-1 but could be auto-configured if that ever proves necessary. Also fix some outright bugs like calling srand() where srandom() is appropriate.	2000-08-07 00:51:42 +00:00
Tom Lane	465a3b0a24	Copy sub-Query nodes to avoid trouble when same sub-Query is linked to multiple times in the parsetree (can happen in COALESCE or BETWEEN contexts, for example). This is a pretty grotty solution --- it will do for now, but perhaps we can do better when we redesign querytrees. What we need is a consistent policy about whether querytrees should be considered read-only structures or not ...	2000-08-06 04:13:22 +00:00
Tom Lane	c298d74d49	More functions updated to new fmgr style --- money, name, tid datatypes. We're reaching the mopup stage here (good thing too, this is getting tedious).	2000-08-03 16:35:08 +00:00
Tom Lane	87cdaf5491	Remove <values.h> inclusions, no-longer-needed MAXINT definitions.	2000-07-28 02:13:52 +00:00
Tom Lane	ff7da2f498	Make planner safe for recursive calls --- needed for cases where eval_const_expressions tries to simplify an SQL function.	2000-07-27 23:16:04 +00:00
Tom Lane	1cffbfcb56	Arrange to free planning memory (or most of it, anyway) at completion of planning. This should reduce memory requirements for large joins.	2000-07-27 04:51:04 +00:00
Tom Lane	90451fe7f3	When dealing with OR-of-ANDs quals, extract multiple subclauses of an AND to use with a multiple-key index. Formerly we would only extract clauses that had to do with the first key of the index, which was correct but didn't exploit the index fully.	2000-07-26 23:46:22 +00:00
Tom Lane	da1ad323b7	Update comments.	2000-07-25 04:30:42 +00:00
Tom Lane	cd9f0ca545	Deduce equality constraints that are implied by transitivity of mergejoinable qual clauses, and add them to the query quals. For example, WHERE a = b AND b = c will cause us to add AND a = c. This is necessary to ensure that it's safe to use these variables as interchangeable sort keys, which is something 7.0 knows how to do. Should provide a useful improvement in planning ability, too.	2000-07-24 03:11:01 +00:00
Tom Lane	a5a12887a1	Make update lists like 'UPDATE tab SET foo[1] = bar, foo[3] = baz' work as expected. THe underlying implementation is essentially 'SET foo = array_set(foo, 1, bar)', so we have to turn the items into nested invocations of array_set() to make it work correctly. Side effect: we now complain about 'UPDATE tab SET foo = bar, foo = baz' which is illegal per SQL92 but we didn't detect it before.	2000-07-22 06:19:04 +00:00
Peter Eisentraut	8a3cbc84ef	Repair parallel make in backend tree (and make it really parallel). Make Gen_fmgrtab.sh reasonably robust against concurrent invocation.	2000-07-13 16:07:14 +00:00
Tom Lane	9191d684a7	Planner did the wrong thing with index-scan-backward plans: generated them, but forgot to attach relevant restriction clauses, so that the plan represented a scan over the whole table with restrictions applied as qpquals not indexquals. Another day, another bug...	2000-07-13 05:47:29 +00:00
Peter Eisentraut	cb292206c5	Remove a bunch of unused configure tests, in particular cases where * the result is not recorded anywhere * the result is not used anywhere * the result is only used in some places, whereas others have been getting away with it * the result is used improperly Also make command line options handling a little better (e.g., --disable-locale, while redundant, should really still disable).	2000-07-12 22:59:15 +00:00
Tom Lane	badce86a2c	First stage of reclaiming memory in executor by resetting short-term memory contexts. Currently, only leaks in expressions executed as quals or projections are handled. Clean up some old dead cruft in executor while at it --- unused fields in state nodes, that sort of thing.	2000-07-12 02:37:39 +00:00
Tom Lane	40f64064ff	Update textin() and textout() to new fmgr style. This is just phase one of updating the whole text datatype, but there are so dang many calls of these two routines that it seems worth a separate commit.	2000-07-05 23:12:09 +00:00
Tom Lane	1aebc3618a	First phase of memory management rewrite (see backend/utils/mmgr/README for details). It doesn't really do that much yet, since there are no short-term memory contexts in the executor, but the infrastructure is in place and long-term contexts are handled reasonably. A few long- standing bugs have been fixed, such as 'VACUUM; anything' in a single query string crashing. Also, out-of-memory is now considered a recoverable ERROR, not FATAL. Eliminate a large amount of crufty, now-dead code in and around memory management. Fix problem with holding off SIGTRAP, SIGSEGV, etc in postmaster and backend startup.	2000-06-28 03:33:33 +00:00
Tom Lane	38db5fab29	Make inheritance planning logic a little simpler and clearer, hopefully even a little faster.	2000-06-20 04:22:21 +00:00
Tom Lane	1ee26b7764	Reimplement nodeMaterial to use a temporary BufFile (or even memory, if the materialized tupleset is small enough) instead of a temporary relation. This was something I was thinking of doing anyway for performance, and Jan says he needs it for TOAST because he doesn't want to cope with toasting noname relations. With this change, the 'noname table' support in heap.c is dead code, and I have accordingly removed it. Also clean up 'noname' plan handling in planner --- nonames are either sort or materialize plans, and it seems less confusing to handle them separately under those names.	2000-06-18 22:44:35 +00:00
Tom Lane	d03a933ec5	Fix performance problems with pg_index lookups (see, for example, discussion of 5/19/00). pg_index is now searched for indexes of a relation using an indexscan. Moreover, this is done once and cached in the relcache entry for the relation, in the form of a list of OIDs for the indexes. This list is used by the parser and executor to drive lookups in the pg_index syscache when they want to know the properties of the indexes. Net result: index information will be fully cached for repetitive operations such as inserts.	2000-06-17 21:49:04 +00:00
Bruce Momjian	df43800fc8	Clean up #include's.	2000-06-15 03:33:12 +00:00
Tom Lane	ce7746201b	Cause inheritance patch to meet minimum coding standards (no gcc warnings).	2000-06-09 03:17:13 +00:00
Bruce Momjian	8c1d09d591	Inheritance overhaul by Chris Bitmead <chris@bitmead.com>	2000-06-09 01:44:34 +00:00
Bruce Momjian	20ad43b576	Mark functions as static and ifdef NOT_USED as appropriate.	2000-06-08 22:38:00 +00:00
Tom Lane	2190cf2926	Repair bug reported by ldm@apartia.com: Append nodes, which don't actually use their targetlist, are given a targetlist that is just a pointer to the first appended plan's targetlist. This is OK, but what is not OK is that any sub-select expressions in said tlist were being entered in the subPlan lists of both the Append and the first appended plan. That led to two startup and two shutdown calls for the same plan node at exec time, which led to crashes. Fix is to not generate a list of subPlans for an Append node. Same problem and fix apply to other node types that don't have a real, functioning targetlist: Material, Sort, Unique, Hash.	2000-06-04 20:50:50 +00:00
Tom Lane	cbf503180f	Tweak recognition of range-clause pairs so that 'var > $1 AND var < $2' (ie, parameters instead of consts) will be treated as a range query. We do not know the actual selectivities involved, but it seems like a good idea to use a smaller estimate than we would use for two unrelated inequalities.	2000-05-31 15:38:53 +00:00
Peter Eisentraut	6a68f42648	The heralded `Grand Unified Configuration scheme' (GUC) That means you can now set your options in either or all of $PGDATA/configuration, some postmaster option (--enable-fsync=off), or set a SET command. The list of options is in backend/utils/misc/guc.c, documentation will be written post haste. pg_options is gone, so is that pq_geqo config file. Also removed were backend -K, -Q, and -T options (no longer applicable, although -d0 does the same as -Q). Added to configure an --enable-syslog option. changed all callers from TPRINTF to elog(DEBUG)	2000-05-31 00:28:42 +00:00
Tom Lane	0f1e39643d	Third round of fmgr updates: eliminate calls using fmgr() and fmgr_faddr() in favor of new-style calls. Lots of cleanup of sloppy casts to use XXXGetDatum and DatumGetXXX ...	2000-05-30 04:25:00 +00:00
Bruce Momjian	a12a23f0d0	Remove unused include files. Do not touch /port or includes used by defines.	2000-05-30 00:49:57 +00:00
Tom Lane	091126fa28	Generated header files parse.h and fmgroids.h are now copied into the src/include tree, so that -I backend is no longer necessary anywhere. Also, clean up some bit rot in contrib tree.	2000-05-29 05:45:56 +00:00
Tom Lane	ab843085f1	Constant-expression simplifier now knows how to simplify strict functions that have at least one constant-NULL input, even if other inputs are not constants.	2000-05-28 20:33:28 +00:00
Tom Lane	0a7fb4e918	First round of changes for new fmgr interface. fmgr itself and the key call sites are changed, but most called functions are still oldstyle. An exception is that the PL managers are updated (so, for example, NULL handling now behaves as expected in plperl and plpgsql functions). NOTE initdb is forced due to added column in pg_proc.	2000-05-28 17:56:29 +00:00
Tom Lane	1c5b902018	Fix problem in which sloppily-coded test in ExecInitIndexScan would think that both sides of indexqual look like index keys. An example is create table inside (f1 float8 primary key); create table outside (g1 float8, g2 float8); select * from inside,outside where f1 = atan2(g1+1, g2); ERROR: ExecInitIndexScan: both left and right ops are rel-vars (note that failure is potentially platform-dependent). Solution is a cleanup I had had in mind to make anyway: functional index keys should be represented as Var nodes in the fixed indexqual, just like regular index keys.	2000-05-23 16:56:37 +00:00
Tom Lane	d6eac08f11	Repair problem noted by Elphick: make_rels_by_joins failed to handle cases where joinclauses were present but some joins have to be made by cartesian-product join anyway. An example is SELECT * FROM a,b,c WHERE (a.f1 + b.f2 + c.f3) = 0; Even though all the rels have joinclauses, we must join two of them in cartesian style before we can use the join clause...	2000-04-27 18:35:04 +00:00
Tom Lane	32e192d712	Repair coredump seen when a view refers to an inheritance group (SELECT FROM table*). Cause was reference to 'eref' field of an RTE, which is null in an RTE loaded from a stored rule parsetree. There wasn't any good reason to be touching the refname anyway...	2000-04-18 05:52:35 +00:00
Tom Lane	25442d8d2f	Correct oversight in hashjoin cost estimation: nodeHash sizes its hash table for an average of NTUP_PER_BUCKET tuples/bucket, but cost_hashjoin was assuming a target load of one tuple/bucket. This was causing a noticeable underestimate of hashjoin costs.	2000-04-18 05:43:02 +00:00
Tom Lane	82849df6c6	Add new selectivity estimation functions for pattern-matching operators (LIKE and regexp matches). These are not yet referenced in pg_operator, so by default the system will continue to use eqsel/neqsel. Also, tweak convert_to_scalar() logic so that common prefixes of strings are stripped off, allowing better accuracy when all strings in a table share a common prefix.	2000-04-16 04:41:03 +00:00
Tom Lane	8064a49f6f	get_relattval() should treat a NULL constant as a non-constant expression, since it has no way to indicate to its caller that the constant is actually NULL. This prevents coredump in cases like WHERE textfield < null::text;	2000-04-16 01:55:45 +00:00
Tom Lane	9d91db4fde	Repair bug reported by Wickstrom: backend would crash if WHERE clause contained a sub-SELECT nested within an AND/OR tree that cnfify() thought it should rearrange. Same physical sub-SELECT node could end up linked into multiple places in resulting expression tree. This is harmless for most node types, but not for SubLink. Repair bug by making physical copies of subexpressions that get logically duplicated by cnfify(). Also, tweak the heuristic that decides whether it's a good idea to do cnfify() --- we don't really want that to happen when it would cause multiple copies of a subselect to be generated, I think.	2000-04-14 00:19:17 +00:00
Bruce Momjian	52f77df613	Ye-old pgindent run. Same 4-space tabs.	2000-04-12 17:17:23 +00:00
Tom Lane	9c38a8d296	Further tweaking of indexscan cost estimates.	2000-04-09 04:31:37 +00:00
Tom Lane	1c72a8a37a	Fix extremely nasty little bug observed when a sub-SELECT appears in WHERE in a place where it can be part of a nestloop inner indexqual. As the code stood, it put the same physical sub-Plan node into both indxqual and indxqualorig of the IndexScan plan node. That confused later processing in the optimizer (which expected that tracing the subPlan list would visit each subplan node exactly once), and would probably have blown up in the executor if the planner hadn't choked first. Fix by making the 'fixed' indexqual be a complete deep copy of the original indexqual, rather than trying to share nodes below the topmost operator node. This had further ramifications though, because we were making the aforesaid list of sub-Plan nodes during SS_process_sublinks which is run before construction of the 'fixed' indexqual, meaning that the copy of the sub-Plan didn't show up in that list. Fix by rearranging logic so that the sub-Plan list is built by the final set_plan_references pass, not in SS_process_sublinks. This may sound like a mess, but it's actually a good deal cleaner now than it was before, because we are no longer dependent on the assumption that planning will never make a copy of a sub-Plan node.	2000-04-04 01:21:48 +00:00
Tom Lane	e55985d3be	Tweak indexscan cost estimation: round estimated # of tuples visited up to next integer. Previously, if selectivity was small, we could compute very tiny scan cost on the basis of estimating that only 0.001 tuple would be fetched, which is silly. This naturally led to some rather silly plans...	2000-03-30 00:53:30 +00:00
Tom Lane	8cbeb5f131	Save a few cycles in simple cases: no need to call cost_sort() when there is no presorted path to compare with.	2000-03-24 21:40:43 +00:00
Tom Lane	7177bbac29	A little further tweaking of the range-query selectivity logic: to avoid undue sensitivity to roundoff error, believe that a zero or slightly negative range estimate should represent a small positive selectivity, rather than falling back on a generic default estimate.	2000-03-23 23:35:47 +00:00
Tom Lane	1afaa2557a	If we cannot get a real estimate for the selectivity of a range query, use a default value that's fairly small. We were generating a result of about 0.1, but I think 0.01 is probably better --- want to encourage use of an indexscan in this situation.	2000-03-23 00:58:36 +00:00
Tom Lane	1d5e7a6f46	Repair logic flaw in cost estimator: cost_nestloop() was estimating CPU costs using the inner path's parent->rows count as the number of tuples processed per inner scan iteration. This is wrong when we are using an inner indexscan with indexquals based on join clauses, because the rows count in a Relation node reflects the selectivity of the restriction clauses for that rel only. Upshot was that if join clause was very selective, we'd drastically overestimate the true cost of the join. Fix is to calculate correct output-rows estimate for an inner indexscan when the IndexPath node is created and save it in the path node. Change of path node doesn't require initdb, since path nodes don't appear in saved rules.	2000-03-22 22:08:35 +00:00
Tom Lane	3ee8f7e207	Restructure planning code so that preprocessing of targetlist and quals to simplify constant expressions and expand SubLink nodes into SubPlans is done in a separate routine subquery_planner() that calls union_planner(). We formerly did most of this work in query_planner(), but that's the wrong place because it may never see the real targetlist. Splitting union_planner into two routines also allows us to avoid redundant work when union_planner is invoked recursively for UNION and inheritance cases. Upshot is that it is now possible to do something like select float8(count()) / (select count() from int4_tbl) from int4_tbl group by f1; which has never worked before.	2000-03-21 05:12:12 +00:00
Tom Lane	d6429e552d	Minor code rearrangement & doc improvement in eval_const_expressions().	2000-03-19 18:20:38 +00:00
Tom Lane	341b328b18	Fix a bunch of minor portability problems and maybe-bugs revealed by running gcc and HP's cc with warnings cranked way up. Signed vs unsigned comparisons, routines declared static and then defined not-static, that kind of thing. Tedious, but perhaps useful...	2000-03-17 02:36:41 +00:00
Thomas G. Lockhart	6456810078	Implement column aliases on views "CREATE VIEW name (collist)". Implement TIME WITH TIME ZONE type (timetz internal type). Remap length() for character strings to CHAR_LENGTH() for SQL92 and to remove the ambiguity with geometric length() functions. Keep length() for character strings for backward compatibility. Shrink stored views by removing internal column name list from visible rte. Implement min(), max() for time and timetz data types. Implement conversion of TIME to INTERVAL. Implement abs(), mod(), fac() for the int8 data type. Rename some math functions to generic names: round(), sqrt(), cbrt(), pow(), etc. Rename NUMERIC power() function to pow(). Fix int2 factorial to calculate result in int4. Enhance the Oracle compatibility function translate() to work with string arguments (from Edwin Ramirez). Modify pg_proc system table to remove OID holes.	2000-03-14 23:06:59 +00:00
Tom Lane	6217a8c7ba	Fix some bogosities in the code that deals with estimating the fraction of tuples we are going to retrieve from a sub-SELECT. Must have been half asleep when I did this code the first time :-(	2000-03-14 02:23:15 +00:00
Tom Lane	1879175b18	Fix performance bug in constant-expression simplifier. After finding that the inputs to a given operator can be recursively simplified to constants, it was evaluating the operator using the op's original (unsimplified) arg list, so that any subexpressions had to be evaluated again. A constant subexpression at depth N got evaluated N times. Probably not very important in practical situations, but it made us look real slow in MySQL's 'crashme' test...	2000-03-12 19:32:06 +00:00
Tom Lane	e8be8ffaf0	Further tweaking of logic that decides when to materialize an uncorrelated subplan: do it if subplan has subplans itself, and always do it if the subplan is an indexscan. (I originally set it to materialize an indexscan only if the indexqual is fairly selective, but I dunno what I was thinking ... an unselective indexscan is still expensive ...)	2000-03-11 23:53:41 +00:00
Hiroshi Inoue	fd9ff86bd9	Trial implementation of ALTER DROP COLUMN. They are #ifdef'd. Add -D_DROP_COLUMN_HACK__ compile option to evaluate it.	2000-03-09 05:00:26 +00:00
Tom Lane	0eb5ab8250	Apply a MATERIAL node to the result of an uncorrelated subplan, if it looks like it will save computation to do so.	2000-03-02 04:08:16 +00:00
Tom Lane	84ccfdf087	Avoid a little bit of unnecessary computation in canonicalize_qual.	2000-02-27 19:45:44 +00:00
Tom Lane	be05edd812	Tweak planner to use OFFSET+LIMIT, not just LIMIT, as estimate of the portion of the query result that will be retrieved. As far as I could tell, the consensus was that we should let the planner do the best it can with a LIMIT query, and require the user to add ORDER BY if he wants consistent results from different LIMIT values.	2000-02-21 01:13:04 +00:00
Tom Lane	57b30e8e22	Create a new expression node type RelabelType, which exists solely to represent the result of a binary-compatible type coercion. At runtime it just evaluates its argument --- but during type resolution, exprType will pick up the output type of the RelabelType node instead of the type of the argument. This solves some longstanding problems with dropped type coercions, an example being 'select now()::abstime::int4' which used to produce date-formatted output, not an integer, because the coercion to int4 was dropped on the floor.	2000-02-20 21:32:16 +00:00
Tom Lane	3cbcb78a3d	Plug some more memory leaks in the planner. It still leaks like a sieve, but this is as good as it'll get for this release...	2000-02-18 23:47:31 +00:00
Hiroshi Inoue	e3a97b370c	Implement reindex command	2000-02-18 09:30:20 +00:00
Tom Lane	598ea2c359	Finish repairing 6.5's problems with r-tree indexes: create appropriate selectivity functions and make the r-tree operators use them. The estimation functions themselves are just stubs, unfortunately, but perhaps someday someone will make them compute realistic estimates. Change pg_am so that the optimizer can reliably tell the difference between ordered and unordered indexes --- before it would think that an r-tree index can be scanned in '<<' order, which is not right AFAIK. Repair broken negator links for network_sup and related ops. Initdb forced. This might be my last initdb force for 7.0 ... hope so anyway ...	2000-02-17 03:40:02 +00:00
Tom Lane	47dde30222	Remove long-dead code.	2000-02-15 23:12:26 +00:00
Tom Lane	b1577a7c78	New cost model for planning, incorporating a penalty for random page accesses versus sequential accesses, a (very crude) estimate of the effects of caching on random page accesses, and cost to evaluate WHERE- clause expressions. Export critical parameters for this model as SET variables. Also, create SET variables for the planner's enable flags (enable_seqscan, enable_indexscan, etc) so that these can be controlled more conveniently than via PGOPTIONS. Planner now estimates both startup cost (cost before retrieving first tuple) and total cost of each path, so it can optimize queries with LIMIT on a reasonable basis by interpolating between these costs. Same facility is a win for EXISTS(...) subqueries and some other cases. Redesign pathkey representation to achieve a major speedup in planning (I saw as much as 5X on a 10-way join); also minor changes in planner to reduce memory consumption by recycling discarded Path nodes and not constructing unnecessary lists. Minor cleanups to display more-plausible costs in some cases in EXPLAIN output. Initdb forced by change in interface to index cost estimation functions.	2000-02-15 20:49:31 +00:00
Thomas G. Lockhart	a344a6e7b5	Carry column aliases from the parser frontend. Enables queries like SELECT a FROM t1 tx (a); Allow join syntax, including queries like SELECT * FROM t1 NATURAL JOIN t2; Update RTE structure to hold column aliases in an Attr structure.	2000-02-15 03:38:29 +00:00
Tom Lane	d8733ce674	Repair planning bugs caused by my misguided removal of restrictinfo link fields in JoinPaths --- turns out that we do need that after all :-(. Also, rearrange planner so that only one RelOptInfo is created for a particular set of joined base relations, no matter how many different subsets of relations it can be created from. This saves memory and processing time compared to the old method of making a bunch of RelOptInfos and then removing the duplicates. Clean up the jointree iteration logic; not sure if it's better, but I sure find it more readable and plausible now, particularly for the case of 'bushy plans'.	2000-02-07 04:41:04 +00:00
Tom Lane	81fc1d5edb	Rename same() to sameseti() to have a slightly less generic name. Move nonoverlap_sets() and is_subset() to list.c, where they should have lived to begin with, and rename to nonoverlap_setsi and is_subseti since they only work on integer lists.	2000-02-06 03:27:35 +00:00
Tom Lane	78296c2797	Further cleanup for OR-of-AND WHERE-clauses. orindxpath can now handle extracting from an AND subclause just those opclauses that are relevant for a particular index. For example, we can now consider using an index on x to process WHERE (x = 1 AND y = 2) OR (x = 2 AND y = 4) OR ...	2000-02-05 18:26:09 +00:00
Tom Lane	d24ef0d08f	Make EXPLAIN results for Append, Group, Agg, Unique nodes more plausible. Group and Unique use an arbitrary assumption that there will be about 10% as many groups as input tuples --- perhaps someday we can refine this.	2000-02-03 06:12:19 +00:00
Tom Lane	003dd965d2	Apply the heuristic proposed by Taral (see pgsql-general archives for 2-Oct-98 or TODO.detail/cnfify) to decide whether we want to reduce WHERE clause to CNF form, DNF form, or neither. This is a HUGE win. The heuristic conditions could probably still use a little tweaking to make sure we don't pick CNF when DNF would be better, or vice versa, but the risk of exponential explosion in cnfify() is gone. I was able to run ten-thousand-AND-subclause queries through the planner in a reasonable amount of time.	2000-01-28 03:22:36 +00:00
Tom Lane	dd979f66be	Redesign DISTINCT ON as discussed in pgsql-sql 1/25/00: syntax is now SELECT DISTINCT ON (expr [, expr ...]) targetlist ... and there is a check to make sure that the user didn't specify an ORDER BY that's incompatible with the DISTINCT operation. Reimplement nodeUnique and nodeGroup to use the proper datatype-specific equality function for each column being compared --- they used to do bitwise comparisons or convert the data to text strings and strcmp(). (To add insult to injury, they'd look up the conversion functions once for each tuple...) Parse/plan representation of DISTINCT is now a list of SortClause nodes. initdb forced by querytree change...	2000-01-27 18:11:50 +00:00
Bruce Momjian	5c25d60244	Add: * Portions Copyright (c) 1996-2000, PostgreSQL, Inc to all files copyright Regents of Berkeley. Man, that's a lot of files.	2000-01-26 05:58:53 +00:00
Tom Lane	0dbffa704a	First cut at making useful selectivity estimates for range queries (ie, WHERE x > lowbound AND x < highbound). It's not very bright yet but it does something useful. Also, rename intltsel/intgtsel to scalarltsel/scalargtsel to reflect usage better. Extend convert_to_scalar to do something a little bit useful with string data types. Still need to make it do something with date/time datatypes, but I'll wait for Thomas's datetime unification dust to settle first. Eventually the routine ought not have any type-specific knowledge at all; it ought to be calling a type-dependent routine found via a pg_type column; but that's a task for another day.	2000-01-24 07:16:52 +00:00
Tom Lane	8449df8a67	First cut at unifying regular selectivity estimation with indexscan selectivity estimation wasn't right. This is better...	2000-01-23 02:07:00 +00:00
Tom Lane	71ed7eb494	Revise handling of index-type-specific indexscan cost estimation, per pghackers discussion of 5-Jan-2000. The amopselect and amopnpages estimators are gone, and in their place is a per-AM amcostestimate procedure (linked to from pg_am, not pg_amop).	2000-01-22 23:50:30 +00:00
Peter Eisentraut	1cd4c14116	Fixed all elog related warnings, as well as a few others.	2000-01-15 02:59:43 +00:00
Tom Lane	421d4f9bd7	Put back erroneously removed zeroing of sentinel elements in indexkeys, classlist arrays.	2000-01-12 00:53:21 +00:00
Bruce Momjian	bd52f4bffd	More cleanups. Still doesn't work.	2000-01-11 03:33:14 +00:00
Tom Lane	166b5c1def	Another round of planner/optimizer work. This is just restructuring and code cleanup; no major improvements yet. However, EXPLAIN does produce more intuitive outputs for nested loops with indexscans now...	2000-01-09 00:26:47 +00:00
Tom Lane	d8f3752133	Generate double-sided LIKE indexquals that work even in weird locales, by continuing to increment the rightmost character until we get a string that is demonstrably greater than the pattern prefix.	1999-12-31 05:38:25 +00:00
Tom Lane	5f68d5c38f	Clean up loose end in LIKE optimization fix: parser's code would generate <= and >= indexquals from a LIKE even if the index in question didn't support those operators. (As, for example, a hash index does not.)	1999-12-31 03:41:03 +00:00
Tom Lane	7431796b46	fix_parsetree_attnums was not nearly smart enough about walking parse trees. Also rewrite find_all_inheritors() in a more intelligible style.	1999-12-14 03:35:28 +00:00
Bruce Momjian	a82f9ffde6	New LDOUT makefile variable for QNX os.	1999-12-13 22:35:27 +00:00
Tom Lane	a8ae19ec3d	aggregate(DISTINCT ...) works, per SQL spec. Note this forces initdb because of change of Aggref node in stored rules.	1999-12-13 01:27:21 +00:00
Bruce Momjian	3ffd3d82db	Make LD -r as macros that can be changed for QNX.	1999-12-09 19:15:45 +00:00
Tom Lane	f7f41c7c8c	Replace generic 'Illegal use of aggregates' error message with one that shows the specific ungrouped variable being complained of. Perhaps this will reduce user confusion...	1999-12-09 05:58:56 +00:00
Bruce Momjian	6f9ff92cc0	Tid access method feature from Hiroshi Inoue, Inoue@tpf.co.jp	1999-11-23 20:07:06 +00:00
Bruce Momjian	fc955b14ea	Add system indexes to match all caches. Make all system indexes unique. Make all cache loads use system indexes. Rename rel to relid in inheritance tables. Rename cache names to be clearer.	1999-11-22 17:56:41 +00:00
Tom Lane	610dfa6d55	Combine index_info and find_secondary_indexes into a single routine that returns a list of RelOptInfos, eliminating the need for static state in index_info. That static state was a direct cause of coredumps; if anything decided to elog(ERROR) partway through an index_info search of pg_index, the next query would try to close a scan pointer that was pointing at no-longer-valid memory. Another example of the reasons to avoid static state variables...	1999-11-21 23:25:47 +00:00
Tom Lane	f68e11f373	Implement subselects in target lists. Also, relax requirement that subselects can only appear on the righthand side of a binary operator. That's still true for quantified predicates like x = ANY (SELECT ...), but a subselect that delivers a single result can now appear anywhere in an expression. This is implemented by changing EXPR_SUBLINK sublinks to represent just the (SELECT ...) expression, without any 'left hand side' or combining operator --- so they're now more like EXISTS_SUBLINK. To handle the case of '(x, y, z) = (SELECT ...)', I added a new sublink type MULTIEXPR_SUBLINK, which acts just like EXPR_SUBLINK used to. But the grammar will only generate one for a multiple-left-hand-side row expression.	1999-11-15 02:00:15 +00:00
Bruce Momjian	86ef36c907	New NameStr macro to convert Name to Str. No need for var.data anymore. Fewer calls to nameout. Better use of RelationGetRelationName.	1999-11-07 23:08:36 +00:00
Tom Lane	57ea208477	Skip invoking set_uppernode_references() for a RESULT node that has no subplan --- saves a material amount of time for a simple INSERT ... VALUES query.	1999-10-30 23:07:55 +00:00
Tom Lane	e2a29eb52c	Rewrite preprocess_targetlist() to reduce overhead for simple INSERTs. In particular, don't bother to look up type information for attributes where we're not actually going to use it, and avoid copying entire tlist structure when it's not necessary.	1999-10-30 23:06:32 +00:00
Tom Lane	3eb1c82277	Fix planner and rewriter to follow SQL semantics for tables that are mentioned in FROM but not elsewhere in the query: such tables should be joined over anyway. Aside from being more standards-compliant, this allows removal of some very ugly hacks for COUNT(*) processing. Also, allow HAVING clause without aggregate functions, since SQL does. Clean up CREATE RULE statement-list syntax the same way Bruce just fixed the main stmtmulti production. CAUTION: addition of a field to RangeTblEntry nodes breaks stored rules; you will have to initdb if you have any rules.	1999-10-07 04:23:24 +00:00
Tom Lane	fc43696d1a	Fix make_clause and make_opclause to record valid type info in the Expr nodes they produce. This fixes a few cases of errors like 'typeidTypeRelid: Invalid type - oid = 0' caused by calling parser-related routines on expression trees that have already been processed by planner- related routines.	1999-10-02 04:37:52 +00:00
Tom Lane	40f6524161	Implement constant-expression simplification per Bernard Frankpitt, plus some improvements from yours truly. The simplifier depends on the proiscachable field of pg_proc to tell it whether a function is safe to pre-evaluate --- things like nextval() are not, for example. Update pg_proc.h to contain reasonable cacheability information; as of 6.5.* hardly any functions were marked cacheable. I may have erred too far in the other direction; see recent mail to pghackers for more info. This update does not force an initdb, exactly, but you won't see much benefit from the simplifier until you do one.	1999-09-26 02:28:44 +00:00
Bruce Momjian	ad604ac372	values.h patch from Alex Howansky	1999-09-21 20:58:25 +00:00
Tom Lane	bd272cace6	Mega-commit to make heap_open/heap_openr/heap_close take an additional argument specifying the kind of lock to acquire/release (or 'NoLock' to do no lock processing). Ensure that all relations are locked with some appropriate lock level before being examined --- this ensures that relevant shared-inval messages have been processed and should prevent problems caused by concurrent VACUUM. Fix several bugs having to do with mismatched increment/decrement of relation ref count and mismatched heap_open/close (which amounts to the same thing). A bogus ref count on a relation doesn't matter much unless a SI Inval message happens to arrive at the wrong time, which is probably why we got away with this sloppiness for so long. Repair missing grab of AccessExclusiveLock in DROP TABLE, ALTER/RENAME TABLE, etc, as noted by Hiroshi. Recommend 'make clean all' after pulling this update; I modified the Relation struct layout slightly. Will post further discussion to pghackers list shortly.	1999-09-18 19:08:25 +00:00
Tom Lane	43d32d3683	First cut at doing something reasonable with OR-of-ANDs WHERE conditions. There are some pretty bogus heuristics in prepqual.c that try to decide whether to output CNF or DNF format; they need to be replaced, likely. Right now the code is probably too willing to choose DNF form, which might hurt performance in some cases that used to work OK. But at least we have a foundation to build on.	1999-09-13 00:17:25 +00:00
Tom Lane	2119cc0670	Further improvements in cnfify: reduce amount of self-recursion in or_normalize, remove detection of duplicate subexpressions (since it's highly unlikely to be worth the amount of time it takes), and introduce a dnfify() entry point so that unintelligible backwards logic in UNION processing can be eliminated. This is just an intermediate step --- next thing is to look at not forcing the qual into CNF form when it would be better off in DNF form.	1999-09-12 18:08:17 +00:00
Tom Lane	51db6455ea	Repair error noticed by Roberto Cornacchia: selectivity code was rejecting negative attnums as bogus, which of course they are not. Add code to get_attdisbursion to produce a useful value for OID attribute, since VACUUM does not store stats for system attributes. Also, repair bug that's been in eqjoinsel for a long time: it was taking the max of the two columns' disbursions, whereas it should use the min.	1999-09-09 02:36:04 +00:00
Tom Lane	8759f175db	Performance improvements in cnfify(): get rid of exponential space consumption in pull_args, and avoid doing the full CNF transform on operands of operator clauses, where it's really not particularly helpful. This answers the TODO item about large numbers of OR clauses, at least partially. I was able to do a ten-thousand-OR-clause query with about 20Mb memory consumption ... it took an obscenely long time, but it worked...	1999-09-07 03:47:06 +00:00
Tom Lane	37d20eb855	Clean up some mistakes in handling of uplevel Vars in planner. Most parts of the planner should ignore, or indeed never even see, uplevel Vars because they will be or have been replaced by Params. There were a couple of places that got it wrong though, probably my fault from recent changes...	1999-08-26 05:09:06 +00:00
Tom Lane	42af56e1ea	Revise implementation of SubLinks so that there is a consistent, documented intepretation of the lefthand and oper fields. Fix a number of obscure problems while at it --- for example, the old code failed if the parser decided to insert a type-coercion function just below the operator of a SubLink. CAUTION: this will break stored rules that contain subplans. You may need to initdb.	1999-08-25 23:21:43 +00:00
Tom Lane	e8140adb10	Further sort-order twiddling in optimizer: be smart about case where ORDER BY and GROUP BY request the same sort order.	1999-08-22 23:56:45 +00:00
Tom Lane	78114cd4d4	Further planner/optimizer cleanups. Move all set_tlist_references and fix_opids processing to a single recursive pass over the plan tree executed at the very tail end of planning, rather than haphazardly here and there at different places. Now that tlist Vars do not get modified until the very end, it's possible to get rid of the klugy var_equal and match_varid partial-matching routines, and just use plain equal() throughout the optimizer. This is a step towards allowing merge and hash joins to be done on expressions instead of only Vars ...	1999-08-22 20:15:04 +00:00
Tom Lane	db436adf76	Major revision of sort-node handling: push knowledge of query sort order down into planner, instead of handling it only at the very top level of the planner. This fixes many things. An explicit sort is now avoided if there is a cheaper alternative (typically an indexscan) not only for ORDER BY, but also for the internal sort of GROUP BY. It works even when there is no other reason (such as a WHERE condition) to consider the indexscan. It works for indexes on functions. It works for indexes on functions, backwards. It's just so cool... CAUTION: I have changed the representation of SortClause nodes, therefore THIS UPDATE BREAKS STORED RULES. You will need to initdb.	1999-08-21 03:49:17 +00:00
Tom Lane	abee4c299f	Remove extraneous SeqScan node that make_noname was inserting above a Sort or Materialize node. As far as I can tell, the only place that actually needed that was set_tlist_references, which was being lazy about checking to see if it had a noname node to fix or not...	1999-08-18 04:15:16 +00:00
Tom Lane	91f82de48a	Assign sort keys properly when there are duplicate entries in pathkey list --- corrects misbehavior seen with multiple mergejoin clauses mentioning same variable.	1999-08-16 23:07:20 +00:00
Tom Lane	e6381966c1	Major planner/optimizer revision: get rid of PathOrder node type, store all ordering information in pathkeys lists (which are now lists of lists of PathKeyItem nodes, not just lists of lists of vars). This was a big win --- the code is smaller and IMHO more understandable than it was, even though it handles more cases. I believe the node changes will not force an initdb for anyone; planner nodes don't show up in stored rules.	1999-08-16 02:17:58 +00:00
Tom Lane	47f18ec702	Update comments about pathkeys.	1999-08-13 01:17:16 +00:00
Tom Lane	8f9f6e51a8	Clean up optimizer's handling of indexscan quals that need to be commuted (ie, the index var appears on the right). These are now handled the same way as merge and hash join quals that need to be commuted: the actual reversing of the clause only happens if we actually choose the path and generate a plan from it. Furthermore, the clause is only reversed in the 'indexqual' field of the plan, not in the 'indxqualorig' field. This allows the clause to still be recognized and removed from qpquals of upper level join plans. Also, simplify and generalize match_clause_to_indexkey; now it recognizes binary-compatible indexes for join as well as restriction clauses.	1999-08-12 04:32:54 +00:00
Tom Lane	2ae51c86c9	Minor cleanups and code beautification; eliminate some routines that are now dead code.	1999-08-10 03:00:15 +00:00
Tom Lane	4a1c5cb953	Revise create_nestloop_node's handling of inner indexscan to work under a wider range of scenarios than it did --- it formerly did not handle a multi-pass inner scan, nor cases in which the inner scan's indxqualorig or non-index qual contained outer var references. I am not sure that these limitations could be hit in the existing optimizer, but they need to be fixed for future expansion.	1999-08-10 02:58:56 +00:00
Bruce Momjian	158fd5f1c4	> > Prevent sorting if result is already sorted > > > > was implemented by Jan Wieck. > > His work is for ascending order cases. > > > > Here is a patch to prevent sorting also in descending > > order cases. > > Because I had already changed _bt_first() to position > > backward correctly before v6.5,this patch would work. > > Hiroshi Inoue Inoue@tpf.co.jp	1999-08-09 06:20:27 +00:00
Tom Lane	5efe31214a	Clean up tlist.c tree-walking routines with expression_tree_mutator.	1999-08-09 05:34:13 +00:00
Tom Lane	14f84cd821	Store -1 in attdisbursion to signal 'no duplicates in column'. Centralize att_disbursion readout logic.	1999-08-09 03:16:47 +00:00
Tom Lane	5af4b04f31	Move get_attdisbursion to lsyscache. Clean up get_typdefault.	1999-08-09 03:13:31 +00:00
Tom Lane	10d6d411a8	Rewrite fix_indxqual_references, which was entirely bogus for multi-scan indexscan plans; it tried to use the same table-to-index attribute mapping for all the scans, even if they used different indexes. It would klugily work as long as OR indexquals never used multikey indexes, but that's not likely to hold up much longer...	1999-08-09 01:01:42 +00:00
Tom Lane	ecef2caae9	Clean up routines in setrefs.c by replacing individual tree walking logic with expression_tree_walker/mutator calls.	1999-08-09 00:56:05 +00:00
Tom Lane	6bc601b648	Create a standardized expression_tree_mutator support routine to go along with expression_tree_walker. (_walker is not suitable for routines that need to alter the tree structure significantly.) Other minor cleanups in clauses.c.	1999-08-09 00:51:26 +00:00
Tom Lane	e1fad50a5d	Revise generation of hashjoin paths: generate one path per hashjoinable clause, not one path for a randomly-chosen element of each set of clauses with the same join operator. That is, if you wrote SELECT ... WHERE t1.f1 = t2.f2 and t1.f3 = t2.f4, and both '=' ops were the same opcode (say, all four fields are int4), then the system would either consider hashing on f1=f2 or on f3=f4, but it would not consider both possibilities. Boo hiss. Also, revise estimation of hashjoin costs to include a penalty when the inner join var has a high disbursion --- ie, the most common value is pretty common. This tends to lead to badly skewed hash bucket occupancy and way more comparisons than you'd expect on average. I imagine that the cost calculation still needs tweaking, but at least it generates a more reasonable plan than before on George Young's example.	1999-08-06 04:00:17 +00:00
Tom Lane	30da344cb1	Update comments about clause selectivity estimation.	1999-07-30 22:34:19 +00:00
Tom Lane	04578a9180	Further cleanups of indexqual processing: simplify control logic in indxpath.c, avoid generation of redundant indexscan paths for the same relation and index.	1999-07-30 04:07:25 +00:00
Tom Lane	7d572886d6	Fix coredump seen when doing mergejoin between indexed tables, for example in the regression test database, try select * from tenk1 t1, tenk1 t2 where t1.unique1 = t2.unique2; 6.5 has this same bug ...	1999-07-30 00:56:17 +00:00
Tom Lane	161be69544	Update comments for create_indexscan_node().	1999-07-30 00:44:23 +00:00
Tom Lane	ecbfafbe0e	Add support for Case exprs to fix_indxqual_references, so that Case works in WHERE join clauses. Temporary patch --- this routine is one of many that ought to be changed to use centralized expression-tree- walking logic.	1999-07-29 02:48:05 +00:00
Tom Lane	b62fdc13f0	Correct bug in best_innerjoin(): it should check all the rels that the inner path needs to join to, but it was only checking for the first one. Failure could only have been observed with an OR-clause that mentions 3 or more tables, and then only if the bogus path was actually selected as cheapest ...	1999-07-27 06:23:12 +00:00
Tom Lane	9e7e29e6c9	First cut at doing LIKE/regex indexing optimization in optimizer rather than parser. This has many advantages, such as not getting fooled by chance uses of operator names ~ and ~~ (the operators are identified by OID now), and not creating useless comparison operations in contexts where the comparisons will not actually be used as indexquals. The new code also recognizes exact-match LIKE and regex patterns, and produces an = indexqual instead of >= and <=. This change does NOT fix the problem with non-ASCII locales: the code still doesn't know how to generate an upper bound indexqual for non-ASCII collation order. But it's no worse than before, just the same deficiency in a different place... Also, dike out loc_restrictinfo fields in Plan nodes. These were doing nothing useful in the absence of 'expensive functions' optimization, and they took a considerable amount of processing to fill in.	1999-07-27 03:51:11 +00:00
Tom Lane	49ed4dd779	Further work on planning of indexscans. Cleaned up interfaces to index_selectivity so that it can be handed an indexqual clause list rather than a bunch of assorted derivative data.	1999-07-25 23:07:26 +00:00
Tom Lane	8ae29a1d40	Remove 'restrictinfojoinid' field from RestrictInfo nodes. The only place it was being used was as temporary storage in indxpath.c, and the logic was wrong: the same restrictinfo node could get chosen to carry the info for two different joins. Right fix is to return a second list of unjoined-relids parallel to the list of clause groups.	1999-07-25 17:53:27 +00:00
Tom Lane	ac4913a0dd	Clean up messy clause-selectivity code in clausesel.c; repair bug identified by Hiroshi (incorrect cost attributed to OR clauses after multiple passes through set_rest_selec()). I think the code was trying to allow selectivities of OR subclauses to be passed in from outside, but noplace was actually passing any useful data, and set_rest_selec() was passing wrong data. Restructure representation of "indexqual" in IndexPath nodes so that it is the same as for indxqual in completed IndexScan nodes: namely, a toplevel list with an entry for each pass of the index scan, having sublists that are implicitly-ANDed index qual conditions for that pass. You don't want to know what the old representation was :-( Improve documentation of OR-clause indexscan functions. Remove useless 'notclause' field from RestrictInfo nodes. (This might force an initdb for anyone who has stored rules containing RestrictInfos, but I do not think that RestrictInfo ever appears in completed plans.)	1999-07-24 23:21:14 +00:00
Tom Lane	348bdbce79	Minor code beautification, extensive improvement of comments. This file was full of obsolete and just plain wrong commentary...	1999-07-23 03:34:49 +00:00
Bruce Momjian	3406901a29	Move some system includes into c.h, and remove duplicates.	1999-07-17 20:18:55 +00:00
Bruce Momjian	a71802e12e	Final cleanup.	1999-07-16 05:00:38 +00:00
Bruce Momjian	9b645d481c	Update #include cleanups	1999-07-16 03:14:30 +00:00
Bruce Momjian	a9591ce66a	Change #include's to use <> and "" as appropriate.	1999-07-15 23:04:24 +00:00
Bruce Momjian	2e6b1e63a3	Remove unused #includes in *.c files.	1999-07-15 22:40:16 +00:00
Bruce Momjian	4b2c2850bf	Clean up #include in /include directory. Add scripts for checking includes.	1999-07-15 15:21:54 +00:00
Tom Lane	8aea617c03	Several routines failed to cope with CASE expressions, and indeed some of 'em were missing support for more node types than that...	1999-07-15 01:52:09 +00:00
Bruce Momjian	0cf1b79528	Cleanup of /include #include's, for 6.6 only.	1999-07-14 01:20:30 +00:00
Bruce Momjian	db15dc05ad	Fix for \do and ceil()/float.	1999-07-07 16:09:33 +00:00
Bruce Momjian	e9c977da7d	Fix spelling of variable name.	1999-07-07 09:36:45 +00:00
Bruce Momjian	9f7ac20e57	Cleanup of min tuple size.	1999-07-07 09:27:28 +00:00

... 4 5 6 7 8 ...

865 Commits