postgresql

mirror of https://git.postgresql.org/git/postgresql.git synced 2024-10-03 00:06:52 +02:00

Author	SHA1	Message	Date
Tom Lane	58f337a343	Centralize implementation of delay code by creating a pg_usleep() subroutine in src/port/pgsleep.c. Remove platform dependencies from miscadmin.h and put them in port.h where they belong. Extend recent vacuum cost-based-delay patch to apply to VACUUM FULL, ANALYZE, and non-btree index vacuuming. By the way, where is the documentation for the cost-based-delay patch?	2004-02-10 03:42:45 +00:00
Tom Lane	87bd956385	Restructure smgr API as per recent proposal. smgr no longer depends on the relcache, and so the notion of 'blind write' is gone. This should improve efficiency in bgwriter and background checkpoint processes. Internal restructuring in md.c to remove the not-very-useful array of MdfdVec objects --- might as well just use pointers. Also remove the long-dead 'persistent main memory' storage manager (mm.c), since it seems quite unlikely to ever get resurrected.	2004-02-10 01:55:27 +00:00
Jan Wieck	f425b605f4	Cost based vacuum delay feature. Jan	2004-02-06 19:36:18 +00:00
Tom Lane	391c3811a2	Rename SortMem and VacuumMem to work_mem and maintenance_work_mem. Make btree index creation and initial validation of foreign-key constraints use maintenance_work_mem rather than work_mem as their memory limit. Add some code to guc.c to allow these variables to be referenced by their old names in SHOW and SET commands, for backwards compatibility.	2004-02-03 17:34:04 +00:00
Tom Lane	2f0d43b251	Review uses of IsUnderPostmaster, change some tests to look at whereToSendOutput instead because they are really inquiring about the correct client communication protocol. Update some comments. This is pointing towards supporting regular FE/BE client protocol in a standalone backend, per discussion a month or so back.	2004-01-28 21:02:40 +00:00
Bruce Momjian	f4921e5ca3	Attached is a patch that fixes some trivial typos and alignment. Please apply. Alvaro Herrera	2004-01-26 22:51:56 +00:00
Tom Lane	c77f363384	Ensure that close() and fclose() are checked for errors, at least in cases involving writes. Per recent discussion about the possibility of close-time failures on some filesystems. There is a TODO item for this, too.	2004-01-26 22:35:32 +00:00
Tom Lane	be11fa26e3	Repair incorrect order of operations in GetNewTransactionId(). We must complete ExtendCLOG() before advancing nextXid, so that if that routine fails, the next incoming transaction will try it again. Per trouble report from Christopher Kings-Lynne.	2004-01-26 19:15:59 +00:00
Tom Lane	9bd681a522	Repair problem identified by Olivier Prenant: ALTER DATABASE SET search_path should not be too eager to reject paths involving unknown schemas, since it can't really tell whether the schemas exist in the target database. (Also, when reading pg_dumpall output, it could be that the schemas don't exist yet, but eventually will.) ALTER USER SET has a similar issue. So, reduce the normal ERROR to a NOTICE when checking search_path values for these commands. Supporting this requires changing the API for GUC assign_hook functions, which causes the patch to touch a lot of places, but the changes are conceptually trivial.	2004-01-19 19:04:40 +00:00
Tom Lane	0966516b75	Tighten short-circuit tests for deciding whether we need to invoke tuptoaster.c --- fields that are compressed in-line are not a reason to invoke the toaster. Along the way, add a couple more htup.h macros to eliminate confusing negated tests, and get rid of the already vestigial TUPLE_TOASTER_ACTIVE symbol.	2004-01-16 20:51:30 +00:00
Bruce Momjian	38081fd000	Change PG_DELAY from msec to usec and use it consistenly rather than select(). Add Win32 Sleep() for delay.	2004-01-09 21:08:50 +00:00
Neil Conway	192ad63bd7	More janitorial work: remove the explicit casting of NULL literals to a pointer type when it is not necessary to do so. For future reference, casting NULL to a pointer type is only necessary when (a) invoking a function AND either (b) the function has no prototype OR (c) the function is a varargs function.	2004-01-07 18:56:30 +00:00
Tom Lane	06288d4e22	Suppress compiler warning (xlog_outrec is unused if not WAL_DEBUG).	2004-01-06 22:22:37 +00:00
Neil Conway	bc028beb16	Make the 'wal_debug' GUC variable a boolean (rather than an integer), and hide it behind #ifdef WAL_DEBUG blocks.	2004-01-06 17:26:23 +00:00
Neil Conway	548523533f	Fix three trivial typos in comments.	2004-01-05 20:36:04 +00:00
Tom Lane	ef92b82dbb	Further cleanup in _bt_first: eliminate duplicate code paths.	2003-12-21 17:52:34 +00:00
Tom Lane	2a0caefeb5	Previous change exposed some opportunities for further simplification in _bt_first().	2003-12-21 03:00:04 +00:00
Tom Lane	569659ae16	Improve btree's initial-positioning-strategy code so that we never need to step more than one entry after descending the search tree to arrive at the correct place to start the scan. This can improve the behavior substantially when there are many entries equal to the chosen boundary value. Per suggestion from Dmitry Tkach, 14-Jul-03.	2003-12-21 01:23:06 +00:00
Bruce Momjian	d75b2ec4eb	This patch is the next step towards (re)allowing fork/exec. Claudio Natoli	2003-12-20 17:31:21 +00:00
Neil Conway	fef0c8345a	I posted some bufmgr cleanup a few weeks ago, but it conflicted with some concurrent changes Jan was making to the bufmgr. Here's an updated version of the patch -- it should apply cleanly to CVS HEAD and passes the regression tests. This patch makes the following changes: - remove the UnlockAndReleaseBuffer() and UnlockAndWriteBuffer() macros, and replace uses of them with calls to the appropriate functions. - remove a bunch of #ifdef BMTRACE code: it is ugly & broken (i.e. it doesn't compile) - make BufferReplace() return a bool, not an int - cleanup some logic in bufmgr.c; should be functionality equivalent to the previous code, just cleaner now - remove the BM_PRIVATE flag as it is unused - improve a few comments, etc.	2003-12-14 00:34:47 +00:00
Peter Eisentraut	2afacfc403	This patch properly sets the prototype for the on_shmem_exit and on_proc_exit functions, and adjust all other related code to use the proper types too. by Kurt Roeckx	2003-12-12 18:45:10 +00:00
Joe Conway	e2605c8311	Add a warning to AtEOXact_SPI() to catch cases where the current transaction has been committed without SPI_finish() being called first. Per recent discussion here: http://archives.postgresql.org/pgsql-patches/2003-11/msg00286.php	2003-12-02 19:26:47 +00:00
PostgreSQL Daemon	55b113257c	make sure the $Id tags are converted to $PostgreSQL as well ...	2003-11-29 22:41:33 +00:00
PostgreSQL Daemon	969685ad44	$Header: -> $PostgreSQL Changes ...	2003-11-29 19:52:15 +00:00
Tom Lane	fa5c8a055a	Cross-data-type comparisons are now indexable by btrees, pursuant to my pghackers proposal of 8-Nov. All the existing cross-type comparison operators (int2/int4/int8 and float4/float8) have appropriate support. The original proposal of storing the right-hand-side datatype as part of the primary key for pg_amop and pg_amproc got modified a bit in the event; it is easier to store zero as the 'default' case and only store a nonzero when the operator is actually cross-type. Along the way, remove the long-since-defunct bigbox_ops operator class.	2003-11-12 21:15:59 +00:00
Tom Lane	c1d62bfd00	Add operator strategy and comparison-value datatype fields to ScanKey. Remove the 'strategy map' code, which was a large amount of mechanism that no longer had any use except reverse-mapping from procedure OID to strategy number. Passing the strategy number to the index AM in the first place is simpler and faster. This is a preliminary step in planned support for cross-datatype index operations. I'm committing it now since the ScanKeyEntryInitialize() API change touches quite a lot of files, and I want to commit those changes before the tree drifts under me.	2003-11-09 21:30:38 +00:00
Tom Lane	90b2202975	Fix bad interaction between NOTIFY processing and V3 extended query protocol, per report from Igor Shevchenko. NOTIFY thought it could do its thing if transaction blockState is TBLOCK_DEFAULT, but in reality it had better check the low-level transaction state is TRANS_DEFAULT as well. Formerly it was not possible to wait for the client in a state where the first is true and the second is not ... but now we can have such a state. Minor cleanup in StartTransaction() as well.	2003-10-16 16:50:41 +00:00
Tom Lane	55d85f42a8	Repair RI trigger visibility problems (this time for sure ;-)) per recent discussion on pgsql-hackers: in READ COMMITTED mode we just have to force a QuerySnapshot update in the trigger, but in SERIALIZABLE mode we have to run the scan under a current snapshot and then complain if any rows would be updated/deleted that are not visible in the transaction snapshot.	2003-10-01 21:30:53 +00:00
Tom Lane	e33f205a94	Adjust btree index build procedure so that the btree metapage looks invalid (has the wrong magic number) until the build is entirely complete. This turns out to cost no additional writes in the normal case, since we were rewriting the metapage at the end of the process anyway. In normal scenarios there's no real gain in security, because a failed index build would roll back the transaction leaving an unused index file, but for rebuilding shared system indexes this seems to add some useful protection.	2003-09-29 23:40:26 +00:00
Tom Lane	8934790052	Add a mechanism to let dynamically loaded modules register post-commit/ post-abort cleanup hooks. I'm surprised that we have not needed this already, but I need it now to fix a plpgsql problem, and the usefulness for other dynamically loaded modules seems obvious.	2003-09-28 23:26:20 +00:00
Tom Lane	4f7a2fa0c3	Fix typo in message.	2003-09-27 18:16:35 +00:00
Peter Eisentraut	d84b6ef56b	Various message fixes, among those fixes for the previous round of fixes	2003-09-26 15:27:37 +00:00
Peter Eisentraut	feb4f44d29	Message editing: remove gratuitous variations in message wording, standardize terms, add some clarifications, fix some untranslatable attempts at dynamic message building.	2003-09-25 06:58:07 +00:00
Tom Lane	a56a016ceb	Repair some REINDEX problems per recent discussions. The relcache is now able to cope with assigning new relfilenode values to nailed-in-cache indexes, so they can be reindexed using the fully crash-safe method. This leaves only shared system indexes as special cases. Remove the 'index deactivation' code, since it provides no useful protection in the shared- index case. Require reindexing of shared indexes to be done in standalone mode, but remove other restrictions on REINDEX. -P (IgnoreSystemIndexes) now prevents using indexes for lookups, but does not disable index updates. It is therefore safe to allow from PGOPTIONS. Upshot: reindexing system catalogs can be done without a standalone backend for all cases except shared catalogs.	2003-09-24 18:54:02 +00:00
Tom Lane	db18703b5a	Fix LISTEN/NOTIFY race condition reported by Gavin Sherry. While a really general fix might be difficult, I believe the only case where AtCommit_Notify could see an uncommitted tuple is where the other guy has just unlistened and not yet committed. The best solution seems to be to just skip updating that tuple, on the assumption that the other guy does not want to hear about the notification anyway. This is not perfect --- if the other guy rolls back his unlisten instead of committing, then he really should have gotten this notify. But to do that, we'd have to wait to see if he commits or not, or make UNLISTEN hold exclusive lock on pg_listener until commit. Either of these answers is deadlock-prone, not to mention horrible for interactive performance. Do it this way for now. (What happened to that project to do LISTEN/NOTIFY in memory with no table, anyway?)	2003-09-15 23:33:43 +00:00
Tom Lane	7a3693716d	Reimplement hash index locking algorithms, per my recent proposal to pghackers. This fixes the problem recently reported by Markus KrÌutner (hash bucket split corrupts the state of scans being done concurrently), and I believe it also fixes all the known problems with deadlocks in hash index operations. Hash indexes are still not really ready for prime time (since they aren't WAL-logged), but this is a step forward.	2003-09-04 22:06:27 +00:00
Tom Lane	5ac2d7c0eb	In _bt_check_unique() loop, don't bother applying _bt_isequal() to killed items; just skip to the next item immediately. Only check for key equality when we reach a non-killed item or the end of the index page. This saves key comparisons when there are lots of killed items, as for example in a heavily-updated table that's not been vacuumed lately. Seems to be a win for pgbench anyway.	2003-09-02 22:10:16 +00:00
Tom Lane	d70610c4ee	Several fixes for hash indexes that involve changing the on-disk index layout; therefore, this change forces REINDEX of hash indexes (though not a full initdb). Widen hashm_ntuples to double so that hash space management doesn't get confused by more than 4G entries; enlarge the allowed number of free-space-bitmap pages; replace the useless bshift field with a useful bmshift field; eliminate 4 bytes of wasted space in the per-page special area.	2003-09-02 18:13:32 +00:00
Tom Lane	8b2450c831	Fix a couple typos, add some more comments.	2003-09-02 03:29:01 +00:00
Tom Lane	39673ca47b	Rewrite hashbulkdelete() to make it amenable to new bucket locking scheme. A pleasant side effect is that it is much faster when deleting a large fraction of the indexed tuples, because of elimination of redundant hash_step activity induced by hash_adjscans. Various other continuing code cleanup.	2003-09-02 02:18:38 +00:00
Tom Lane	65c2d427fb	Preliminary cleanup for hash index code (doesn't attack the locking problem yet). Fix a couple of bugs that would only appear if multiple bitmap pages are used, including a buffer reference leak and incorrect computation of bit indexes. Get rid of 'overflow address' concept, which accomplished nothing except obfuscating the code and creating a risk of failure due to limited range of offset field. Rename some misleadingly-named fields and routines, and improve documentation.	2003-09-01 20:26:34 +00:00
Tom Lane	eaeb8621f8	Add some internals documentation for hash indexes, including an explanation of the remarkably confusing page addressing scheme. The file also includes my planned-but-not-yet-implemented revision of the hash index locking scheme.	2003-09-01 20:24:49 +00:00
Tom Lane	302f1a86dc	Rewriter and planner should use only resno, not resname, to identify target columns in INSERT and UPDATE targetlists. Don't rely on resname to be accurate in ruleutils, either. This fixes bug reported by Donald Fraser, in which renaming a column referenced in a rule did not work very well.	2003-08-11 23:04:50 +00:00
Tom Lane	ffafacc1f6	Repair potential deadlock created by recent changes to recycle btree index pages: when _bt_getbuf asks the FSM for a free index page, it is possible (and, in some cases, even moderately likely) that the answer will be the same page that _bt_split is trying to split. _bt_getbuf already knew that the returned page might not be free, but it wasn't prepared for the possibility that even trying to lock the page could be problematic. Fix by doing a conditional rather than unconditional grab of the page lock.	2003-08-10 19:48:08 +00:00
Bruce Momjian	46785776c4	Another pgindent run with updated typedefs.	2003-08-08 21:42:59 +00:00
Tom Lane	870886affe	Suppress unused-variable warnings when building without Asserts.	2003-08-08 14:39:45 +00:00
Tom Lane	338aa57be0	Rename fields of DestReceiver to avoid collisions with (ill-considered) macros in some platforms' sys/socket.h.	2003-08-06 17:46:46 +00:00
Tom Lane	2f9c859ea1	Fix some copyright notices that weren't updated. Improve copyright tool so it won't miss 'em again.	2003-08-04 23:59:41 +00:00
Bruce Momjian	f3c3deb7d0	Update copyrights to 2003.	2003-08-04 02:40:20 +00:00
Bruce Momjian	089003fb46	pgindent run.	2003-08-04 00:43:34 +00:00
Tom Lane	892a51c367	Fix longstanding error in _bt_search(): should moveright at top of loop not bottom. Otherwise we fail to moveright when the root page was split while we were "in flight" to it. This is not a significant problem when the root is above the leaf level, but if the root was also a leaf (ie, a single-page index just got split) we may return the wrong leaf page to the caller, resulting in failure to find a key that is in fact present. Bug has existed at least since 7.1, probably forever.	2003-07-29 22:18:38 +00:00
Tom Lane	81b5c8a136	A visit from the message-style police ...	2003-07-28 00:09:16 +00:00
Tom Lane	ec7aa4b515	Error message editing in backend/access.	2003-07-21 20:29:40 +00:00
Tom Lane	fa3bd4dbd0	Error message editing: finish up undone task of reporting the problem xid when we fail to access pg_clog.	2003-07-19 21:37:37 +00:00
Tom Lane	8cf63ba920	Repair boundary-case bug introduced by patch of two months ago that fixed incorrect initial setting of StartUpID. The logic in XLogWrite() expects that Write->curridx is advanced to the next page as soon as LogwrtResult points to the end of the current page, but StartupXLOG() failed to make that happen when the old WAL ended exactly on a page boundary. Per trouble report from Hannu Krosing.	2003-07-17 16:45:04 +00:00
Tom Lane	0c985ab5a8	Add comment pointing out that XLByteToPrevSeg macro is not broken.	2003-06-26 18:23:07 +00:00
Tom Lane	bff0422b6c	Revise hash join and hash aggregation code to use the same datatype- specific hash functions used by hash indexes, rather than the old not-datatype-aware ComputeHashFunc routine. This makes it safe to do hash joining on several datatypes that previously couldn't use hashing. The sets of datatypes that are hash indexable and hash joinable are now exactly the same, whereas before each had some that weren't in the other.	2003-06-22 22:04:55 +00:00
Tom Lane	3fb6f1347f	Replace cryptic 'Unknown kind of return type' messages with something hopefully a little more useful.	2003-06-15 17:59:10 +00:00
Bruce Momjian	0abe7431c6	This patch extracts page buffer pooling and the simple least-recently-used strategy from clog.c into slru.c. It doesn't change any visible behaviour and passes all regression tests plus a TruncateCLOG test done manually. Apart from refactoring I made a little change to SlruRecentlyUsed, formerly ClogRecentlyUsed: It now skips incrementing lru_counts, if slotno is already the LRU slot, thus saving a few CPU cycles. To make this work, lru_counts are initialised to 1 in SimpleLruInit. SimpleLru will be used by pg_subtrans (part of the nested transactions project), so the main purpose of this patch is to avoid future code duplication. Manfred Koizar	2003-06-11 22:37:46 +00:00
Bruce Momjian	98b6f37e47	Make debug_ GUC varables output DEBUG1 rather than LOG, and mention in docs that CLIENT/LOG_MIN_MESSAGES now controls debug_* output location. Doc changes included.	2003-05-27 17:49:47 +00:00
Tom Lane	8c43300ccc	Make sure printtup() always sends the number of columns previously advertised in RowDescription message. Depending on the physical tuple's column count is not really correct, since according to heap_getattr() conventions the tuple may be short some columns, which will automatically get read as nulls. Problem has been latent since forever, but was only exposed by recent change to skip a projection step in SELECT * FROM...	2003-05-26 17:51:38 +00:00
Tom Lane	39e98d9563	Repair sometimes-incorrect computation of StartUpID after a crash, per example from Rao Kumar. This is a very corner corner-case, requiring a minimum of three closely-spaced database crashes and an unlucky positioning of the second recovery's checkpoint record before you'd notice any problem. But the consequences are dire enough that it's a must-fix.	2003-05-22 14:39:28 +00:00
Peter Eisentraut	2c0556068f	Indexing support for pattern matching operations via separate operator class when lc_collate is not C.	2003-05-15 15:50:21 +00:00
Tom Lane	f85f43dfb5	Backend support for autocommit removed, per recent discussions. The only remnant of this failed experiment is that the server will take SET AUTOCOMMIT TO ON. Still TODO: provide some client-side autocommit logic in libpq.	2003-05-14 03:26:03 +00:00
Tom Lane	d9b679c13a	In RowDescription messages, report columns of domain datatypes as having the type OID and typmod of the underlying base type. Per discussions a few weeks ago with Andreas Pflug and others. Note that this behavioral change affects both old- and new-protocol clients.	2003-05-13 18:39:50 +00:00
Tom Lane	30f609484d	Add binary I/O routines for a bunch more datatypes. Still a few to go, but that was enough tedium for one day. Along the way, move the few support routines for types xid and cid into a more logical place.	2003-05-12 23:08:52 +00:00
Tom Lane	8d86a96068	Adjust CreateCheckpoint so that buffer dumping activities and cleanup of dead xlog segments are not considered part of a critical section. It is not necessary to force a database-wide panic if we get a failure in these operations. Per recent trouble reports.	2003-05-10 18:01:31 +00:00
Tom Lane	0ac6298bb8	Implement new-protocol binary I/O support in DataRow, Bind, and FunctionCall messages. Binary I/O is now up and working, but only for a small set of datatypes (integers, text, bytea).	2003-05-09 18:08:48 +00:00
Tom Lane	c0a8c3ac13	Update 3.0 protocol support to match recent agreements about how to handle multiple 'formats' for data I/O. Restructure CommandDest and DestReceiver stuff one more time (it's finally starting to look a bit clean though). Code now matches latest 3.0 protocol document as far as message formats go --- but there is no support for binary I/O yet.	2003-05-08 18:16:37 +00:00
Tom Lane	79913910d4	Restructure command destination handling so that we pass around DestReceiver pointers instead of just CommandDest values. The DestReceiver is made at the point where the destination is selected, rather than deep inside the executor. This cleans up the original kluge implementation of tstoreReceiver.c, and makes it easy to support retrieving results from utility statements inside portals. Thus, you can now do fun things like Bind and Execute a FETCH or EXPLAIN command, and it'll all work as expected (e.g., you can Describe the portal, or use Execute's count parameter to suspend the output partway through). Implementation involves stuffing the utility command's output into a Tuplestore, which would be kind of annoying for huge output sets, but should be quite acceptable for typical uses of utility commands.	2003-05-06 20:26:28 +00:00
Tom Lane	2cf57c8f8d	Implement feature of new FE/BE protocol whereby RowDescription identifies the column by table OID and column number, if it's a simple column reference. Along the way, get rid of reskey/reskeyop fields in Resdoms. Turns out that representation was not convenient for either the planner or the executor; we can make the planner deliver exactly what the executor wants with no more effort. initdb forced due to change in stored rule representation.	2003-05-06 00:20:33 +00:00
Tom Lane	16503e6fa4	Extended query protocol: parse, bind, execute, describe FE/BE messages. Only lightly tested as yet, since libpq doesn't know anything about 'em.	2003-05-05 00:44:56 +00:00
Bruce Momjian	a7fd03e1de	Handle clog structure in shared memory in exec() case, for Win32.	2003-05-03 03:52:07 +00:00
Bruce Momjian	a2e038fbee	Back out last commit --- wrong patch.	2003-05-02 21:59:31 +00:00
Bruce Momjian	fb1f7ccec5	Dump/read non-default GUC values for use by exec'ed backends, for Win32.	2003-05-02 21:52:42 +00:00
Tom Lane	de28dc9a04	Portal and memory management infrastructure for extended query protocol. Both plannable queries and utility commands are now always executed within Portals, which have been revamped so that they can handle the load (they used to be good only for single SELECT queries). Restructure code to push command-completion-tag selection logic out of postgres.c, so that it won't have to be duplicated between simple and extended queries. initdb forced due to addition of a field to Query nodes.	2003-05-02 20:54:36 +00:00
Tom Lane	4db9689d1a	Add transaction status field to ReadyForQuery messages, and make room for tableID/columnID in RowDescription. (The latter isn't really implemented yet though --- the backend always sends zeroes, and libpq just throws away the data.)	2003-04-26 20:23:00 +00:00
Tom Lane	9cbaf72177	In the continuing saga of FE/BE protocol revisions, add reporting of initial values and runtime changes in selected parameters. This gets rid of the need for an initial 'select pg_client_encoding()' query in libpq, bringing us back to one message transmitted in each direction for a standard connection startup. To allow server version to be sent using the same GUC mechanism that handles other parameters, invent the concept of a never-settable GUC parameter: you can 'show server_version' but it's not settable by any GUC input source. Create 'lc_collate' and 'lc_ctype' never-settable parameters so that people can find out these settings without need for pg_controldata. (These side ideas were all discussed some time ago in pgsql-hackers, but not yet implemented.)	2003-04-25 19:45:10 +00:00
Tom Lane	5ed27e35f3	Another round of protocol changes. Backend-to-frontend messages now all have length words. COPY OUT reimplemented per new protocol: it doesn't need \. anymore, thank goodness. COPY BINARY to/from frontend works, at least as far as the backend is concerned --- libpq's PQgetline API is not up to snuff, and will have to be replaced with something that is null-safe. libpq uses message length words for performance improvement (no cycles wasted rescanning long messages), but not yet for error recovery.	2003-04-22 00:08:07 +00:00
Bruce Momjian	4d4953fc41	Make Win32 tests to match existing Cygwin tests, where appropriate.	2003-04-18 01:03:42 +00:00
Tom Lane	0851e12244	Reorganize clog's error reporting so that PANIC on clog I/O error can be reduced to a plain ERROR. Should make it at least a little less painful to deal with data-corruption problems.	2003-04-14 17:31:33 +00:00
Bruce Momjian	54f7338fa1	This patch implements holdable cursors, following the proposal (materialization into a tuple store) discussed on pgsql-hackers earlier. I've updated the documentation and the regression tests. Notes on the implementation: - I needed to change the tuple store API slightly -- it assumes that it won't be used to hold data across transaction boundaries, so the temp files that it uses for on-disk storage are automatically reclaimed at end-of-transaction. I added a flag to tuplestore_begin_heap() to control this behavior. Is changing the tuple store API in this fashion OK? - in order to store executor results in a tuple store, I added a new CommandDest. This works well for the most part, with one exception: the current DestFunction API doesn't provide enough information to allow the Executor to store results into an arbitrary tuple store (where the particular tuple store to use is chosen by the call site of ExecutorRun). To workaround this, I've temporarily hacked up a solution that works, but is not ideal: since the receiveTuple DestFunction is passed the portal name, we can use that to lookup the Portal data structure for the cursor and then use that to get at the tuple store the Portal is using. This unnecessarily ties the Portal code with the tupleReceiver code, but it works... The proper fix for this is probably to change the DestFunction API -- Tom suggested passing the full QueryDesc to the receiveTuple function. In that case, callers of ExecutorRun could "subclass" QueryDesc to add any additional fields that their particular CommandDest needed to get access to. This approach would work, but I'd like to think about it for a little bit longer before deciding which route to go. In the mean time, the code works fine, so I don't think a fix is urgent. - (semi-related) I added a NO SCROLL keyword to DECLARE CURSOR, and adjusted the behavior of SCROLL in accordance with the discussion on -hackers. - (unrelated) Cleaned up some SGML markup in sql.sgml, copy.sgml Neil Conway	2003-03-27 16:51:29 +00:00
Tom Lane	fddc2d94ce	Modify keys_are_unique optimization to release buffer pins before it returns NULL. This avoids out-of-buffers failures during many-way indexscans, as in Shraibman's complaint of 21-Mar.	2003-03-24 21:42:33 +00:00
Tom Lane	0489783011	Adjust amrescan code so that it's allowed to call index_rescan with a NULL key pointer, indicating that the existing scan key should be reused. This behavior isn't used yet but will be needed for my planned fix to the keys_are_unique code.	2003-03-23 23:01:03 +00:00
Bruce Momjian	9a9719e482	Allow error query to start transaction in autocommit off mode.	2003-03-21 04:33:15 +00:00
Bruce Momjian	c90354bad0	Remove unneeded dash blocks around function start comments.	2003-03-14 22:40:31 +00:00
Tom Lane	e4704001ea	This patch fixes a bunch of spelling mistakes in comments throughout the PostgreSQL source code. Neil Conway	2003-03-10 22:28:22 +00:00
Tom Lane	391eb5e5b6	Reimplement free-space-map management as per recent discussions. Adjustable threshold is gone in favor of keeping track of total requested page storage and doling out proportional fractions to each relation (with a minimum amount per relation, and some quantization of the results to avoid thrashing with small changes in page counts). Provide special- case code for indexes so as not to waste space storing useless page free space counts. Restructure internal data storage to be a flat array instead of list-of-chunks; this may cost a little more work in data copying when reorganizing, but allows binary search to be used during lookup_fsm_page_entry().	2003-03-04 21:51:22 +00:00
Tom Lane	0797bb5c50	During VACUUM FULL, truncate off any deletable pages that are at the end of a btree index. This isn't super-effective, since we won't move nondeletable pages, but it's better than nothing. Also, improve stats displayed during VACUUM VERBOSE.	2003-02-24 00:57:17 +00:00
Tom Lane	3981f2195f	Remove no-longer-used FixBTree GUC variable.	2003-02-23 23:27:21 +00:00
Tom Lane	61b22d3aab	btree page recycling can be done as soon as page's next-xact label is older than current Xmin; we don't have to wait till it's older than GlobalXmin.	2003-02-23 23:20:52 +00:00
Tom Lane	3bbd6af37c	Adjust btbulkdelete logic so that only one WAL record is issued while deleting multiple index entries on a single index page. This makes for a very substantial reduction in the amount of WAL traffic during a large delete operation.	2003-02-23 22:43:09 +00:00
Tom Lane	13dadef8b5	Improve coding of log_heap_clean() and heap_xlog_clean().	2003-02-23 20:32:12 +00:00
Tom Lane	88dc31e3f2	First cut at recycling space in btree indexes. Still some rough edges to fix, but it seems to basically work...	2003-02-23 06:17:13 +00:00
Tom Lane	799bc58dc7	More infrastructure for btree compaction project. Tree-traversal code now knows what to do upon hitting a dead page (in theory anyway, it's untested...). Add a post-VACUUM-cleanup entry point for index AMs, to provide a place for dead-page scavenging to happen. Also, fix oversight that broke btpo_prev links in temporary indexes. initdb forced due to additions in pg_am.	2003-02-22 00:45:05 +00:00
Tom Lane	70508ba7ae	Make btree index structure adjustments and WAL logging changes needed to support btree compaction, as per proposal of a few days ago. btree index pages no longer store parent links, instead they have a level indicator (counting up from zero for leaf pages). The FixBTree recovery logic is removed, and replaced by code that detects missing parent-level insertions during WAL replay. Also, generate appropriate WAL entries when updating btree metapage and when building a btree index from scratch. I believe btree indexes are now completely WAL-legal for the first time. initdb forced due to index and WAL changes.	2003-02-21 00:06:22 +00:00
Bruce Momjian	48ee6f4916	This trivial patch removes the usage of some old statistics code that no longer works -- IncrHeapAccessStat() didn't actually do anything anymore, so no reason to keep it around AFAICS. I also fixed a grammatical error in a comment. Neil Conway	2003-02-13 05:35:11 +00:00
Tom Lane	80727ce14f	Use stat(2) to probe for existing xlog segments in InstallXLogFileSegment, rather than actually opening the files. This eliminates some corner cases where the file indeed exists but open() fails for another reason, such as being out of file descriptors. The net reliability gain is probably tiny, since xlog.c is full of other file open calls that will elog(PANIC) if they fail for any reason; but this specific failure mode has been observed in the field, so we may as well fix it.	2003-01-25 03:06:04 +00:00
Peter Eisentraut	b65cd56240	Read-only transactions, as defined in SQL.	2003-01-10 22:03:30 +00:00
Tom Lane	cbca6c4896	Fix for bug #866 . 7.3 contains new logic for avoiding redundant calls to the index AM when we know we are fetching a unique row. However, this logic did not consider the possibility that it would be asked to fetch backwards. Also fix mark/restore to work correctly in this scenario.	2003-01-08 19:41:40 +00:00
Bruce Momjian	1b7f3cc02d	This patch implements FOR EACH STATEMENT triggers, per my email to -hackers a couple days ago. Notes/caveats: - added regression tests for the new functionality, all regression tests pass on my machine - added pg_dump support - updated PL/PgSQL to support per-statement triggers; didn't look at the other procedural languages. - there's (even) more code duplication in trigger.c than there was previously. Any suggestions on how to refactor the ExecXXXTriggers() functions to reuse more code would be welcome -- I took a brief look at it, but couldn't see an easy way to do it (there are several subtly-different versions of the code in question) - updated the documentation. I also took the liberty of removing a big chunk of duplicated syntax documentation in the Programmer's Guide on triggers, and moving that information to the CREATE TRIGGER reference page. - I also included some spelling fixes and similar small cleanups I noticed while making the changes. If you'd like me to split those into a separate patch, let me know. Neil Conway	2002-11-23 03:59:09 +00:00
Tom Lane	17ac74797a	Put back error test for DECLARE CURSOR outside a transaction block ... but do it correctly now.	2002-11-18 01:17:39 +00:00
Bruce Momjian	559b6c7ced	Rename show_btree_build_stats to log_btree_build_stats	2002-11-15 01:26:09 +00:00
Bruce Momjian	63e9734542	Update xact.c comments for clarity.	2002-11-13 03:12:05 +00:00
Bruce Momjian	9b12ab6d5d	Add new palloc0 call as merge of palloc and MemSet(0).	2002-11-13 00:39:48 +00:00
Tom Lane	f9b5b41ef9	Code review for ON COMMIT patch. Make the actual on-commit action happen before commit, not after :-( --- the original coding is not only unsafe if an error occurs while it's processing, but it generates an invalid sequence of WAL entries. Resurrect 7.2 logic for deleting items when no longer needed. Use an enum instead of random macros. Editorialize on names used for routines and constants. Teach backend/nodes routines about new field in CreateTable struct. Add a regression test.	2002-11-11 22:19:25 +00:00
Bruce Momjian	75fee4535d	Back out use of palloc0 in place if palloc/MemSet. Seems constant len to MemSet is a performance boost.	2002-11-11 03:02:20 +00:00
Bruce Momjian	8fee9615cc	Merge palloc()/MemSet(0) calls into a single palloc0() call.	2002-11-10 07:25:14 +00:00
Bruce Momjian	ebb531836a	Add code to handle [ON COMMIT { PRESERVE ROWS \| DELETE ROWS \| DROP }] for temp tables. Gavin Sherry	2002-11-09 23:56:39 +00:00
Bruce Momjian	bea4792125	This patch removes a bunch of superfluous #include directives: if postgres.h or c.h includes a system header (such as stdio.h or stdlib.h), there's no need to specifically include it in any of the .c files in the backend. Neil Conway	2002-11-08 20:23:57 +00:00
Tom Lane	f6e0130b5b	Clean up a few fprintf(stderr)'s that should be elog's.	2002-11-02 15:54:13 +00:00
Tom Lane	30963fc200	Perform transaction cleanup operations in a less ad-hoc, more principled order; in particular ensure that all shared resources are released before we release transaction locks. The code used to release locks before buffer pins, which might explain an ancient note I have about a bufmgr assertion failure I'd seen once several years ago, and been unable to reproduce since. (Theory: someone trying to drop a relation might be able to reach FlushRelationBuffers before the last user of the relation had gotten around to dropping his buffer pins.)	2002-10-22 22:44:36 +00:00
Tom Lane	200b151615	Fix places that were using IsTransactionBlock() as an (inadequate) check that they'd get to commit immediately on finishing. There's now a centralized routine PreventTransactionChain() that implements the necessary tests.	2002-10-21 22:06:20 +00:00
Tom Lane	e16f04cf72	Make CREATE/ALTER/DROP USER/GROUP transaction-safe, or at least pretty nearly so, by postponing write of flat password file until transaction commit.	2002-10-21 19:46:45 +00:00
Tom Lane	13416a1f8f	Fix potential problem with btbulkdelete deleting an indexscan's current item, if the page containing the current item is split while the indexscan is stopped and holds no read-lock on the page. The current item might move right onto a page that the indexscan holds no pin on. In the prior code this would allow btbulkdelete to reach and possibly delete the item, causing 'my bits moved right off the end of the world!' when the indexscan finally resumes. Fix by chaining read-locks to the right during _bt_restscan and requiring btbulkdelete to LockBufferForCleanup on every page it scans, not only those with deletable items. Per my pghackers message of 25-May-02. (Too bad no one could think of a better way.)	2002-10-20 20:47:31 +00:00
Tom Lane	4e9b159484	Change order of operations during XLogFlush so that we try to include in our write/flush operation any WAL entries that got queued while we were waiting to get the WALWriteLock. This improves throughput when transactions are small enough that several can be committed per WAL write (ie, per disk revolution).	2002-10-07 17:04:30 +00:00
Tom Lane	6d0d15c451	Make the world at least somewhat safe for zero-column tables, and remove the special case in ALTER DROP COLUMN to prohibit dropping a table's last column.	2002-09-28 20:00:19 +00:00
Tom Lane	cb253de21a	Don't mess with HEAP_XMAX_INVALID in heaptuple.c routines; there is no reason to worry about the tuple commit status bits until the tuple is inserted in a relation by heapam.c. Also, improve comments for heap_addheader().	2002-09-27 15:04:08 +00:00
Tom Lane	b2ab1e6bc9	Ensure that before truncating CLOG, we force a checkpoint even if no recent WAL activity has occurred. Without this, it's possible that a later crash might leave tuples on disk with un-updated commit status bits.	2002-09-26 22:58:34 +00:00
Tom Lane	c87469e64a	Fix problems with loss of tuple commit status bits during WAL redo of VACUUM FULL tuple moves. Store full-width t_infomask in WAL, rather than storing low 8 bits and expecting to be able to reconstruct upper bits. While at it, remove redundant t_oid field from WAL headers (the OID, if present, is now recorded in the data portion of the tuple). WAL version number bumped --- this does not force an initdb, you can instead run pg_resetxlog after a clean shutdown of the old postmaster.	2002-09-26 22:46:29 +00:00
Tom Lane	c328b6dd8b	Replace pg_attribute.attisinherited with attislocal and attinhcount columns, to allow more correct behavior in multiple-inheritance cases. Patch by Alvaro Herrera, review by Tom Lane.	2002-09-22 19:42:52 +00:00
Bruce Momjian	e50f52a074	pgindent run.	2002-09-04 20:31:48 +00:00
Bruce Momjian	97ac103289	Remove sys/types.h in files that include postgres.h, and hence c.h, because c.h has sys/types.h.	2002-09-02 02:47:07 +00:00
Tom Lane	c7a165adc6	Code review for HeapTupleHeader changes. Add version number to page headers (overlaying low byte of page size) and add HEAP_HASOID bit to t_infomask, per earlier discussion. Simplify scheme for overlaying fields in tuple header (no need for cmax to live in more than one place). Don't try to clear infomask status bits in tqual.c --- not safe to do it there. Don't try to force output table of a SELECT INTO to have OIDs, either. Get rid of unnecessarily complex three-state scheme for TupleDesc.tdhasoids, which has already caused one recent failure. Improve documentation.	2002-09-02 01:05:06 +00:00
Bruce Momjian	d64e6392fb	Remove code that suggested increasing wal_files.	2002-09-01 01:58:42 +00:00
Tom Lane	26993b2918	AUTOCOMMIT mode is now an available backend GUC variable; setting it to false provides more SQL-spec-compliant behavior than we had before. I am not sure that setting it false is actually a good idea yet; there is a lot of client-side code that will probably be broken by turning autocommit off. But it's a start. Loosely based on a patch by David Van Wie.	2002-08-30 22:18:07 +00:00
Tom Lane	e2d156fa6e	Add attisinherited column to pg_attribute; use it to guard against column additions, deletions, and renames that would let a child table get out of sync with its parent. Patch by Alvaro Herrera, with some kibitzing by Tom Lane.	2002-08-30 19:23:20 +00:00
Bruce Momjian	63653f7ffa	Complete TODO item: * Remove wal_files postgresql.conf option because WAL files are now recycled	2002-08-30 16:50:50 +00:00
Tom Lane	64505ed58b	Code review for standalone composite types, query-specified composite types, SRFs. Not happy with memory management yet, but I'll commit these other changes.	2002-08-29 00:17:06 +00:00
Tom Lane	58de480999	Clean up comments to be careful about the distinction between variable- width types and varlena types, since with the introduction of CSTRING as a more-or-less-real type, these concepts aren't identical. I've tried to use varlena consistently to denote datatypes with typlen = -1, ie, they have a length word and are potentially TOASTable; while the term variable width covers both varlena and cstring (and, perhaps, someday other types with other rules for computing the actual width). No code changes in this commit except for renaming a couple macros.	2002-08-25 17:20:01 +00:00
Tom Lane	976246cc7e	The cstring datatype can now be copied, passed around, etc. The typlen value '-2' is used to indicate a variable-width type whose width is computed as strlen(datum)+1. Everything that looks at typlen is updated except for array support, which Joe Conway is working on; at the moment it wouldn't work to try to create an array of cstring.	2002-08-24 15:00:47 +00:00
Tom Lane	b663f3443b	Add a bunch of pseudo-types to replace the behavior formerly associated with OPAQUE, as per recent pghackers discussion. I still want to do some more work on the 'cstring' pseudo-type, but I'm going to commit the bulk of the changes now before the tree starts shifting under me ...	2002-08-22 00:01:51 +00:00
Bruce Momjian	d04e9137c9	Reverse out XLogDir/-X write-ahead log handling, per discussion. Original patch from Thomas.	2002-08-17 15:12:07 +00:00
Tom Lane	0affc29e1e	Make sure that t_ctid is reset to equal t_self in heap_delete and heap_mark4update; this avoids situations where a deleted tuple might look like it is chained to something else. Also, cause all the WAL redo routines to set t_ctid to equal t_self, rather than leaving it undefined as before. Make heap_xlog_clean set the page's LSN and SUI correctly. All per past discussions in pghackers, ranging back to last December.	2002-08-13 20:11:03 +00:00
Peter Eisentraut	f1d820494c	Fix failure to relink postmaster executable in the first make run if only a single source file a few directories deep in the backend tree has changed.	2002-08-10 17:59:28 +00:00
Tom Lane	ba053de197	Still more paranoia in PageAddItem: disallow specification of an item offset past the last-used-item-plus-one, since that would result in leaving uninitialized holes in the item pointer array. AFAICT the only place that was depending on this was btree index build, which was being cavalier about when to fill in the P_HIKEY pointer; easily fixed. Also a small performance improvement: shuffle itemid's by means of memmove, not a one-at-a-time loop.	2002-08-06 19:41:23 +00:00
Tom Lane	5df307c778	Restructure local-buffer handling per recent pghackers discussion. The local buffer manager is no longer used for newly-created relations (unless they are TEMP); a new non-TEMP relation goes through the shared bufmgr and thus will participate normally in checkpoints. But TEMP relations use the local buffer manager throughout their lifespan. Also, operations in TEMP relations are not logged in WAL, thus improving performance. Since it's no longer necessary to fsync relations as they move out of the local buffers into shared buffers, quite a lot of smgr.c/md.c/fd.c code is no longer needed and has been removed: there's no concept of a dirty relation anymore in md.c/fd.c, and we never fsync anything but WAL. Still TODO: improve local buffer management algorithms so that it would be reasonable to increase NLocBuffer.	2002-08-06 02:36:35 +00:00
Tom Lane	07f9682de4	Preliminary code review for anonymous-composite-types patch: fix breakage of functions returning domain types, update documentation for typtype, move get_typtype to lsyscache.c (actually, resurrect the old version), add defense against creating pseudo-typed table columns, fix some bogus list-parsing in grammar. Issues remain with respect to alias handling and type checking; Joe is on those.	2002-08-05 02:30:50 +00:00
Thomas G. Lockhart	ac1a3dcf24	Fix compilation problem with assert checking enabled for recent xlog location feature.	2002-08-05 01:24:16 +00:00
Bruce Momjian	9218689b69	Attached are two patches to implement and document anonymous composite types for Table Functions, as previously proposed on HACKERS. Here is a brief explanation: 1. Creates a new pg_type typtype: 'p' for pseudo type (currently either 'b' for base or 'c' for catalog, i.e. a class). 2. Creates new builtin type of typtype='p' named RECORD. This is the first of potentially several pseudo types. 3. Modify FROM clause grammer to accept: SELECT * FROM my_func() AS m(colname1 type1, colname2 type1, ...) where m is the table alias, colname1, etc are the column names, and type1, etc are the column types. 4. When typtype == 'p' and the function return type is RECORD, a list of column defs is required, and when typtype != 'p', it is disallowed. 5. A check was added to ensure that the tupdesc provide via the parser and the actual return tupdesc match in number and type of attributes. When creating a function you can do: CREATE FUNCTION foo(text) RETURNS setof RECORD ... When using it you can do: SELECT * from foo(sqlstmt) AS (f1 int, f2 text, f3 timestamp) or SELECT * from foo(sqlstmt) AS f(f1 int, f2 text, f3 timestamp) or SELECT * from foo(sqlstmt) f(f1 int, f2 text, f3 timestamp) Included in the patches are adjustments to the regression test sql and expected files, and documentation. p.s. This potentially solves (or at least improves) the issue of builtin Table Functions. They can be bootstrapped as returning RECORD, and we can wrap system views around them with properly specified column defs. For example: CREATE VIEW pg_settings AS SELECT s.name, s.setting FROM show_all_settings()AS s(name text, setting text); Then we can also add the UPDATE RULE that I previously posted to pg_settings, and have pg_settings act like a virtual table, allowing settings to be queried and set. Joe Conway	2002-08-04 19:48:11 +00:00
Thomas G. Lockhart	c755f6027f	Implement WAL log location control using "-X" or PGXLOG.	2002-08-04 06:53:10 +00:00
Tom Lane	22c64f1834	When compiling with --enable-cassert, check for reference count leaks in the relcache. It's rather silly that we have reference count leak checks in bufmgr and in catcache, but not in relcache which will normally have many fewer entries. Chris K-L would have caught at least one bug in his recent DROP patch if he'd had this.	2002-08-02 22:36:05 +00:00
Tom Lane	38bb77a5d1	ALTER TABLE DROP COLUMN works. Patch by Christopher Kings-Lynne, code review by Tom Lane. Remaining issues: functions that take or return tuple types are likely to break if one drops (or adds!) a column in the table defining the type. Need to think about what to do here. Along the way: some code review for recent COPY changes; mark system columns attnotnull = true where appropriate, per discussion a month ago.	2002-08-02 18:15:10 +00:00
Tom Lane	ce7565ab91	Instead of having a configure-time DEFAULT_ATTSTATTARGET, store -1 in attstattarget to indicate 'use the default'. The default is now a GUC variable default_statistics_target, and so may be changed on the fly. Along the way we gain the ability to have pg_dump dump the per-column statistics target when it's not the default. Patch by Neil Conway, with some kibitzing from Tom Lane.	2002-07-31 17:19:54 +00:00
Bruce Momjian	ceb438ed8c	This patch fixes one serious bug (runaway INSERT) and a few rare (and hard to reproduce) error conditions. Manfred Koizar	2002-07-30 16:08:33 +00:00
Bruce Momjian	b0f5086e41	oid is needed, it is added at the end of the struct (after the null bitmap, if present). Per Tom Lane's suggestion the information whether a tuple has an oid or not is carried in the tuple descriptor. For debugging reasons tdhasoid is of type char, not bool. There are predefined values for WITHOID, WITHOUTOID and UNDEFOID. This patch has been generated against a cvs snapshot from last week and I don't expect it to apply cleanly to current sources. While I post it here for public review, I'm working on a new version against a current snapshot. (There's been heavy activity recently; hope to catch up some day ...) This is a long patch; if it is too hard to swallow, I can provide it in smaller pieces: Part 1: Accessor macros Part 2: tdhasoid in TupDesc Part 3: Regression test Part 4: Parameter withoid to heap_addheader Part 5: Eliminate t_oid from HeapTupleHeader Part 2 is the most hairy part because of changes in the executor and even in the parser; the other parts are straightforward. Up to part 4 the patched postmaster stays binary compatible to databases created with an unpatched version. Part 5 is small (100 lines) and finally breaks compatibility. Manfred Koizar	2002-07-20 05:16:59 +00:00
Bruce Momjian	c9a7345217	>the extra level of struct naming for pd_opaque has no obvious >usefulness. > >> [...] should I post a patch that puts pagesize directly into >> PageHeaderData? > >If you're so inclined. Given that pd_opaque is hidden in those macros, >there wouldn't be much of any gain in readability either, so I haven't >worried about changing the declaration. Thanks for the clarification. Here is the patch. Not much gain, but at least it saves the next junior hacker from scratching his head ... Manfred Koizar	2002-07-02 06:18:57 +00:00
Bruce Momjian	33f1687879	There already was a macro PageGetItemId; this is now used in (almost) all places, where pd_linp is accessed. Also introduce new macros SizeOfPageHeaderData and BTMaxItemSize. This is just source code cosmetic, no behaviour changed. Manfred Koizar	2002-07-02 05:48:44 +00:00
Bruce Momjian	97bfffe50e	This patch, which is built upon the "HeapTupleHeader accessor macros" patch from 2002-06-10, is supposed to reduce the heap tuple header size by four bytes on most architectures. Of course it changes the on-disk tuple format and therefore requires initdb. This overlays cmin/cmax/xmax fields into only two fields. Manfred Koizar	2002-07-02 05:46:14 +00:00
Bruce Momjian	8a9462867a	Here is a patch for a memory leak in rtree.c, version 7.2.1 (in code that I submitted last year, alas). Kenneth Been	2002-06-25 17:26:11 +00:00
Bruce Momjian	d84fe82230	Update copyright to 2002.	2002-06-20 20:29:54 +00:00
Bruce Momjian	ba790a5608	Here is a patch for Composite and Set returning function support. I made two small changes to the API since last patch, which hopefully completes the decoupling of composite function support from SRF specific support. Joe Conway	2002-06-20 17:19:08 +00:00
Bruce Momjian	918e864f14	Remove some pre-WAL relics: SharedBufferChanged BufferRelidLastDirtied BufferTagLastDirtied BufferDirtiedByMe Manfred Koizar	2002-06-15 19:55:38 +00:00
Bruce Momjian	3c35face41	This patch wraps all accesses to t_xmin, t_cmin, t_xmax, and t_cmax in HeapTupleHeaderData in setter and getter macros called HeapTupleHeaderGetXmin, HeapTupleHeaderSetXmin etc. It also introduces a "virtual" field xvac by defining HeapTupleHeaderGetXvac and HeapTupleHeaderSetXvac. Xvac is used by VACUUM, in fact it is stored in t_cmin. Manfred Koizar	2002-06-15 19:54:24 +00:00
Jan Wieck	469cb65aca	Katherine Ward wrote: > Changes to avoid collisions with WIN32 & MFC names... > 1. Renamed: > a. PROC => PGPROC > b. GetUserName() => GetUserNameFromId() > c. GetCurrentTime() => GetCurrentDateTime() > d. IGNORE => IGNORE_DTF in include/utils/datetime.h & utils/adt/datetim > > 2. Added _P to some lex/yacc tokens: > CONST, CHAR, DELETE, FLOAT, GROUP, IN, OUT Jan	2002-06-11 13:40:53 +00:00
Bruce Momjian	2f297a2fcf	The attached patch fixes a problem with InstallXLogFileSegment()'s use of link() under Cygwin: http://archives.postgresql.org/pgsql-cygwin/2002-04/msg00072.php Note that it appears that BeOS and Netware also have the above or similar problem. I have only verified that PostgreSQL builds under Cygwin with this patch. Since I cannot reproduce the problem, I cannot verify that the proposed patch solves it. Nevertheless, both Barry Pederson and David P. Caldwell attest that this patch solves the problem. See the following for details: http://archives.postgresql.org/pgsql-cygwin/2002-05/msg00043.php http://archives.postgresql.org/pgsql-cygwin/2002-05/msg00040.php Jason Tishler	2002-06-07 21:47:45 +00:00
Tom Lane	a71a53079c	Repair error with not adjusting active scans properly after gistSplit. Patch from Teodor Sigaev.	2002-05-28 15:22:33 +00:00
Tom Lane	3212cf9417	Distinguish between MaxHeapAttributeNumber and MaxTupleAttributeNumber, where the latter is made slightly larger to allow for in-memory tuples containing resjunk attributes. Responds to today's complaint that one cannot UPDATE a table containing the allegedly-legal maximum number of columns. Also, apply Manfred Koizar's recent patch to avoid extra alignment padding when there is a null bitmap. This saves bytes in some cases while not creating any backward-compatibility problem AFAICS.	2002-05-27 19:53:33 +00:00
Tom Lane	4d567013cf	Remove AMI_OVERRIDE tests from tqual.c routines; they aren't necessary and just slow down normal operations (only fractionally, but a cycle saved is a cycle earned). Improve documentation of AMI_OVERRIDE behavior.	2002-05-25 20:00:12 +00:00
Tom Lane	de09da547a	Wups, managed to break ANALYZE with one aspect of that heap_fetch change.	2002-05-24 19:52:43 +00:00
Tom Lane	3f4d488022	Mark index entries "killed" when they are no longer visible to any transaction, so as to avoid returning them out of the index AM. Saves repeated heap_fetch operations on frequently-updated rows. Also detect queries on unique keys (equality to all columns of a unique index), and don't bother continuing scan once we have found first match. Killing is implemented in the btree and hash AMs, but not yet in rtree or gist, because there isn't an equally convenient place to do it in those AMs (the outer amgetnext routine can't do it without re-pinning the index page). Did some small cleanup on APIs of HeapTupleSatisfies, heap_fetch, and index_insert to make this a little easier.	2002-05-24 18:57:57 +00:00
Tom Lane	a2597ef179	Modify sequence state storage to eliminate dangling-pointer problem exemplified by bug #671. Moving the storage to relcache turned out to be a bad idea because relcache might decide to discard the info. Instead, open and close the relcache entry on each sequence operation, and use a record of the current XID to discover whether we already hold AccessShareLock on the sequence.	2002-05-22 21:40:55 +00:00
Tom Lane	959e61e917	Remove global variable scanCommandId in favor of storing a command ID in snapshots, per my proposal of a few days ago. Also, tweak heapam.c routines (heap_insert, heap_update, heap_delete, heap_mark4update) to be passed the command ID to use, instead of doing GetCurrentCommandID. For catalog updates they'll still get passed current command ID, but for updates generated from the main executor they'll get passed the command ID saved in the snapshot the query is using. This should fix some corner cases associated with functions and triggers that advance current command ID while an outer query is still in progress.	2002-05-21 22:05:55 +00:00
Tom Lane	44fbe20d62	Restructure indexscan API (index_beginscan, index_getnext) per yesterday's proposal to pghackers. Also remove unnecessary parameters to heap_beginscan, heap_rescan. I modified pg_proc.h to reflect the new numbers of parameters for the AM interface routines, but did not force an initdb because nothing actually looks at those fields.	2002-05-20 23:51:44 +00:00
Tom Lane	940f772a29	Support temporary setting of search path during CREATE SCHEMA; this allows the example in the CREATE SCHEMA ref page to actually work now. Also, clean up when the transaction that initially creates a temp-table namespace is later aborted. Simplify internal representation of search path by folding special cases into the main list.	2002-05-17 20:53:33 +00:00
Tom Lane	f0811a74b3	Merge the last few variable.c configuration variables into the generic GUC support. It's now possible to set datestyle, timezone, and client_encoding from postgresql.conf and per-database or per-user settings. Also, implement rollback of SET commands that occur in a transaction that later fails. Create a SET LOCAL var = value syntax that sets the variable only for the duration of the current transaction. All per previous discussions in pghackers.	2002-05-17 01:19:19 +00:00
Peter Eisentraut	1944bff1d6	Make initdb print a message about which locale it is about to use. Re-add warning if the locale prevents LIKE-optimization. Done within initdb now.	2002-05-09 13:30:24 +00:00
Hiroshi Inoue	d1406f1b1e	Change heap_get_latest_tid() so that a transaction can see changes made by the transaction itself.	2002-05-01 01:23:37 +00:00
Bruce Momjian	d37134085b	xlog.c: If possible please add the following patch to better support NetWare. Ulrich Neumann	2002-04-24 01:54:43 +00:00
Thomas G. Lockhart	f56e8fec31	Add fields in the control file to check for whether the backend was compiled for integer date/time storage and to check the length of storage for the locale fields in the same data structure. Slightly reword some of the error messages to be more accurate on possible recovery options (e.g. recompile or re-initdb). Bump version number on this file.	2002-04-21 19:08:02 +00:00
Tom Lane	27a54ae282	Opclasses live in namespaces. I also took the opportunity to create an 'opclass owner' column in pg_opclass. Nothing is done with it at present, but since there are plans to invent a CREATE OPERATOR CLASS command soon, we'll probably want DROP OPERATOR CLASS too, which suggests that a notion of ownership would be a good idea.	2002-04-17 20:57:57 +00:00
Peter Eisentraut	867901db9e	Locale support is on by default. The choice of locale is done in initdb and/or with GUC variables.	2002-04-03 05:39:33 +00:00
Tom Lane	838fe25a95	Create a new GUC variable search_path to control the namespace search path. The default behavior if no per-user schemas are created is that all users share a 'public' namespace, thus providing behavior backwards compatible with 7.2 and earlier releases. Probably the semantics and default setting will need to be fine-tuned, but this is a start.	2002-04-01 03:34:27 +00:00
Tom Lane	3114102521	Reimplement temp tables using schemas. The temp table map is history; temp table entries in pg_class have the names the user would expect.	2002-03-31 06:26:32 +00:00
Tom Lane	d67442ccfd	Mop-up some infelicities in new relation lookup handling.	2002-03-29 22:10:34 +00:00
Tom Lane	d5e99ab4d6	pg_type has a typnamespace column; system now supports creating types in different namespaces. Also, cleanup work on relation namespace support: drop, alter, rename commands work for tables in non-default namespaces.	2002-03-29 19:06:29 +00:00
Tom Lane	1dbf8aa7a8	pg_class has a relnamespace column. You can create and access tables in schemas other than the system namespace; however, there's no search path yet, and not all operations work yet on tables outside the system namespace.	2002-03-26 19:17:02 +00:00
Tom Lane	01747692fe	Repair two problems with WAL logging of sequence nextvalI() ops, as per recent pghackers discussion: force a new WAL record at first nextval after a checkpoint, and ensure that xlog is flushed to disk if a nextval record is the only thing emitted by a transaction.	2002-03-15 19:20:36 +00:00
Tom Lane	c422b5ca6b	Code review for improved-hashing patch. Fix some portability issues (char != unsigned char, Datum != uint32); make use of new hash code in dynahash hash tables and hash joins.	2002-03-09 17:35:37 +00:00
Bruce Momjian	7ab7467318	I've attached a patch which implements Bob Jenkin's hash function for PostgreSQL. This hash function replaces the one used by hash indexes and the catalog cache. Hash joins use a different, relatively poor-quality hash function, but I'll fix that later. As suggested by Tom Lane, this patch also changes the size of the fixed hash table used by the catalog cache to be a power-of-2 (instead of a prime: I chose 256 instead of 257). This allows the catcache to lookup hash buckets using a simple bitmask. This should improve the performance of the catalog cache slightly, since the previous method (modulo a prime) was slow. In my tests, this improves the performance of hash indexes by between 4% and 8%; the performance when using btree indexes or seqscans is basically unchanged. Neil Conway <neilconway@rogers.com>	2002-03-06 20:49:46 +00:00
Bruce Momjian	92288a1cf9	Change made to elog: o Change all current CVS messages of NOTICE to WARNING. We were going to do this just before 7.3 beta but it has to be done now, as you will see below. o Change current INFO messages that should be controlled by client_min_messages to NOTICE. o Force remaining INFO messages, like from EXPLAIN, VACUUM VERBOSE, etc. to always go to the client. o Remove INFO from the client_min_messages options and add NOTICE. Seems we do need three non-ERROR elog levels to handle the various behaviors we need for these messages. Regression passed.	2002-03-06 06:10:59 +00:00
Bruce Momjian	03194432de	I attach a version of my toast-slicing patch, against current CVS (current as of a few hours ago.) This patch: 1. Adds PG_GETARG_xxx_P_SLICE() macros and associated support routines. 2. Adds routines in src/backend/access/tuptoaster.c for fetching only necessary chunks of a toasted value. (Modelled on latest changes to assume chunks are returned in order). 3. Amends text_substr and bytea_substr to use new methods. It now handles multibyte cases -and should still lead to a performance improvement in the multibyte case where the substring is near the beginning of the string. 4. Added new command: ALTER TABLE tabname ALTER COLUMN colname SET STORAGE {PLAIN \| EXTERNAL \| EXTENDED \| MAIN} to parser and documented in alter-table.sgml. (NB I used ColId as the item type for the storage mode string, rather than a new production - I hope this makes sense!). All this does is sets attstorage for the specified column. 4. AlterTableAlterColumnStatistics is now AlterTableAlterColumnFlags and handles both statistics and storage (it uses the subtype code to distinguish). The previous version of my patch also re-arranged other code in backend/commands/command.c but I have dropped that from this patch.(I plan to return to it separately). 5. Documented new macros (and also the PG_GETARG_xxx_P_COPY macros) in xfunc.sgml. ref/alter_table.sgml also contains documentation for ALTER COLUMN SET STORAGE. John Gray	2002-03-05 05:33:31 +00:00
Bruce Momjian	276fc7ce82	I was digging through the GiST code, and figured I'd fix up some of the "bad smell" in that code. Stuff like function parameters that aren't used, typos in the comments, comparison between signed and unsigned ints, etc. Attached is a pretty trivial patch; it compiles, but beyond that completely untested. Unless anyone sees any problems, please apply for 7.3. Neil Conway	2002-03-05 05:30:40 +00:00
Tom Lane	26ac217173	Catcaches can now store negative entries as well as positive ones, to speed up repetitive failed searches; per pghackers discussion in late January. inval.c logic substantially simplified, since we can now treat inserts and deletes alike as far as inval events are concerned. Some repair work needed in heap_create_with_catalog, which turns out to have been doing CommandCounterIncrement at a point where the new relation has non-self-consistent catalog entries. With the new inval code, that resulted in assert failures during a relcache entry rebuild.	2002-03-03 17:47:56 +00:00
Bruce Momjian	a033daf566	Commit to match discussed elog() changes. Only update is that LOG is now just below FATAL in server_min_messages. Added more text to highlight ordering difference between it and client_min_messages. --------------------------------------------------------------------------- REALLYFATAL => PANIC STOP => PANIC New INFO level the prints to client by default New LOG level the prints to server log by default Cause VACUUM information to print only to the client NOTICE => INFO where purely information messages are sent DEBUG => LOG for purely server status messages DEBUG removed, kept as backward compatible DEBUG5, DEBUG4, DEBUG3, DEBUG2, DEBUG1 added DebugLvl removed in favor of new DEBUG[1-5] symbols New server_min_messages GUC parameter with values: DEBUG[5-1], INFO, NOTICE, ERROR, LOG, FATAL, PANIC New client_min_messages GUC parameter with values: DEBUG[5-1], LOG, INFO, NOTICE, ERROR, FATAL, PANIC Server startup now logged with LOG instead of DEBUG Remove debug_level GUC parameter elog() numbers now start at 10 Add test to print error message if older elog() values are passed to elog() Bootstrap mode now has a -d that requires an argument, like postmaster	2002-03-02 21:39:36 +00:00
Tom Lane	6779c55c22	Clean up BeginCommand and related routines. BeginCommand and EndCommand are now both invoked once per received SQL command (raw parsetree) from pg_exec_query_string. BeginCommand is actually just an empty routine at the moment --- all its former operations have been pushed into tuple receiver setup routines in printtup.c. This makes for a clean distinction between BeginCommand/EndCommand (once per command) and the tuple receiver setup/teardown routines (once per ExecutorRun call), whereas the old code was quite ad hoc. Along the way, clean up the calling conventions for ExecutorRun a little bit.	2002-02-27 19:36:13 +00:00
Bruce Momjian	f5dff44736	I've attached a simple patch which should improve the performance of hashname() and reduce the penalty incured when NAMEDATALEN is increased. I posted this to -hackers a couple days ago, and there haven't been any major complaints. It passes the regression tests. See -hackers for more discussion, as well as the suggestion from Tom Lane on which this patch is based. Unless anyone sees any problems, please apply for 7.3. Cheers, Neil Conway	2002-02-25 04:06:52 +00:00
Tom Lane	7863404417	A bunch of changes aimed at reducing backend startup time... Improve 'pg_internal.init' relcache entry preload mechanism so that it is safe to use for all system catalogs, and arrange to preload a realistic set of system-catalog entries instead of only the three nailed-in-cache indexes that were formerly loaded this way. Fix mechanism for deleting out-of-date pg_internal.init files: this must be synchronized with transaction commit, not just done at random times within transactions. Drive it off relcache invalidation mechanism so that no special-case tests are needed. Cache additional information in relcache entries for indexes (their pg_index tuples and index-operator OIDs) to eliminate repeated lookups. Also cache index opclass info at the per-opclass level to avoid repeated lookups during relcache load. Generalize 'systable scan' utilities originally developed by Hiroshi, move them into genam.c, use in a number of places where there was formerly ugly code for choosing either heap or index scan. In particular this allows simplification of the logic that prevents infinite recursion between syscache and relcache during startup: we can easily switch to heapscans in relcache.c when and where needed to avoid recursion, so IndexScanOK becomes simpler and does not need any expensive initialization. Eliminate useless opening of a heapscan data structure while doing an indexscan (this saves an mdnblocks call and thus at least one kernel call).	2002-02-19 20:11:20 +00:00
Bruce Momjian	c448847378	Add better error text: elog(LOG, "XLogWrite: new log file created - " "consider increasing 'wal_files' in postgresql.conf.");	2002-02-18 05:44:45 +00:00
Tom Lane	028e13bc08	Tweak GiST code to work correctly on machines where 8-byte alignment of pointers is required. Patch from Teodor Sigaev per pghackers discussion. It's an ugly kluge but avoids forcing initdb; we'll put a better fix into 7.3 or later.	2002-02-11 22:41:59 +00:00
Tom Lane	cf97080fa4	TOAST needs to do at least minimal time-qual checking in order not to mess up after an aborted VACUUM FULL, per today's pghackers discussion. Add a suitable HeapTupleSatisfiesToast routine. Remove useless special- case test in HeapTupleSatisfiesVisibility macro for xmax = BootstrapTransactionId; perhaps that was needed at one time, but it's a waste of cycles now, not to mention actively wrong for SnapshotAny. Along the way, add some much-needed comments to tqual.c, and simplify toast_fetch_datum, which no longer needs to assume it may see chunks out-of-order.	2002-01-16 20:29:02 +00:00
Tom Lane	aa00e6134e	Add more sanity-checking to PageAddItem and PageIndexTupleDelete, to prevent spreading of corruption when page header pointers are bad. Merge PageZero into PageInit, since it was never used separately, and remove separate memset calls used at most other PageInit call points. Remove IndexPageCleanup, which wasn't used at all.	2002-01-15 22:14:17 +00:00
Tom Lane	2004337785	Reduce severity of 'XLogFlush: request is not satisfied' error condition, per my proposal of a couple days ago. This will eliminate the unable- to-restart-database class of problem that we have seen reported half a dozen times with 7.1.*.	2002-01-14 17:55:57 +00:00
Tom Lane	3b6cbce458	Add CHECK_FOR_INTERRUPTS() in various strategic spots, per comments from Hiroshi.	2002-01-06 00:37:44 +00:00
Tom Lane	1ccc67600b	Fix race condition that could allow two concurrent transactions to insert the same key into a supposedly unique index. The bug is of low probability, and may not explain any of the recent reports of duplicated rows; but a bug is a bug.	2002-01-01 20:32:37 +00:00
Tom Lane	d3fc362ec2	Ensure that all direct uses of spinlock-protected data structures use 'volatile' pointers to access those structures, so that optimizing compilers will not decide to move the structure accesses outside of the spinlock-acquire-to-spinlock-release sequence. There are no known bugs in these uses at present, but based on bad experience with lwlock.c, it seems prudent to ensure that we protect these other uses too. Per pghackers discussion around 12-Dec. (Note: it should not be necessary to worry about structures protected by LWLocks, since the LWLock acquire and release operations are not inline macros.)	2001-12-28 18:16:43 +00:00
Tom Lane	aed0c29f7e	Fix mispeling ...	2001-12-23 07:25:39 +00:00
Tom Lane	9aa2e7da51	Temporarily dike out GetUndoRecPtr() in checkpoint generation, since we do not use the undo pointer anyway. This is a quick-hack solution for the three-way deadlock condition discussed in pghackers 17-Dec-01. Need to find a better way of doing it.	2001-12-19 19:42:51 +00:00
Tom Lane	cd255bb070	Fix boundary condition in btbulkdelete: don't examine high key in case where rightmost index page splits while we are waiting to obtain exclusive lock on it. Not clear this would actually hurt (probably the callback would always fail), but better safe than sorry. Also, improve comments describing concurrency considerations in this code.	2001-11-23 23:41:54 +00:00
Tom Lane	f6ee99a062	Clean up usage-statistics display code (ShowUsage and friends). StatFp is gone, usage messages now go through elog(DEBUG).	2001-11-10 23:51:14 +00:00
Bruce Momjian	ea08e6cd55	New pgindent run with fixes suggested by Tom. Patch manually reviewed, initdb/regression tests pass.	2001-11-05 17:46:40 +00:00
Tom Lane	fb5f1b2c13	Merge three existing ways of signaling postmaster from child processes, so that only one signal number is used not three. Flags in shared memory tell the reason(s) for the current signal. This method is extensible to handle more signal reasons without chewing up even more signal numbers, but the immediate reason is to keep pg_pwd reloads separate from SIGHUP processing in the postmaster. Also clean up some problems in the postmaster with delayed response to checkpoint status changes --- basically, it wouldn't schedule a checkpoint if it wasn't getting connection requests on a regular basis.	2001-11-04 19:55:31 +00:00
Tom Lane	7d05310828	Fix problem reported by Alex Korn: if a relation has been dropped and recreated since the start of our transaction, our first reference to it errored out because we'd try to reuse our old relcache entry for it. Do this by accepting SI inval messages just before relcache search in heap_openr, so that dead relcache entries will be flushed before we search. Also, break heap_open/openr into two pairs of routines, relation_open(r) and heap_open(r). The relation_open routines make no tests on relkind and so can be used to open anything that has a pg_class entry. The heap_open routines are wrappers that add a relkind test to preserve their established behavior. Use the relation_open routines in several places that had various kluge solutions for opening rels that might be either heap or index rels. Also, remove the old 'heap stats' code that's been superseded by Jan's stats collector, and clean up some inconsistencies in error reporting between the different types of ALTER TABLE.	2001-11-02 16:30:29 +00:00
Tom Lane	bdea97ea95	Add missing #include.	2001-11-01 06:17:01 +00:00
Bruce Momjian	6783b2372e	Another pgindent run. Fixes enum indenting, and improves #endif spacing. Also adds space for one-line comments.	2001-10-28 06:26:15 +00:00
Tom Lane	22d9e91219	Fix a couple of places where lack of parenthesization of a cast causes pgindent to make weird formatting decisions. Easiest fix seems to be to put in the extra parens...	2001-10-25 20:37:30 +00:00
Bruce Momjian	b81844b173	pgindent run on all C files. Java run to follow. initdb/regression tests pass.	2001-10-25 05:50:21 +00:00
Thomas G. Lockhart	9310075a13	Accept an INTERVAL argument for SET TIME ZONE per SQL99. Modified the parser and the SET handlers to use full Node structures rather than simply a character string argument. Implement INTERVAL() YEAR TO MONTH (etc) syntax per SQL99. Does not yet accept the goofy string format that goes along with, but this should be fairly straight forward to fix now as a bug or later as a feature. Implement precision for the INTERVAL() type. Use the typmod mechanism for both of INTERVAL features. Fix the INTERVAL syntax in the parser: opt_interval was in the wrong place. INTERVAL is now a reserved word, otherwise we get reduce/reduce errors. Implement an explicit date_part() function for TIMETZ. Should fix coersion problem with INTERVAL reported by Peter E. Fix up some error messages for date/time types. Use all caps for type names within message. Fix recently introduced side-effect bug disabling 'epoch' as a recognized field for date_part() etc. Reported by Peter E. (??) Bump catalog version number. Rename "microseconds" current transaction time field from ...Msec to ...Usec. Duh! date/time regression tests updated for reference platform, but a few changes will be necessary for others.	2001-10-18 17:30:21 +00:00
Tom Lane	85801a4dbd	Rearrange fmgr.c and relcache so that it's possible to keep FmgrInfo lookup info in the relcache for index access method support functions. This makes a huge difference for dynamically loaded support functions, and should save a few cycles even for built-in ones. Also tweak dfmgr.c so that load_external_function is called only once, not twice, when doing fmgr_info for a dynamically loaded function. All per performance gripe from Teodor Sigaev, 5-Oct-01.	2001-10-06 23:21:45 +00:00
Tom Lane	8a52b893b3	Further cleanup of dynahash.c API, in pursuit of portability and readability. Bizarre '(long *) TRUE' return convention is gone, in favor of just raising an error internally in dynahash.c when we detect hashtable corruption. HashTableWalk is gone, in favor of using hash_seq_search directly, since it had no hope of working with non-LONGALIGNable datatypes. Simplify some other code that was made undesirably grotty by promixity to HashTableWalk.	2001-10-05 17:28:13 +00:00
Tom Lane	5999e78fc4	Another round of cleanups for dynahash.c (maybe it's finally clean of portability issues). Caller-visible data structures are now allocated on MAXALIGN boundaries, allowing safe use of datatypes wider than 'long'. Rejigger hash_create API so that caller specifies size of key and total size of entry, not size of key and size of rest of entry. This simplifies life considerably since each number is just a sizeof(), and padding issues etc. are taken care of automatically.	2001-10-01 05:36:17 +00:00
Tom Lane	1663f33838	Tweak btree page split logic so that when splitting a page that is rightmost on its tree level, we split 2/3 to the left and 1/3 to the new right page, rather than the even split we use elsewhere. The idea is that when faced with a steadily increasing series of inserted keys (such as sequence or timestamp values), we'll end up with a btree that's about 2/3ds full not 1/2 full, which is much closer to the desired steady-state load for a btree. Per suggestion from Ann Harrison of IBPhoenix.	2001-09-29 23:49:51 +00:00
Tom Lane	499abb0c0f	Implement new 'lightweight lock manager' that's intermediate between existing lock manager and spinlocks: it understands exclusive vs shared lock but has few other fancy features. Replace most uses of spinlocks with lightweight locks. All remaining uses of spinlocks have very short lock hold times (a few dozen instructions), so tweak spinlock backoff code to work efficiently given this assumption. All per my proposal on pghackers 26-Sep-01.	2001-09-29 04:02:27 +00:00
Bruce Momjian	818fb55ac4	I have made three changes to the rtree code: one bug fix and two performance improvements. I put an explanation of the changes at http://cs1.cs.nyu.edu/been/postgres-rtree.html The performance improvements are quite significant. All the changes are in the file src/backend/access/rtree/rtree.c I was working with the 7.1.3 code. I'm including the diff output as an attachment. Kenneth Been	2001-09-29 03:46:12 +00:00
Thomas G. Lockhart	6f58115ddd	Measure the current transaction time to milliseconds. Define a new function, GetCurrentTransactionStartTimeUsec() to get the time to this precision. Allow now() and timestamp 'now' to use this higher precision result so we now have fractional seconds in this "constant". Add timestamp without time zone type. Move previous timestamp type to timestamp with time zone. Accept another ISO variant for date/time values: yyyy-mm-ddThh:mm:ss (note the "T" separating the day from hours information). Remove 'current' from date/time types; convert to 'now' in input. Separate time and timetz regression tests. Separate timestamp and timestamptz regression test.	2001-09-28 08:09:14 +00:00
Tom Lane	1481b3b28b	Remove useless test for time field in pg_control being > 0. We don't need this, and it will create a Y2038 failure. Per report from David Wheeler, who is evidently running on a platform where time_t is already negative.	2001-09-26 20:24:02 +00:00
Tom Lane	f2b604ecf4	Add some debugging details to some of the elog(STOP) conditions for WAL. Standardize on %X/%X as the formatting for XLOG position display --- we had a couple of different formats before, and none of 'em were as useful as hex offsets IMHO.	2001-09-06 02:02:48 +00:00
Tom Lane	bc7d37a525	Transaction IDs wrap around, per my proposal of 13-Aug-01. More documentation to come, but the code is all here. initdb forced.	2001-08-26 16:56:03 +00:00
Tom Lane	ca86791a61	Fix portability problem in new CLOG code, per report from Rene Pijlman.	2001-08-25 23:24:39 +00:00
Tom Lane	2589735da0	Replace implementation of pg_log as a relation accessed through the buffer manager with 'pg_clog', a specialized access method modeled on pg_xlog. This simplifies startup (don't need to play games to open pg_log; among other things, OverrideTransactionSystem goes away), should improve performance a little, and opens the door to recycling commit log space by removing no-longer-needed segments of the commit log. Actual recycling is not there yet, but I felt I should commit this part separately since it'd still be useful if we chose not to do transaction ID wraparound.	2001-08-25 18:52:43 +00:00
Peter Eisentraut	968d7733a1	Rename config.h to pg_config.h and os.h to pg_config_os.h, fix a number of places that were including the wrong files.	2001-08-24 14:07:50 +00:00
Tom Lane	7326e78c42	Ensure that all TransactionId comparisons are encapsulated in macros (TransactionIdPrecedes, TransactionIdFollows, etc). First step on the way to transaction ID wrap solution ...	2001-08-23 23:06:38 +00:00
Tom Lane	a54075a6d6	Update GiST for new pg_opclass arrangement (finally a clean solution for haskeytype). Update GiST contrib modules too. Add linear-time split algorithm for R-tree GiST opclass. From Oleg Bartunov and Teodor Sigaev.	2001-08-22 18:24:26 +00:00
Tom Lane	f933766ba7	Restructure pg_opclass, pg_amop, and pg_amproc per previous discussions in pgsql-hackers. pg_opclass now has a row for each opclass supported by each index AM, not a row for each opclass name. This allows pg_opclass to show directly whether an AM supports an opclass, and furthermore makes it possible to store additional information about an opclass that might be AM-dependent. pg_opclass and pg_amop now store "lossy" and "haskeytype" information that we previously expected the user to remember to provide in CREATE INDEX commands. Lossiness is no longer an index-level property, but is associated with the use of a particular operator in a particular index opclass. Along the way, IndexSupportInitialize now uses the syscaches to retrieve pg_amop and pg_amproc entries. I find this reduces backend launch time by about ten percent, at the cost of a couple more special cases in catcache.c's IndexScanOK. Initial work by Oleg Bartunov and Teodor Sigaev, further hacking by Tom Lane. initdb forced.	2001-08-21 16:36:06 +00:00
Tom Lane	bf56f0759b	Make OIDs optional, per discussions in pghackers. WITH OIDS is still the default, but OIDS are removed from many system catalogs that don't need them. Some interesting side effects: TOAST pointers are 20 bytes not 32 now; pg_description has a three-column key instead of one. Bugs fixed in passing: BINARY cursors work again; pg_class.relhaspkey has some usefulness; pg_dump dumps comments on indexes, rules, and triggers in a valid order. initdb forced.	2001-08-10 18:57:42 +00:00
Bruce Momjian	13923be7c8	1. null-safe interface to GiST (as proposed in http://fts.postgresql.org/db/mw/msg.html?mid=1028327) 2. support for 'pass-by-value' arguments - to test this we used special opclass for int4 with values in range [0-2^15] More testing will be done after resolving problem with index_formtuple and implementation of B-tree using GiST 3. small patch to contrib modules (seg,cube,rtree_gist,intarray) - mark functions as 'isstrict' where needed. Oleg Bartunov	2001-08-10 14:34:28 +00:00
Tom Lane	94cb3fd875	Suppress gcc warning in USE_LOCALE case.	2001-07-22 22:01:04 +00:00
Tom Lane	7d4d5c00f0	Arrange to recycle old XLOG log segment files as new segment files, rather than deleting them only to have to create more. Steady state is 2*CHECKPOINT_SEGMENTS + WAL_FILES + 1 segment files, which will simply be renamed rather than constantly deleted and recreated. To make this safe, added current XLOG file/offset number to page header of XLOG pages, so that an un-overwritten page from an old incarnation of a logfile can be reliably told from a valid page. This change means that if you try to restart postmaster in a CVS-tip database after installing the change, you'll get a complaint about bad XLOG page magic number. If you don't want to initdb, run contrib/pg_resetxlog (and be sure you shut down the old postmaster cleanly).	2001-07-19 02:12:35 +00:00
Tom Lane	ed5c4e4a14	Improve documentation about reasoning behind the order of operations in GetSnapshotData, GetNewTransactionId, CommitTransaction, AbortTransaction, etc. Correct race condition in transaction status testing in HeapTupleSatisfiesVacuum --- this wasn't important for old VACUUM with exclusive lock on its table, but it sure is important now. All per pghackers discussion 7/11/01 and 7/12/01.	2001-07-16 22:43:34 +00:00
Tom Lane	c8076f09d2	Restructure index AM interface for index building and index tuple deletion, per previous discussion on pghackers. Most of the duplicate code in different AMs' ambuild routines has been moved out to a common routine in index.c; this means that all index types now do the right things about inserting recently-dead tuples, etc. (I also removed support for EXTEND INDEX in the ambuild routines, since that's about to go away anyway, and it cluttered the code a lot.) The retail indextuple deletion routines have been replaced by a "bulk delete" routine in which the indexscan is inside the access method. I haven't pushed this change as far as it should go yet, but it should allow considerable simplification of the internal bookkeeping for deletions. Also, add flag columns to pg_am to eliminate various hardcoded tests on AM OIDs, and remove unused pg_am columns. Fix rtree and gist index types to not attempt to store NULLs; before this, gist usually crashed, while rtree managed not to crash but computed wacko bounding boxes for NULL entries (which might have had something to do with the performance problems we've heard about occasionally). Add AtEOXact routines to hash, rtree, and gist, all of which have static state that needs to be reset after an error. We discovered this need long ago for btree, but missed the other guys. Oh, one more thing: concurrent VACUUM is now the default.	2001-07-15 22:48:19 +00:00
Tom Lane	20ca834ce9	Minor code cleanup/beautification in RelationPutHeapTuple.	2001-07-13 22:52:58 +00:00
Tom Lane	b9f3a929ee	Create a new HeapTupleSatisfiesVacuum() routine in tqual.c that embodies the validity checking rules for VACUUM. Make some other rearrangements of the VACUUM code to allow more code to be shared between full and lazy VACUUM. Minor code cleanups and added comments for TransactionId manipulations.	2001-07-12 04:11:13 +00:00
Tom Lane	55432fedd2	Implement LockBufferForCleanup(), which will allow concurrent VACUUM to wait until it's safe to remove tuples and compact free space in a shared buffer page. Miscellaneous small code cleanups in bufmgr, too.	2001-07-06 21:04:26 +00:00
Hiroshi Inoue	852a26f79e	Fix my old fault(returns auto variable reference).	2001-07-06 09:41:36 +00:00
Tom Lane	af5ced9cfd	Further work on connecting the free space map (which is still just a stub) into the rest of the system. Adopt a cleaner approach to preventing deadlock in concurrent heap_updates: allow RelationGetBufferForTuple to select any page of the rel, and put the onus on it to lock both buffers in a consistent order. Remove no-longer-needed isExtend hack from API of ReleaseAndReadBuffer.	2001-06-29 21:08:25 +00:00
Tom Lane	fb2c3289ff	Repair logic error for multi-key indexes. From Oleg Bartunov.	2001-06-28 16:00:07 +00:00
Tom Lane	e0c9301c87	Install infrastructure for shared-memory free space map. Doesn't actually do anything yet, but it has the necessary connections to initialization and so forth. Make some gestures towards allowing number of blocks in a relation to be BlockNumber, ie, unsigned int, rather than signed int. (I doubt I got all the places that are sloppy about it, yet.) On the way, replace the hardwired NLOCKS_PER_XACT fudge factor with a GUC variable.	2001-06-27 23:31:40 +00:00
Tom Lane	4d58a7ca87	Optimizer can now estimate selectivity of IS NULL, IS NOT NULL, IS TRUE, etc, with some degree of verisimilitude. Split out selectivity support functions from builtins.h into a new header file selfuncs.h, so as to reduce the number of header files builtins.h must depend on. Fix a few missing inclusions exposed thereby. From Joe Conway, with some kibitzing from Tom Lane.	2001-06-25 21:11:45 +00:00
Jan Wieck	8d80b0d980	Statistical system views (yet without the config stuff, but it's hard to keep such massive changes in sync with the tree so I need to get it in and work from there now). Jan	2001-06-22 19:16:24 +00:00
Tom Lane	695e575470	Tweak error message.	2001-06-21 19:45:45 +00:00
Tom Lane	bbbc00af88	Clean up some longstanding problems in shared-cache invalidation. SI messages now include the relevant database OID, so that operations in one database do not cause useless cache flushes in backends attached to other databases. Declare SI messages properly using a union, to eliminate the former assumption that Oid is the same size as int or Index. Rewrite the nearly-unreadable code in inval.c, and document it better. Arrange for catcache flushes at end of command/transaction to happen before relcache flushes do --- this avoids loading a new tuple into the catcache while setting up new relcache entry, only to have it be flushed again immediately.	2001-06-19 19:42:16 +00:00
Tom Lane	1d584f97b9	Clean up various to-do items associated with system indexes: pg_database now has unique indexes on oid and on datname. pg_shadow now has unique indexes on usename and on usesysid. pg_am now has unique index on oid. pg_opclass now has unique index on oid. pg_amproc now has unique index on amid+amopclaid+amprocnum. Remove pg_rewrite's unnecessary index on oid, delete unused RULEOID syscache. Remove index on pg_listener and associated syscache for performance reasons (caching rows that are certain to change before you need 'em again is rather pointless). Change pg_attrdef's nonunique index on adrelid into a unique index on adrelid+adnum. Fix various incorrect settings of pg_class.relisshared, make that the primary reference point for whether a relation is shared or not. IsSharedSystemRelationName() is now only consulted to initialize relisshared during initial creation of tables and indexes. In theory we might now support shared user relations, though it's not clear how one would get entries for them into pg_class &etc of multiple databases. Fix recently reported bug that pg_attribute rows created for an index all have the same OID. (Proof that non-unique OID doesn't matter unless it's actually used to do lookups ;-)) There's no need to treat pg_trigger, pg_attrdef, pg_relcheck as bootstrap relations. Convert them into plain system catalogs without hardwired entries in pg_class and friends. Unify global.bki and template1.bki into a single init script postgres.bki, since the alleged distinction between them was misleading and pointless. Not to mention that it didn't work for setting up indexes on shared system relations. Rationalize locking of pg_shadow, pg_group, pg_attrdef (no need to use AccessExclusiveLock where ExclusiveLock or even RowExclusiveLock will do). Also, hold locks until transaction commit where necessary.	2001-06-12 05:55:50 +00:00
Tom Lane	88e948216c	Nest macros with slightly less enthusiasm, for performance and to avoid having non-gcc compilers spit up.	2001-06-11 05:00:56 +00:00
Tom Lane	bdadc9bf1c	Remove RelationGetBufferWithBuffer(), which is horribly confused about appropriate pin-count manipulation, and instead use ReleaseAndReadBuffer. Make use of the fact that the passed-in buffer (if there is one) must be pinned to avoid grabbing the bufmgr spinlock when we are able to return this same buffer. Eliminate unnecessary 'previous tuple' and 'next tuple' fields of HeapScanDesc and IndexScanDesc, thereby removing a whole lot of bookkeeping from heap_getnext() and related routines.	2001-06-09 18:16:59 +00:00
Tom Lane	1173344e74	Adjust WAL code so that checkpoints truncate the xlog at the previous checkpoint's redo pointer, not its undo pointer, per discussion in pghackers a few days ago. No point in hanging onto undo information until we have the ability to do something with it --- and this solves a rather large problem with log space for long-running transactions. Also, change all calls of write() to detect the case where write returned a count less than requested, but failed to set errno. Presume that this situation indicates ENOSPC, and give the appropriate error message, rather than a random message associated with the previous value of errno.	2001-06-06 17:07:46 +00:00
Peter Eisentraut	12c1552066	Mark many strings in backend not covered by elog for translation. Also, make strings in xlog.c look more like English and less like binary noise.	2001-06-03 14:53:56 +00:00
Tom Lane	0b370ea7c8	Clean up some minor problems exposed by further thought about Panon's bug report on old-style functions invoked by RI triggers. We had a number of other places that were being sloppy about which memory context FmgrInfo subsidiary data will be allocated in. Turns out none of them actually cause a problem in 7.1, but this is for arcane reasons such as the fact that old-style triggers aren't supported anyway. To avoid getting burnt later, I've restructured the trigger support so that we don't keep trigger FmgrInfo structs in relcache memory. Some other related cleanups too: it's not really necessary to call fmgr_info at all while setting up the index support info in relcache entries, because those ScanKeyEntry structs are never used to invoke the functions. This should speed up relcache initialization a tiny bit.	2001-06-01 02:41:36 +00:00
Tom Lane	3043810d97	Updates to make GIST work with multi-key indexes (from Oleg Bartunov and Teodor Sigaev). Declare key values as Datum where appropriate, rather than char* (Tom Lane).	2001-05-31 18:16:55 +00:00
Tom Lane	f1d5d0905c	Tweak StrategyEvaluation data structure to eliminate hardwired limit on number of strategies supported by an index AM. Add missing copyright notices and CVS $Header$ markers to GIST source files.	2001-05-30 19:53:40 +00:00
Bruce Momjian	33f2614aa1	Remove SEP_CHAR, replace with / or '/' as appropriate.	2001-05-30 14:15:27 +00:00
Bruce Momjian	f6923ff3ac	Oops, only wanted python change in the last commit. Backing out.	2001-05-25 15:45:34 +00:00
Bruce Momjian	dffb673692	While changing Cygwin Python to build its core as a DLL (like Win32 Python) to support shared extension modules, I have learned that Guido prefers the style of the attached patch to solve the above problem. I feel that this solution is particularly appropriate in this case because the following: PglargeType PgType PgQueryType are already being handled in the way that I am proposing for PgSourceType. Jason Tishler	2001-05-25 15:34:50 +00:00
Bruce Momjian	f08245cfe3	I found the answer to this: the partition had filled up, and so the problem was lack of disk space. Oliver Elphick	2001-05-22 16:52:49 +00:00
Bruce Momjian	dc0ff5c67a	Small code cleanups,formatting.	2001-05-18 21:24:20 +00:00
Bruce Momjian	2d7795ebb4	Prevent forced blank line before comment block in pgindent.	2001-05-17 15:55:24 +00:00
Bruce Momjian	e044fc0599	Spacing cleanup.	2001-05-17 15:22:12 +00:00
Bruce Momjian	806aba49fd	Small cleanup of spacing.	2001-05-17 14:59:31 +00:00
Tom Lane	27336e4f7a	Repair race condition introduced into heap_update() in 7.1 --- PageGetFreeSpace() was being called while not holding the buffer lock, which not only could yield a garbage answer, but even if it's the right answer there might be less space available after we reacquire the buffer lock. Also repair potential deadlock introduced by my recent performance improvement in RelationGetBufferForTuple(): it was possible for two heap_updates to try to lock two buffers in opposite orders. The fix creates a global rule that buffers of a single heap relation should be locked in decreasing block number order. Currently, this only applies to heap_update; VACUUM can get away with ignoring the rule since it holds exclusive lock on the whole relation anyway. However, if we try to implement a VACUUM that can run in parallel with other transactions, VACUUM will also have to obey the lock order rule.	2001-05-16 22:35:12 +00:00
Bruce Momjian	d0e1091cfd	we found a problem in GiST with massive insert/update operations with many NULLs ( inserting of NULL into indexed field cause ERROR: MemoryContextAlloc: invalid request size) As a workaround 'vacuum analyze' could be used. This patch resolves the problem, please upply to 7.1.1 sources and current cvs tree. Oleg Bartunov	2001-05-15 14:14:49 +00:00
Bruce Momjian	ed6998d0b3	Re-add pg_index.indhaskeytype.	2001-05-15 03:49:35 +00:00
Bruce Momjian	783fbdab70	Remove columns pg_index.haskeytype and pg_index.indisclustered. Not used.	2001-05-14 21:53:16 +00:00
Bruce Momjian	1e7b79cebc	Remove unused tables pg_variable, pg_inheritproc, pg_ipl tables. Initdb forced.	2001-05-14 20:30:21 +00:00
Tom Lane	eedb7d18fa	Modify RelationGetBufferForTuple() so that we only do lseek and lock when we need to move to a new page; as long as we can insert the new tuple on the same page as before, we only need LockBuffer and not the expensive stuff. Also, twiddle bufmgr interfaces to avoid redundant lseeks in RelationGetBufferForTuple and BufferAlloc. Successive inserts now require one lseek per page added, rather than one per tuple with several additional ones at each page boundary as happened before. Lock contention when multiple backends are inserting in same table is also greatly reduced.	2001-05-12 19:58:28 +00:00
Tom Lane	f905d65ee3	Rewrite of planner statistics-gathering code. ANALYZE is now available as a separate statement (though it can still be invoked as part of VACUUM, too). pg_statistic redesigned to be more flexible about what statistics are stored. ANALYZE now collects a list of several of the most common values, not just one, plus a histogram (not just the min and max values). Random sampling is used to make the process reasonably fast even on very large tables. The number of values and histogram bins collected is now user-settable via an ALTER TABLE command. There is more still to do; the new stats are not being used everywhere they could be in the planner. But the remaining changes for this project should be localized, and the behavior is already better than before. A not-very-related change is that sorting now makes use of btree comparison routines if it can find one, rather than invoking '<' twice.	2001-05-07 00:43:27 +00:00
Tom Lane	e2e19ca0cd	Seems like we should not hold off cancel/die interrupts while we are running deferred triggers. They are really part of the regular transaction, and they could take awhile.	2001-05-04 18:39:16 +00:00
Tom Lane	2792374cff	Ensure that btree sort ordering functions and boolean comparison operators give consistent results for all datatypes. Types float4, float8, and numeric were broken for NaN values; abstime, timestamp, and interval were broken for INVALID values; timetz was just plain broken (some possible pairs of values were neither < nor = nor >). Also clean up text, bpchar, varchar, and bit/varbit to eliminate duplicate code and thereby reduce the probability of similar inconsistencies arising in the future.	2001-05-03 19:00:37 +00:00
Tom Lane	f10596c3ec	Fix comment that Vadim found confusing.	2001-04-05 16:55:21 +00:00
Vadim B. Mikheev	3092869233	StartupXLOG(): initialize XLogCtl->Insert to new page if there is no room for a record on last log page.	2001-04-05 09:34:32 +00:00
Tom Lane	ccd415c63f	Fix unportable assumptions about alignment of local char[n] variables.	2001-03-25 23:23:59 +00:00
Tom Lane	00713cb7cb	Fix code that incorrectly assumed a 'char foo[N]' local variable would be aligned on a word boundary. Per report from Steve Nicolai.	2001-03-25 00:45:20 +00:00
Bruce Momjian	7cf952e7b4	Fix comments that were mis-wrapped, for Tom Lane.	2001-03-23 04:49:58 +00:00
Bruce Momjian	0686d49da0	Remove dashes in comments that don't need them, rewrap with pgindent.	2001-03-22 06:16:21 +00:00
Bruce Momjian	9e1552607a	pgindent run. Make it all clean.	2001-03-22 04:01:46 +00:00
Tom Lane	af6e88a9cf	Remove NEXTXID xlog record type to avoid three-way deadlock risk. NEXTXID isn't really necessary, per previous discussion in pghackers, but I mulishy insisted we should put it in anyway. Mea culpa.	2001-03-18 20:18:59 +00:00
Tom Lane	ae293d33cf	Make sure ControlFile logId/logSeg don't go backwards (barely possible given a slow backend, if we update unconditionally as the code did before).	2001-03-18 00:30:27 +00:00
Tom Lane	5a38af7fd8	Rearrange XLogFileInit so that control-file spinlock is not held while filling the new log file with zeroes, only while renaming it into place. This should prevent problems with 'stuck spinlock' errors under heavy load.	2001-03-17 20:54:13 +00:00
Tom Lane	9d645fd84c	Support syncing WAL log to disk using either fsync(), fdatasync(), O_SYNC, or O_DSYNC (as available on a given platform). Add GUC parameter to control sync method. Also, add defense to XLogWrite to prevent it from going nuts if passed a target write position that's past the end of the buffers so far filled by XLogInsert.	2001-03-16 05:44:33 +00:00
Tom Lane	cfab4f6541	Use SEP_CHAR consistently in forming XLOG pathnames.	2001-03-14 20:23:04 +00:00
Tom Lane	1b87e24c4a	Change xlog page-header format to include StartUpID. Use the SUI to detect case that next page in log came from an older run than the prior page. This avoids the necessity to re-zero the log after recovery from a crash, which is good because we need not risk destroying valuable log information. This forces another initdb since yesterday :-(. Need to get that log reset utility done...	2001-03-13 20:32:37 +00:00
Tom Lane	4d14fe0048	XLOG (and related) changes: * Store two past checkpoint locations, not just one, in pg_control. On startup, we fall back to the older checkpoint if the newer one is unreadable. Also, a physical copy of the newest checkpoint record is kept in pg_control for possible use in disaster recovery (ie, complete loss of pg_xlog). Also add a version number for pg_control itself. Remove archdir from pg_control; it ought to be a GUC parameter, not a special case (not that it's implemented yet anyway). * Suppress successive checkpoint records when nothing has been entered in the WAL log since the last one. This is not so much to avoid I/O as to make it actually useful to keep track of the last two checkpoints. If the things are right next to each other then there's not a lot of redundancy gained... * Change CRC scheme to a true 64-bit CRC, not a pair of 32-bit CRCs on alternate bytes. Polynomial borrowed from ECMA DLT1 standard. * Fix XLOG record length handling so that it will work at BLCKSZ = 32k. * Change XID allocation to work more like OID allocation. (This is of dubious necessity, but I think it's a good idea anyway.) * Fix a number of minor bugs, such as off-by-one logic for XLOG file wraparound at the 4 gig mark. * Add documentation and clean up some coding infelicities; move file format declarations out to include files where planned contrib utilities can get at them. * Checkpoint will now occur every CHECKPOINT_SEGMENTS log segments or every CHECKPOINT_TIMEOUT seconds, whichever comes first. It is also possible to force a checkpoint by sending SIGUSR1 to the postmaster (undocumented feature...) * Defend against kill -9 postmaster by storing shmem block's key and ID in postmaster.pid lockfile, and checking at startup to ensure that no processes are still connected to old shmem block (if it still exists). * Switch backends to accept SIGQUIT rather than SIGUSR1 for emergency stop, for symmetry with postmaster and xlog utilities. Clean up signal handling in bootstrap.c so that xlog utilities launched by postmaster will react to signals better. * Standalone bootstrap now grabs lockfile in target directory, as added insurance against running it in parallel with live postmaster.	2001-03-13 01:17:06 +00:00
Tom Lane	b109b03fea	Repair a number of places that didn't bother to check whether PageAddItem succeeds or not. Revise rtree page split algorithm to take care about making a feasible split --- ie, will the incoming tuple actually fit? Failure to make a feasible split, combined with failure to notice the failure, account for Jim Stone's recent bug report. I suspect that hash and gist indices may have the same type of bug, but at least now we'll get error messages rather than silent failures if so. Also clean up rtree code to use Datum rather than char* where appropriate.	2001-03-07 21:20:26 +00:00
Tom Lane	9c9936587c	Implement COMMIT_SIBLINGS parameter to allow pre-commit delay to occur only if at least N other backends currently have open transactions. This is not a great deal of intelligence about whether a delay might be profitable ... but it beats no intelligence at all. Note that the default COMMIT_DELAY is still zero --- this new code does nothing unless that setting is changed. Also, mark ENABLEFSYNC as a system-wide setting. It's no longer safe to allow that to be set per-backend, since we may be relying on some other backend's fsync to have synced the WAL log.	2001-02-26 00:50:08 +00:00
Bruce Momjian	4f6c49fef0	Clean up index/btree comments/macros, as approved.	2001-02-22 21:48:49 +00:00
Hiroshi Inoue	50e3c60b95	Avoid 'FATAL: out of free buffers: time to abort !" error during WAL recovery. Recovery failure is always serious.	2001-02-22 08:59:40 +00:00
Tom Lane	57e0847180	Change default commit_delay to zero, update documentation.	2001-02-18 04:50:43 +00:00
Tom Lane	33cc5d8a4d	Change s_lock to not use any zero-delay select() calls; these are just a waste of cycles on single-CPU machines, and of dubious utility on multi-CPU machines too. Tweak s_lock_stuck so that caller can specify timeout interval, and increase interval before declaring stuck spinlock for buffer locks and XLOG locks. On systems that have fdatasync(), use that rather than fsync() to sync WAL log writes. Ensure that WAL file is entirely allocated during XLogFileInit.	2001-02-18 04:39:42 +00:00
Tom Lane	059e361481	Although we can't support out-of-line TOAST storage in indexes (yet), compressed storage works perfectly well. Might as well have a coherent strategy for applying it, rather than the haphazard store-what-you-get approach that was in the code before. The strategy I've set up here is to attempt compression of any compressible index value exceeding BLCKSZ/16, or about 500 bytes by default.	2001-02-15 20:57:01 +00:00
Vadim B. Mikheev	7e04843ba7	Comments about GetFreeXLBuffer(). GetFreeXLBuffer(): use Insert->LgwrResult instead of private LgwrResult copy if it's more fresh (attempt to avoid acquiring info_lck/lgwr_lck).	2001-02-13 20:40:25 +00:00
Vadim B. Mikheev	35273825dc	Removed abort() in XLogFileOpen.	2001-02-13 08:44:09 +00:00
Tom Lane	7fdca53711	When updating a tuple containing compressed-in-line fields, do not decompress the existing fields unnecessarily.	2001-02-09 17:30:03 +00:00
Vadim B. Mikheev	c19dadbf08	Runtime btree recovery is now ON by default.	2001-02-07 23:35:33 +00:00
Vadim B. Mikheev	b18c09ee3a	Runtime tree recovery is implemented, just testing is left -:)	2001-02-02 19:49:15 +00:00
Vadim B. Mikheev	dca0762efc	Couple additional functions to fix tree at runtime. Need in one more function to handle "my bits moved..." case. FixBTree is still FALSE.	2001-01-31 01:08:36 +00:00
Vadim B. Mikheev	598a12722a	Call _bt_fixroot() from _bt_insertonpg.	2001-01-29 07:28:17 +00:00
Tom Lane	0d54d6ac44	Clean up handling of tuple descriptors so that result-tuple descriptors allocated by plan nodes are not leaked at end of query. This doesn't really matter for normal queries, but it sure does for queries invoked repetitively inside SQL functions. Clean up some other grotty code associated with tupdescs, and fix a few other memory leaks exposed by tests with simple SQL functions.	2001-01-29 00:39:20 +00:00
Vadim B. Mikheev	c6e6d292bc	First step in attempt to fix tree at runtime: create upper levels and new root page if old root one was splitted but new root page wasn't created. New code is protected by FixBTree bool flag setted to FALSE, so nothing should be affected by this untested approach.	2001-01-26 01:24:31 +00:00
Bruce Momjian	623bf843d2	Change Copyright from PostgreSQL, Inc to PostgreSQL Global Development Group.	2001-01-24 19:43:33 +00:00
Tom Lane	4e27b308e2	Do _bt_wrtbuf() outside critical section, per discussion with Vadim 1/19.	2001-01-23 23:29:22 +00:00
Tom Lane	786f1a59cd	Fix all the places that called heap_update() and heap_delete() without bothering to check the return value --- which meant that in case the update or delete failed because of a concurrent update, you'd not find out about it, except by observing later that the transaction produced the wrong outcome. There are now subroutines simple_heap_update and simple_heap_delete that should be used anyplace that you're not prepared to do the full nine yards of coping with concurrent updates. In practice, that seems to mean absolutely everywhere but the executor, because noplace else was checking.	2001-01-23 04:32:23 +00:00
Tom Lane	6ce0ed2813	Make critical sections (elog->crash) and interrupt holdoff sections into distinct concepts, per recent discussion on pghackers.	2001-01-19 22:08:47 +00:00
Vadim B. Mikheev	0a12767004	Comment out xlrec in xact_redo - no support for file unlinking on commit yet.	2001-01-18 18:33:45 +00:00
Tom Lane	efd6cade83	Tweak heap_update/delete so that we do not hold the buffer context lock on the old tuple's page while we are doing TOAST pushups.	2001-01-15 05:29:19 +00:00
Tom Lane	36839c1927	Restructure backend SIGINT/SIGTERM handling so that 'die' interrupts are treated more like 'cancel' interrupts: the signal handler sets a flag that is examined at well-defined spots, rather than trying to cope with an interrupt that might happen anywhere. See pghackers discussion of 1/12/01.	2001-01-14 05:08:17 +00:00
Tom Lane	6162432de9	Add more critical-section calls: all code sections that hold spinlocks are now critical sections, so as to ensure die() won't interrupt us while we are munging shared-memory data structures. Avoid insecure intermediate states in some code that proc_exit will call, like palloc/pfree. Rename START/END_CRIT_CODE to START/END_CRIT_SECTION, since that seems to be what people tend to call them anyway, and make them be called with () like a function call, in hopes of not confusing pg_indent. I doubt that this is sufficient to make SIGTERM safe anywhere; there's just too much code that could get invoked during proc_exit().	2001-01-12 21:54:01 +00:00
Marc G. Fournier	0ad7db4be4	New feature: 1. Support of variable size keys - new algorithm of insertion to tree (GLI - gist layrered insertion). Previous algorithm was implemented as described in paper by Joseph M. Hellerstein et.al "Generalized Search Trees for Database Systems". This (old) algorithm was not suitable for variable size keys and could be not effective ( walking up-down ) in case of multiple levels split Bug fixed: 1. fixed bug in gistPageAddItem - key values were written to disk uncompressed. This caused failure if decompression function does real job. 2. NULLs handling - we keep NULLs in tree. Right way is to remove them, but we don't know how to inform vacuum about index statistics. This is just cosmetic warning message (like in case with R-Tree), but I'm not sure how to recognize real problem if we remove NULLs and suppress this warning as Tom suggested. 3. various memory leaks This work was done by Teodor Sigaev (teodor@stack.net) and Oleg Bartunov (oleg@sai.msu.su).	2001-01-12 00:12:58 +00:00
Vadim B. Mikheev	4b59366e57	1. Checkpoint.undo may be after checkpoint itself: - no more elog(STOP) in StartupXLOG(); - both checkpoint' undo & redo are used to define oldest on-line log file. 2. Ability to pre-allocate a few log files at checkpoint time (wal_files option). Off by default.	2001-01-09 06:24:33 +00:00
Tom Lane	a4ddbbd1a4	Correct nasty error in heap_update: it was releasing the buffer refcount before calling RelationInvalidateHeapTuple(), which is bad because the latter needs to look at the tuple data, which is in the shared disk buffer. If another backend manages to recycle the buffer while this is going on, we will compute the wrong hashindex for the tuple or maybe even crash outright. Must hold buffer refcount until afterwards. (This bug is not in 7.0.*; seems to be have introduced during WAL changes.)	2001-01-07 22:14:31 +00:00
Tom Lane	1b8a219eef	Clean up non-reentrant interface for hash_seq/HashTableWalk, so that starting a new hashtable search no longer clobbers any other search active anywhere in the system. Fix RelationCacheInvalidate() so that it will not crash or go into an infinite loop if invoked recursively, as for example by a second SI Reset message arriving while we are still processing a prior one.	2001-01-02 04:33:24 +00:00
Vadim B. Mikheev	3e059b3802	1. WAL needs in zero-ed content of newly initialized page. 2. Log record for PageRepaireFragmentation now keeps array of !LP_USED offnums to redo cleanup properly.	2000-12-30 15:19:57 +00:00
Vadim B. Mikheev	c193f19a39	Fixed misprint in heap update WALoging.	2000-12-30 06:52:34 +00:00
Tom Lane	7f60b81e1a	Fix failure in CreateCheckPoint on some Alpha boxes --- it's not OK to assume that TAS() will always succeed the first time, even if the lock is known to be free. Also, make sure that code will eventually time out and report a stuck spinlock, rather than looping forever. Small cleanups in s_lock.h, too.	2000-12-29 21:31:21 +00:00
Vadim B. Mikheev	7d363c4c33	MUST update (in-memory) data page BEFORE XLogInsert to log NEW page content if WAL will decide to backup page.	2000-12-29 20:47:17 +00:00
Vadim B. Mikheev	b3c4f03c9c	nbtree_xlog_newroot: set meta flag in meta page opaque.	2000-12-29 08:08:59 +00:00
Vadim B. Mikheev	7ceeeb662f	New WAL version - CRC and data blocks backup.	2000-12-28 13:00:29 +00:00
Tom Lane	8609d4abf2	Fix portability problems recently exposed by regression tests on Alphas. 1. Distinguish cases where a Datum representing a tuple datatype is an OID from cases where it is a pointer to TupleTableSlot, and make sure we use the right typlen in each case. 2. Make fetchatt() and related code support 8-byte by-value datatypes on machines where Datum is 8 bytes. Centralize knowledge of the available by-value datatype sizes in two macros in tupmacs.h, so that this will be easier if we ever have to do it again.	2000-12-27 23:59:14 +00:00
Tom Lane	6cc842abd3	Revise lock manager to support "session level" locks as well as "transaction level" locks. A session lock is not released at transaction commit (but it is released on transaction abort, to ensure recovery after an elog(ERROR)). In VACUUM, use a session lock to protect the master table while vacuuming a TOAST table, so that the TOAST table can be done in an independent transaction. I also took this opportunity to do some cleanup and renaming in the lock code. The previously noted bug in ProcLockWakeup, that it couldn't wake up any waiters beyond the first non-wakeable waiter, is now fixed. Also found a previously unknown bug of the same kind (failure to scan all members of a lock queue in some cases) in DeadLockCheck. This might have led to failure to detect a deadlock condition, resulting in indefinite waits, but it's difficult to characterize the conditions required to trigger a failure.	2000-12-22 00:51:54 +00:00
Bruce Momjian	1f159e562b	>> Here is a patch for the beos port (All regression tests are OK). >> xlog.c : special case for beos to avoid 'link' which does not work yet >> beos/sem.c : implementation of new sem_ctl call (GETPID) and a new >sem_op >> flag (IPCNOWAIT) >> dynloader/beos.c : add a verification of symbol validity (seem that the >> loader sometime return OK with an invalid symbol) >> postmaster.c : add beos forking support for the new checkpoint process >> postgres.c : remove beos special case for getrusage >> beos.h : Correction of a bas definition of AF_UNIX, misc defnitions >> >> >> thanks >> >> >> cyril Cyril VELTER	2000-12-18 18:45:05 +00:00
Tom Lane	a626b78c89	Clean up backend-exit-time cleanup behavior. Use on_shmem_exit callbacks to ensure that we have released buffer refcounts and so forth, rather than putting ad-hoc operations before (some of the calls to) proc_exit. Add commentary to discourage future hackers from repeating that mistake.	2000-12-18 00:44:50 +00:00
Vadim B. Mikheev	5bb4f723d2	Remove elog for online log files.	2000-12-11 19:27:42 +00:00
Vadim B. Mikheev	dae369d390	elog(LOG)-->elog(DEBUG) for skipped logs.	2000-12-11 18:02:25 +00:00
Hiroshi Inoue	6ef0219c34	Resolve complie error(was my fault).	2000-12-11 09:14:03 +00:00
Hiroshi Inoue	a8824ff257	redo: Heap move neglects to set t_cmin for MOVED_IN tuples.	2000-12-11 05:25:23 +00:00
Tom Lane	376784cf8a	Repair erroneous use of hashvarlena() for MACADDR, which is not a varlena type. (I did not force initdb, but you won't see the fix unless you do one.) Also, make sure all index support operators and functions are careful not to leak memory for toasted inputs; I had missed some hash and rtree support ops on this point before.	2000-12-08 23:57:03 +00:00
Tom Lane	fb47385fc8	Resurrect -F switch: it controls fsyncs again, though the fsyncs are mostly just on the WAL logfile nowadays. But if people want to disable fsync for performance, why should we say no?	2000-12-08 22:21:33 +00:00
Hiroshi Inoue	8bb4dab94d	RecordTransactionAbort() shouldn't log XLOG_XACT_ABORT if the transaction has already been committed ?	2000-12-07 10:03:46 +00:00
Tom Lane	06dde51ef0	Silence compiler warning.	2000-12-07 02:04:30 +00:00
Vadim B. Mikheev	65b362fae1	Disable elog(ERROR\|FATAL) in signal handlers in critical sections of code.	2000-12-03 10:27:29 +00:00
Tom Lane	217d1566bf	Make tuple receive/print routines TOAST-aware. Formerly, printtup would leak memory when printing a toasted attribute, and printtup_internal didn't work at all...	2000-12-01 22:10:31 +00:00
Tom Lane	1f5cc8c78a	Remove VARLENA_FIXED_SIZE hack, which is irreversibly broken now that both MULTIBYTE and TOAST prevent char(n) from being truly fixed-size. Simplify and speed up fastgetattr() and index_getattr() macros by eliminating special cases for attnum=1. It's just as fast to handle the first attribute by presetting its attcacheoff to zero; so do that instead when loading the tupledesc in relcache.c.	2000-11-30 18:38:47 +00:00
Vadim B. Mikheev	81c8c244b2	No more #ifdef XLOG.	2000-11-30 08:46:26 +00:00
Vadim B. Mikheev	741510521c	XLOG stuff for sequences. CommitDelay in guc.c	2000-11-30 01:47:33 +00:00
Tom Lane	680b7357ce	Rearrange bufmgr header files so that buf_internals.h need not be included by everything that includes bufmgr.h --- it's supposed to be internals, after all, not part of the API! This fixes the conflict against FreeBSD headers reported by Rosenman, by making it unnecessary for s_lock.h to be included by plperl.c.	2000-11-30 01:39:08 +00:00
Tom Lane	c715fdea26	Significant cleanups in SysV IPC handling (shared mem and semaphores). IPC key assignment will now work correctly even when multiple postmasters are using same logical port number (which is possible given -k switch). There is only one shared-mem segment per postmaster now, not 3. Rip out broken code for non-TAS case in bufmgr and xlog, substitute a complete S_LOCK emulation using semaphores in spin.c. TAS and non-TAS logic is now exactly the same. When deadlock is detected, "Deadlock detected" is now the elog(ERROR) message, rather than a NOTICE that comes out before an unhelpful ERROR.	2000-11-28 23:27:57 +00:00
Tom Lane	230cf8d373	Check for link(2) failure.	2000-11-27 05:36:12 +00:00
Tom Lane	bbea3643a3	Store current LC_COLLATE and LC_CTYPE settings in pg_control during initdb; re-adopt these settings at every postmaster or standalone-backend startup. This should fix problems with indexes becoming corrupt due to failure to provide consistent locale environment for postmaster at all times. Also, refuse to start up a non-locale-enabled compilation in a database originally initdb'd with a non-C locale. Suppress LIKE index optimization if locale is not "C" or "POSIX" (are there any other locales where it's safe?). Issue NOTICE during initdb if selected locale disables LIKE optimization.	2000-11-25 20:33:54 +00:00
Peter Eisentraut	403abf1ca5	Refine log/error messages. Print out the errno message, not the number. Remove timestamps from messages where this would be redundant with the log_timestamp option.	2000-11-21 22:27:26 +00:00
Peter Eisentraut	a70e74b060	Put external declarations into header files.	2000-11-21 21:16:06 +00:00
Vadim B. Mikheev	2536267404	misc	2000-11-21 10:17:57 +00:00
Vadim B. Mikheev	e8ff221d8b	Fix OID bootstraping.	2000-11-21 09:39:57 +00:00
Vadim B. Mikheev	01f2547c6b	Init ShmemVariableCache in BootStrapXLOG() (should fix OID bootstraping).	2000-11-21 02:11:06 +00:00
Tom Lane	3568cf50e5	Silence gcc warnings.	2000-11-20 21:14:13 +00:00
Peter Eisentraut	2b1d8bd29a	Include postgres.h before checking #ifdef XLOG.	2000-11-20 16:47:32 +00:00
Vadim B. Mikheev	a221d95f28	Compile WAL by default.	2000-11-20 05:18:40 +00:00
Tom Lane	a933ee38bb	Change SearchSysCache coding conventions so that a reference count is maintained for each cache entry. A cache entry will not be freed until the matching ReleaseSysCache call has been executed. This eliminates worries about cache entries getting dropped while still in use. See my posting to pg-hackers of even date for more info.	2000-11-16 22:30:52 +00:00
Bruce Momjian	a5046ad13a	That variable I removed broke XLOG, that part of the delta should have read: Alfred Perlstein	2000-11-16 06:16:00 +00:00
Bruce Momjian	312063c97b	Make pgsql compile on FreeBSD-alpha. Context diff this time. Remove -m486 compile args for FreeBSD-i386, compile -O2 on i386. Compile with only -O on alpha for codegen safety. Make the port use the TEST_AND_SET for alpha and i386 on FreeBSD. Fix a lot of bogus string formats for outputting pointers (cast to int and %u/%x replaced with no cast and %p), and 'Size'(size_t) are now cast to 'unsigned long' and output with %lu/ Remove an unused variable. Alfred Perlstein	2000-11-16 05:51:07 +00:00
Tom Lane	21e1e6643c	Minor cleanup of tableOid-related coding.	2000-11-14 21:04:32 +00:00
Tom Lane	ddeab22565	Clean up syscache so that recursive invocation is safe, and remove error message about recursive use of a syscache. Also remove most of the specialized indexscan routines in indexing.c --- it turns out that catcache.c is perfectly able to perform the indexscan for itself, in fact has already looked up all the information needed to do so! This should be faster as well as needing far less boilerplate code.	2000-11-10 00:33:12 +00:00
Vadim B. Mikheev	b0299c5d37	Auto checkpoint creation.	2000-11-09 11:26:00 +00:00
Tom Lane	3908473c80	Make DROP TABLE rollback-able: postpone physical file delete until commit. (WAL logging for this is not done yet, however.) Clean up a number of really crufty things that are no longer needed now that DROP behaves nicely. Make temp table mapper do the right things when drop or rename affecting a temp table is rolled back. Also, remove "relation modified while in use" error check, in favor of locking tables at first reference and holding that lock throughout the statement.	2000-11-08 22:10:03 +00:00
Vadim B. Mikheev	f0e37a8531	New CHECKPOINT command. Auto removing of offline log files and creating new file at checkpoint time.	2000-11-05 22:50:21 +00:00
Vadim B. Mikheev	b98ba2a04c	pg_variable is not used in WAL version now.	2000-11-03 11:39:36 +00:00
Vadim B. Mikheev	855ffa0be0	Forgot to check page LSN and unlock buffer in btree_xlog_delete - fixed. (Thanks to Tatsuo Ishii for finding bug)	2000-11-01 20:39:58 +00:00
Vadim B. Mikheev	3706f08ace	Fix recovery cache code (thanks to Peter Eisentraut for pointing to bug).	2000-10-31 23:56:36 +00:00
Vadim B. Mikheev	e3ba543525	WAL fixes.	2000-10-29 18:33:41 +00:00
Vadim B. Mikheev	5b0740d3fc	WAL	2000-10-28 16:21:00 +00:00
Tom Lane	fa9357d0b7	Fix AbortOutOfAnyTransaction logic to avoid notice about 'AbortTransaction and not in in-progress state' when client disconnects just after an error. Notice seems pretty harmless, so I'm not going to worry about back-patching this into 7.0.* ...	2000-10-24 20:06:39 +00:00
Vadim B. Mikheev	db2faa943a	WAL misc	2000-10-24 09:56:23 +00:00
Tom Lane	dea7d54151	If a field is incompressible ('compressed' data is actually larger than source, due to addition of header overhead), store it as plain data rather than pseudo-compressed data. This saves a few microseconds when reading it out, but much more importantly guarantees that the toaster won't actually expand tuples that contain incompressible data. That's essential to avoid 'Tuple too big' failures with large objects.	2000-10-23 23:42:04 +00:00
Vadim B. Mikheev	4b65a2840b	New relcache hash table with RelFileNode as key to be used from bufmgr - it would be nice to have separate hash in smgr for node <--> fd mappings, but for the moment it's easy to add new hash to relcache. Fixed small bug in xlog.c:ReadRecord.	2000-10-23 04:10:24 +00:00
Peter Eisentraut	fba790ad58	Makeover for Unixware 7.1.1 * Makefile: Add more standard targets. Improve shell redirection in GNU make detection. * src/backend/access/transam/rmgr.c: Fix incorrect(?) C. * src/backend/libpq/pqcomm.c (StreamConnection): Work around accept() bug. * src/include/port/unixware.h: ...with help from here. * src/backend/nodes/print.c (plannode_type): Remove some "break"s after "return"s. * src/backend/tcop/dest.c (DestToFunction): ditto. * src/backend/nodes/readfuncs.c: Add proper prototypes. * src/backend/utils/adt/numutils.c (pg_atoi): Cope specially with strtol() setting EINVAL. This saves us from creating an extra set of regression test output for the affected systems. * src/include/storage/s_lock.h (tas): Correct prototype. * src/interfaces/libpq/fe-connect.c (parseServiceInfo): Don't use variable as dimension in array definition. * src/makefiles/Makefile.unixware: Add support for GCC. * src/template/unixware: same here * src/test/regress/expected/abstime-solaris-1947.out: Adjust whitespace. * src/test/regress/expected/horology-solaris-1947.out: Part of this file was evidently missing. * src/test/regress/pg_regress.sh: Fix shell. mkdir -p returns non-zero if the directory exists. * src/test/regress/resultmap: Add entries for Unixware.	2000-10-22 22:15:13 +00:00
Vadim B. Mikheev	a7fcadd10a	WAL	2000-10-21 15:43:36 +00:00
Vadim B. Mikheev	b58c0411ba	redo/undo support functions and cleanups.	2000-10-20 11:01:21 +00:00
Vadim B. Mikheev	b33428d20c	Various utils for WAL	2000-10-13 12:06:40 +00:00
Vadim B. Mikheev	deee783052	WAL	2000-10-13 12:05:22 +00:00
Vadim B. Mikheev	25a26a7ab8	WAL	2000-10-13 02:03:02 +00:00
Bruce Momjian	f41f8eebe7	Fix temp relation handling for indexes, cleanup	2000-10-11 21:28:19 +00:00
Tom Lane	32616129cd	Suppress gcc warnings.	2000-10-05 20:10:20 +00:00
Bruce Momjian	b32685a999	Add proofreader's changes to docs. Fix misspelling of disbursion to dispersion.	2000-10-05 19:48:34 +00:00
Vadim B. Mikheev	5800c6b9aa	Btree WAL logging.	2000-10-04 00:04:43 +00:00
Peter Eisentraut	64610a82f2	Reset current user id to session user id during transaction abort	2000-09-27 10:41:55 +00:00
Tom Lane	acbbeffc29	Clean up some ugly coding (hardwired constants) in index_formtuple.	2000-09-23 22:40:12 +00:00
Vadim B. Mikheev	f2bfe8a24c	Heap redo/undo (except for tuple moving used by vacuum).	2000-09-07 09:58:38 +00:00
Peter Eisentraut	424f0edcb8	Fix relative path references so that make knowns which dependencies refer to one another. Sort out builddir vs srcdir variable namings. Remove some now obsoleted make variables.	2000-08-31 16:12:35 +00:00
Tom Lane	40549e9cb5	Tweak btree insertion to avoid O(N^2) slowdown with large numbers of equal keys. See discussion of today's date in pghackers list.	2000-08-25 23:13:33 +00:00
Hiroshi Inoue	b0d5036c7c	CREATE btree INDEX takes dead tuples into account when old transactions are running.	2000-08-10 02:33:20 +00:00
Tom Lane	925418d2fa	Ensure that catcache 'busy' flags are reset at transaction abort. Without this, an elog during cache-entry load leaves that catcache unusable. elog in that segment of code is pretty unusual but it can happen.	2000-08-06 04:17:47 +00:00
Tom Lane	dd8ad64118	Fix tuptoaster bugs induced by making bytea toastable. Durn thing was trying to toast tuples inserted into toast tables! Fix is two-pronged: first, ensure all columns of a toast table are marked attstorage='p', and second, alter the target chunk size so that it's less than the threshold for trying to toast a tuple. (Code tried to do that but the expression was wrong.) A few cosmetic cleanups in tuptoaster too. NOTE: initdb forced due to change in toaster chunk-size.	2000-08-04 04:16:17 +00:00
Tom Lane	61aca818c4	Modify heap_open()/heap_openr() API per pghackers discussion of 11 July. These two routines will now ALWAYS elog() on failure, whether you ask for a lock or not. If you really want to get a NULL return on failure, call the new routines heap_open_nofail()/heap_openr_nofail(). By my count there are only about three places that actually want that behavior. There were rather more than three places that were missing the check they needed to make under the old convention :-(.	2000-08-03 19:19:38 +00:00
Tom Lane	c298d74d49	More functions updated to new fmgr style --- money, name, tid datatypes. We're reaching the mopup stage here (good thing too, this is getting tedious).	2000-08-03 16:35:08 +00:00
Tom Lane	7d0c4188f1	Make acl-related functions safe for TOAST. Mark pg_class.relacl as compressible but not externally storable (since we're not sure about whether creating a toast relation for pg_class would work).	2000-07-31 22:39:17 +00:00
Tom Lane	3a9a74a09d	Convert all remaining geometric operators to new fmgr style. This allows fixing problems with operators that expected to be able to return a NULL, such as the '#' line-segment-intersection operator that tried to return NULL when the two segments don't intersect. (See, eg, bug report from 1-Nov-99 on pghackers.) Fix some other bugs in passing, such as backwards comparison in path_distance().	2000-07-30 20:44:02 +00:00
Tom Lane	d70d46fd60	PATH and POLYGON datatypes are now TOASTable. Associated functions updated to new fmgr style. Deleted hoary old functions for compatibility with pre-6.1 representations of these datatypes.	2000-07-29 18:46:12 +00:00
Tom Lane	742cd87999	Ensure that if the OID counter wraps around, we will not generate 0, nor any OID in the reserved range (1-16383).	2000-07-25 20:18:19 +00:00
Tom Lane	dc73e25a5e	Add commentary about varying usage of scankeys in btree code.	2000-07-25 05:26:40 +00:00
Tom Lane	916b2321ad	Clean up and document btree code for ordering keys. Neat stuff, actually, but who could understand it with no comments? Fix bug while at it: _bt_orderkeys would try to invoke comparisons on NULL inputs, given the right sort of redundant quals.	2000-07-25 04:47:59 +00:00
Jan Wieck	f67e79045d	2nd try for the index tuple toast hack. This time as suggested by Tom. Jan	2000-07-22 11:18:47 +00:00
Tom Lane	421f0baaff	Further cleanup of btbuild (CREATE INDEX). Avoid storing unneeded left keys during bottom-up index build, and leave some free space instead of packing the pages to the brim (so as to avoid vast numbers of page splits during the first interactive insertions).	2000-07-21 22:14:09 +00:00
Tom Lane	1ea912e16d	Fix sloppiness about alignment requirements in findsplitloc() space calculation, also make it stop when it has a 'good enough' split instead of exhaustively trying all split points.	2000-07-21 19:21:00 +00:00
Jan Wieck	0143d391c6	Need to switch to tuples memory context when replacing the toasted one with the plain one. Jan	2000-07-21 11:18:51 +00:00
Jan Wieck	82f3945a67	Temporary fix to make TOAST vacuum-safe. All values are forced to be in memory (plain or compressed) in the tuple returned from the heap-am. So no index will ever contain an external reference. Jan	2000-07-21 10:31:31 +00:00
Tom Lane	9e85183bfc	Major overhaul of btree index code. Eliminate special BTP_CHAIN logic for duplicate keys by letting search go to the left rather than right when an equal key is seen at an upper tree level. Fix poor choice of page split point (leading to insertion failures) that was forced by chaining logic. Don't store leftmost key in non-leaf pages, since it's not necessary. Don't create root page until something is first stored in the index, so an unused index is now 8K not 16K. (Doesn't seem to be as easy to get rid of the metadata page, unfortunately.) Massive cleanup of unreadable code, fix poor, obsolete, and just plain wrong documentation and comments. See src/backend/access/nbtree/README for the gory details.	2000-07-21 06:42:39 +00:00
Tom Lane	6bfe64032e	Cleanup of code for creating index entries. Functional indexes with pass-by-ref data types --- eg, an index on lower(textfield) --- no longer leak memory during index creation or update. Clean up a lot of redundant code ... did you know that copy, vacuum, truncate, reindex, extend index, and bootstrap each basically duplicated the main executor's logic for extracting information about an index and preparing index entries? Functional indexes should be a little faster now too, due to removal of repeated function lookups. CREATE INDEX 'opt_type' clause is deimplemented by these changes, but I haven't removed it from the parser yet (need to merge with Thomas' latest change set first).	2000-07-14 22:18:02 +00:00
Peter Eisentraut	8a3cbc84ef	Repair parallel make in backend tree (and make it really parallel). Make Gen_fmgrtab.sh reasonably robust against concurrent invocation.	2000-07-13 16:07:14 +00:00
Tom Lane	badce86a2c	First stage of reclaiming memory in executor by resetting short-term memory contexts. Currently, only leaks in expressions executed as quals or projections are handled. Clean up some old dead cruft in executor while at it --- unused fields in state nodes, that sort of thing.	2000-07-12 02:37:39 +00:00
Jan Wieck	793704d71e	Some security checks that we've found an external value completely when fetching toasted values. Jan	2000-07-11 12:32:03 +00:00
Jan Wieck	2a225ebf18	Bugfix. If toasted tuple containted NULLs, DataFill() was handed a wrong pointer causing the bitmap overwriting the tuple header. Jan	2000-07-06 18:22:45 +00:00
Jan Wieck	d819f5fe83	Moving toaster out of NO ELOG area in heap_update(). Jan	2000-07-04 17:11:40 +00:00
Tom Lane	3b61ba6d5c	DataFill() has no business resetting xact status bits in the infomask of the provided tuple.	2000-07-04 02:40:56 +00:00
Vadim B. Mikheev	d0273c07ac	misc	2000-07-04 01:49:44 +00:00

... 6 7 8 9 10 ...

1163 Commits