postgresql

mirror of https://git.postgresql.org/git/postgresql.git synced 2024-10-12 00:16:51 +02:00

Author	SHA1	Message	Date
Peter Eisentraut	968d7733a1	Rename config.h to pg_config.h and os.h to pg_config_os.h, fix a number of places that were including the wrong files.	2001-08-24 14:07:50 +00:00
Tom Lane	7326e78c42	Ensure that all TransactionId comparisons are encapsulated in macros (TransactionIdPrecedes, TransactionIdFollows, etc). First step on the way to transaction ID wrap solution ...	2001-08-23 23:06:38 +00:00
Tom Lane	a54075a6d6	Update GiST for new pg_opclass arrangement (finally a clean solution for haskeytype). Update GiST contrib modules too. Add linear-time split algorithm for R-tree GiST opclass. From Oleg Bartunov and Teodor Sigaev.	2001-08-22 18:24:26 +00:00
Tom Lane	f933766ba7	Restructure pg_opclass, pg_amop, and pg_amproc per previous discussions in pgsql-hackers. pg_opclass now has a row for each opclass supported by each index AM, not a row for each opclass name. This allows pg_opclass to show directly whether an AM supports an opclass, and furthermore makes it possible to store additional information about an opclass that might be AM-dependent. pg_opclass and pg_amop now store "lossy" and "haskeytype" information that we previously expected the user to remember to provide in CREATE INDEX commands. Lossiness is no longer an index-level property, but is associated with the use of a particular operator in a particular index opclass. Along the way, IndexSupportInitialize now uses the syscaches to retrieve pg_amop and pg_amproc entries. I find this reduces backend launch time by about ten percent, at the cost of a couple more special cases in catcache.c's IndexScanOK. Initial work by Oleg Bartunov and Teodor Sigaev, further hacking by Tom Lane. initdb forced.	2001-08-21 16:36:06 +00:00
Tom Lane	bf56f0759b	Make OIDs optional, per discussions in pghackers. WITH OIDS is still the default, but OIDS are removed from many system catalogs that don't need them. Some interesting side effects: TOAST pointers are 20 bytes not 32 now; pg_description has a three-column key instead of one. Bugs fixed in passing: BINARY cursors work again; pg_class.relhaspkey has some usefulness; pg_dump dumps comments on indexes, rules, and triggers in a valid order. initdb forced.	2001-08-10 18:57:42 +00:00
Bruce Momjian	13923be7c8	1. null-safe interface to GiST (as proposed in http://fts.postgresql.org/db/mw/msg.html?mid=1028327) 2. support for 'pass-by-value' arguments - to test this we used special opclass for int4 with values in range [0-2^15] More testing will be done after resolving problem with index_formtuple and implementation of B-tree using GiST 3. small patch to contrib modules (seg,cube,rtree_gist,intarray) - mark functions as 'isstrict' where needed. Oleg Bartunov	2001-08-10 14:34:28 +00:00
Tom Lane	94cb3fd875	Suppress gcc warning in USE_LOCALE case.	2001-07-22 22:01:04 +00:00
Tom Lane	7d4d5c00f0	Arrange to recycle old XLOG log segment files as new segment files, rather than deleting them only to have to create more. Steady state is 2*CHECKPOINT_SEGMENTS + WAL_FILES + 1 segment files, which will simply be renamed rather than constantly deleted and recreated. To make this safe, added current XLOG file/offset number to page header of XLOG pages, so that an un-overwritten page from an old incarnation of a logfile can be reliably told from a valid page. This change means that if you try to restart postmaster in a CVS-tip database after installing the change, you'll get a complaint about bad XLOG page magic number. If you don't want to initdb, run contrib/pg_resetxlog (and be sure you shut down the old postmaster cleanly).	2001-07-19 02:12:35 +00:00
Tom Lane	ed5c4e4a14	Improve documentation about reasoning behind the order of operations in GetSnapshotData, GetNewTransactionId, CommitTransaction, AbortTransaction, etc. Correct race condition in transaction status testing in HeapTupleSatisfiesVacuum --- this wasn't important for old VACUUM with exclusive lock on its table, but it sure is important now. All per pghackers discussion 7/11/01 and 7/12/01.	2001-07-16 22:43:34 +00:00
Tom Lane	c8076f09d2	Restructure index AM interface for index building and index tuple deletion, per previous discussion on pghackers. Most of the duplicate code in different AMs' ambuild routines has been moved out to a common routine in index.c; this means that all index types now do the right things about inserting recently-dead tuples, etc. (I also removed support for EXTEND INDEX in the ambuild routines, since that's about to go away anyway, and it cluttered the code a lot.) The retail indextuple deletion routines have been replaced by a "bulk delete" routine in which the indexscan is inside the access method. I haven't pushed this change as far as it should go yet, but it should allow considerable simplification of the internal bookkeeping for deletions. Also, add flag columns to pg_am to eliminate various hardcoded tests on AM OIDs, and remove unused pg_am columns. Fix rtree and gist index types to not attempt to store NULLs; before this, gist usually crashed, while rtree managed not to crash but computed wacko bounding boxes for NULL entries (which might have had something to do with the performance problems we've heard about occasionally). Add AtEOXact routines to hash, rtree, and gist, all of which have static state that needs to be reset after an error. We discovered this need long ago for btree, but missed the other guys. Oh, one more thing: concurrent VACUUM is now the default.	2001-07-15 22:48:19 +00:00
Tom Lane	20ca834ce9	Minor code cleanup/beautification in RelationPutHeapTuple.	2001-07-13 22:52:58 +00:00
Tom Lane	b9f3a929ee	Create a new HeapTupleSatisfiesVacuum() routine in tqual.c that embodies the validity checking rules for VACUUM. Make some other rearrangements of the VACUUM code to allow more code to be shared between full and lazy VACUUM. Minor code cleanups and added comments for TransactionId manipulations.	2001-07-12 04:11:13 +00:00
Tom Lane	55432fedd2	Implement LockBufferForCleanup(), which will allow concurrent VACUUM to wait until it's safe to remove tuples and compact free space in a shared buffer page. Miscellaneous small code cleanups in bufmgr, too.	2001-07-06 21:04:26 +00:00
Hiroshi Inoue	852a26f79e	Fix my old fault(returns auto variable reference).	2001-07-06 09:41:36 +00:00
Tom Lane	af5ced9cfd	Further work on connecting the free space map (which is still just a stub) into the rest of the system. Adopt a cleaner approach to preventing deadlock in concurrent heap_updates: allow RelationGetBufferForTuple to select any page of the rel, and put the onus on it to lock both buffers in a consistent order. Remove no-longer-needed isExtend hack from API of ReleaseAndReadBuffer.	2001-06-29 21:08:25 +00:00
Tom Lane	fb2c3289ff	Repair logic error for multi-key indexes. From Oleg Bartunov.	2001-06-28 16:00:07 +00:00
Tom Lane	e0c9301c87	Install infrastructure for shared-memory free space map. Doesn't actually do anything yet, but it has the necessary connections to initialization and so forth. Make some gestures towards allowing number of blocks in a relation to be BlockNumber, ie, unsigned int, rather than signed int. (I doubt I got all the places that are sloppy about it, yet.) On the way, replace the hardwired NLOCKS_PER_XACT fudge factor with a GUC variable.	2001-06-27 23:31:40 +00:00
Tom Lane	4d58a7ca87	Optimizer can now estimate selectivity of IS NULL, IS NOT NULL, IS TRUE, etc, with some degree of verisimilitude. Split out selectivity support functions from builtins.h into a new header file selfuncs.h, so as to reduce the number of header files builtins.h must depend on. Fix a few missing inclusions exposed thereby. From Joe Conway, with some kibitzing from Tom Lane.	2001-06-25 21:11:45 +00:00
Jan Wieck	8d80b0d980	Statistical system views (yet without the config stuff, but it's hard to keep such massive changes in sync with the tree so I need to get it in and work from there now). Jan	2001-06-22 19:16:24 +00:00
Tom Lane	695e575470	Tweak error message.	2001-06-21 19:45:45 +00:00
Tom Lane	bbbc00af88	Clean up some longstanding problems in shared-cache invalidation. SI messages now include the relevant database OID, so that operations in one database do not cause useless cache flushes in backends attached to other databases. Declare SI messages properly using a union, to eliminate the former assumption that Oid is the same size as int or Index. Rewrite the nearly-unreadable code in inval.c, and document it better. Arrange for catcache flushes at end of command/transaction to happen before relcache flushes do --- this avoids loading a new tuple into the catcache while setting up new relcache entry, only to have it be flushed again immediately.	2001-06-19 19:42:16 +00:00
Tom Lane	1d584f97b9	Clean up various to-do items associated with system indexes: pg_database now has unique indexes on oid and on datname. pg_shadow now has unique indexes on usename and on usesysid. pg_am now has unique index on oid. pg_opclass now has unique index on oid. pg_amproc now has unique index on amid+amopclaid+amprocnum. Remove pg_rewrite's unnecessary index on oid, delete unused RULEOID syscache. Remove index on pg_listener and associated syscache for performance reasons (caching rows that are certain to change before you need 'em again is rather pointless). Change pg_attrdef's nonunique index on adrelid into a unique index on adrelid+adnum. Fix various incorrect settings of pg_class.relisshared, make that the primary reference point for whether a relation is shared or not. IsSharedSystemRelationName() is now only consulted to initialize relisshared during initial creation of tables and indexes. In theory we might now support shared user relations, though it's not clear how one would get entries for them into pg_class &etc of multiple databases. Fix recently reported bug that pg_attribute rows created for an index all have the same OID. (Proof that non-unique OID doesn't matter unless it's actually used to do lookups ;-)) There's no need to treat pg_trigger, pg_attrdef, pg_relcheck as bootstrap relations. Convert them into plain system catalogs without hardwired entries in pg_class and friends. Unify global.bki and template1.bki into a single init script postgres.bki, since the alleged distinction between them was misleading and pointless. Not to mention that it didn't work for setting up indexes on shared system relations. Rationalize locking of pg_shadow, pg_group, pg_attrdef (no need to use AccessExclusiveLock where ExclusiveLock or even RowExclusiveLock will do). Also, hold locks until transaction commit where necessary.	2001-06-12 05:55:50 +00:00
Tom Lane	88e948216c	Nest macros with slightly less enthusiasm, for performance and to avoid having non-gcc compilers spit up.	2001-06-11 05:00:56 +00:00
Tom Lane	bdadc9bf1c	Remove RelationGetBufferWithBuffer(), which is horribly confused about appropriate pin-count manipulation, and instead use ReleaseAndReadBuffer. Make use of the fact that the passed-in buffer (if there is one) must be pinned to avoid grabbing the bufmgr spinlock when we are able to return this same buffer. Eliminate unnecessary 'previous tuple' and 'next tuple' fields of HeapScanDesc and IndexScanDesc, thereby removing a whole lot of bookkeeping from heap_getnext() and related routines.	2001-06-09 18:16:59 +00:00
Tom Lane	1173344e74	Adjust WAL code so that checkpoints truncate the xlog at the previous checkpoint's redo pointer, not its undo pointer, per discussion in pghackers a few days ago. No point in hanging onto undo information until we have the ability to do something with it --- and this solves a rather large problem with log space for long-running transactions. Also, change all calls of write() to detect the case where write returned a count less than requested, but failed to set errno. Presume that this situation indicates ENOSPC, and give the appropriate error message, rather than a random message associated with the previous value of errno.	2001-06-06 17:07:46 +00:00
Peter Eisentraut	12c1552066	Mark many strings in backend not covered by elog for translation. Also, make strings in xlog.c look more like English and less like binary noise.	2001-06-03 14:53:56 +00:00
Tom Lane	0b370ea7c8	Clean up some minor problems exposed by further thought about Panon's bug report on old-style functions invoked by RI triggers. We had a number of other places that were being sloppy about which memory context FmgrInfo subsidiary data will be allocated in. Turns out none of them actually cause a problem in 7.1, but this is for arcane reasons such as the fact that old-style triggers aren't supported anyway. To avoid getting burnt later, I've restructured the trigger support so that we don't keep trigger FmgrInfo structs in relcache memory. Some other related cleanups too: it's not really necessary to call fmgr_info at all while setting up the index support info in relcache entries, because those ScanKeyEntry structs are never used to invoke the functions. This should speed up relcache initialization a tiny bit.	2001-06-01 02:41:36 +00:00
Tom Lane	3043810d97	Updates to make GIST work with multi-key indexes (from Oleg Bartunov and Teodor Sigaev). Declare key values as Datum where appropriate, rather than char* (Tom Lane).	2001-05-31 18:16:55 +00:00
Tom Lane	f1d5d0905c	Tweak StrategyEvaluation data structure to eliminate hardwired limit on number of strategies supported by an index AM. Add missing copyright notices and CVS $Header$ markers to GIST source files.	2001-05-30 19:53:40 +00:00
Bruce Momjian	33f2614aa1	Remove SEP_CHAR, replace with / or '/' as appropriate.	2001-05-30 14:15:27 +00:00
Bruce Momjian	f6923ff3ac	Oops, only wanted python change in the last commit. Backing out.	2001-05-25 15:45:34 +00:00
Bruce Momjian	dffb673692	While changing Cygwin Python to build its core as a DLL (like Win32 Python) to support shared extension modules, I have learned that Guido prefers the style of the attached patch to solve the above problem. I feel that this solution is particularly appropriate in this case because the following: PglargeType PgType PgQueryType are already being handled in the way that I am proposing for PgSourceType. Jason Tishler	2001-05-25 15:34:50 +00:00
Bruce Momjian	f08245cfe3	I found the answer to this: the partition had filled up, and so the problem was lack of disk space. Oliver Elphick	2001-05-22 16:52:49 +00:00
Bruce Momjian	dc0ff5c67a	Small code cleanups,formatting.	2001-05-18 21:24:20 +00:00
Bruce Momjian	2d7795ebb4	Prevent forced blank line before comment block in pgindent.	2001-05-17 15:55:24 +00:00
Bruce Momjian	e044fc0599	Spacing cleanup.	2001-05-17 15:22:12 +00:00
Bruce Momjian	806aba49fd	Small cleanup of spacing.	2001-05-17 14:59:31 +00:00
Tom Lane	27336e4f7a	Repair race condition introduced into heap_update() in 7.1 --- PageGetFreeSpace() was being called while not holding the buffer lock, which not only could yield a garbage answer, but even if it's the right answer there might be less space available after we reacquire the buffer lock. Also repair potential deadlock introduced by my recent performance improvement in RelationGetBufferForTuple(): it was possible for two heap_updates to try to lock two buffers in opposite orders. The fix creates a global rule that buffers of a single heap relation should be locked in decreasing block number order. Currently, this only applies to heap_update; VACUUM can get away with ignoring the rule since it holds exclusive lock on the whole relation anyway. However, if we try to implement a VACUUM that can run in parallel with other transactions, VACUUM will also have to obey the lock order rule.	2001-05-16 22:35:12 +00:00
Bruce Momjian	d0e1091cfd	we found a problem in GiST with massive insert/update operations with many NULLs ( inserting of NULL into indexed field cause ERROR: MemoryContextAlloc: invalid request size) As a workaround 'vacuum analyze' could be used. This patch resolves the problem, please upply to 7.1.1 sources and current cvs tree. Oleg Bartunov	2001-05-15 14:14:49 +00:00
Bruce Momjian	ed6998d0b3	Re-add pg_index.indhaskeytype.	2001-05-15 03:49:35 +00:00
Bruce Momjian	783fbdab70	Remove columns pg_index.haskeytype and pg_index.indisclustered. Not used.	2001-05-14 21:53:16 +00:00
Bruce Momjian	1e7b79cebc	Remove unused tables pg_variable, pg_inheritproc, pg_ipl tables. Initdb forced.	2001-05-14 20:30:21 +00:00
Tom Lane	eedb7d18fa	Modify RelationGetBufferForTuple() so that we only do lseek and lock when we need to move to a new page; as long as we can insert the new tuple on the same page as before, we only need LockBuffer and not the expensive stuff. Also, twiddle bufmgr interfaces to avoid redundant lseeks in RelationGetBufferForTuple and BufferAlloc. Successive inserts now require one lseek per page added, rather than one per tuple with several additional ones at each page boundary as happened before. Lock contention when multiple backends are inserting in same table is also greatly reduced.	2001-05-12 19:58:28 +00:00
Tom Lane	f905d65ee3	Rewrite of planner statistics-gathering code. ANALYZE is now available as a separate statement (though it can still be invoked as part of VACUUM, too). pg_statistic redesigned to be more flexible about what statistics are stored. ANALYZE now collects a list of several of the most common values, not just one, plus a histogram (not just the min and max values). Random sampling is used to make the process reasonably fast even on very large tables. The number of values and histogram bins collected is now user-settable via an ALTER TABLE command. There is more still to do; the new stats are not being used everywhere they could be in the planner. But the remaining changes for this project should be localized, and the behavior is already better than before. A not-very-related change is that sorting now makes use of btree comparison routines if it can find one, rather than invoking '<' twice.	2001-05-07 00:43:27 +00:00
Tom Lane	e2e19ca0cd	Seems like we should not hold off cancel/die interrupts while we are running deferred triggers. They are really part of the regular transaction, and they could take awhile.	2001-05-04 18:39:16 +00:00
Tom Lane	2792374cff	Ensure that btree sort ordering functions and boolean comparison operators give consistent results for all datatypes. Types float4, float8, and numeric were broken for NaN values; abstime, timestamp, and interval were broken for INVALID values; timetz was just plain broken (some possible pairs of values were neither < nor = nor >). Also clean up text, bpchar, varchar, and bit/varbit to eliminate duplicate code and thereby reduce the probability of similar inconsistencies arising in the future.	2001-05-03 19:00:37 +00:00
Tom Lane	f10596c3ec	Fix comment that Vadim found confusing.	2001-04-05 16:55:21 +00:00
Vadim B. Mikheev	3092869233	StartupXLOG(): initialize XLogCtl->Insert to new page if there is no room for a record on last log page.	2001-04-05 09:34:32 +00:00
Tom Lane	ccd415c63f	Fix unportable assumptions about alignment of local char[n] variables.	2001-03-25 23:23:59 +00:00
Tom Lane	00713cb7cb	Fix code that incorrectly assumed a 'char foo[N]' local variable would be aligned on a word boundary. Per report from Steve Nicolai.	2001-03-25 00:45:20 +00:00
Bruce Momjian	7cf952e7b4	Fix comments that were mis-wrapped, for Tom Lane.	2001-03-23 04:49:58 +00:00
Bruce Momjian	0686d49da0	Remove dashes in comments that don't need them, rewrap with pgindent.	2001-03-22 06:16:21 +00:00
Bruce Momjian	9e1552607a	pgindent run. Make it all clean.	2001-03-22 04:01:46 +00:00
Tom Lane	af6e88a9cf	Remove NEXTXID xlog record type to avoid three-way deadlock risk. NEXTXID isn't really necessary, per previous discussion in pghackers, but I mulishy insisted we should put it in anyway. Mea culpa.	2001-03-18 20:18:59 +00:00
Tom Lane	ae293d33cf	Make sure ControlFile logId/logSeg don't go backwards (barely possible given a slow backend, if we update unconditionally as the code did before).	2001-03-18 00:30:27 +00:00
Tom Lane	5a38af7fd8	Rearrange XLogFileInit so that control-file spinlock is not held while filling the new log file with zeroes, only while renaming it into place. This should prevent problems with 'stuck spinlock' errors under heavy load.	2001-03-17 20:54:13 +00:00
Tom Lane	9d645fd84c	Support syncing WAL log to disk using either fsync(), fdatasync(), O_SYNC, or O_DSYNC (as available on a given platform). Add GUC parameter to control sync method. Also, add defense to XLogWrite to prevent it from going nuts if passed a target write position that's past the end of the buffers so far filled by XLogInsert.	2001-03-16 05:44:33 +00:00
Tom Lane	cfab4f6541	Use SEP_CHAR consistently in forming XLOG pathnames.	2001-03-14 20:23:04 +00:00
Tom Lane	1b87e24c4a	Change xlog page-header format to include StartUpID. Use the SUI to detect case that next page in log came from an older run than the prior page. This avoids the necessity to re-zero the log after recovery from a crash, which is good because we need not risk destroying valuable log information. This forces another initdb since yesterday :-(. Need to get that log reset utility done...	2001-03-13 20:32:37 +00:00
Tom Lane	4d14fe0048	XLOG (and related) changes: * Store two past checkpoint locations, not just one, in pg_control. On startup, we fall back to the older checkpoint if the newer one is unreadable. Also, a physical copy of the newest checkpoint record is kept in pg_control for possible use in disaster recovery (ie, complete loss of pg_xlog). Also add a version number for pg_control itself. Remove archdir from pg_control; it ought to be a GUC parameter, not a special case (not that it's implemented yet anyway). * Suppress successive checkpoint records when nothing has been entered in the WAL log since the last one. This is not so much to avoid I/O as to make it actually useful to keep track of the last two checkpoints. If the things are right next to each other then there's not a lot of redundancy gained... * Change CRC scheme to a true 64-bit CRC, not a pair of 32-bit CRCs on alternate bytes. Polynomial borrowed from ECMA DLT1 standard. * Fix XLOG record length handling so that it will work at BLCKSZ = 32k. * Change XID allocation to work more like OID allocation. (This is of dubious necessity, but I think it's a good idea anyway.) * Fix a number of minor bugs, such as off-by-one logic for XLOG file wraparound at the 4 gig mark. * Add documentation and clean up some coding infelicities; move file format declarations out to include files where planned contrib utilities can get at them. * Checkpoint will now occur every CHECKPOINT_SEGMENTS log segments or every CHECKPOINT_TIMEOUT seconds, whichever comes first. It is also possible to force a checkpoint by sending SIGUSR1 to the postmaster (undocumented feature...) * Defend against kill -9 postmaster by storing shmem block's key and ID in postmaster.pid lockfile, and checking at startup to ensure that no processes are still connected to old shmem block (if it still exists). * Switch backends to accept SIGQUIT rather than SIGUSR1 for emergency stop, for symmetry with postmaster and xlog utilities. Clean up signal handling in bootstrap.c so that xlog utilities launched by postmaster will react to signals better. * Standalone bootstrap now grabs lockfile in target directory, as added insurance against running it in parallel with live postmaster.	2001-03-13 01:17:06 +00:00
Tom Lane	b109b03fea	Repair a number of places that didn't bother to check whether PageAddItem succeeds or not. Revise rtree page split algorithm to take care about making a feasible split --- ie, will the incoming tuple actually fit? Failure to make a feasible split, combined with failure to notice the failure, account for Jim Stone's recent bug report. I suspect that hash and gist indices may have the same type of bug, but at least now we'll get error messages rather than silent failures if so. Also clean up rtree code to use Datum rather than char* where appropriate.	2001-03-07 21:20:26 +00:00
Tom Lane	9c9936587c	Implement COMMIT_SIBLINGS parameter to allow pre-commit delay to occur only if at least N other backends currently have open transactions. This is not a great deal of intelligence about whether a delay might be profitable ... but it beats no intelligence at all. Note that the default COMMIT_DELAY is still zero --- this new code does nothing unless that setting is changed. Also, mark ENABLEFSYNC as a system-wide setting. It's no longer safe to allow that to be set per-backend, since we may be relying on some other backend's fsync to have synced the WAL log.	2001-02-26 00:50:08 +00:00
Bruce Momjian	4f6c49fef0	Clean up index/btree comments/macros, as approved.	2001-02-22 21:48:49 +00:00
Hiroshi Inoue	50e3c60b95	Avoid 'FATAL: out of free buffers: time to abort !" error during WAL recovery. Recovery failure is always serious.	2001-02-22 08:59:40 +00:00
Tom Lane	57e0847180	Change default commit_delay to zero, update documentation.	2001-02-18 04:50:43 +00:00
Tom Lane	33cc5d8a4d	Change s_lock to not use any zero-delay select() calls; these are just a waste of cycles on single-CPU machines, and of dubious utility on multi-CPU machines too. Tweak s_lock_stuck so that caller can specify timeout interval, and increase interval before declaring stuck spinlock for buffer locks and XLOG locks. On systems that have fdatasync(), use that rather than fsync() to sync WAL log writes. Ensure that WAL file is entirely allocated during XLogFileInit.	2001-02-18 04:39:42 +00:00
Tom Lane	059e361481	Although we can't support out-of-line TOAST storage in indexes (yet), compressed storage works perfectly well. Might as well have a coherent strategy for applying it, rather than the haphazard store-what-you-get approach that was in the code before. The strategy I've set up here is to attempt compression of any compressible index value exceeding BLCKSZ/16, or about 500 bytes by default.	2001-02-15 20:57:01 +00:00
Vadim B. Mikheev	7e04843ba7	Comments about GetFreeXLBuffer(). GetFreeXLBuffer(): use Insert->LgwrResult instead of private LgwrResult copy if it's more fresh (attempt to avoid acquiring info_lck/lgwr_lck).	2001-02-13 20:40:25 +00:00
Vadim B. Mikheev	35273825dc	Removed abort() in XLogFileOpen.	2001-02-13 08:44:09 +00:00
Tom Lane	7fdca53711	When updating a tuple containing compressed-in-line fields, do not decompress the existing fields unnecessarily.	2001-02-09 17:30:03 +00:00
Vadim B. Mikheev	c19dadbf08	Runtime btree recovery is now ON by default.	2001-02-07 23:35:33 +00:00
Vadim B. Mikheev	b18c09ee3a	Runtime tree recovery is implemented, just testing is left -:)	2001-02-02 19:49:15 +00:00
Vadim B. Mikheev	dca0762efc	Couple additional functions to fix tree at runtime. Need in one more function to handle "my bits moved..." case. FixBTree is still FALSE.	2001-01-31 01:08:36 +00:00
Vadim B. Mikheev	598a12722a	Call _bt_fixroot() from _bt_insertonpg.	2001-01-29 07:28:17 +00:00
Tom Lane	0d54d6ac44	Clean up handling of tuple descriptors so that result-tuple descriptors allocated by plan nodes are not leaked at end of query. This doesn't really matter for normal queries, but it sure does for queries invoked repetitively inside SQL functions. Clean up some other grotty code associated with tupdescs, and fix a few other memory leaks exposed by tests with simple SQL functions.	2001-01-29 00:39:20 +00:00
Vadim B. Mikheev	c6e6d292bc	First step in attempt to fix tree at runtime: create upper levels and new root page if old root one was splitted but new root page wasn't created. New code is protected by FixBTree bool flag setted to FALSE, so nothing should be affected by this untested approach.	2001-01-26 01:24:31 +00:00
Bruce Momjian	623bf843d2	Change Copyright from PostgreSQL, Inc to PostgreSQL Global Development Group.	2001-01-24 19:43:33 +00:00
Tom Lane	4e27b308e2	Do _bt_wrtbuf() outside critical section, per discussion with Vadim 1/19.	2001-01-23 23:29:22 +00:00
Tom Lane	786f1a59cd	Fix all the places that called heap_update() and heap_delete() without bothering to check the return value --- which meant that in case the update or delete failed because of a concurrent update, you'd not find out about it, except by observing later that the transaction produced the wrong outcome. There are now subroutines simple_heap_update and simple_heap_delete that should be used anyplace that you're not prepared to do the full nine yards of coping with concurrent updates. In practice, that seems to mean absolutely everywhere but the executor, because noplace else was checking.	2001-01-23 04:32:23 +00:00
Tom Lane	6ce0ed2813	Make critical sections (elog->crash) and interrupt holdoff sections into distinct concepts, per recent discussion on pghackers.	2001-01-19 22:08:47 +00:00
Vadim B. Mikheev	0a12767004	Comment out xlrec in xact_redo - no support for file unlinking on commit yet.	2001-01-18 18:33:45 +00:00
Tom Lane	efd6cade83	Tweak heap_update/delete so that we do not hold the buffer context lock on the old tuple's page while we are doing TOAST pushups.	2001-01-15 05:29:19 +00:00
Tom Lane	36839c1927	Restructure backend SIGINT/SIGTERM handling so that 'die' interrupts are treated more like 'cancel' interrupts: the signal handler sets a flag that is examined at well-defined spots, rather than trying to cope with an interrupt that might happen anywhere. See pghackers discussion of 1/12/01.	2001-01-14 05:08:17 +00:00
Tom Lane	6162432de9	Add more critical-section calls: all code sections that hold spinlocks are now critical sections, so as to ensure die() won't interrupt us while we are munging shared-memory data structures. Avoid insecure intermediate states in some code that proc_exit will call, like palloc/pfree. Rename START/END_CRIT_CODE to START/END_CRIT_SECTION, since that seems to be what people tend to call them anyway, and make them be called with () like a function call, in hopes of not confusing pg_indent. I doubt that this is sufficient to make SIGTERM safe anywhere; there's just too much code that could get invoked during proc_exit().	2001-01-12 21:54:01 +00:00
Marc G. Fournier	0ad7db4be4	New feature: 1. Support of variable size keys - new algorithm of insertion to tree (GLI - gist layrered insertion). Previous algorithm was implemented as described in paper by Joseph M. Hellerstein et.al "Generalized Search Trees for Database Systems". This (old) algorithm was not suitable for variable size keys and could be not effective ( walking up-down ) in case of multiple levels split Bug fixed: 1. fixed bug in gistPageAddItem - key values were written to disk uncompressed. This caused failure if decompression function does real job. 2. NULLs handling - we keep NULLs in tree. Right way is to remove them, but we don't know how to inform vacuum about index statistics. This is just cosmetic warning message (like in case with R-Tree), but I'm not sure how to recognize real problem if we remove NULLs and suppress this warning as Tom suggested. 3. various memory leaks This work was done by Teodor Sigaev (teodor@stack.net) and Oleg Bartunov (oleg@sai.msu.su).	2001-01-12 00:12:58 +00:00
Vadim B. Mikheev	4b59366e57	1. Checkpoint.undo may be after checkpoint itself: - no more elog(STOP) in StartupXLOG(); - both checkpoint' undo & redo are used to define oldest on-line log file. 2. Ability to pre-allocate a few log files at checkpoint time (wal_files option). Off by default.	2001-01-09 06:24:33 +00:00
Tom Lane	a4ddbbd1a4	Correct nasty error in heap_update: it was releasing the buffer refcount before calling RelationInvalidateHeapTuple(), which is bad because the latter needs to look at the tuple data, which is in the shared disk buffer. If another backend manages to recycle the buffer while this is going on, we will compute the wrong hashindex for the tuple or maybe even crash outright. Must hold buffer refcount until afterwards. (This bug is not in 7.0.*; seems to be have introduced during WAL changes.)	2001-01-07 22:14:31 +00:00
Tom Lane	1b8a219eef	Clean up non-reentrant interface for hash_seq/HashTableWalk, so that starting a new hashtable search no longer clobbers any other search active anywhere in the system. Fix RelationCacheInvalidate() so that it will not crash or go into an infinite loop if invoked recursively, as for example by a second SI Reset message arriving while we are still processing a prior one.	2001-01-02 04:33:24 +00:00
Vadim B. Mikheev	3e059b3802	1. WAL needs in zero-ed content of newly initialized page. 2. Log record for PageRepaireFragmentation now keeps array of !LP_USED offnums to redo cleanup properly.	2000-12-30 15:19:57 +00:00
Vadim B. Mikheev	c193f19a39	Fixed misprint in heap update WALoging.	2000-12-30 06:52:34 +00:00
Tom Lane	7f60b81e1a	Fix failure in CreateCheckPoint on some Alpha boxes --- it's not OK to assume that TAS() will always succeed the first time, even if the lock is known to be free. Also, make sure that code will eventually time out and report a stuck spinlock, rather than looping forever. Small cleanups in s_lock.h, too.	2000-12-29 21:31:21 +00:00
Vadim B. Mikheev	7d363c4c33	MUST update (in-memory) data page BEFORE XLogInsert to log NEW page content if WAL will decide to backup page.	2000-12-29 20:47:17 +00:00
Vadim B. Mikheev	b3c4f03c9c	nbtree_xlog_newroot: set meta flag in meta page opaque.	2000-12-29 08:08:59 +00:00
Vadim B. Mikheev	7ceeeb662f	New WAL version - CRC and data blocks backup.	2000-12-28 13:00:29 +00:00
Tom Lane	8609d4abf2	Fix portability problems recently exposed by regression tests on Alphas. 1. Distinguish cases where a Datum representing a tuple datatype is an OID from cases where it is a pointer to TupleTableSlot, and make sure we use the right typlen in each case. 2. Make fetchatt() and related code support 8-byte by-value datatypes on machines where Datum is 8 bytes. Centralize knowledge of the available by-value datatype sizes in two macros in tupmacs.h, so that this will be easier if we ever have to do it again.	2000-12-27 23:59:14 +00:00
Tom Lane	6cc842abd3	Revise lock manager to support "session level" locks as well as "transaction level" locks. A session lock is not released at transaction commit (but it is released on transaction abort, to ensure recovery after an elog(ERROR)). In VACUUM, use a session lock to protect the master table while vacuuming a TOAST table, so that the TOAST table can be done in an independent transaction. I also took this opportunity to do some cleanup and renaming in the lock code. The previously noted bug in ProcLockWakeup, that it couldn't wake up any waiters beyond the first non-wakeable waiter, is now fixed. Also found a previously unknown bug of the same kind (failure to scan all members of a lock queue in some cases) in DeadLockCheck. This might have led to failure to detect a deadlock condition, resulting in indefinite waits, but it's difficult to characterize the conditions required to trigger a failure.	2000-12-22 00:51:54 +00:00
Bruce Momjian	1f159e562b	>> Here is a patch for the beos port (All regression tests are OK). >> xlog.c : special case for beos to avoid 'link' which does not work yet >> beos/sem.c : implementation of new sem_ctl call (GETPID) and a new >sem_op >> flag (IPCNOWAIT) >> dynloader/beos.c : add a verification of symbol validity (seem that the >> loader sometime return OK with an invalid symbol) >> postmaster.c : add beos forking support for the new checkpoint process >> postgres.c : remove beos special case for getrusage >> beos.h : Correction of a bas definition of AF_UNIX, misc defnitions >> >> >> thanks >> >> >> cyril Cyril VELTER	2000-12-18 18:45:05 +00:00
Tom Lane	a626b78c89	Clean up backend-exit-time cleanup behavior. Use on_shmem_exit callbacks to ensure that we have released buffer refcounts and so forth, rather than putting ad-hoc operations before (some of the calls to) proc_exit. Add commentary to discourage future hackers from repeating that mistake.	2000-12-18 00:44:50 +00:00
Vadim B. Mikheev	5bb4f723d2	Remove elog for online log files.	2000-12-11 19:27:42 +00:00
Vadim B. Mikheev	dae369d390	elog(LOG)-->elog(DEBUG) for skipped logs.	2000-12-11 18:02:25 +00:00

1 2 3 4 5 ...

643 Commits