postgresql

Commit Graph

Author	SHA1	Message	Date
Peter Eisentraut	88047e59ba	Fix an error when a set-returning function fails halfway through the execution If the function using yield to return rows fails halfway, the iterator stays open and subsequent calls to the function will resume reading from it. The fix is to unref the iterator and set it to NULL if there has been an error. Jan Urbański	2011-01-18 23:22:37 +02:00
Bruce Momjian	8995440e38	In test_fsync, adjust test headings to match wal_sync_method values; add more test cases for open_sync of different sizes.	2011-01-18 15:53:55 -05:00
Tom Lane	1b393f4e5d	Avoid detoast in texteq/textne/byteaeq/byteane for unequal-length strings. We can get the length of a compressed or out-of-line datum without actually detoasting it. If the lengths of two strings are unequal, we can then conclude they are unequal without detoasting. That saves considerable work in an admittedly less-common case, without costing anything much when the optimization doesn't apply. Noah Misch	2011-01-18 14:11:54 -05:00
Magnus Hagander	6e1726d082	Log replication connections only when log_connections is on Previously we'd always log replication connections, with no way to turn them off.	2011-01-18 20:02:25 +01:00
Heikki Linnakangas	b1dc45c11d	Fix thinko in comment. Spotted by Jim Nasby.	2011-01-18 10:46:13 +02:00
Bruce Momjian	4acfd43a7d	Remove "github test" that somehow got into my tree. Sorry.	2011-01-17 21:40:42 -05:00
Bruce Momjian	2c38cce1be	github test	2011-01-17 20:48:49 -05:00
Peter Eisentraut	46211da1b8	Use HTABs instead of Python dictionary objects to cache procedures Two separate hash tables are used for regular procedures and for trigger procedures, since the way trigger procedures work is quite different from normal stored procedures. Change the signatures of PLy_procedure_{get,create} to accept the function OID and a Boolean flag indicating whether it's a trigger. This should make implementing a PL/Python validator easier. Using HTABs instead of Python dictionaries makes error recovery easier, and allows for procedures to be cached based on their OIDs, not their names. It also allows getting rid of the PyCObject field that used to hold a pointer to PLyProcedure, since PyCObjects are deprecated in Python 2.7 and replaced by Capsules in Python 3. Jan Urbański	2011-01-17 21:46:36 +02:00
Tom Lane	bdd8ed973d	Fix miscalculation of itemsafter in array_set_slice(). If the slice to be assigned to was before the existing array lower bound (requiring at least one null element to spring into existence to fill the gap), the code miscalculated how many entries needed to be copied from the old array's null bitmap. This could result in trashing the array's data area (as seen in bug #5840 from Karsten Loesing), or worse. This has been broken since we first allowed the behavior of assigning to non-adjacent slices, in 8.2. Back-patch to all affected versions.	2011-01-17 12:38:52 -05:00
Alvaro Herrera	978445bece	Increment Py_None refcount for NULL array elements Per bug #5835 by Julien Demoor Author: Alex Hunsaker	2011-01-17 13:04:53 -03:00
Bruce Momjian	08af45f4ff	Add getopt() support to test_fsync; also fix printf() format problem.	2011-01-17 09:36:25 -05:00
Magnus Hagander	48075095ac	Set fallback_application_name in walreceiver Makes replication slaves identify themselves in the new pg_stat_replication view.	2011-01-17 11:42:53 +01:00
Heikki Linnakangas	34ef02b4d4	Before exiting walreceiver, fsync() all the WAL received. Otherwise WAL recovery will replay the un-flushed WAL after walreceiver has exited, which can lead to a non-recoverable standby if the system crashes hard at that point.	2011-01-17 12:27:35 +02:00
Bruce Momjian	e0c274679c	In test_fsync, use #define for printf format of ops/sec.	2011-01-16 08:36:43 -05:00
Bruce Momjian	6dc15e3bef	Use O_DIRECT in O_SYNC test of different size. Restructure O_DIRECT error reporting to be more consistent.	2011-01-15 19:40:49 -05:00
Bruce Momjian	3eebb33ddd	Reverse number of stars used for test_fsync details.	2011-01-15 18:40:10 -05:00
Bruce Momjian	431605f666	In test_fsync, warn about options without o_direct that are not used by Postgres, and cases where o_direct does not work with certain file systems.	2011-01-15 18:27:43 -05:00
Tom Lane	6ca452ba7f	Move a couple of declarations to reflect where the routines really are.	2011-01-15 16:09:05 -05:00
Tom Lane	36750dcef5	Add .gitignore to silence git complaints about parser/scanner output files.	2011-01-15 16:05:28 -05:00
Bruce Momjian	001d3664e3	Have test_fsync output details that fdatasync is the default wal_sync_method on Linux.	2011-01-15 15:00:20 -05:00
Bruce Momjian	169516ad93	Restructure test_fync to use modular C so there is less duplicate code and it can be enhanced easier.	2011-01-15 14:42:48 -05:00
Magnus Hagander	3866ff6149	Enumerate available tablespaces after starting the backup This closes a race condition where if a tablespace was created after the enumeration happened but before the do_pg_start_backup() was called, the backup would be incomplete. Now that it's done while we are in backup mode, WAL replay will recreate it during restore. Noted by Heikki.	2011-01-15 19:31:16 +01:00
Bruce Momjian	3ab80cfe03	Improve output display of test_fsync.	2011-01-15 12:24:05 -05:00
Bruce Momjian	677b06ca46	Apply patch for test_fsync to add tests for O_DIRECT. Adjusted patch by Josh Berkus	2011-01-15 11:55:13 -05:00
Heikki Linnakangas	8f5d65e916	Treat a WAL sender process that hasn't started streaming yet as a regular backend, as far as the postmaster shutdown logic is concerned. That means, fast shutdown will wait for WAL sender processes to exit before signaling bgwriter to finish. This avoids race conditions between a base backup stopping or starting, and bgwriter writing the shutdown checkpoint WAL record. We don't want e.g the end-of-backup WAL record to be written after the shutdown checkpoint.	2011-01-15 16:38:21 +02:00
Magnus Hagander	fcd810c69a	Use a lexer and grammar for parsing walsender commands Makes it easier to parse mainly the BASE_BACKUP command with it's options, and avoids having to manually deal with quoted identifiers in the label (previously broken), and makes it easier to add new commands and options in the future. In passing, refactor the case statement in the walsender to put each command in it's own function.	2011-01-14 16:30:33 +01:00
Magnus Hagander	688423d004	Exit from base backups when shutdown is requested When the exit waits until the whole backup completes, it may take a very long time. In passing, add back an error check in the main loop so we detect clients that disconnect much earlier if the backup is large.	2011-01-14 12:36:45 +01:00
Tom Lane	52948169bc	Code review for postmaster.pid contents changes. Fix broken test for pre-existing postmaster, caused by wrong code for appending lines to the lockfile; don't write a failed listen_address setting into the lockfile; don't arbitrarily change the location of the data directory in the lockfile compared to previous releases; provide more consistent and useful definitions of the socket path and listen_address entries; avoid assuming that pg_ctl has the same DEFAULT_PGSOCKET_DIR as the postmaster; assorted code style improvements.	2011-01-13 19:01:28 -05:00
Tom Lane	f0f36045b2	Revert incorrect memory-conservation hack in inheritance_planner(). This reverts commit `d1001a78ce` of 2010-12-05, which was broken as reported by Jeff Davis. The problem is that the individual planning steps may have side-effects on substructures of PlannerGlobal, not only the current PlannerInfo root. Arranging to keep all such side effects in the main planning context is probably possible, but it would change this from a quick local hack into a wide-ranging and rather fragile endeavor. Which it's not worth.	2011-01-13 14:33:19 -05:00
Magnus Hagander	9eacd427e8	Make sure walsender state is only read while holding the spinlock Noted by Robert Haas.	2011-01-13 18:51:13 +01:00
Heikki Linnakangas	a5a02a7445	Fix the logic in libpqrcv_receive() to determine if there's any incoming data that can be read without blocking. It used to conclude that there isn't, even though there was data in the socket receive buffer. That lead walreceiver to flush the WAL after every received chunk, potentially causing big performance issues. Backpatch to 9.0, because the performance impact can be very significant.	2011-01-13 18:26:39 +02:00
Peter Eisentraut	c667cc24e8	Workaround for recursive make breakage Changing a file two directory levels deep under src/backend/ would not cause the postgres binary to be rebuilt. This change fixes it, but no one knows why.	2011-01-13 09:32:06 +02:00
Peter Eisentraut	35eb0958be	Don't run regression tests in SQL_ASCII encoding by default Instead, run them in the encoding that the locale selects, which is more representative of real use. Also document how locale and encoding for regression test runs can be selected.	2011-01-13 09:16:55 +02:00
Tom Lane	d487afbb81	Fix PlanRowMark/ExecRowMark structures to handle inheritance correctly. In an inherited UPDATE/DELETE, each target table has its own subplan, because it might have a column set different from other targets. This means that the resjunk columns we add to support EvalPlanQual might be at different physical column numbers in each subplan. The EvalPlanQual rewrite I did for 9.0 failed to account for this, resulting in possible misbehavior or even crashes during concurrent updates to the same row, as seen in a recent report from Gordon Shannon. Revise the data structure so that we track resjunk column numbers separately for each subplan. I also chose to move responsibility for identifying the physical column numbers back to executor startup, instead of assuming that numbers derived during preprocess_targetlist would stay valid throughout subsequent massaging of the plan. That's a bit slower, so we might want to consider undoing it someday; but it would complicate the patch considerably and didn't seem justifiable in a bug fix that has to be back-patched to 9.0.	2011-01-12 20:47:02 -05:00
Robert Haas	7a32ff9732	Revert patch adding support for logging the current role. This reverts commit `a8a8867912`, committed by me earlier today (2011-01-12). This isn't safe inside an aborted transaction. Noted by Tom Lane.	2011-01-12 11:59:21 -05:00
Robert Haas	a8a8867912	Add support for logging the current role. Stephen Frost, with some editorialization by me.	2011-01-12 11:34:53 -05:00
Andrew Dunstan	b7a0b42641	Unbreak regression tests, apparently broken by commit `4c8e20f`	2011-01-11 22:27:20 -05:00
Peter Eisentraut	e3094fd3a8	Re-add recursive coverage target in src/backend/ This was lost during the recent recursive make change.	2011-01-12 00:26:20 +02:00
Magnus Hagander	4c8e20f815	Track walsender state in shared memory and expose in pg_stat_replication	2011-01-11 21:25:28 +01:00
Magnus Hagander	47a5f3e9da	Add missing function prototype, for consistency	2011-01-11 21:12:12 +01:00
Tom Lane	e6dce4e439	Adjust basebackup.c to suppress compiler warnings. Some versions of gcc complain about "variable `tablespaces' might be clobbered by `longjmp' or `vfork'" with the original coding. Fix by moving the PG_TRY block into a separate subroutine.	2011-01-11 13:41:13 -05:00
Tom Lane	9d1ac2f5fa	Tweak create_index_paths()'s test for whether to consider a bitmap scan. Per my note of a couple days ago, create_index_paths would refuse to consider any path at all for GIN indexes if the selectivity estimate came out as 1.0; not even if you tried to force it with enable_seqscan. While this isn't really a bad outcome in practice, it could be annoying for testing purposes. Adjust the test for "is this path only useful for sorting" so that it doesn't fire on paths with nil pathkeys, which will include all GIN paths.	2011-01-11 12:13:02 -05:00
Magnus Hagander	b7ebda9d8c	Reset walsender ps title in the main loop When in streaming mode we can never get out, so it will never be required, but after a base backup (or other operations) we can get back to the loop, so the title needs to be cleared.	2011-01-11 10:04:54 +01:00
Magnus Hagander	2e36343f82	Set process title to indicate base backup is running	2011-01-10 21:53:18 +01:00
Heikki Linnakangas	dc1305ce5f	Leave temporary files out of streaming base backups.	2011-01-10 19:42:05 +02:00
Magnus Hagander	0eb59c4591	Backend support for streaming base backups Add BASE_BACKUP command to walsender, allowing it to stream a base backup to the client (in tar format). The syntax is still far from ideal, that will be fixed in the switch to use a proper grammar for walsender. No client included yet, will come as a separate commit. Magnus Hagander and Heikki Linnakangas	2011-01-10 14:04:19 +01:00
Magnus Hagander	4448917d51	Split pg_start_backup() and pg_stop_backup() into two pieces Move the actual functionality into a separate function that's easier to call internally, and change the SQL-callable function to be a wrapper calling this. Also create a pg_abort_backup() function, only callable internally, that does only the most vital parts of pg_stop_backup(), making it safe(r) to call from error handlers.	2011-01-09 21:00:28 +01:00
Heikki Linnakangas	ca63029eac	Fix crash in the new GiST insertion code, when an update splits the root page. This bug was exercised by contrib/intarray/bench, as noted by Tom Lane.	2011-01-09 21:36:22 +02:00
Tom Lane	52fd2d65a3	Fix up core tsquery GIN support for new extractQuery API. No need for the empty-prefix-match kluge to force a full scan anymore.	2011-01-09 14:34:50 -05:00
Tom Lane	304845075c	Use array_contains_nulls instead of ARR_HASNULL on user-supplied arrays. This applies the fix for bug #5784 to remaining places where we wish to reject nulls in user-supplied arrays. In all these places, there's no reason not to allow a null bitmap to be present, so long as none of the current elements are actually null. I did not change some other places where we are looking at system catalog entries or aggregate transition values, as the presence of a null bitmap in such an array would be suspicious.	2011-01-09 13:09:07 -05:00
Magnus Hagander	361418be7c	Ensure the directory for gram.h is created on win32 Result of bad testing of my last commit.	2011-01-09 17:01:15 +01:00
Magnus Hagander	3457514c2d	Properly install gram.h on MSVC builds This file is now needed by pgAdmin builds, which started failing since it was missing in the installer builds.	2011-01-09 15:31:48 +01:00
Magnus Hagander	db4d22d0ef	Add pgreadlink() on Windows to read junction points Add support for reading back information about the symbolic links we've created with pgsymlink(), which are actually Junction Points. Just like pgsymlink() can only create directory symlinks, pgreadlink() can only read directory symlinks.	2011-01-09 15:09:19 +01:00
Michael Meskes	1066dbfb85	There is no need to have to identical functions in ecpg thus removing one of them.	2011-01-09 12:47:43 +01:00
Tom Lane	adf328c0e1	Add array_contains_nulls() function in arrayfuncs.c. This will support fixing contrib/intarray (and probably other places) so that they don't have to fail on arrays that contain a null bitmap but no live null entries.	2011-01-08 20:26:14 -05:00
Tom Lane	4d1b76e49e	Fix up gincostestimate for new extractQuery API. The only reason this wasn't crashing while testing the core anyarray operators was that it was disabled for those cases because of passing the wrong type information to get_opfamily_proc :-(. So fix that too, and make it insist on finding the support proc --- in hindsight, silently doing nothing is not as sane a coping mechanism as all that.	2011-01-08 20:26:13 -05:00
Michael Meskes	833a2b57bc	In ecpg's parser removed a fixed length limit for constants defining an array dimension.	2011-01-08 23:04:50 +01:00
Tom Lane	7e2f906201	Remove pg_am.amindexnulls. The only use we have had for amindexnulls is in determining whether an index is safe to cluster on; but since the addition of the amclusterable flag, that usage is pretty redundant. In passing, clean up assorted sloppiness from the last patch that touched pg_am.h: Natts_pg_am was wrong, and ambuildempty was not documented.	2011-01-08 16:08:05 -05:00
Tom Lane	56a57473a9	Refactor GIN's handling of duplicate search entries. The original coding could combine duplicate entries only when they originated from the same qual condition. In particular it could not combine cases where multiple qual conditions all give rise to full-index scan requests, which is an expensive case well worth optimizing. Refactor so that duplicates are recognized across all the quals.	2011-01-08 14:48:08 -05:00
Bruce Momjian	d8d3d2a4f3	Fix pg_upgrade of large object permissions by preserving pg_auth.oid, which is stored in pg_largeobject_metadata. No backpatch to 9.0 because you can't migrate from 9.0 to 9.0 with the same catversion (because of tablespace conflict), and a pre-9.0 migration to 9.0 has not large object permissions to migrate.	2011-01-07 21:59:29 -05:00
Bruce Momjian	2896c87ce4	Force pg_upgrade's to preserve pg_class.oid, not pg_class.relfilenode. Toast tables have identical pg_class.oid and pg_class.relfilenode, but for clarity it is good to preserve the pg_class.oid. Update comments regarding what is preserved, and do some variable/function renaming for clarity.	2011-01-07 21:26:13 -05:00
Tom Lane	a032d50128	Fix the built-in GIN support procedure declarations in pg_proc.h. Add more "internal" arguments so that these pg_proc entries reflect the current preferred API. This is purely a cosmetic change, since GIN doesn't actually consult the pg_proc entry when calling a support function. Accordingly, no catversion bump.	2011-01-07 20:40:48 -05:00
Tom Lane	73912e7fbd	Fix GIN to support null keys, empty and null items, and full index scans. Per my recent proposal(s). Null key datums can now be returned by extractValue and extractQuery functions, and will be stored in the index. Also, placeholder entries are made for indexable items that are NULL or contain no keys according to extractValue. This means that the index is now always complete, having at least one entry for every indexed heap TID, and so we can get rid of the prohibition on full-index scans. A full-index scan is implemented much the same way as partial-match scans were already: we build a bitmap representing all the TIDs found in the index, and then drive the results off that. Also, introduce a concept of a "search mode" that can be requested by extractQuery when the operator requires matching to empty items (this is just as cheap as matching to a single key) or requires a full index scan (which is not so cheap, but it sure beats failing or giving wrong answers). The behavior remains backward compatible for opclasses that don't return any null keys or request a non-default search mode. Using these features, we can now make the GIN index opclass for anyarray behave in a way that matches the actual anyarray operators for &&, <@, @>, and = ... which it failed to do before in assorted corner cases. This commit fixes the core GIN code and ginarrayprocs.c, updates the documentation, and adds some simple regression test cases for the new behaviors using the array operators. The tsearch and contrib GIN opclass support functions still need to be looked over and probably fixed. Another thing I intend to fix separately is that this is pretty inefficient for cases where more than one scan condition needs a full-index search: we'll run duplicate GinScanEntrys, each one of which builds a large bitmap. There is some existing logic to merge duplicate GinScanEntrys but it needs refactoring to make it work for entries belonging to different scan keys. Note that most of gin.h has been split out into a new file gin_private.h, so that gin.h doesn't export anything that's not supposed to be used by GIN opclasses or the rest of the backend. I did quite a bit of other code beautification work as well, mostly fixing comments and choosing more appropriate names for things.	2011-01-07 19:16:24 -05:00
Robert Haas	9b4271deb9	Document pg_stat_replication, bump catversion since that was overlooked. Itagaki Takahiro, edited by me.	2011-01-07 11:06:55 -05:00
Robert Haas	a9f72b4083	Improve recovery.conf.sample comments. Jehan-Guillaume de Rorthais, with some additional wordsmithing by me.	2011-01-07 11:01:25 -05:00
Itagaki Takahiro	a755ea33ae	New system view pg_stat_replication displays activity of wal sender processes. Itagaki Takahiro and Simon Riggs.	2011-01-07 20:35:38 +09:00
Bruce Momjian	46d28820b6	Improve C comments about backend variables set by pg_upgrade_support functions.	2011-01-06 22:45:36 -05:00
Tom Lane	6c596c29a3	Update sequence_1.out for recent changes in sequence regression test.	2011-01-06 10:58:32 -05:00
Bruce Momjian	5cff5b5779	Clarify pg_upgrade's creation of the map file structure. Also clean up pg_dump's calling of pg_upgrade_support functions.	2011-01-05 11:37:08 -05:00
Magnus Hagander	66a8a0428d	Give superusers REPLIACTION permission by default This can be overriden by using NOREPLICATION on the CREATE ROLE statement, but by default they will have it, making it backwards compatible and "less surprising" (given that superusers normally override all checks).	2011-01-05 14:24:17 +01:00
Itagaki Takahiro	14158f25cd	Improve psql tab completion for CREATE/ALTER ROLE [NO]REPLICATION. Missing support for VALID UNTIL in CREATE ROLE is also added.	2011-01-04 17:56:01 +09:00
Robert Haas	7f60be72b0	Fix crash in ALTER OPERATOR CLASS/FAMILY .. SET SCHEMA. In the previous coding, the parser emitted a List containing a C string, which is no good, because copyObject() can't handle it. Dimitri Fontaine	2011-01-03 22:08:55 -05:00
Robert Haas	dc8a14311a	Update comments in RecordTransactionCommit() to mention unlogged tables.	2011-01-03 10:29:22 -05:00
Magnus Hagander	77745cc7f1	Bump catversion, forgot in previous commit.	2011-01-03 12:50:30 +01:00
Magnus Hagander	40d9e94bd7	Add views and functions to monitor hot standby query conflicts Add the view pg_stat_database_conflicts and a column to pg_stat_database, and the underlying functions to provide the information.	2011-01-03 12:46:03 +01:00
Magnus Hagander	c0e96b49e5	perltidy run on the MSVC build system Forgot this with previuos commit, line it up so it's easier to submit (readable) patches against the MSVC build system.	2011-01-03 10:44:56 +01:00
Peter Eisentraut	39b8843296	Implement remaining fields of information_schema.sequences view Add new function pg_sequence_parameters that returns a sequence's start, minimum, maximum, increment, and cycle values, and use that in the view. (bug #5662; design suggestion by Tom Lane) Also slightly adjust the view's column order and permissions after review of SQL standard.	2011-01-02 15:15:21 +02:00
Robert Haas	e657b55e66	Fix typo. Noted by Magnus Hagander.	2011-01-02 07:26:10 -05:00
Robert Haas	0d692a0dc9	Basic foreign table support. Foreign tables are a core component of SQL/MED. This commit does not provide a working SQL/MED infrastructure, because foreign tables cannot yet be queried. Support for foreign table scans will need to be added in a future patch. However, this patch creates the necessary system catalog structure, syntax support, and support for ancillary operations such as COMMENT and SECURITY LABEL. Shigeru Hanada, heavily revised by Robert Haas	2011-01-01 23:48:11 -05:00
Robert Haas	d7acf6cc4a	Fix pg_dump support for security labels on columns. Along the way, correct an erroneous comment.	2011-01-01 17:44:28 -05:00
Peter Eisentraut	6a208aa404	Allow casting a table's row type to the table's supertype if it's a typed table This is analogous to the existing facility that allows casting a row type to a supertable's row type.	2011-01-01 23:04:14 +02:00
Bruce Momjian	92a73d2190	Add #include <time.h> to pg_ctl.c to fix compiler warning.	2011-01-01 15:55:36 -05:00
Bruce Momjian	5d950e3b0c	Stamp copyrights for year 2011.	2011-01-01 13:18:15 -05:00
Bruce Momjian	30aeda4394	Include the first valid listen address in pg_ctl to improve server start "wait" detection and add postmaster start time to help determine if the postmaster is actually using the specified data directory.	2010-12-31 17:25:02 -05:00
Tom Lane	39c8dd6620	Invert and rename flag variable to improve code readability. No change in functionality. Per discussion with Robert.	2010-12-31 11:59:38 -05:00
Tom Lane	7b46401557	Move symbols for ExecMergeJoin's state machine into nodeMergejoin.c. There's no reason for these values to be known anywhere else. After doing this, executor/execdefs.h is vestigial and can be removed.	2010-12-30 22:12:40 -05:00
Tom Lane	f4e4b32743	Support RIGHT and FULL OUTER JOIN in hash joins. This is advantageous first because it allows us to hash the smaller table regardless of the outer-join type, and second because hash join can be more flexible than merge join in dealing with arbitrary join quals in a FULL join. For merge join all the join quals have to be mergejoinable, but hash join will work so long as there's at least one hashjoinable qual --- the others can be any condition. (This is true essentially because we don't keep per-inner-tuple match flags in merge join, while hash join can do so.) To do this, we need a has-it-been-matched flag for each tuple in the hashtable, not just one for the current outer tuple. The key idea that makes this practical is that we can store the match flag in the tuple's infomask, since there are lots of bits there that are of no interest for a MinimalTuple. So we aren't increasing the size of the hashtable at all for the feature. To write this without turning the hash code into even more of a pile of spaghetti than it already was, I rewrote ExecHashJoin in a state-machine style, similar to ExecMergeJoin. Other than that decision, it was pretty straightforward.	2010-12-30 20:26:08 -05:00
Alvaro Herrera	55573990ca	Avoid unnecessary public struct declaration in slru.h Instead, declare a public wrapper of the sole function using it for external callers, so that they don't have to always pass a NULL argument. Author: Kevin Grittner	2010-12-30 12:09:17 -03:00
Robert Haas	d2bc1c9907	Bump XLOG_PAGE_MAGIC. The unlogged tables patch (commit `53dbc27c62`, 2010-12-29) should have done this, since it changes the format of an XLOG_SMGR_CREATE record.	2010-12-29 07:19:21 -05:00
Robert Haas	53dbc27c62	Support unlogged tables. The contents of an unlogged table are WAL-logged; thus, they are not available on standby servers and are truncated whenever the database system enters recovery. Indexes on unlogged tables are also unlogged. Unlogged GiST indexes are not currently supported.	2010-12-29 06:48:53 -05:00
Magnus Hagander	9b8aff8c19	Add REPLICATION privilege for ROLEs This privilege is required to do Streaming Replication, instead of superuser, making it possible to set up a SR slave that doesn't have write permissions on the master. Superuser privileges do NOT override this check, so in order to use the default superuser account for replication it must be explicitly granted the REPLICATION permissions. This is backwards incompatible change, in the interest of higher default security.	2010-12-29 11:05:03 +01:00
Tom Lane	f2ba1e994c	Avoid unexpected conversion overflow in planner for distant date values. The "date" type supports a wider range of dates than int64 timestamps do. However, there is pre-int64-timestamp code in the planner that assumes that all date values can be converted to timestamp with impunity. Fortunately, what we really need out of the conversion is always a double (float8) value; so even when the date is out of timestamp's range it's possible to produce a sane answer. All we need is a code path that doesn't try to force the result into int64. Per trouble report from David Rericha. Back-patch to all supported versions. Although this is surely a corner case, there's not much point in advertising a date range wider than timestamp's if we will choke on such values in unexpected places.	2010-12-28 22:49:57 -05:00
Tom Lane	81a530a65e	Fix ill-advised placement of PGRES_COPY_BOTH enum value. It must be added at the end of the ExecStatusType enum to avoid ABI breakage compared to previous libpq versions. Noted by Magnus.	2010-12-28 11:02:10 -05:00
Bruce Momjian	b4d3792daa	Another fix for larger postmaster.pid files.	2010-12-28 09:34:46 -05:00
Bruce Momjian	bada44a2a2	Fix code to properly pull out shared memory key now that the postmaster.pid file is larger than in previous major versions. This is a bug introduced when I added lines to the file recently.	2010-12-27 23:11:33 -05:00
Tom Lane	f79136439f	Remove -fno-operator-names switch from cpluspluscheck. No longer needed now that bitand() and bitor() have been renamed.	2010-12-27 15:03:24 -05:00
Tom Lane	84fc571395	Rename the C functions bitand(), bitor() to bit_and(), bit_or(). This is to avoid use of the C++ keywords "bitand" and "bitor" in the header file utils/varbit.h. Note the functions' SQL-level names are not changed, only their C-level names. In passing, make some comments in varbit.c conform to project-standard layout.	2010-12-27 14:57:41 -05:00
Tom Lane	8c61f81b31	Rearrange cpluspluscheck to check just one .h file at a time. This is slower than the original coding but avoids the problem of including files in an unpredictable order. Aside from being more trustworthy, we can get rid of some exclusions that were formerly made for what turn out to be ordering or re-inclusion problems. I also modified it to include libpq's exported files in the check. ecpg should be included as well, but I'm unclear on which ecpg .h files are meant to be included by clients.	2010-12-27 12:51:44 -05:00
Tom Lane	37b61a69f3	Fix failure of executor/hashjoin.h to compile standalone. Noted while experimenting with cpluspluscheck.	2010-12-27 12:20:09 -05:00
Tom Lane	a977db6f1c	Tweak cpluspluscheck to avoid directly #include'ing gram.h. gram.h has ordering dependencies, which are satisfied when it's included from gramparse.h, but might not be if it's pulled in directly.	2010-12-27 11:36:52 -05:00
Tom Lane	275411912d	Fix ill-chosen use of "private" as an argument and struct field name. "private" is a keyword in C++, so this breaks the poorly-enforced policy that header files should be include-able in C++ code. Per report from Craig Ringer and some investigation with cpluspluscheck.	2010-12-27 11:26:19 -05:00
Robert Haas	63676ebff4	Corrections to patch adding SQL/MED error codes. My previous commit, `85cff3ce7f` on 2010-12-25, failed to update errcodes.sgml or plerrcodes.h. This patch corrects that oversight, per a gripe from Tom Lane, and also corrects a typographical error.	2010-12-26 21:35:25 -05:00
Andrew Dunstan	a534728afb	Only build in crashdump support on Windows if there's a working dbghelp.h.	2010-12-26 10:34:47 -05:00
Robert Haas	85cff3ce7f	Add foreign data wrapper error code values for SQL/MED. Extracted from a much larger patch by Shigeru Hanada.	2010-12-25 13:57:39 -05:00
Andrew Dunstan	04ee0db6b2	Allow vpath builds and regression tests to succeed on Mingw. Backpatch to release 8.4 - earlier releases would require more changes and it's not worth the trouble.	2010-12-24 13:31:28 -05:00
Bruce Momjian	5000472112	Remove quotes from boolean recovery.conf.sample parameters, now that the quotes are not required. This now matches postgresql.conf's specification of booleans.	2010-12-24 11:51:51 -05:00
Bruce Momjian	075354ad1b	Improve "pg_ctl -w start" server detection by writing the postmaster port and socket directory into postmaster.pid, and have pg_ctl read from that file, for use by PQping().	2010-12-24 09:45:52 -05:00
Michael Meskes	727a5a1620	Added rule to ecpg lexer to accept "Unicode surrogate pair in extended quoted string". This is not really needed because the string gets copied to the output untranslated anyway, but by adding this rule the lexer stays in sync with the backend lexer.	2010-12-23 20:37:42 +01:00
Heikki Linnakangas	9de3aa65f0	Rewrite the GiST insertion logic so that we don't need the post-recovery cleanup stage to finish incomplete inserts or splits anymore. There was two reasons for the cleanup step: 1. When a new tuple was inserted to a leaf page, the downlink in the parent needed to be updated to contain (ie. to be consistent with) the new key. Updating the parent in turn might require recursively updating the parent of the parent. We now handle that by updating the parent while traversing down the tree, so that when we insert the leaf tuple, all the parents are already consistent with the new key, and the tree is consistent at every step. 2. When a page is split, we need to insert the downlink for the new right page(s), and update the downlink for the original page to not include keys that moved to the right page(s). We now handle that by setting a new flag, F_FOLLOW_RIGHT, on the non-rightmost pages in the split. When that flag is set, scans always follow the rightlink, regardless of the NSN mechanism used to detect concurrent page splits. That way the tree is consistent right after split, even though the downlink is still missing. This is very similar to the way B-tree splits are handled. When the downlink is inserted in the parent, the flag is cleared. To keep the insertion algorithm simple, when an insertion sees an incomplete split, indicated by the F_FOLLOW_RIGHT flag, it finishes the split before doing anything else. These changes allow removing the whole "invalid tuple" mechanism, but I retained the scan code to still follow invalid tuples correctly. While we don't create any such tuples anymore, we want to handle them gracefully in case you pg_upgrade a GiST index that has them. If we encounter any on an insert, though, we just throw an error saying that you need to REINDEX. The issue that got me into doing this is that if you did a checkpoint while an insert or split was in progress, and the checkpoint finishes quickly so that there is no WAL record related to the insert between RedoRecPtr and the checkpoint record, recovery from that checkpoint would not know to finish the incomplete insert. IOW, we have the same issue we solved with the rm_safe_restartpoint mechanism during normal operation too. It's highly unlikely to happen in practice, and this fix is far too large to backpatch, so we're just going to live with in previous versions, but this refactoring fixes it going forward. With this patch, you don't get the annoying 'index "FOO" needs VACUUM or REINDEX to finish crash recovery' notices anymore if you crash at an unfortunate moment.	2010-12-23 16:21:47 +02:00
Magnus Hagander	de9a4c27fe	Add PQlibVersion() function to libpq This function is like the PQserverVersion() function except it returns the version of libpq, making it possible for a client program or driver to determine which version of libpq is in use at runtime, and not just at link time. Suggested by Harald Armin Massa and several others.	2010-12-22 14:23:56 +01:00
Robert Haas	32ba2b5160	Use memcmp() rather than strncmp() when shorter string length is known. It appears that this will be faster for all but the shortest strings; at least one some platforms, memcmp() can use word-at-a-time comparisons. Noah Misch, somewhat pared down.	2010-12-21 22:11:40 -05:00
Robert Haas	c5160b7eec	Fix typos. Andreas Karlsson	2010-12-21 17:58:53 -05:00
Robert Haas	24ecde7742	Work around unfortunate getppid() behavior on BSD-ish systems. On MacOS X, and apparently also on other BSD-derived systems, attaching a debugger causes getppid() to return the pid of the debugging process rather than the actual parent PID. As a result, debugging the autovacuum launcher, startup process, or WAL sender on such systems causes it to exit, because the previous coding of PostmasterIsAlive() detects postmaster death by testing whether getppid() == PostmasterPid. Work around that behavior by checking the return value of getppid() more carefully. If it's PostmasterPid, the postmaster must be alive; if it's 1, assume the postmaster is dead. If it's any other value, assume we've been debugged and fall through to the less-reliable kill() test. Review by Tom Lane.	2010-12-21 06:30:32 -05:00
Robert Haas	f6a0863e3c	Allow transactions that don't write WAL to commit asynchronously. This case can arise if a transaction has written data, but only to temporary tables. Loss of the commit record in case of a crash won't matter, because the temporary tables will be lost anyway. Reviewed by Heikki Linnakangas and Simon Riggs.	2010-12-20 12:59:33 -05:00
Magnus Hagander	d382828f6e	Remove thread dumping constant that requires newer Platform SDK Since we're not multithreaded it only provides marginally useful information, and it does require a newer version of the Platform SDK than we target. We may want to reconsider this in the future along with a fix for MinGW.	2010-12-19 21:32:58 +01:00
Tom Lane	1b19e2c0ba	Fix up handling of simple-form CASE with constant test expression. eval_const_expressions() can replace CaseTestExprs with constants when the surrounding CASE's test expression is a constant. This confuses ruleutils.c's heuristic for deparsing simple-form CASEs, leading to Assert failures or "unexpected CASE WHEN clause" errors. I had put in a hack solution for that years ago (see commit `514ce7a331` of 2006-10-01), but bug #5794 from Peter Speck shows that that solution failed to cover all cases. Fortunately, there's a much better way, which came to me upon reflecting that Peter's "CASE TRUE WHEN" seemed pretty redundant: we can "simplify" the simple-form CASE to the general form of CASE, by simply omitting the constant test expression from the rebuilt CASE construct. This is intuitively valid because there is no need for the executor to evaluate the test expression at runtime; it will never be referenced, because any CaseTestExprs that would have referenced it are now replaced by constants. This won't save a whole lot of cycles, since evaluating a Const is pretty cheap, but a cycle saved is a cycle earned. In any case it beats kluging ruleutils.c still further. So this patch improves const-simplification and reverts the previous change in ruleutils.c. Back-patch to all supported branches. The bug exists in 8.1 too, but it's out of warranty.	2010-12-19 15:30:44 -05:00
Tom Lane	abc1026269	Fix erroneous parsing of tsquery input "... & !(subexpression) \| ..." After parsing a parenthesized subexpression, we must pop all pending ANDs and NOTs off the stack, just like the case for a simple operand. Per bug #5793. Also fix clones of this routine in contrib/intarray and contrib/ltree, where input of types query_int and ltxtquery had the same problem. Back-patch to all supported versions.	2010-12-19 12:48:34 -05:00
Magnus Hagander	dcb09b595f	Support for collecting crash dumps on Windows Add support for collecting "minidump" style crash dumps on Windows, by setting up an exception handling filter. Crash dumps will be generated in PGDATA/crashdumps if the directory is created (the existance of the directory is used as on/off switch for the generation of the dumps). Craig Ringer and Magnus Hagander	2010-12-19 16:45:28 +01:00
Bruce Momjian	7e95337d58	Properly print the IP number and "localhost" for failed localhost connections when the server is down, on Win32.	2010-12-18 11:26:17 -05:00
Magnus Hagander	4754dbf4c3	Make GUC variables for syslog and SSL always visible Make the variables visible (but not used) even when support is not compiled in.	2010-12-18 16:53:59 +01:00
Alvaro Herrera	3026027ec3	set_ps_display when calling functions via fastpath This improves tag output by log_line_prefix	2010-12-17 18:51:22 -03:00
Alvaro Herrera	b68193c0c7	Remove unnecessary definition for autovacuum in SignalSomeChildren.	2010-12-17 15:59:19 -03:00
Robert Haas	8bd4b89e24	Try to save a kernel call in ResolveRecoveryConflictWithVirtualXIDs. If there's no work to be done, just exit quickly, before initialization.	2010-12-17 11:32:02 -05:00
Robert Haas	611fed3712	Reset 'ps' display just once when resolving VXID conflicts. This prevents the word "waiting" from briefly disappearing from the ps status line when ResolveRecoveryConflictWithVirtualXIDs begins a new iteration of the outer loop. Along the way, remove some useless pgstat_report_waiting() calls; the startup process doesn't appear in pg_stat_activity. Fujii Masao	2010-12-17 08:30:57 -05:00
Tom Lane	14ed7735f5	Improve comments around startup_hacks() code. These comments were not updated when we added the EXEC_BACKEND mechanism for Windows, even though it rendered them inaccurate. Also unify two unnecessarily-separate #ifdef __alpha code blocks.	2010-12-16 17:57:57 -05:00
Tom Lane	61b53695fb	Remove optreset from src/port/ implementations of getopt and getopt_long. We don't actually need optreset, because we can easily fix the code to ensure that it's cleanly restartable after having completed a scan over the argv array; which is the only case we need to restart in. Getting rid of it avoids a class of interactions with the system libraries and allows reversion of my change of yesterday in postmaster.c and postgres.c. Back-patch to 8.4. Before that the getopt code was a bit different anyway.	2010-12-16 16:23:05 -05:00
Alvaro Herrera	cd1fefa973	Avoid clobbering errno, per comment from Tom.	2010-12-16 17:15:37 -03:00
Alvaro Herrera	83c759ea0e	Fix inconsequential FILE pointer leakage	2010-12-16 16:45:11 -03:00
Alvaro Herrera	e359b8496d	Add some minor missing error checks	2010-12-16 12:23:07 -03:00
Alvaro Herrera	16ca75baeb	Simplify SignalSomeChildren(BACKEND_TYPE_ALL) to SignalChildren()	2010-12-16 12:23:07 -03:00
Bruce Momjian	48da2b87e3	Fix crash caused by NULL lookup when reporting IP address of failed libpq connection, per report from Magnus. This happens only on GIT master and only on Win32 because that is the platform where "" maps to an IP address (localhost).	2010-12-16 10:13:43 -05:00
Tom Lane	5cdd65f324	Fix up getopt() reset management so it works on recent mingw. The mingw people don't appear to care about compatibility with non-GNU versions of getopt, so force use of our own copy of getopt on Windows. Also, ensure that we make use of optreset when using our own copy. Per report from Andrew Dunstan. Back-patch to all versions supported on Windows.	2010-12-15 23:50:41 -05:00
Robert Haas	290f1603b4	Some copy editing of pg_read_binary_file() patch.	2010-12-15 21:02:31 -05:00
Itagaki Takahiro	03db44eae3	Add pg_read_binary_file() and whole-file-at-once versions of pg_read_file(). One of the usages of the binary version is to read files in a different encoding from the server encoding. Dimitri Fontaine and Itagaki Takahiro.	2010-12-16 06:56:28 +09:00
Robert Haas	34c70c7ac4	Instrument checkpoint sync calls. Greg Smith, reviewed by Jeff Janes	2010-12-14 09:26:19 -05:00
Robert Haas	9878e295dc	Improved tab completion for views with triggers. Allow INSERT INTO, UPDATE, and DELETE FROM to be completed with either the name of a table (as before) or the name of a view with an appropriate INSTEAD OF rule. Along the way, allow CREATE TRIGGER to be completed with INSTEAD OF, as well as BEFORE and AFTER. David Fetter, reviewed by Itagaki Takahiro	2010-12-13 22:46:55 -05:00
Robert Haas	d368e1a2a7	Allow plugins to suppress inlining and hook function entry/exit/abort. This is intended as infrastructure to allow an eventual SE-Linux plugin to support trusted procedures. KaiGai Kohei	2010-12-13 19:15:53 -05:00
Tom Lane	f5e4f743e6	Update time zone data files to tzdata release 2010o: DST law changes in Fiji and Samoa. Historical corrections for Hong Kong.	2010-12-13 12:45:31 -05:00
Robert Haas	5f7b58fad8	Generalize concept of temporary relations to "relation persistence". This commit replaces pg_class.relistemp with pg_class.relpersistence; and also modifies the RangeVar node type to carry relpersistence rather than istemp. It also removes removes rd_istemp from RelationData and instead performs the correct computation based on relpersistence. For clarity, we add three new macros: RelationNeedsWAL(), RelationUsesLocalBuffers(), and RelationUsesTempNamespace(), so that we can clarify the purpose of each check that previous depended on rd_istemp. This is intended as infrastructure for the upcoming unlogged tables patch, as well as for future possible work on global temporary tables.	2010-12-13 12:34:26 -05:00
Tom Lane	0c90442355	Reset all database-level stats in pgstat_recv_resetcounter(). We were failing to zero out some pg_stat_database counters that have been added since the initial pgstats coding. This is a bug, but not back-patching the fix since changing this behavior in a minor release seems a cure worse than the disease. Report and patch by Tomas Vondra.	2010-12-12 15:09:53 -05:00
Tom Lane	5132ad8bdf	Make S_IRGRP etc available in mingw builds as well as MSVC. (Hm, I wonder whether BCC defines them either...) Also label dangling endifs a bit better in this area.	2010-12-12 13:43:44 -05:00
Tom Lane	1319002e2e	Provide a complete set of file-permission-bit macros in win32.h. My previous patch exposed the fact that we didn't have these. Those hard-wired octal constants were actually wrong on Windows, not just inconsistent.	2010-12-11 13:11:18 -05:00
Robert Haas	d3d414696f	Allow bidirectional copy messages in streaming replication mode. Fujii Masao. Review by Alvaro Herrera, Tom Lane, and myself.	2010-12-11 09:27:37 -05:00
Magnus Hagander	20f3964291	Add required new port files to MSVC builds.	2010-12-11 14:19:08 +01:00
Tom Lane	671199929d	Move a couple of initdb's subroutines into src/port/. mkdir_p and check_data_dir will be useful in CREATE TABLESPACE, since we have agreed that that command should handle subdirectory creation just like initdb creates the PGDATA directory. Push them into src/port/ so that they are available to both initdb and the backend. Rename to pg_mkdir_p and pg_check_dir, just to be on the safe side. Add FreeBSD's copyright notice to pgmkdirp.c, since that's where the code came from originally (this really should have been in initdb.c). Very marginal code/comment cleanup.	2010-12-10 19:42:44 -05:00
Tom Lane	04f4e10cfc	Use symbolic names not octal constants for file permission flags. Purely cosmetic patch to make our coding standards more consistent --- we were doing symbolic some places and octal other places. This patch fixes all C-coded uses of mkdir, chmod, and umask. There might be some other calls I missed. Inconsistency noted while researching tablespace directory permissions issue.	2010-12-10 17:35:33 -05:00
Tom Lane	244407a710	Fix efficiency problems in tuplestore_trim(). The original coding in tuplestore_trim() was only meant to work efficiently in cases where each trim call deleted most of the tuples in the store. Which, in fact, was the pattern of the original usage with a Material node supporting mark/restore operations underneath a MergeJoin. However, WindowAgg now uses tuplestores and it has considerably less friendly trimming behavior. In particular it can attempt to trim one tuple at a time off a large tuplestore. tuplestore_trim() had O(N^2) runtime in this situation because of repeatedly shifting its tuple pointer array. Fix by avoiding shifting the array until a reasonably large number of tuples have been deleted. This can waste some pointer space, but we do still reclaim the tuples themselves, so the percentage wastage should be pretty small. Per Jie Li's report of slow percent_rank() evaluation. cume_dist() and ntile() would certainly be affected as well, along with any other window function that has a moving frame start and requires reading substantially ahead of the current row. Back-patch to 8.4, where window functions were introduced. There's no need to tweak it before that.	2010-12-10 11:33:38 -05:00
Tom Lane	663fc32e26	Eliminate O(N^2) behavior in parallel restore with many blobs. With hundreds of thousands of TOC entries, the repeated searches in reduce_dependencies() become the dominant cost. Get rid of that searching by constructing reverse-dependency lists, which we can do in O(N) time during the fix_dependencies() preprocessing. I chose to store the reverse dependencies as DumpId arrays for consistency with the forward-dependency representation, and keep the previously-transient tocsByDumpId[] array around to locate actual TOC entry structs quickly from dump IDs. While this fixes the slow case reported by Vlad Arkhipov, there is still a potential for O(N^2) behavior with sufficiently many tables: fix_dependencies itself, as well as mark_create_done and inhibit_data_for_failed_table, are doing repeated searches to deal with table-to-table-data dependencies. Possibly this work could be extended to deal with that, although the latter two functions are also used in non-parallel restore where we currently don't run fix_dependencies. Another TODO is that we fail to parallelize restore of multiple blobs at all. This appears to require changes in the archive format to fix. Back-patch to 9.0 where the problem was reported. 8.4 has potential issues as well; but since it doesn't create a separate TOC entry for each blob, it's at much less risk of having enough TOC entries to cause real problems.	2010-12-09 13:03:11 -05:00
Simon Riggs	9975c683b1	Self review of previous patch. Fix assumption that xmax >= xmin.	2010-12-09 10:20:49 +00:00
Simon Riggs	b9075a6d2f	Reduce spurious Hot Standby conflicts from never-visible records. Hot Standby conflicts only with tuples that were visible at some point. So ignore tuples from aborted transactions or for tuples updated/deleted during the inserting transaction when generating the conflict transaction ids. Following detailed analysis and test case by Noah Misch. Original report covered btree delete records, correctly observed by Heikki Linnakangas that this applies to other cases also. Fix covers all sources of cleanup records via common code.	2010-12-09 09:41:47 +00:00

1 2 3 4 5 ...

21411 Commits