postgresql

mirror of https://git.postgresql.org/git/postgresql.git synced 2024-10-02 22:16:48 +02:00

Author	SHA1	Message	Date
Tom Lane	f24fa9c1a5	Fix pg_dump's dump order for collations versus extensions. Mixing them together alphabetically won't be nice. Per my gripe of 2011-02-12.	2011-03-06 18:26:53 -05:00
Simon Riggs	a8a8a3e096	Efficient transaction-controlled synchronous replication. If a standby is broadcasting reply messages and we have named one or more standbys in synchronous_standby_names then allow users who set synchronous_replication to wait for commit, which then provides strict data integrity guarantees. Design avoids sending and receiving transaction state information so minimises bookkeeping overheads. We synchronize with the highest priority standby that is connected and ready to synchronize. Other standbys can be defined to takeover in case of standby failure. This version has very strict behaviour; more relaxed options may be added at a later date. Simon Riggs and Fujii Masao, with reviews by Yeb Havinga, Jaime Casanova, Heikki Linnakangas and Robert Haas, plus the assistance of many other design reviewers.	2011-03-06 22:49:16 +00:00
Tom Lane	149b2673c2	Fix incorrect access to pg_index.indcollation. Since this field is after a variable-length field, it can't simply be accessed via the C struct for pg_index. Fortunately, the relcache already did the dirty work of pulling the information out to where it can be accessed easily, so this is a one-line fix. Andres Freund	2011-03-06 12:10:50 -05:00
Bruce Momjian	c15c1f1c15	Fix parallel gmake for extension directory addition in PL languages.	2011-03-05 18:32:39 -05:00
Andrew Dunstan	a956b16026	Add PL extension files to MSVC Install procedure.	2011-03-05 16:21:37 -05:00
Tom Lane	bfd7f8cbb2	Make plpythonu language use plpython2 shared library directly. The original scheme for this was to symlink plpython.$DLSUFFIX to plpython2.$DLSUFFIX, but that doesn't work on Windows, and only accidentally failed to fail because of the way that CREATE LANGUAGE created or didn't create new C functions. My changes of yesterday exposed the weakness of that approach. To fix, get rid of the symlink and make pg_pltemplate show what's really going on.	2011-03-05 15:13:15 -05:00
Tom Lane	ba0c93a0f4	Convert createlang/droplang to use CREATE/DROP EXTENSION. In createlang this is a one-line change. In droplang there's a whole lot of cruft that can be discarded since the extension mechanism now manages removal of the language's support functions. Also, add deprecation notices to these two programs' reference pages, since per discussion we may toss them overboard altogether in a release or two.	2011-03-05 14:03:06 -05:00
Peter Eisentraut	9650364b7b	Update of SQL feature conformance	2011-03-05 17:03:21 +02:00
Tom Lane	63b656b7bf	Create extension infrastructure for the core procedural languages. This mostly just involves creating control, install, and update-from-unpackaged scripts for them. However, I had to adjust plperl and plpython to not share the same support functions between variants, because we can't put the same function into multiple extensions. catversion bump forced due to new contents of pg_pltemplate, and because initdb now installs plpgsql as an extension not a bare language. Add support for regression testing these as extensions not bare languages. Fix a couple of other issues that popped up while testing this: my initial hack at pg_dump binary-upgrade support didn't work right, and we don't want an extra schema permissions test after all. Documentation changes still to come, but I'm committing now to see whether the MSVC build scripts need work (likely they do).	2011-03-04 21:51:14 -05:00
Robert Haas	efa415da8c	Refactor seclabel.c to use the new check_object_ownership function. This avoids duplicate (and not-quite-matching) code, and makes the logic for SECURITY LABEL match COMMENT and ALTER EXTENSION ADD/DROP.	2011-03-04 17:26:37 -05:00
Peter Eisentraut	b9cff97fdf	Don't allow CREATE TABLE AS to create a column with invalid collation It is possible that an expression ends up with a collatable type but without a collation. CREATE TABLE AS could then create a table based on that. But such a column cannot be dumped with valid SQL syntax, so we disallow creating such a column. per test report from Noah Misch	2011-03-04 23:42:07 +02:00
Tom Lane	8d3b421f5f	Allow non-superusers to create (some) extensions. Remove the unconditional superuser permissions check in CREATE EXTENSION, and instead define a "superuser" extension property, which when false (not the default) skips the superuser permissions check. In this case the calling user only needs enough permissions to execute the commands in the extension's installation script. The superuser property is also enforced in the same way for ALTER EXTENSION UPDATE cases. In other ALTER EXTENSION cases and DROP EXTENSION, test ownership of the extension rather than superuserness. ALTER EXTENSION ADD/DROP needs to insist on ownership of the target object as well; to do that without duplicating code, refactor comment.c's big switch for permissions checks into a separate function in objectaddress.c. I also removed the superuserness checks in pg_available_extensions and related functions; there's no strong reason why everybody shouldn't be able to see that info. Also invent an IF NOT EXISTS variant of CREATE EXTENSION, and use that in pg_dump, so that dumps won't fail for installed-by-default extensions. We don't have any of those yet, but we will soon. This is all per discussion of wrapping the standard procedural languages into extensions. I'll make those changes in a separate commit; this is just putting the core infrastructure in place.	2011-03-04 16:08:53 -05:00
Peter Eisentraut	4442e1975d	When creating a collation, check that the locales can be loaded This is the same check that would happen later when the collation is used, but it's friendlier to check the collation already when it is created.	2011-03-04 22:14:37 +02:00
Tom Lane	bd58d9d883	In initialize_SSL, don't fail unnecessarily when home dir is unavailable. Instead, just act as though the certificate file(s) are not present. There is only one case where this need be a hard failure condition: when sslmode is verify-ca or verify-full, not having a root cert file is an error. Change the logic so that we complain only in that case, and otherwise fall through cleanly. This is how it used to behave pre-9.0, but my patch `4ed4b6c54e` of 2010-05-26 broke the case. Per report from Christian Kastner.	2011-03-04 11:38:45 -05:00
Heikki Linnakangas	ee3838b1d3	You must hold a lock on the heap page when you call CheckForSerializableConflictOut(), because it can set hint bits. YAMAMOTO Takashi	2011-03-04 15:43:11 +02:00
Andrew Dunstan	12bf602f3f	Add a comment explaining the recent fix for plpython breakage in commit `4c966d9`. Mostly text supplied by Jan Urbański.	2011-03-03 19:41:54 -05:00
Tom Lane	908ab80286	Further refine patch for commenting operator implementation functions. Instead of manually maintaining the "implementation of XXX operator" comments in pg_proc.h, delete all those entries and let initdb create them via a join. To let initdb figure out which name to use when there is a conflict, change the comments for deprecated operators to say they are deprecated --- which seems like a good thing to do anyway.	2011-03-03 15:55:47 -05:00
Tom Lane	6252c4f9e2	Run a portal's cleanup hook immediately when pushing it to DONE state. This works around the problem noted by Yamamoto Takashi in bug #5906, that there were code paths whereby we could reach AtCleanup_Portals with a portal's cleanup hook still unexecuted. The changes I made a few days ago were intended to prevent that from happening, and I think that on balance it's still a good thing to avoid, so I don't want to remove the Assert in AtCleanup_Portals. Hence do this instead.	2011-03-03 13:04:06 -05:00
Michael Meskes	32fce70564	Added new version of ecpg's parser generator script. This one was written by Andy Colson <andy@squeakycode.net>.	2011-03-03 13:43:50 +01:00
Heikki Linnakangas	8e2d8b1497	Add tab-completion for table name after JOIN. Andrey Popp	2011-03-03 09:42:49 +02:00
Tom Lane	94133a9354	Mark operator implementation functions as such in their comments. Historically, we've not had separate comments for built-in pg_operator entries, but relied on the comments for the underlying functions. The trouble with this approach is that there isn't much of anything to suggest to users that they'd be better off using the operators instead. So, move all the relevant comments into pg_operator, and give each underlying function a comment that just says "implementation of XXX operator". There are only about half a dozen cases where it seems reasonable to use the underlying function interchangeably with the operator; in these cases I left the same comment in place on the function as on the operator. While at it, establish a policy that every built-in function and operator entry should have a comment: there are now queries in the opr_sanity regression test that will complain if one doesn't. This only required adding a dozen or two more entries than would have been there anyway. I also spent some time trying to eliminate gratuitous inconsistencies in the style of the comments, though it's hopeless to suppose that more won't creep in soon enough. Per my proposal of 2010-10-15.	2011-03-03 01:34:17 -05:00
Peter Eisentraut	091bda0188	Add collations to information_schema.usage_privileges This is faked information like for domains.	2011-03-02 23:17:56 +02:00
Andrew Dunstan	4c966d920f	Fix plpython breakage detected on certain Fedora machines on buildfarm. Patch from Jan Urbański.	2011-03-01 18:59:31 -05:00
Peter Eisentraut	2f363590c1	Additional PL/Python regression test expected file plpython_subtransaction test needs a separate expected file specifically for Python 2.5.	2011-03-01 23:35:18 +02:00
Heikki Linnakangas	6eba5a7c57	Change pg_last_xlog_receive_location() not to move backwards. That makes it a lot more useful for determining which standby is most up-to-date, for example. There was long discussions on whether overwriting existing existing WAL makes sense to begin with, and whether we should do some more extensive variable renaming, but this change nevertheless seems quite uncontroversial. Fujii Masao, reviewed by Jeff Janes, Robert Haas, Stephen Frost.	2011-03-01 20:54:35 +02:00
Heikki Linnakangas	47ad79122b	Fix bugs in Serializable Snapshot Isolation. Change the way UPDATEs are handled. Instead of maintaining a chain of tuple-level locks in shared memory, copy any existing locks on the old tuple to the new tuple at UPDATE. Any existing page-level lock needs to be duplicated too, as a lock on the new tuple. That was neglected previously. Store xmin on tuple-level predicate locks, to distinguish a lock on an old already-recycled tuple from a new tuple at the same physical location. Failure to distinguish them caused loops in the tuple-lock chains, as reported by YAMAMOTO Takashi. Although we don't use the chain representation of UPDATEs anymore, it seems like a good idea to store the xmin to avoid some false positives if no other reason. CheckSingleTargetForConflictsIn now correctly handles the case where a lock that's being held is not reflected in the local lock table. That happens if another backend acquires a lock on our behalf due to an UPDATE or a page split. PredicateLockPageCombine now retains locks for the page that is being removed, rather than removing them. This prevents a potentially dangerous false-positive inconsistency where the local lock table believes that a lock is held, but it is actually not. Dan Ports and Kevin Grittner	2011-03-01 19:05:16 +02:00
Peter Eisentraut	16143d6451	Dump the COLLATABLE attribute in CREATE TYPE This was previously omitted by accident.	2011-03-01 18:45:34 +02:00
Tom Lane	97c4ee94ad	Include the target table in EXPLAIN output for ModifyTable nodes. Per discussion, this seems important for plans involving writable CTEs, since there can now be more than one ModifyTable node in the plan. To retain the same formatting as for target tables of scan nodes, we show only one target table, which will be the parent table in case of an UPDATE or DELETE on an inheritance tree. Individual child tables can be determined by inspecting the child plan trees if needed.	2011-03-01 11:37:01 -05:00
Robert Haas	59d6a75942	Avoid excessive Hot Standby feedback messages. Without this patch, when wal_receiver_status_interval=0, indicating that no status messages should be sent, Hot Standby feedback messages are instead sent extremely frequently. Fujii Masao, with documentation changes by me.	2011-03-01 11:34:25 -05:00
Tom Lane	c0b0076036	Rearrange snapshot handling to make rule expansion more consistent. With this patch, portals, SQL functions, and SPI all agree that there should be only a CommandCounterIncrement between the queries that are generated from a single SQL command by rule expansion. Fetching a whole new snapshot now happens only between original queries. This is equivalent to the existing behavior of EXPLAIN ANALYZE, and it was judged to be the best choice since it eliminates one source of concurrency hazards for rules. The patch should also make things marginally faster by reducing the number of snapshot push/pop operations. The patch removes pg_parse_and_rewrite(), which is no longer used anywhere. There was considerable discussion about more aggressive refactoring of the query-processing functions exported by postgres.c, but for the moment nothing more has been done there. I also took the opportunity to refactor snapmgr.c's API slightly: the former PushUpdatedSnapshot() has been split into two functions. Marko Tiikkaja, reviewed by Steve Singer and Tom Lane	2011-02-28 23:28:06 -05:00
Andrew Dunstan	57e9bda5ec	Unbreak vpath builds broken by commit `474a42473a`.	2011-02-28 21:31:39 -05:00
Robert Haas	92c30fd2ed	Rename pg_stat_replication.apply_location to replay_location. For consistency with pg_last_xlog_replay_location. Per discussion.	2011-02-28 12:49:57 -05:00
Peter Eisentraut	4b853c879d	Fix regression tests after PL/Python custom SPI exceptions patch	2011-02-28 19:43:36 +02:00
Peter Eisentraut	474a42473a	PL/Python custom SPI exceptions This provides a separate exception class for each error code that the backend defines, as well as the ability to get the SQLSTATE from the exception object. Jan Urbański, reviewed by Steve Singer	2011-02-28 18:41:10 +02:00
Peter Eisentraut	22690719ea	PL/Python explicit subtransactions Adds a context manager, obtainable by plpy.subtransaction(), to run a group of statements in a subtransaction. Jan Urbański, reviewed by Steve Singer, additional scribbling by me	2011-02-27 21:15:35 +02:00
Peter Eisentraut	438cdf6e48	Remove remaining expected file for Python 2.2 We don't have complete expected coverage for Python 2.2 anyway, so it doesn't seem worth keeping this one around that no one appears to be updating anyway. Visual inspection of the differences ought to be good enough for those few who care about this obsolete Python version.	2011-02-27 21:15:35 +02:00
Tom Lane	a874fe7b4c	Refactor the executor's API to support data-modifying CTEs better. The originally committed patch for modifying CTEs didn't interact well with EXPLAIN, as noted by myself, and also had corner-case problems with triggers, as noted by Dean Rasheed. Those problems show it is really not practical for ExecutorEnd to call any user-defined code; so split the cleanup duties out into a new function ExecutorFinish, which must be called between the last ExecutorRun call and ExecutorEnd. Some Asserts have been added to these functions to help verify correct usage. It is no longer necessary for callers of the executor to call AfterTriggerBeginQuery/AfterTriggerEndQuery for themselves, as this is now done by ExecutorStart/ExecutorFinish respectively. If you really need to suppress that and do it for yourself, pass EXEC_FLAG_SKIP_TRIGGERS to ExecutorStart. Also, refactor portal commit processing to allow for the possibility that PortalDrop will invoke user-defined code. I think this is not actually necessary just yet, since the portal-execution-strategy logic forces any non-pure-SELECT query to be run to completion before we will consider committing. But it seems like good future-proofing.	2011-02-27 13:44:12 -05:00
Bruce Momjian	67a5e727c8	Be less detailed about reporting shared memory failure by avoiding the output of actual Postgres parameter _values_ related to shared memory, and suggesting that these are only possible parameters to reduce.	2011-02-27 12:21:58 -05:00
Magnus Hagander	b04137a294	Fix verbose display of REPLICATION role attribute Josh Kupershmidt	2011-02-27 12:35:31 +01:00
Heikki Linnakangas	be6668d6ef	Increase the default for wal_sender_delay from 200ms to 1s. Now that WAL sender is immediately woken up by transaction commit, there's no need to wake up so aggressively.	2011-02-26 23:38:25 +02:00
Peter Eisentraut	bc411f25c1	Table function support for PL/Python This allows functions with multiple OUT parameters returning both one or multiple records (RECORD or SETOF RECORD). Jan Urbański, reviewed by Hitoshi Harada	2011-02-26 16:53:11 +02:00
Tom Lane	000128bc7f	Fix order of shutdown processing when CTEs contain inter-references. We need ExecutorEnd to run the ModifyTable nodes to completion in reverse order of initialization, not forward order. Easily done by constructing the list back-to-front.	2011-02-25 23:53:34 -05:00
Tom Lane	389af95155	Support data-modifying commands (INSERT/UPDATE/DELETE) in WITH. This patch implements data-modifying WITH queries according to the semantics that the updates all happen with the same command counter value, and in an unspecified order. Therefore one WITH clause can't see the effects of another, nor can the outer query see the effects other than through the RETURNING values. And attempts to do conflicting updates will have unpredictable results. We'll need to document all that. This commit just fixes the code; documentation updates are waiting on author. Marko Tiikkaja and Hitoshi Harada	2011-02-25 18:58:02 -05:00
Robert Haas	79ad8fc5f8	Named restore point improvements. Emit a log message when creating a named restore point, and improve documentation for pg_create_restore_point(). Euler Taveira de Oliveira, per suggestions from Thom Brown, with some additional wordsmithing by me.	2011-02-24 19:02:00 -05:00
Itagaki Takahiro	6079375431	More psql tab-completion for new commands. - ALTER FOREIGN DATA WRAPPER with HANDLER - ALTER TABLE VALIDATE CONSTRAINT - ALTER TYPE ADD VALUE - COPY with ENCODING and FORCE NOT NULL - CREATE FOREIGN DATA WRAPPER with HANDLER - CREATE TRIGGER ... INSTEAD OF	2011-02-24 21:05:40 +09:00
Itagaki Takahiro	4191e16cbe	Add tab-completion for CREATE UNLOGGED TABLE in psql, and fix unexpected completion for DROP TEMP and UNIQUE.	2011-02-24 10:13:27 +09:00
Itagaki Takahiro	5a922f13ef	Make the second words lowercase in psql's \d titles for unlogged tables.	2011-02-23 09:58:00 +09:00
Tom Lane	bdca82f44d	Add a relkind field to RangeTblEntry to avoid some syscache lookups. The recent additions for FDW support required checking foreign-table-ness in several places in the parse/plan chain. While it's not clear whether that would really result in a noticeable slowdown, it seems best to avoid any performance risk by keeping a copy of the relation's relkind in RangeTblEntry. That might have some other uses later, anyway. Per discussion.	2011-02-22 19:24:40 -05:00
Peter Eisentraut	1c51c7d5ff	Add PL/Python functions for quoting strings Add functions plpy.quote_ident, plpy.quote_literal, plpy.quote_nullable, which wrap the equivalent SQL functions. To be able to propagate char * constness properly, make the argument of quote_literal_cstr() const char *. This also makes it more consistent with quote_identifier(). Jan Urbański, reviewed by Hitoshi Harada, some refinements by Peter Eisentraut	2011-02-22 23:41:23 +02:00
Robert Haas	3e6b305d9e	Fix a couple of unlogged tables goofs. "SELECT ... INTO UNLOGGED tabname" works, but wasn't documented; CREATE UNLOGGED SEQUENCE and CREATE UNLOGGED VIEW failed an assertion, instead of throwing a sensible error. Latter issue reported by Itagaki Takahiro; patch review by Tom Lane.	2011-02-22 14:46:19 -05:00
Tom Lane	1ab9b012bd	Allow binary I/O of type "void". void_send is useful for the same reason that void_out doesn't throw error, namely that someone might do "select void_returning_func(...)" from a client that prefers to operate in binary mode. The void_recv function may or may not have any practical use, but we provide it for symmetry. Radosław Smogura	2011-02-22 13:08:22 -05:00
Tom Lane	2e852e541c	Remove ExecRemoveJunk(), which is no longer used anywhere. This was a leftover from the pre-8.1 design of junkfilters. It doesn't seem to have any reason to live, since it's merely a combination of two easy function calls, and not a well-designed combination at that (it encourages callers to leak the result tuple).	2011-02-21 21:41:08 -05:00
Tom Lane	a210be7720	Fix dangling-pointer problem in before-row update trigger processing. ExecUpdate checked for whether ExecBRUpdateTriggers had returned a new tuple value by seeing if the returned tuple was pointer-equal to the old one. But the "old one" was in estate->es_junkFilter's result slot, which would be scribbled on if we had done an EvalPlanQual update in response to a concurrent update of the target tuple; therefore we were comparing a dangling pointer to a live one. Given the right set of circumstances we could get a false match, resulting in not forcing the tuple to be stored in the slot we thought it was stored in. In the case reported by Maxim Boguk in bug #5798, this led to "cannot extract system attribute from virtual tuple" failures when trying to do "RETURNING ctid". I believe there is a very-low-probability chance of more serious errors, such as generating incorrect index entries based on the original rather than the trigger-modified version of the row. In HEAD, change all of ExecBRInsertTriggers, ExecIRInsertTriggers, ExecBRUpdateTriggers, and ExecIRUpdateTriggers so that they continue to have similar APIs. In the back branches I just changed ExecBRUpdateTriggers, since there is no bug in the ExecBRInsertTriggers case.	2011-02-21 21:19:50 -05:00
Itagaki Takahiro	ca9cf85d54	Fix pg_server_to_client, that was broken in the previous commit.	2011-02-21 16:27:57 +09:00
Itagaki Takahiro	3cba8240a1	Add ENCODING option to COPY TO/FROM and file_fdw. File encodings can be specified separately from client encoding. If not specified, client encoding is used for backward compatibility. Cases when the encoding doesn't match client encoding are slower than matched cases because we don't have conversion procs for other encodings. Performance improvement would be be a future work. Original patch by Hitoshi Harada, and modified by me.	2011-02-21 14:32:40 +09:00
Tom Lane	7c5d0ae707	Add contrib/file_fdw foreign-data wrapper for reading files via COPY. This is both very useful in its own right, and an important test case for the core FDW support. This commit includes a small refactoring of copy.c to expose its option checking code as a separately callable function. The original patch submission duplicated hundreds of lines of that code, which seemed pretty unmaintainable. Shigeru Hanada, reviewed by Itagaki Takahiro and Tom Lane	2011-02-20 14:06:59 -05:00
Tom Lane	bb74240794	Implement an API to let foreign-data wrappers actually be functional. This commit provides the core code and documentation needed. A contrib module test case will follow shortly. Shigeru Hanada, Jan Urbanski, Heikki Linnakangas	2011-02-20 00:18:14 -05:00
Peter Eisentraut	b05186f8a4	Invalidate PL/Python functions with composite type argument when the type changes. The invalidation will cause the type information to be refetched, and everything will work. Jan Urbański, reviewed by Alex Hunsaker	2011-02-19 16:56:02 +02:00
Bruce Momjian	964b46d00e	Initialize variable to quiet compiler.	2011-02-19 08:14:32 -05:00
Peter Eisentraut	02e14562a8	Set psql client encoding from locale by default Add a new libpq connection option client_encoding (which includes the existing PGCLIENTENCODING environment variable), which besides an encoding name accepts a special value "auto" that tries to determine the encoding from the locale in the client's environment, using the mechanisms that have been in use in initdb. psql sets this new connection option to "auto" when running from a terminal and not overridden by setting PGCLIENTENCODING. original code by Heikki Linnakangas, with subsequent contributions by Jaime Casanova, Peter Eisentraut, Stephen Frost, Ibrar Ahmed	2011-02-19 08:54:58 +02:00
Tom Lane	327e025071	Create the catalog infrastructure for foreign-data-wrapper handlers. Add a fdwhandler column to pg_foreign_data_wrapper, plus HANDLER options in the CREATE FOREIGN DATA WRAPPER and ALTER FOREIGN DATA WRAPPER commands, plus pg_dump support for same. Also invent a new pseudotype fdw_handler with properties similar to language_handler. This is split out of the "FDW API" patch for ease of review; it's all stuff we will certainly need, regardless of any other details of the FDW API. FDW handler functions will not actually get called yet. In passing, fix some omissions and infelicities in foreigncmds.c. Shigeru Hanada, Jan Urbanski, Heikki Linnakangas	2011-02-19 00:07:15 -05:00
Tom Lane	82220e8832	Un-break building with BTREE_BUILD_STATS. This has been broken for awhile, but not clear it's worth back-patching. Euler Taveira de Oliveira	2011-02-18 14:06:16 -05:00
Tom Lane	4cff100d73	Fix parallel pg_restore to handle comments on POST_DATA items correctly. The previous coding would try to process all SECTION_NONE items in the initial sequential-restore pass, which failed if they were dependencies of not-yet-restored items. Fix by postponing such items into the parallel processing pass once we have skipped any non-PRE_DATA item. Back-patch into 9.0; the original parallel-restore coding in 8.4 did not have this bug, so no need to change it. Report and diagnosis by Arnd Hannemann.	2011-02-18 13:11:45 -05:00
Alvaro Herrera	a5dfc94c9a	Use $INDENT instead of `which` to find the indent binary Per discussion after my commit o yesterday.	2011-02-18 12:49:16 -03:00
Simon Riggs	bc76695c4c	Make a hard state change from catchup to streaming mode. More useful state change for monitoring purposes, plus a required change for synchronous replication patch.	2011-02-18 15:07:26 +00:00
Simon Riggs	06828c5feb	Separate messages for standby replies and hot standby feedback. Allow messages to be sent at different times, and greatly reduce the frequency of hot standby feedback. Refactor to allow additional message types.	2011-02-18 11:31:49 +00:00
Magnus Hagander	45a6d79b17	Properly initialize variables Kevin Grittner	2011-02-18 11:59:57 +01:00
Michael Meskes	bc423879cc	Applied a patch by Zoltán Böszörményi that makes ecpg's parser accept dynamic cursornames even in WHERE CURRENT OF clauses.	2011-02-18 11:16:16 +01:00
Itagaki Takahiro	5c63982af2	Fix an uninitialized field in DR_copy. Shigeru HANADA	2011-02-18 14:32:19 +09:00
Itagaki Takahiro	62c7bd31c8	Add transaction-level advisory locks. They share the same locking namespace with the existing session-level advisory locks, but they are automatically released at the end of the current transaction and cannot be released explicitly via unlock functions. Marko Tiikkaja, reviewed by me.	2011-02-18 14:05:12 +09:00
Alvaro Herrera	87bb2ade2c	Convert Postgres arrays to Perl arrays on PL/perl input arguments More generally, arrays are turned in Perl array references, and row and composite types are turned into Perl hash references. This is done recursively, in a way that's natural to every Perl programmer. To avoid a backwards compatibility hit, the string representation of each structure is also available if the function requests it. Authors: Alexey Klyukin and Alex Hunsaker. Some code cleanups by me.	2011-02-17 22:20:40 -03:00
Alvaro Herrera	f7b51d175a	pgindent run on plperl.c	2011-02-17 22:20:39 -03:00
Alvaro Herrera	c4d124365b	Use $INDENT rather than indent throughout the pgindent code This allows the user to change the path to be used more easily. Also, change URL in README.	2011-02-17 22:20:19 -03:00
Tom Lane	52b60530f2	Fix tsmatchsel() to account properly for null rows. ts_typanalyze.c computes MCE statistics as fractions of the non-null rows, which seems fairly reasonable, and anyway changing it in released versions wouldn't be a good idea. But then ts_selfuncs.c has to account for that. Failure to do so results in overestimates in columns with a significant fraction of null documents. Back-patch to 8.4 where this stuff was introduced. Jesper Krogh	2011-02-17 19:00:49 -05:00
Robert Haas	4a25bc145a	Add client_hostname field to pg_stat_activity. Peter Eisentraut, reviewed by Steve Singer, Alvaro Herrera, and me.	2011-02-17 16:03:28 -05:00
Robert Haas	a3e8486dff	Prevent possible compiler warnings. Simon Riggs reports that rnode.dbNode and rnode.spcNode were generating unused variable warnings on gcc 4.4.3 with CFLAGS=-O1	2011-02-17 16:01:46 -05:00
Robert Haas	f196738534	Add some words of caution to elog.c. Stephen Frost, somewhat rewritten by me	2011-02-17 10:29:42 -05:00
Tom Lane	93016983d1	Fix blatantly uninitialized variable in recent commit. Doesn't anybody around here pay attention to compiler warnings?	2011-02-16 19:53:20 -05:00
Tom Lane	a2095f7fb5	Fix bogus test for hypothetical indexes in get_actual_variable_range(). That function was supposing that indexoid == 0 for a hypothetical index, but that is not likely to be true in any non-toy implementation of an index adviser, since assigning a fake OID is the only way to know at EXPLAIN time which hypothetical index got selected. Fix by adding a flag to IndexOptInfo to mark hypothetical indexes. Back-patch to 9.0 where get_actual_variable_range() was added. Gurjeet Singh	2011-02-16 19:24:45 -05:00
Tom Lane	6595dd04d1	Add backwards-compatible declarations of some core GIN support functions. These are needed to support reloading dumps of 9.0 installations containing contrib/intarray or contrib/tsearch2. Since not only regular dump/reload but binary upgrade would fail, it seems worth the trouble to carry these stubs for awhile. Note that the contrib opclasses referencing these functions will still work fine, since GIN doesn't actually pay any attention to the declared signature of a support function.	2011-02-16 17:24:46 -05:00
Peter Eisentraut	b15fabf997	Also process psqlrc when running psql -l This was previously not very useful, but with many people customizing the linestyle, it is nice for a consistent appearance.	2011-02-16 23:15:54 +02:00
Peter Eisentraut	66d6b4cb54	Fix for warnings-free compilation with Python 3.2 The first argument of PyEval_EvalCode() was changed from PyCodeObject* to PyObject* because of PEP 384.	2011-02-16 23:15:53 +02:00
Simon Riggs	bca8b7f16a	Hot Standby feedback for avoidance of cleanup conflicts on standby. Standby optionally sends back information about oldestXmin of queries which is then checked and applied to the WALSender's proc->xmin. GetOldestXmin() is modified slightly to agree with GetSnapshotData(), so that all backends on primary include WALSender within their snapshots. Note this does nothing to change the snapshot xmin on either master or standby. Feedback piggybacks on the standby reply message. vacuum_defer_cleanup_age is no longer used on standby, though parameter still exists on primary, since some use cases still exist. Simon Riggs, review comments from Fujii Masao, Heikki Linnakangas, Robert Haas	2011-02-16 19:29:37 +00:00
Tom Lane	65076269ea	Make a no-op ALTER EXTENSION UPDATE give just a NOTICE, not ERROR. This seems a bit more user-friendly.	2011-02-16 12:40:31 -05:00
Robert Haas	3a087369c0	WAL receiver shouldn't try to send a reply when dying. Per report from, and discussion with, Fujii Masao.	2011-02-16 10:27:35 -05:00
Tom Lane	6e02755b22	Add FOREACH IN ARRAY looping to plpgsql. (I'm not entirely sure that we've finished bikeshedding the syntax details, but the functionality seems OK.) Pavel Stehule, reviewed by Stephen Frost and Tom Lane	2011-02-16 01:53:03 -05:00
Robert Haas	4695da5ae9	pg_ctl promote Fujii Masao, reviewed by Robert Haas, Stephen Frost, and Magnus Hagander.	2011-02-15 21:30:23 -05:00
Itagaki Takahiro	8ddc05fb01	Export the external file reader used in COPY FROM as APIs. They are expected to be used by extension modules like file_fdw. There are no user-visible changes. Itagaki Takahiro Reviewed and tested by Kevin Grittner and Noah Misch.	2011-02-16 11:19:11 +09:00
Tom Lane	89c29c0331	Fix corner case for binary upgrade: extension functions in pg_catalog. Normally, pg_dump summarily excludes functions in pg_catalog from consideration. However, some extensions may create functions in pg_catalog (adminpack already does that, and extensions for procedural languages will likely do it too). In binary-upgrade mode, we have to dump such functions, or the extension will be incomplete after upgrading. Per experimentation with adminpack.	2011-02-15 18:10:22 -05:00
Tom Lane	eff027c432	Add CheckTableNotInUse calls in DROP TABLE and DROP INDEX. Recent releases had a check on rel->rd_refcnt in heap_drop_with_catalog, but failed to cover the possibility of pending trigger events at DROP time. (Before 8.4 we didn't even check the refcnt.) When the trigger events were eventually fired, you'd get "could not open relation with OID nnn" errors, as in recent report from strk. Better to throw a suitable error when the DROP is attempted. Also add a similar check in DROP INDEX. Back-patch to all supported branches.	2011-02-15 15:50:48 -05:00
Tom Lane	887dd041a6	Fix obsolete comment. Comment about MaxAllocSize was not updated when the TOAST-header macros were replaced in 8.3 "varvarlena" changes. Per report from Frederik Ramm.	2011-02-15 13:27:54 -05:00
Robert Haas	883a9659fa	Assorted corrections to the patch to add WAL receiver replies. Per reports from Fujii Masao.	2011-02-15 12:05:00 -05:00
Robert Haas	6a77e9385e	Rename max_predicate_locks_per_transaction. The new name, max_pred_locks_per_transaction, is shorter. Kevin Grittner, per discussion.	2011-02-15 08:04:55 -05:00
Peter Eisentraut	2fd77060a2	Allow make check in PL directories Also add make check-world target, and refactor pg_regress invocation code in makefiles a bit.	2011-02-15 06:52:12 +02:00
Robert Haas	0d90dc16f8	Avoid a few more SET DATA TYPE table rewrites. When the new type is an unconstrained domain over the old type, we don't need to rewrite the table. Noah Misch and Robert Haas	2011-02-14 23:40:05 -05:00
Robert Haas	8e1124eeeb	Delete stray word from comment.	2011-02-14 22:38:08 -05:00
Simon Riggs	5c588be729	PITR can stop at a named restore point when recovery target = time though must not update the last transaction timestamp. Plus comment and message cleanup for recent named restore point. Fujii Masao, minor changes by me	2011-02-15 00:51:39 +00:00
Tom Lane	01ff8dd756	Fix MSVC build scripts for recent extension-related changes. Untested, but we'll soon see if the buildfarm likes this.	2011-02-14 19:45:46 -05:00
Tom Lane	555353c0c5	Rearrange extension-related views as per recent discussion. The original design of pg_available_extensions did not consider the possibility of version-specific control files. Split it into two views: pg_available_extensions shows information that is generic about an extension, while pg_available_extension_versions shows all available versions together with information that could be version-dependent. Also, add an SRF pg_extension_update_paths() to assist in checking that a collection of update scripts provide sane update path sequences.	2011-02-14 19:22:36 -05:00
Simon Riggs	f0b8a79c4b	Add version-sensitive SQL for psql when constraints NOT VALID Bug report and fix by Andres Freund	2011-02-15 00:08:15 +00:00
Tom Lane	2ee69ff65d	Remove no-longer-needed special case hacks in MSVC build scripts.	2011-02-13 23:42:57 -05:00
Tom Lane	e693e97d75	Support replacing MODULE_PATHNAME during extension script file execution. This avoids the need to find a way to make PGXS' .sql.in-to-.sql rule insert the right thing. We'll just deprecate use of that hack for extensions.	2011-02-13 22:54:43 -05:00
Tom Lane	27d5d7ab10	Change the naming convention for extension files to use double dashes. This allows us to have an unambiguous rule for deconstructing the names of script files and secondary control files, without having to forbid extension and version names from containing any dashes. We do have to forbid them from containing double dashes or leading/trailing dashes, but neither restriction is likely to bother anyone in practice. Per discussion, this seems like a better solution overall than the original design.	2011-02-13 22:54:42 -05:00
Bruce Momjian	8e7af60872	Fix reverse 'if' test in path_is_relative_and_below_cwd(), per Tom.	2011-02-13 00:14:47 -05:00
Tom Lane	6c2e734f0a	Refactor ALTER EXTENSION UPDATE to have cleaner multi-step semantics. This change causes a multi-step update sequence to behave exactly as if the updates had been commanded one at a time, including updating the "requires" dependencies afresh at each step. The initial implementation took the shortcut of examining only the final target version's "requires" and changing the catalog entry but once. But on reflection that's a bad idea, since it could lead to executing old update scripts under conditions different than they were designed/tested for. Better to expend a few extra cycles and avoid any surprises. In the same spirit, if a CREATE EXTENSION FROM operation involves applying a series of update files, it will act as though the CREATE had first been done using the initial script's target version and then the additional scripts were invoked with ALTER EXTENSION UPDATE. I also removed the restriction about not changing encoding in secondary control files. The new rule is that a script is assumed to be in whatever encoding the control file(s) specify for its target version. Since this reimplementation causes us to read each intermediate version's control file, there's no longer any uncertainty about which encoding setting would get applied.	2011-02-12 16:40:41 -05:00
Bruce Momjian	0de0cc150a	Properly handle Win32 paths of 'E:abc', which can be either absolute or relative, by creating a function path_is_relative_and_below_cwd() to check for specific requirements. It is unclear if this fixes a security problem or not but the new code is more robust.	2011-02-12 09:47:51 -05:00
Peter Eisentraut	b313bca0af	DDL support for collations - collowner field - CREATE COLLATION - ALTER COLLATION - DROP COLLATION - COMMENT ON COLLATION - integration with extensions - pg_dump support for the above - dependency management - psql tab completion - psql \dO command	2011-02-12 15:55:18 +02:00
Robert Haas	d31e2a495b	Teach ALTER TABLE .. SET DATA TYPE to avoid some table rewrites. When the old type is binary coercible to the new type and the using clause does not change the column contents, we can avoid a full table rewrite, though any indexes on the affected columns will still need to be rebuilt. This applies, for example, when changing a varchar column to be of type text. The prior coding assumed that the set of operations that force a rewrite is identical to the set of operations that must be propagated to tables making use of the affected table's rowtype. This is no longer true: even though the tuples in those tables wouldn't need to be modified, the data type change invalidate indexes built using those composite type columns. Indexes on the table we're actually modifying can be invalidated too, of course, but the existing machinery is sufficient to handle that case. Along the way, add some debugging messages that make it possible to understand what operations ALTER TABLE is actually performing in these cases. Noah Misch and Robert Haas	2011-02-12 08:27:55 -05:00
Tom Lane	24d1280c4d	Clean up installation directory choices for extensions. Arrange for the control files to be in $SHAREDIR/extension not $SHAREDIR/contrib, since we're generally trying to deprecate the term "contrib" and this is a once-in-many-moons opportunity to get rid of it in install paths. Fix PGXS to install the $EXTENSION file into that directory no matter what MODULEDIR is set to; a nondefault MODULEDIR should only affect the script and secondary extension files. Fix the control file directory parameter to be interpreted relative to $SHAREDIR, to avoid a surprising disconnect between how you specify that and what you set MODULEDIR to. Per discussion with David Wheeler.	2011-02-11 22:53:43 -05:00
Tom Lane	1214749901	Add support for multiple versions of an extension and ALTER EXTENSION UPDATE. This follows recent discussions, so it's quite a bit different from Dimitri's original. There will probably be more changes once we get a bit of experience with it, but let's get it in and start playing with it. This is still just core code. I'll start converting contrib modules shortly. Dimitri Fontaine and Tom Lane	2011-02-11 21:25:57 -05:00
Alvaro Herrera	60141eefaf	Fix comment recently obsoleted	2011-02-11 19:42:51 -03:00
Robert Haas	5917574539	Allow tab-completion of :variable even as first word on a line. Christoph Berg	2011-02-11 16:57:58 -05:00
Robert Haas	d309acf201	Typo fixes. receivedUpto should be capitalized consistently.	2011-02-11 11:55:12 -05:00
Robert Haas	2c20ba1fd2	Tweak find_composite_type_dependencies API a bit more. Per discussion with Noah Misch, the previous coding, introduced by my commit `65377e0b9c` on 2011-02-06, was really an abuse of RELKIND_COMPOSITE_TYPE, since the caller in typecmds.c is actually passing the name of a domain. So go back having a type name argument, but make the first argument a Relation rather than just a string so we can tell whether it's a table or a foreign table and emit the proper error message.	2011-02-11 08:47:38 -05:00
Alvaro Herrera	61cf7bcdf7	Fix isolation tester Makefile so that it runs in a VPATH build	2011-02-10 19:50:43 -03:00
Tom Lane	01467d3e4f	Extend "ALTER EXTENSION ADD object" to permit "DROP object" as well. Per discussion, this is something we should have sooner rather than later, and it doesn't take much additional code to support it.	2011-02-10 17:37:22 -05:00
Alvaro Herrera	289d730655	Fix the isolation tester compilation on VPATH builds	2011-02-10 19:31:39 -03:00
Bruce Momjian	135724ec35	Fix "variable not used" warnings when USE_WIDE_UPPER_LOWER is not defined.	2011-02-10 16:58:02 -05:00
Peter Eisentraut	ff81aa3eda	Update comment It was still claiming that the keyword list is in keywords.c, when it is now in kwlist.h.	2011-02-10 22:49:46 +02:00
Bruce Momjian	2432d10bf2	Fix pg_get_encoding_from_locale() function call parameters to match prototype for cases where there is no multi-language support.	2011-02-10 15:39:41 -05:00
Heikki Linnakangas	b186523fd9	Send status updates back from standby server to master, indicating how far the standby has written, flushed, and applied the WAL. At the moment, this is for informational purposes only, the values are only shown in pg_stat_replication system view, but in the future they will also be needed for synchronous replication. Extracted from Simon riggs' synchronous replication patch by Robert Haas, with some tweaking by me.	2011-02-10 21:04:02 +02:00
Magnus Hagander	4c468b37a2	Track last time for statistics reset on databases and bgwriter Tracks one counter for each database, which is reset whenever the statistics for any individual object inside the database is reset, and one counter for the background writer. Tomas Vondra, reviewed by Greg Smith	2011-02-10 15:14:04 +01:00
Magnus Hagander	a2e61ec319	Use NOWAIT when including WAL in base backup Avoids warning and waiting for the last segment to be archived, which isn't necessary when we're including the required WAL in the backup itself.	2011-02-10 12:11:23 +01:00
Heikki Linnakangas	cecb5901b8	Allocate all entries in the serializable xid hash up-front, so that you don't run out of shared memory when you try to assign an xid to a transaction. Kevin Grittner	2011-02-10 12:03:21 +02:00
Tom Lane	e617f0d7e4	Fix improper matching of resjunk column names for FOR UPDATE in subselect. Flattening of subquery range tables during setrefs.c could lead to the rangetable indexes in PlanRowMark nodes not matching up with the column names previously assigned to the corresponding resjunk ctid (resp. tableoid or wholerow) columns. Typical symptom would be either a "cannot extract system attribute from virtual tuple" error or an Assert failure. This wasn't a problem before 9.0 because we didn't support FOR UPDATE below the top query level, and so the final flattening could never renumber an RTE that was relevant to FOR UPDATE. Fix by using a plan-tree-wide unique number for each PlanRowMark to label the associated resjunk columns, so that the number need not change during flattening. Per report from David Johnston (though I'm darned if I can see how this got past initial testing of the relevant code). Back-patch to 9.0.	2011-02-09 23:27:42 -05:00
Tom Lane	caddcb8f4b	Fix pg_upgrade to handle extensions. This follows my proposal of yesterday, namely that we try to recreate the previous state of the extension exactly, instead of allowing CREATE EXTENSION to run a SQL script that might create some entirely-incompatible on-disk state. In --binary-upgrade mode, pg_dump won't issue CREATE EXTENSION at all, but instead uses a kluge function provided by pg_upgrade_support to recreate the pg_extension row (and extension-level pg_depend entries) without creating any member objects. The member objects are then restored in the same way as if they weren't members, in particular using pg_upgrade's normal hacks to preserve OIDs that need to be preserved. Then, for each member object, ALTER EXTENSION ADD is issued to recreate the pg_depend entry that marks it as an extension member. In passing, fix breakage in pg_upgrade's enum-type support: somebody didn't fix it when the noise word VALUE got added to ALTER TYPE ADD. Also, rationalize parsetree representation of COMMENT ON DOMAIN and fix get_object_address() to allow OBJECT_DOMAIN.	2011-02-09 19:18:08 -05:00
Peter Eisentraut	2e2d56fea9	Information schema views for collation support Add the views character_sets, collations, and collation_character_set_applicability.	2011-02-09 23:26:48 +02:00
Tom Lane	183d3cff85	Rethink order of operations for dumping extension member objects. My original idea of doing extension member identification during getDependencies() didn't work correctly: we have to mark member tables as not-to-be-dumped rather earlier than that, else their subsidiary objects like indexes get dumped anyway. Rearrange code to mark them early enough.	2011-02-09 14:05:34 -05:00
Tom Lane	5bc178b89f	Implement "ALTER EXTENSION ADD object". This is an essential component of making the extension feature usable; first because it's needed in the process of converting an existing installation containing "loose" objects of an old contrib module into the extension-based world, and second because we'll have to use it in pg_dump --binary-upgrade, as per recent discussion. Loosely based on part of Dimitri Fontaine's ALTER EXTENSION UPGRADE patch.	2011-02-09 11:56:37 -05:00
Heikki Linnakangas	036bb15872	Fix allocation of RW-conflict pool in the new predicate lock manager, and also take the RW-conflict pool into account in the PredicateLockShmemSize() estimate.	2011-02-09 12:23:07 +02:00
Magnus Hagander	3144c33a2f	Implement NOWAIT option for BASE_BACKUP command Specifying this option makes the server not wait for the xlog to be archived, or emit a warning that it can't, instead leaving the responsibility with the client. This is useful when the log is being streamed using the streaming protocol in parallel with the backup, without having log archiving enabled.	2011-02-09 10:59:53 +01:00
Tom Lane	375e5b0a68	Suppress some compiler warnings in recent commits. Older versions of gcc tend to throw "variable might be clobbered by `longjmp' or `vfork'" warnings whenever a variable is assigned in more than one place and then used after the end of a PG_TRY block. That's reasonably easy to work around in execute_extension_script, and the overhead of unconditionally saving/restoring the GUC variables seems unlikely to be a serious concern. Also clean up logic in ATExecValidateConstraint to make it easier to read and less likely to provoke "variable might be used uninitialized in this function" warnings.	2011-02-08 18:12:17 -05:00
Tom Lane	0bc0bd07d4	Fix merge conflict.	2011-02-08 16:22:20 -05:00
Tom Lane	d9572c4e3b	Core support for "extensions", which are packages of SQL objects. This patch adds the server infrastructure to support extensions. There is still one significant loose end, namely how to make it play nice with pg_upgrade, so I am not yet committing the changes that would make all the contrib modules depend on this feature. In passing, fix a disturbingly large amount of breakage in AlterObjectNamespace() and callers. Dimitri Fontaine, reviewed by Anssi Kääriäinen, Itagaki Takahiro, Tom Lane, and numerous others	2011-02-08 16:13:22 -05:00
Peter Eisentraut	414c5a2ea6	Per-column collation support This adds collation support for columns and domains, a COLLATE clause to override it per expression, and B-tree index support. Peter Eisentraut reviewed by Pavel Stehule, Itagaki Takahiro, Robert Haas, Noah Misch	2011-02-08 23:04:18 +02:00
Simon Riggs	7a7d36ec33	Continue long tradition of bumping the catalog version a little late.	2011-02-08 19:44:50 +00:00
Simon Riggs	c016ce7281	Named restore points in recovery. Users can record named points, then new recovery.conf parameter recovery_target_name allows PITR to specify named points as recovery targets. Jaime Casanova, reviewed by Euler Taveira de Oliveira, plus minor edits	2011-02-08 19:39:08 +00:00
Simon Riggs	8c6e3adbf7	Basic Recovery Control functions for use in Hot Standby. Pause, Resume, Status check functions only. Also, new recovery.conf parameter to pause_at_recovery_target, default on. Simon Riggs, reviewed by Fujii Masao	2011-02-08 18:30:22 +00:00
Heikki Linnakangas	f9f9d696a9	UINT64_MAX isn't defined on MSVC.	2011-02-08 18:15:53 +02:00
Simon Riggs	faa0550572	Remove rare corner case for data loss when triggering standby server. If the standby was streaming when trigger file arrives, check also in the archive for additional WAL files. This is a corner case since it is unlikely that we would trigger a failover while the master is still available and sending data to standby, while at the same time running in archive mode and also while the streaming standby has fallen behind archive. Someone would eventually be unlucky; we must plug all gaps however small. Fujii Masao	2011-02-08 14:38:02 +00:00
Simon Riggs	722bf7017b	Extend ALTER TABLE to allow Foreign Keys to be added without initial validation. FK constraints that are marked NOT VALID may later be VALIDATED, which uses an ShareUpdateExclusiveLock on constraint table and RowShareLock on referenced table. Significantly reduces lock strength and duration when adding FKs. New state visible from psql. Simon Riggs, with reviews from Marko Tiikkaja and Robert Haas	2011-02-08 12:23:20 +00:00
Heikki Linnakangas	7202ad7b8d	Fix copy-pasto in description of pg_serial, and silence compiler warning about uninitialized field you get on some compilers.	2011-02-08 09:05:13 +02:00
Robert Haas	32896c40ca	Avoid having autovacuum workers wait for relation locks. Waiting for relation locks can lead to starvation - it pins down an autovacuum worker for as long as the lock is held. But if we're doing an anti-wraparound vacuum, then we still wait; maintenance can no longer be put off. To assist with troubleshooting, if log_autovacuum_min_duration >= 0, we log whenever an autovacuum or autoanalyze is skipped for this reason. Per a gripe by Josh Berkus, and ensuing discussion.	2011-02-07 22:04:29 -05:00
Heikki Linnakangas	47082fa875	Oops, forgot to bump catversion in the Serializable Snapshot Isolation patch. I thought we didn't need that, but then I remembered that it added a new SLRU subdirectory, pg_serial. While we're at it, document what pg_serial is.	2011-02-08 00:24:23 +02:00
Heikki Linnakangas	dafaa3efb7	Implement genuine serializable isolation level. Until now, our Serializable mode has in fact been what's called Snapshot Isolation, which allows some anomalies that could not occur in any serialized ordering of the transactions. This patch fixes that using a method called Serializable Snapshot Isolation, based on research papers by Michael J. Cahill (see README-SSI for full references). In Serializable Snapshot Isolation, transactions run like they do in Snapshot Isolation, but a predicate lock manager observes the reads and writes performed and aborts transactions if it detects that an anomaly might occur. This method produces some false positives, ie. it sometimes aborts transactions even though there is no anomaly. To track reads we implement predicate locking, see storage/lmgr/predicate.c. Whenever a tuple is read, a predicate lock is acquired on the tuple. Shared memory is finite, so when a transaction takes many tuple-level locks on a page, the locks are promoted to a single page-level lock, and further to a single relation level lock if necessary. To lock key values with no matching tuple, a sequential scan always takes a relation-level lock, and an index scan acquires a page-level lock that covers the search key, whether or not there are any matching keys at the moment. A predicate lock doesn't conflict with any regular locks or with another predicate locks in the normal sense. They're only used by the predicate lock manager to detect the danger of anomalies. Only serializable transactions participate in predicate locking, so there should be no extra overhead for for other transactions. Predicate locks can't be released at commit, but must be remembered until all the transactions that overlapped with it have completed. That means that we need to remember an unbounded amount of predicate locks, so we apply a lossy but conservative method of tracking locks for committed transactions. If we run short of shared memory, we overflow to a new "pg_serial" SLRU pool. We don't currently allow Serializable transactions in Hot Standby mode. That would be hard, because even read-only transactions can cause anomalies that wouldn't otherwise occur. Serializable isolation mode now means the new fully serializable level. Repeatable Read gives you the old Snapshot Isolation level that we have always had. Kevin Grittner and Dan Ports, reviewed by Jeff Davis, Heikki Linnakangas and Anssi Kääriäinen	2011-02-08 00:09:08 +02:00
Itagaki Takahiro	c18f51da17	Fix a comment for MergeAttributes. We forgot to adjust it when we changed relistemp to relpersistence.	2011-02-07 16:53:05 +09:00
Andrew Dunstan	c852e95b0b	Supply now required HeUTF8 macro for plperl where it's missing, per buildfarm results.	2011-02-06 21:36:56 -05:00
Itagaki Takahiro	fb7355e0ce	Fix error messages for FreeFile in COPY command. They are extracted from COPY API patch. suggested by Noah Misch	2011-02-07 10:46:56 +09:00
Andrew Dunstan	50d89d422f	Force strings passed to and from plperl to be in UTF8 encoding. String are converted to UTF8 on the way into perl and to the database encoding on the way back. This avoids a number of observed anomalies, and ensures Perl a consistent view of the world. Some minor code cleanups are also accomplished. Alex Hunsaker, reviewed by Andy Colson.	2011-02-06 17:29:26 -05:00
Bruce Momjian	97116ca417	Rename macro DECIMAL to DECIMAL_T to help pgindent; this is already done for a few other macros in that file, for other reasons. I also remove pgindent/README mention of the file.	2011-02-06 10:48:17 -05:00
Magnus Hagander	cedd6515ba	IDENTIFY_SYSTEM now returns 3 fields, not 2	2011-02-06 07:46:14 +01:00
Robert Haas	65377e0b9c	Tighten ALTER FOREIGN TABLE .. SET DATA TYPE checks. If the foreign table's rowtype is being used as the type of a column in another table, we can't just up and change its data type. This was already checked for composite types and ordinary tables, but we previously failed to enforce it for foreign tables.	2011-02-06 00:26:27 -05:00
Bruce Momjian	51dbc87dff	Add C comment about why older compilers complain about basebackup.c's longjump.	2011-02-04 23:28:14 -05:00
Andrew Dunstan	895ad83d70	Attempt to unbreak MSVC builds after pipe.c move.	2011-02-04 20:49:39 -05:00
Robert Haas	9e7e1172a5	Clarify comment in ATRewriteTable(). Make sure it's clear that the prohibition on adding a column with a default when the rowtype is used elsewhere is intentional, and be a bit more explicit about the other cases where we perform this check.	2011-02-04 16:14:54 -05:00
Robert Haas	b1e65c3216	Move pipe.c into the backend. It's full of backend-specific error reporting, so it's neither possible nor necessary for this to be used from frontend code.	2011-02-04 15:52:21 -05:00
Robert Haas	8201aea90c	Avoid including postgres.h in frontend compiles of src/port. This isn't kosher, and doesn't play nicely with my recent changes to the Makefile in this directory.	2011-02-04 13:11:53 -05:00
Robert Haas	6f59a5e5dd	Use $(MAKE) rather than make. Per buildfarm.	2011-02-04 09:48:32 -05:00
Robert Haas	356f2cbbb4	Make handling of errcodes.h more consistent with other generated headers. This fixes make distprep, and seems more robust in other ways as well. Some special handling is required because errcodes.txt is needed by some stuff in src/port, but just by src/backend as is the case for the other generated headers. While I'm at it, fix a few other things that were overlooked in the original patch.	2011-02-04 09:29:10 -05:00
Robert Haas	b87811ee27	Unbreak 'configure' followed immediately by 'make install'. More fallout from `ddfe26f644`. Report by Fujii Masao.	2011-02-04 07:06:36 -05:00
Magnus Hagander	39fbec73b0	Use single quotes when there are backslashes in the filename In the hope of unbreaking the buildfarm	2011-02-04 10:52:25 +01:00
Robert Haas	dde9684d65	Unbreak the VPATH build. My commit `ddfe26f644` of 2010-02-03 broke it. Per buildfarm.	2011-02-04 00:07:08 -05:00
Robert Haas	b8a0467e10	Preserve copyright notice from old errcodes.h file.	2011-02-03 22:38:02 -05:00
Robert Haas	ddfe26f644	Avoid maintaining three separate copies of the error codes list. src/pl/plpgsql/src/plerrcodes.h, src/include/utils/errcodes.h, and a big chunk of errcodes.sgml are now automatically generated from a single file, src/backend/utils/errcodes.txt. Jan Urbański, reviewed by Tom Lane.	2011-02-03 22:32:49 -05:00
Bruce Momjian	35b0a6b205	Simplify code used in is_absolute_path() macro; also add comment about 'E:abc' Win32 path handling.	2011-02-03 10:47:06 -05:00
Magnus Hagander	76129e7f14	Include more status information in walsender results Add the current xlog insert location to the response of IDENTIFY_SYSTEM, and adds result sets containing start and stop location of backups to BASE_BACKUP responses.	2011-02-03 13:46:23 +01:00
Bruce Momjian	426227850b	Rename function to first_path_var_separator() to clarify it works with path variables, not directory paths.	2011-02-02 22:49:54 -05:00
Bruce Momjian	bffb638d16	Clearify macro IS_PATH_VAR_SEP in path.c so it is clear this is a path variable, not a directory path.	2011-02-02 22:28:45 -05:00
Robert Haas	0af695fd43	Log restartpoints in the same fashion as checkpoints. Prior to 9.0, restartpoints never created, deleted, or recycled WAL files, but now they can. This code makes log_checkpoints treat checkpoints and restartpoints symmetrically. It also adjusts up the documentation of the parameter to mention restartpoints. Fujii Masao. Docs by me, as suggested by Itagaki Takahiro.	2011-02-02 21:08:53 -05:00
Tom Lane	907855ac75	Clean up missed change to plpython expected files.	2011-02-02 20:16:27 -05:00
Peter Eisentraut	0c5933d010	Wrap PL/Python SPI calls into subtransactions This allows the language-specific try/catch construct to catch and handle exceptions arising from SPI calls, matching the behavior of other PLs. As an additional bonus you no longer get all the ugly "unrecognized error in PLy_spi_execute_query" errors. Jan Urbański, reviewed by Steve Singer	2011-02-02 22:06:10 +02:00
Andrew Dunstan	c73fe72e27	Add comment on why we're passing a useless 'false' to the plperl function compiler. It's for compatibility with modules like PostgreSQL::PLPerl::NYTProf.	2011-02-02 12:45:42 -05:00
Peter Eisentraut	15f55cc38a	Add validator to PL/Python Jan Urbański, reviewed by Hitoshi Harada	2011-02-01 22:55:04 +02:00
Andrew Dunstan	ef19dc6d39	Set up PLPerl trigger data using C code instead of Perl code. This is an efficiency change, and means we now no longer have to run "out $_TD; local $_TD = shift;", which was especially pointless in the case of non-trigger functions where the passed value was always undef anyway. A tiny open issue is whether we should get rid of the $prolog argument of mkfunc, and the corresponding pushed value, which is now just a constant "false". Tim Bunce, reviewed by Alex Hunsaker.	2011-02-01 09:43:25 -05:00
Magnus Hagander	5273f21434	Undefine setlocale() macro on Win32 New versions of libintl redefine setlocale() to a macro which causes problems when the backend and libintl are linked against different versions of the runtime, which is often the case in msvc builds. Hiroshi Inoue, slightly updated comment by me	2011-02-01 13:19:18 +01:00
Simon Riggs	56b21b7ae3	Re-classify ERRCODE_DATABASE_DROPPED to 57P04	2011-02-01 08:44:01 +00:00
Itagaki Takahiro	0c707aa458	Fix wrong error reports in 'number of array dimensions exceeds the maximum allowed' messages, that have reported one-less dimensions. Alexey Klyukin	2011-02-01 15:21:32 +09:00
Simon Riggs	9e95c9ad55	Create new errcode for recovery conflict caused by db drop on master. Previously reported as ERRCODE_ADMIN_SHUTDOWN, this case is now reported as ERRCODE_T_R_DATABASE_DROPPED. No message text change. Unlikely to happen on most servers, so low impact change to allow session poolers to correctly handle this situation. Tatsuo Ishii, edits by me, review by Robert Haas	2011-02-01 00:20:53 +00:00
Simon Riggs	8585ad3625	Fix error code for canceling statement due to conflict with recovery. All retryable conflict errors now have an error code that indicates that a retry is possible, correcting my incomplete fix of 2010/05/12 Tatsuo Ishii and Simon Riggs, input from Robert Haas and Florian Pflug	2011-01-31 19:20:23 +00:00
Heikki Linnakangas	32866837f0	Fix typo	2011-01-31 18:29:38 +02:00
Heikki Linnakangas	997b48ed96	Support multiple concurrent pg_basebackup backups. With this patch, pg_basebackup doesn't write a backup_label file in the data directory, so it doesn't interfere with a pg_start/stop_backup() based backup anymore. backup_label is still included in the backup, but it is injected directly into the tar stream. Heikki Linnakangas, reviewed by Fujii Masao and Magnus Hagander.	2011-01-31 18:25:39 +02:00
Andrew Dunstan	48c9de8028	Fix typo	2011-01-30 20:34:05 -05:00
Andrew Dunstan	91812df4ed	Enable building with the Mingw64 compiler. This can be used to build 64 bit Windows binaries, not only on 64 bit Windows but on supported cross-compiling hosts including 32 bit Windows, Cygwin, Darwin and Linux.	2011-01-30 19:56:46 -05:00
Tom Lane	9688c4e6f1	Make reduce_outer_joins() smarter about semijoins. reduce_outer_joins() mistakenly treated a semijoin like a left join for purposes of deciding whether not-null constraints created by the join's quals could be passed down into the join's left-hand side (possibly resulting in outer-join simplification there). Actually, semijoin works like inner join for this purpose, ie, we do not need to see any rows that can't possibly satisfy the quals. Hence, two-line fix to treat semi and inner joins alike. Per observation by Andres Freund about a performance gripe from Yazan Suleiman. Back-patch to 8.4, since this oversight has been there since the current handling of semijoins was implemented.	2011-01-30 17:04:31 -05:00
Magnus Hagander	507069de6d	Add option to include WAL in base backup When included, this makes the base backup a complete working "clone" of the initial database, ready to have a postmaster started against it without the need to set up any log archiving or similar. Magnus Hagander, reviewed by Fujii Masao and Heikki Linnakangas	2011-01-30 21:30:09 +01:00
Magnus Hagander	4ea1a273fb	Use GSSAPI library for SSPI auth, when native SSPI is not available This allows non-Windows clients to connect to a Windows server with SSPI authentication. Christian Ullrich, largely modified by me	2011-01-29 17:06:55 +01:00
Robert Haas	7f242d880b	Try to avoid running with a full fsync request queue. When we need to insert a new entry and the queue is full, compact the entire queue in the hopes of making room for the new entry. Doing this on every insertion might worsen contention on BgWriterCommLock, but when the queue it's full, it's far better than allowing the backend to perform its own fsync, per testing by Greg Smith as reported in http://archives.postgresql.org/pgsql-hackers/2011-01/msg02665.php Original idea from Greg Smith. Patch by me. Review by Chris Browne and Greg Smith	2011-01-29 08:08:41 -05:00
Tom Lane	0ac8c8df85	Don't include <asm/ia64regs.h> unnecessarily. We only need that header when compiling with icc, since the gcc variant of ia64_get_bsp() uses in-line assembly code. Per report from Frank Brendel, the header doesn't exist on all IA64 platforms; so don't include it unless we need it.	2011-01-27 16:27:27 -05:00
Heikki Linnakangas	1e4baa5c96	Update psql's \copyright to match the text we have in the COPYRIGHT file.	2011-01-27 20:20:49 +02:00
Robert Haas	a40b1e0bf3	Restore ALTER TABLE .. ADD COLUMN w/DEFAULT restriction. This reverts commit `a06e41deeb` of 2011-01-26. Per discussion, this behavior is not wanted, as it would need to change if we ever made composite types support DEFAULT.	2011-01-27 08:35:34 -05:00
Tom Lane	7ab6f2da23	Change inv_truncate() to not repeat its systable_getnext_ordered() scan. In the case where the initial call of systable_getnext_ordered() returned NULL, this function would nonetheless call it again. That's undefined behavior that only by chance failed to not give visibly incorrect results. Put an if-test around the final loop to prevent that, and in passing improve some comments. No back-patch since there's no actual failure. Per report from YAMAMOTO Takashi.	2011-01-26 19:33:50 -05:00
Peter Eisentraut	6fe5e4e63e	autoreconf Synchronize pg_config.h.in with configure.in (someone must have forgotten to run autoheader or autoreconf), and clean up some spurious change in configure introduced by the last commit there.	2011-01-27 01:19:45 +02:00
Peter Eisentraut	5829738868	Do not prefix error messages with the string "PL/Python: " It is redundant, given the error context. Jan Urbański	2011-01-27 01:00:58 +02:00
Peter Eisentraut	582b5ac62e	Improve exception usage in PL/Python Use the built-in TypeError, not SPIError, for errors having to do with argument counts or types. Use SPIError, not simply plpy.Error, for errors in PLy_spi_execute_plan. Finally, do not set a Python exception if PyArg_ParseTuple failed, as it already sets the correct exception. Jan Urbański	2011-01-27 00:47:14 +02:00
Peter Eisentraut	418df3a5dd	Also save the error detail in SPIError The temporarily broken plpython_unicode test shows a case where this is used. Do remaining fix-ups on the expected files at the same time.	2011-01-27 00:35:28 +02:00
Peter Eisentraut	ddf8c16822	Fix compiler warnings Older versions of GCC appear to report these with the current standard option set, newer versions need -Wformat-security.	2011-01-27 00:19:15 +02:00
Robert Haas	5c2a7c6e97	Add a comment explaining why we force physical removal of OIDs. Noah Misch, slightly revised.	2011-01-26 06:42:51 -05:00
Robert Haas	a06e41deeb	Remove arbitrary ALTER TABLE .. ADD COLUMN restriction. The previous coding prevented ALTER TABLE .. ADD COLUMN from being used with a non-NULL default in situations where the table's rowtype was being used elsewhere. But this is a completely arbitrary restriction since you could do the same operation in multiple steps (add the column, add the default, update the table). Inspired by a patch from Noah Misch, though I didn't use his code.	2011-01-26 06:37:08 -05:00
Tom Lane	bd1ad1b019	Replace pg_class.relhasexclusion with pg_index.indisexclusion. There isn't any need to track this state on a table-wide basis, and trying to do so introduces undesirable semantic fuzziness. Move the flag to pg_index, where it clearly describes just a single index and can be immutable after index creation.	2011-01-25 17:51:59 -05:00
Tom Lane	88452d5ba6	Implement ALTER TABLE ADD UNIQUE/PRIMARY KEY USING INDEX. This feature allows a unique or pkey constraint to be created using an already-existing unique index. While the constraint isn't very functionally different from the bare index, it's nice to be able to do that for documentation purposes. The main advantage over just issuing a plain ALTER TABLE ADD UNIQUE/PRIMARY KEY is that the index can be created with CREATE INDEX CONCURRENTLY, so that there is not a long interval where the table is locked against updates. On the way, refactor some of the code in DefineIndex() and index_create() so that we don't have to pass through those functions in order to create the index constraint's catalog entries. Also, in parse_utilcmd.c, pass around the ParseState pointer in struct CreateStmtContext to save on notation, and add error location pointers to some error reports that didn't have one before. Gurjeet Singh, reviewed by Steve Singer and Tom Lane	2011-01-25 15:43:05 -05:00
Magnus Hagander	966d4f52c2	Typo fix for MemSet size. Fujii Masao	2011-01-25 10:50:04 +01:00
Peter Eisentraut	77ff840835	Document the "S" option for psql's \dn command in the psql help This option was recently introduced, but the documentation in help.c was not updated.	2011-01-25 01:51:35 +02:00
Peter Eisentraut	88dcdf9007	Call PLy_spi_execute_fetch_result inside the try/catch block This way errors from fetching tuples are correctly reported as errors in the SPI call. While at it, avoid palloc(0). Jan Urbański	2011-01-25 00:43:25 +02:00
Peter Eisentraut	52713d02c7	Refactor PLy_spi_prepare to save two levels of indentation Instead of checking whether the arglist is NULL and then if its length is 0, do it in one step, and outside of the try/catch block. Jan Urbański	2011-01-24 22:13:06 +02:00
Heikki Linnakangas	74be35b07c	Fix typo in the psql \d query handling, so that we use the correct query against 9.0 servers.	2011-01-24 14:34:15 +02:00
Magnus Hagander	9752080942	Exclude sepgsql from MSVC regression testing as well In passing, change exclusion in the build to follow the same pattern as other always-excluded modules.	2011-01-24 08:24:31 +01:00
Heikki Linnakangas	56d77c9e56	Silence compiler warning about uninitialized variable, noted by Itagaki Takahiro	2011-01-24 08:28:35 +02:00
Robert Haas	c26ac226e4	Blind attempt to exclude sepgsql from MSVC build system.	2011-01-23 22:57:32 -05:00
Robert Haas	968bc6fac9	sepgsql, an SE-Linux integration for PostgreSQL This is still pretty rough - among other things, the documentation needs work, and the messages need a visit from the style police - but this gets the basic framework in place. KaiGai Kohei	2011-01-23 20:48:27 -05:00
Magnus Hagander	e5487f65fd	Make walsender options order-independent While doing this, also move base backup options into a struct instead of increasing the number of parameters to multiple functions for each new option.	2011-01-23 23:39:18 +01:00
Magnus Hagander	39e911e28a	Reorder includes to unbreak MSVC	2011-01-23 22:44:07 +01:00
Heikki Linnakangas	7f508f1c6b	Add 'directory' format to pg_dump. The new directory format is compatible with the 'tar' format, in that untarring a tar format archive produces a valid directory format archive. Joachim Wieland and Heikki Linnakangas	2011-01-23 23:10:15 +02:00
Tom Lane	f36920796e	Fix another portability issue in pg_basebackup. The target of sscanf with a %o format had better be of integer width, but "mode_t" conceivably isn't that. Another compiler warning seen only on some platforms; this one I think is potentially a real bug and not just a warning.	2011-01-23 14:26:51 -05:00
Tom Lane	dd5f0db96b	Improve getObjectDescription's display of pg_amop and pg_amproc entries. Include the lefttype/righttype columns explicitly (instead of assuming the reader can deduce them from the operator or function description), and move the operator or function description to the end of the string, to make it clearer that it's a referenced object and not the amop or amproc item itself. Per extensive discussion of Andreas Karlsson's original patch. Andreas Karlsson, Tom Lane	2011-01-23 14:13:46 -05:00
Tom Lane	de3c2d6e92	Revert "Factor out functions responsible for caching I/O routines". This reverts commit `740e54ca84`, which seems to have tickled an optimization bug in gcc 4.5.x, as reported upstream at https://bugzilla.redhat.com/show_bug.cgi?id=671899 Since this patch had no purpose beyond code beautification, it's not worth expending a lot of effort to look for another workaround.	2011-01-23 13:12:55 -05:00
Tom Lane	10e99f15d4	Add .gitignore file to silence complaints about pg_basebackup.	2011-01-23 13:07:34 -05:00
Tom Lane	b3cfcdaad2	Suppress uninitialized-variable warning.	2011-01-23 13:06:38 -05:00
Andrew Dunstan	6c41cf5977	Silence flex warnings about DOS file paths in MSVC builds	2011-01-23 12:24:15 -05:00
Magnus Hagander	d13e0975c9	Use pg_strcasecmp instead of strcasecmp for portability Per buildfarm.	2011-01-23 17:35:02 +01:00
Magnus Hagander	f88a638199	Only show pg_stat_replication details to superusers	2011-01-23 17:28:19 +01:00
Magnus Hagander	fe12263c9f	filemode is parsed on win32 even if never used Per buildfarm failure.	2011-01-23 14:45:23 +01:00
Magnus Hagander	048d148fe6	Add pg_basebackup tool for streaming base backups This tool makes it possible to do the pg_start_backup/ copy files/pg_stop_backup step in a single command. There are still some steps to be done before this is a complete backup solution, such as the ability to stream the required WAL logs, but it's still usable, and could do with some buildfarm coverage. In passing, make the checkpoint request optionally fast instead of hardcoding it. Magnus Hagander, reviewed by Fujii Masao and Dimitri Fontaine	2011-01-23 12:21:23 +01:00
Robert Haas	6f59777c65	Code cleanup for assign_transaction_read_only. As in commit `fb4c5d2798` on 2011-01-21, this avoids spurious debug messages and allows idempotent changes at any time. Along the way, make assign_XactIsoLevel allow idempotent changes even when not within a subtransaction, to be consistent with the new coding of assign_transaction_read_only and because there's no compelling reason to do otherwise. Kevin Grittner, with some adjustments.	2011-01-22 20:55:50 -05:00
Tom Lane	cc73c16050	Quick hack to un-break plpython regression tests. It's not clear to me what should happen to the other plpython_unicode variant expected files, but this patch gets things passing on my own machines and at least some of the buildfarm.	2011-01-22 20:43:54 -05:00
Tom Lane	0f73aae13d	Allow the wal_buffers setting to be auto-tuned to a reasonable value. If wal_buffers is initially set to -1 (which is now the default), it's replaced by 1/32nd of shared_buffers, with a minimum of 8 (the old default) and a maximum of the XLOG segment size. The allowed range for manual settings is still from 4 up to whatever will fit in shared memory. Greg Smith, with implementation correction by me.	2011-01-22 20:31:24 -05:00
Tom Lane	518b1e96c0	Suppress "control reaches end of non-void function" warning from gcc 4.5. Not sure why I'm seeing this on Fedora 14 and not earlier versions. Seems like a regression that gcc no longer knows that DIE() doesn't return. Still, adding a dummy return is harmless enough.	2011-01-22 18:01:31 -05:00
Tom Lane	e2627258c3	Suppress possibly-uninitialized-variable warnings from gcc 4.5. It appears that gcc 4.5 can issue such warnings for whole structs, not just scalar variables as in the past. Refactor some pg_dump code slightly so that the OutputContext local variables are always initialized, even if they won't be used. It's cheap enough to not be worth worrying about.	2011-01-22 17:56:42 -05:00
Peter Eisentraut	116ce2f4d0	Get rid of the global variable holding the error state Global error handling led to confusion and was hard to manage. With this change, errors from PostgreSQL are immediately reported to Python as exceptions. This requires setting a Python exception after reporting the caught PostgreSQL error as a warning, because PLy_elog destroys the Python exception state. Ideally, all places where PostgreSQL errors need to be reported back to Python should be wrapped in subtransactions, to make going back to Python from a longjmp safe. This will be handled in a separate patch. Jan Urbański	2011-01-22 22:12:32 +02:00
Magnus Hagander	f5a0fd2f3b	Link libpgport into pg_test_fsync on msvc	2011-01-22 18:18:27 +01:00
Robert Haas	a0c75f5539	Avoid treating WAL senders as normal backends. The previous coding treated anything that wasn't an autovacuum launcher as a normal backend, which is wrong now that we also have WAL senders. Fujii Masao, reviewed by Robert Haas, Alvaro Herrera, Tom Lane, and Bernd Helmle.	2011-01-21 22:23:01 -05:00
Robert Haas	fb4c5d2798	Code cleanup for assign_XactIsoLevel. The new coding avoids a spurious debug message when a transaction that has changed the isolation level has been rolled back. It also allows the property to be freely changed to the current value within a subtransaction. Kevin Grittner, with one small change by me.	2011-01-21 21:49:19 -05:00
Peter Eisentraut	4609caf364	Correctly add exceptions to the plpy module for Python 3 The way the exception types where added to the module was wrong for Python 3. Exception classes were not actually available from plpy. Fix that by factoring out code that is responsible for defining new Python exceptions and make it work with Python 3. New regression test makes sure the plpy module has the expected contents. Jan Urbanśki, slightly revised by me	2011-01-21 23:46:56 +02:00
Bruce Momjian	606a3d54fc	Move test_fsync to /contrib.	2011-01-21 12:47:54 -05:00
Heikki Linnakangas	8aea1373d8	Don't require usage privileges on the foreign data wrapper when creating a foreign table. We check for usage privileges on the foreign server, that ought to be enough. Shigeru HANADA	2011-01-21 15:05:20 +02:00
Robert Haas	8ceb245680	Make ALTER TABLE revalidate uniqueness and exclusion constraints. Failure to do so can lead to constraint violations. This was broken by commit `1ddc2703a9` on 2010-02-07, so back-patch to 9.0. Noah Misch. Regression test by me.	2011-01-20 22:44:10 -05:00
Peter Eisentraut	14b9f69cb2	Fix wrong comment Hitoshi Harada	2011-01-20 22:04:36 +02:00
Peter Eisentraut	81f79dbf2e	Fix typo Hitoshi Harada	2011-01-20 22:01:10 +02:00
Peter Eisentraut	740e54ca84	Factor out functions responsible for caching I/O routines This makes PLy_procedure_create a bit more manageable. Jan Urbański	2011-01-20 21:23:27 +02:00
Robert Haas	9c5e2c120b	Add new psql command \dL to list languages. Original patch by Fernando Ike, revived by Josh Kuperschmidt, reviewed by Andreas Karlsson, and in earlier versions by Tom Lane and Peter Eisentraut.	2011-01-20 00:00:30 -05:00
Peter Eisentraut	fbed5d4830	Add braces around an if block, for readability Jan Urbański, reviewed by Peter Eisentraut, Álvaro Herrera, Tom Lane :-)	2011-01-19 21:56:21 +02:00
Peter Eisentraut	847e8c7783	Free plan values in the PLyPlanObject dealloc function Jan Urbański	2011-01-19 00:10:19 +02:00
Peter Eisentraut	719461b7a2	Improve message for errors in compiling anonymous PL/Python blocks The previous code would try to print out a null pointer. Jan Urbański	2011-01-19 00:04:46 +02:00
Peter Eisentraut	d9a95c0adb	Use PyObject_New instead of PyObject_NEW The latter is undocumented and the speed gain is negligible. Jan Urbański	2011-01-18 23:53:10 +02:00
Peter Eisentraut	41282111e6	Skip dropped attributes when converting Python objects to tuples Pay attention to the attisdropped field and skip over TupleDesc fields that have it set. Not a real problem until we get table returning functions, but it's the right thing to do anyway. Jan Urbański	2011-01-18 23:39:09 +02:00
Peter Eisentraut	59ea9ef9aa	Use palloc in TopMemoryContext instead of malloc As discussed, even if the PL needs a permanent memory location, it should use palloc, not malloc. It also makes error handling easier. Jan Urbański	2011-01-18 23:27:53 +02:00
Peter Eisentraut	88047e59ba	Fix an error when a set-returning function fails halfway through the execution If the function using yield to return rows fails halfway, the iterator stays open and subsequent calls to the function will resume reading from it. The fix is to unref the iterator and set it to NULL if there has been an error. Jan Urbański	2011-01-18 23:22:37 +02:00
Bruce Momjian	8995440e38	In test_fsync, adjust test headings to match wal_sync_method values; add more test cases for open_sync of different sizes.	2011-01-18 15:53:55 -05:00
Tom Lane	1b393f4e5d	Avoid detoast in texteq/textne/byteaeq/byteane for unequal-length strings. We can get the length of a compressed or out-of-line datum without actually detoasting it. If the lengths of two strings are unequal, we can then conclude they are unequal without detoasting. That saves considerable work in an admittedly less-common case, without costing anything much when the optimization doesn't apply. Noah Misch	2011-01-18 14:11:54 -05:00
Magnus Hagander	6e1726d082	Log replication connections only when log_connections is on Previously we'd always log replication connections, with no way to turn them off.	2011-01-18 20:02:25 +01:00
Heikki Linnakangas	b1dc45c11d	Fix thinko in comment. Spotted by Jim Nasby.	2011-01-18 10:46:13 +02:00
Bruce Momjian	4acfd43a7d	Remove "github test" that somehow got into my tree. Sorry.	2011-01-17 21:40:42 -05:00
Bruce Momjian	2c38cce1be	github test	2011-01-17 20:48:49 -05:00
Peter Eisentraut	46211da1b8	Use HTABs instead of Python dictionary objects to cache procedures Two separate hash tables are used for regular procedures and for trigger procedures, since the way trigger procedures work is quite different from normal stored procedures. Change the signatures of PLy_procedure_{get,create} to accept the function OID and a Boolean flag indicating whether it's a trigger. This should make implementing a PL/Python validator easier. Using HTABs instead of Python dictionaries makes error recovery easier, and allows for procedures to be cached based on their OIDs, not their names. It also allows getting rid of the PyCObject field that used to hold a pointer to PLyProcedure, since PyCObjects are deprecated in Python 2.7 and replaced by Capsules in Python 3. Jan Urbański	2011-01-17 21:46:36 +02:00
Tom Lane	bdd8ed973d	Fix miscalculation of itemsafter in array_set_slice(). If the slice to be assigned to was before the existing array lower bound (requiring at least one null element to spring into existence to fill the gap), the code miscalculated how many entries needed to be copied from the old array's null bitmap. This could result in trashing the array's data area (as seen in bug #5840 from Karsten Loesing), or worse. This has been broken since we first allowed the behavior of assigning to non-adjacent slices, in 8.2. Back-patch to all affected versions.	2011-01-17 12:38:52 -05:00
Alvaro Herrera	978445bece	Increment Py_None refcount for NULL array elements Per bug #5835 by Julien Demoor Author: Alex Hunsaker	2011-01-17 13:04:53 -03:00
Bruce Momjian	08af45f4ff	Add getopt() support to test_fsync; also fix printf() format problem.	2011-01-17 09:36:25 -05:00
Magnus Hagander	48075095ac	Set fallback_application_name in walreceiver Makes replication slaves identify themselves in the new pg_stat_replication view.	2011-01-17 11:42:53 +01:00
Heikki Linnakangas	34ef02b4d4	Before exiting walreceiver, fsync() all the WAL received. Otherwise WAL recovery will replay the un-flushed WAL after walreceiver has exited, which can lead to a non-recoverable standby if the system crashes hard at that point.	2011-01-17 12:27:35 +02:00
Bruce Momjian	e0c274679c	In test_fsync, use #define for printf format of ops/sec.	2011-01-16 08:36:43 -05:00
Bruce Momjian	6dc15e3bef	Use O_DIRECT in O_SYNC test of different size. Restructure O_DIRECT error reporting to be more consistent.	2011-01-15 19:40:49 -05:00
Bruce Momjian	3eebb33ddd	Reverse number of stars used for test_fsync details.	2011-01-15 18:40:10 -05:00
Bruce Momjian	431605f666	In test_fsync, warn about options without o_direct that are not used by Postgres, and cases where o_direct does not work with certain file systems.	2011-01-15 18:27:43 -05:00
Tom Lane	6ca452ba7f	Move a couple of declarations to reflect where the routines really are.	2011-01-15 16:09:05 -05:00
Tom Lane	36750dcef5	Add .gitignore to silence git complaints about parser/scanner output files.	2011-01-15 16:05:28 -05:00
Bruce Momjian	001d3664e3	Have test_fsync output details that fdatasync is the default wal_sync_method on Linux.	2011-01-15 15:00:20 -05:00
Bruce Momjian	169516ad93	Restructure test_fync to use modular C so there is less duplicate code and it can be enhanced easier.	2011-01-15 14:42:48 -05:00
Magnus Hagander	3866ff6149	Enumerate available tablespaces after starting the backup This closes a race condition where if a tablespace was created after the enumeration happened but before the do_pg_start_backup() was called, the backup would be incomplete. Now that it's done while we are in backup mode, WAL replay will recreate it during restore. Noted by Heikki.	2011-01-15 19:31:16 +01:00
Bruce Momjian	3ab80cfe03	Improve output display of test_fsync.	2011-01-15 12:24:05 -05:00
Bruce Momjian	677b06ca46	Apply patch for test_fsync to add tests for O_DIRECT. Adjusted patch by Josh Berkus	2011-01-15 11:55:13 -05:00
Heikki Linnakangas	8f5d65e916	Treat a WAL sender process that hasn't started streaming yet as a regular backend, as far as the postmaster shutdown logic is concerned. That means, fast shutdown will wait for WAL sender processes to exit before signaling bgwriter to finish. This avoids race conditions between a base backup stopping or starting, and bgwriter writing the shutdown checkpoint WAL record. We don't want e.g the end-of-backup WAL record to be written after the shutdown checkpoint.	2011-01-15 16:38:21 +02:00
Magnus Hagander	fcd810c69a	Use a lexer and grammar for parsing walsender commands Makes it easier to parse mainly the BASE_BACKUP command with it's options, and avoids having to manually deal with quoted identifiers in the label (previously broken), and makes it easier to add new commands and options in the future. In passing, refactor the case statement in the walsender to put each command in it's own function.	2011-01-14 16:30:33 +01:00
Magnus Hagander	688423d004	Exit from base backups when shutdown is requested When the exit waits until the whole backup completes, it may take a very long time. In passing, add back an error check in the main loop so we detect clients that disconnect much earlier if the backup is large.	2011-01-14 12:36:45 +01:00
Tom Lane	52948169bc	Code review for postmaster.pid contents changes. Fix broken test for pre-existing postmaster, caused by wrong code for appending lines to the lockfile; don't write a failed listen_address setting into the lockfile; don't arbitrarily change the location of the data directory in the lockfile compared to previous releases; provide more consistent and useful definitions of the socket path and listen_address entries; avoid assuming that pg_ctl has the same DEFAULT_PGSOCKET_DIR as the postmaster; assorted code style improvements.	2011-01-13 19:01:28 -05:00
Tom Lane	f0f36045b2	Revert incorrect memory-conservation hack in inheritance_planner(). This reverts commit `d1001a78ce` of 2010-12-05, which was broken as reported by Jeff Davis. The problem is that the individual planning steps may have side-effects on substructures of PlannerGlobal, not only the current PlannerInfo root. Arranging to keep all such side effects in the main planning context is probably possible, but it would change this from a quick local hack into a wide-ranging and rather fragile endeavor. Which it's not worth.	2011-01-13 14:33:19 -05:00
Magnus Hagander	9eacd427e8	Make sure walsender state is only read while holding the spinlock Noted by Robert Haas.	2011-01-13 18:51:13 +01:00
Heikki Linnakangas	a5a02a7445	Fix the logic in libpqrcv_receive() to determine if there's any incoming data that can be read without blocking. It used to conclude that there isn't, even though there was data in the socket receive buffer. That lead walreceiver to flush the WAL after every received chunk, potentially causing big performance issues. Backpatch to 9.0, because the performance impact can be very significant.	2011-01-13 18:26:39 +02:00
Peter Eisentraut	c667cc24e8	Workaround for recursive make breakage Changing a file two directory levels deep under src/backend/ would not cause the postgres binary to be rebuilt. This change fixes it, but no one knows why.	2011-01-13 09:32:06 +02:00
Peter Eisentraut	35eb0958be	Don't run regression tests in SQL_ASCII encoding by default Instead, run them in the encoding that the locale selects, which is more representative of real use. Also document how locale and encoding for regression test runs can be selected.	2011-01-13 09:16:55 +02:00
Tom Lane	d487afbb81	Fix PlanRowMark/ExecRowMark structures to handle inheritance correctly. In an inherited UPDATE/DELETE, each target table has its own subplan, because it might have a column set different from other targets. This means that the resjunk columns we add to support EvalPlanQual might be at different physical column numbers in each subplan. The EvalPlanQual rewrite I did for 9.0 failed to account for this, resulting in possible misbehavior or even crashes during concurrent updates to the same row, as seen in a recent report from Gordon Shannon. Revise the data structure so that we track resjunk column numbers separately for each subplan. I also chose to move responsibility for identifying the physical column numbers back to executor startup, instead of assuming that numbers derived during preprocess_targetlist would stay valid throughout subsequent massaging of the plan. That's a bit slower, so we might want to consider undoing it someday; but it would complicate the patch considerably and didn't seem justifiable in a bug fix that has to be back-patched to 9.0.	2011-01-12 20:47:02 -05:00
Robert Haas	7a32ff9732	Revert patch adding support for logging the current role. This reverts commit `a8a8867912`, committed by me earlier today (2011-01-12). This isn't safe inside an aborted transaction. Noted by Tom Lane.	2011-01-12 11:59:21 -05:00
Robert Haas	a8a8867912	Add support for logging the current role. Stephen Frost, with some editorialization by me.	2011-01-12 11:34:53 -05:00
Andrew Dunstan	b7a0b42641	Unbreak regression tests, apparently broken by commit `4c8e20f`	2011-01-11 22:27:20 -05:00
Peter Eisentraut	e3094fd3a8	Re-add recursive coverage target in src/backend/ This was lost during the recent recursive make change.	2011-01-12 00:26:20 +02:00
Magnus Hagander	4c8e20f815	Track walsender state in shared memory and expose in pg_stat_replication	2011-01-11 21:25:28 +01:00
Magnus Hagander	47a5f3e9da	Add missing function prototype, for consistency	2011-01-11 21:12:12 +01:00
Tom Lane	e6dce4e439	Adjust basebackup.c to suppress compiler warnings. Some versions of gcc complain about "variable `tablespaces' might be clobbered by `longjmp' or `vfork'" with the original coding. Fix by moving the PG_TRY block into a separate subroutine.	2011-01-11 13:41:13 -05:00
Tom Lane	9d1ac2f5fa	Tweak create_index_paths()'s test for whether to consider a bitmap scan. Per my note of a couple days ago, create_index_paths would refuse to consider any path at all for GIN indexes if the selectivity estimate came out as 1.0; not even if you tried to force it with enable_seqscan. While this isn't really a bad outcome in practice, it could be annoying for testing purposes. Adjust the test for "is this path only useful for sorting" so that it doesn't fire on paths with nil pathkeys, which will include all GIN paths.	2011-01-11 12:13:02 -05:00
Magnus Hagander	b7ebda9d8c	Reset walsender ps title in the main loop When in streaming mode we can never get out, so it will never be required, but after a base backup (or other operations) we can get back to the loop, so the title needs to be cleared.	2011-01-11 10:04:54 +01:00
Magnus Hagander	2e36343f82	Set process title to indicate base backup is running	2011-01-10 21:53:18 +01:00
Heikki Linnakangas	dc1305ce5f	Leave temporary files out of streaming base backups.	2011-01-10 19:42:05 +02:00
Magnus Hagander	0eb59c4591	Backend support for streaming base backups Add BASE_BACKUP command to walsender, allowing it to stream a base backup to the client (in tar format). The syntax is still far from ideal, that will be fixed in the switch to use a proper grammar for walsender. No client included yet, will come as a separate commit. Magnus Hagander and Heikki Linnakangas	2011-01-10 14:04:19 +01:00
Magnus Hagander	4448917d51	Split pg_start_backup() and pg_stop_backup() into two pieces Move the actual functionality into a separate function that's easier to call internally, and change the SQL-callable function to be a wrapper calling this. Also create a pg_abort_backup() function, only callable internally, that does only the most vital parts of pg_stop_backup(), making it safe(r) to call from error handlers.	2011-01-09 21:00:28 +01:00
Heikki Linnakangas	ca63029eac	Fix crash in the new GiST insertion code, when an update splits the root page. This bug was exercised by contrib/intarray/bench, as noted by Tom Lane.	2011-01-09 21:36:22 +02:00
Tom Lane	52fd2d65a3	Fix up core tsquery GIN support for new extractQuery API. No need for the empty-prefix-match kluge to force a full scan anymore.	2011-01-09 14:34:50 -05:00
Tom Lane	304845075c	Use array_contains_nulls instead of ARR_HASNULL on user-supplied arrays. This applies the fix for bug #5784 to remaining places where we wish to reject nulls in user-supplied arrays. In all these places, there's no reason not to allow a null bitmap to be present, so long as none of the current elements are actually null. I did not change some other places where we are looking at system catalog entries or aggregate transition values, as the presence of a null bitmap in such an array would be suspicious.	2011-01-09 13:09:07 -05:00
Magnus Hagander	361418be7c	Ensure the directory for gram.h is created on win32 Result of bad testing of my last commit.	2011-01-09 17:01:15 +01:00
Magnus Hagander	3457514c2d	Properly install gram.h on MSVC builds This file is now needed by pgAdmin builds, which started failing since it was missing in the installer builds.	2011-01-09 15:31:48 +01:00
Magnus Hagander	db4d22d0ef	Add pgreadlink() on Windows to read junction points Add support for reading back information about the symbolic links we've created with pgsymlink(), which are actually Junction Points. Just like pgsymlink() can only create directory symlinks, pgreadlink() can only read directory symlinks.	2011-01-09 15:09:19 +01:00
Michael Meskes	1066dbfb85	There is no need to have to identical functions in ecpg thus removing one of them.	2011-01-09 12:47:43 +01:00
Tom Lane	adf328c0e1	Add array_contains_nulls() function in arrayfuncs.c. This will support fixing contrib/intarray (and probably other places) so that they don't have to fail on arrays that contain a null bitmap but no live null entries.	2011-01-08 20:26:14 -05:00
Tom Lane	4d1b76e49e	Fix up gincostestimate for new extractQuery API. The only reason this wasn't crashing while testing the core anyarray operators was that it was disabled for those cases because of passing the wrong type information to get_opfamily_proc :-(. So fix that too, and make it insist on finding the support proc --- in hindsight, silently doing nothing is not as sane a coping mechanism as all that.	2011-01-08 20:26:13 -05:00
Michael Meskes	833a2b57bc	In ecpg's parser removed a fixed length limit for constants defining an array dimension.	2011-01-08 23:04:50 +01:00
Tom Lane	7e2f906201	Remove pg_am.amindexnulls. The only use we have had for amindexnulls is in determining whether an index is safe to cluster on; but since the addition of the amclusterable flag, that usage is pretty redundant. In passing, clean up assorted sloppiness from the last patch that touched pg_am.h: Natts_pg_am was wrong, and ambuildempty was not documented.	2011-01-08 16:08:05 -05:00
Tom Lane	56a57473a9	Refactor GIN's handling of duplicate search entries. The original coding could combine duplicate entries only when they originated from the same qual condition. In particular it could not combine cases where multiple qual conditions all give rise to full-index scan requests, which is an expensive case well worth optimizing. Refactor so that duplicates are recognized across all the quals.	2011-01-08 14:48:08 -05:00
Bruce Momjian	d8d3d2a4f3	Fix pg_upgrade of large object permissions by preserving pg_auth.oid, which is stored in pg_largeobject_metadata. No backpatch to 9.0 because you can't migrate from 9.0 to 9.0 with the same catversion (because of tablespace conflict), and a pre-9.0 migration to 9.0 has not large object permissions to migrate.	2011-01-07 21:59:29 -05:00
Bruce Momjian	2896c87ce4	Force pg_upgrade's to preserve pg_class.oid, not pg_class.relfilenode. Toast tables have identical pg_class.oid and pg_class.relfilenode, but for clarity it is good to preserve the pg_class.oid. Update comments regarding what is preserved, and do some variable/function renaming for clarity.	2011-01-07 21:26:13 -05:00
Tom Lane	a032d50128	Fix the built-in GIN support procedure declarations in pg_proc.h. Add more "internal" arguments so that these pg_proc entries reflect the current preferred API. This is purely a cosmetic change, since GIN doesn't actually consult the pg_proc entry when calling a support function. Accordingly, no catversion bump.	2011-01-07 20:40:48 -05:00
Tom Lane	73912e7fbd	Fix GIN to support null keys, empty and null items, and full index scans. Per my recent proposal(s). Null key datums can now be returned by extractValue and extractQuery functions, and will be stored in the index. Also, placeholder entries are made for indexable items that are NULL or contain no keys according to extractValue. This means that the index is now always complete, having at least one entry for every indexed heap TID, and so we can get rid of the prohibition on full-index scans. A full-index scan is implemented much the same way as partial-match scans were already: we build a bitmap representing all the TIDs found in the index, and then drive the results off that. Also, introduce a concept of a "search mode" that can be requested by extractQuery when the operator requires matching to empty items (this is just as cheap as matching to a single key) or requires a full index scan (which is not so cheap, but it sure beats failing or giving wrong answers). The behavior remains backward compatible for opclasses that don't return any null keys or request a non-default search mode. Using these features, we can now make the GIN index opclass for anyarray behave in a way that matches the actual anyarray operators for &&, <@, @>, and = ... which it failed to do before in assorted corner cases. This commit fixes the core GIN code and ginarrayprocs.c, updates the documentation, and adds some simple regression test cases for the new behaviors using the array operators. The tsearch and contrib GIN opclass support functions still need to be looked over and probably fixed. Another thing I intend to fix separately is that this is pretty inefficient for cases where more than one scan condition needs a full-index search: we'll run duplicate GinScanEntrys, each one of which builds a large bitmap. There is some existing logic to merge duplicate GinScanEntrys but it needs refactoring to make it work for entries belonging to different scan keys. Note that most of gin.h has been split out into a new file gin_private.h, so that gin.h doesn't export anything that's not supposed to be used by GIN opclasses or the rest of the backend. I did quite a bit of other code beautification work as well, mostly fixing comments and choosing more appropriate names for things.	2011-01-07 19:16:24 -05:00
Robert Haas	9b4271deb9	Document pg_stat_replication, bump catversion since that was overlooked. Itagaki Takahiro, edited by me.	2011-01-07 11:06:55 -05:00
Robert Haas	a9f72b4083	Improve recovery.conf.sample comments. Jehan-Guillaume de Rorthais, with some additional wordsmithing by me.	2011-01-07 11:01:25 -05:00
Itagaki Takahiro	a755ea33ae	New system view pg_stat_replication displays activity of wal sender processes. Itagaki Takahiro and Simon Riggs.	2011-01-07 20:35:38 +09:00
Bruce Momjian	46d28820b6	Improve C comments about backend variables set by pg_upgrade_support functions.	2011-01-06 22:45:36 -05:00
Tom Lane	6c596c29a3	Update sequence_1.out for recent changes in sequence regression test.	2011-01-06 10:58:32 -05:00
Bruce Momjian	5cff5b5779	Clarify pg_upgrade's creation of the map file structure. Also clean up pg_dump's calling of pg_upgrade_support functions.	2011-01-05 11:37:08 -05:00
Magnus Hagander	66a8a0428d	Give superusers REPLIACTION permission by default This can be overriden by using NOREPLICATION on the CREATE ROLE statement, but by default they will have it, making it backwards compatible and "less surprising" (given that superusers normally override all checks).	2011-01-05 14:24:17 +01:00
Itagaki Takahiro	14158f25cd	Improve psql tab completion for CREATE/ALTER ROLE [NO]REPLICATION. Missing support for VALID UNTIL in CREATE ROLE is also added.	2011-01-04 17:56:01 +09:00
Robert Haas	7f60be72b0	Fix crash in ALTER OPERATOR CLASS/FAMILY .. SET SCHEMA. In the previous coding, the parser emitted a List containing a C string, which is no good, because copyObject() can't handle it. Dimitri Fontaine	2011-01-03 22:08:55 -05:00
Robert Haas	dc8a14311a	Update comments in RecordTransactionCommit() to mention unlogged tables.	2011-01-03 10:29:22 -05:00
Magnus Hagander	77745cc7f1	Bump catversion, forgot in previous commit.	2011-01-03 12:50:30 +01:00
Magnus Hagander	40d9e94bd7	Add views and functions to monitor hot standby query conflicts Add the view pg_stat_database_conflicts and a column to pg_stat_database, and the underlying functions to provide the information.	2011-01-03 12:46:03 +01:00
Magnus Hagander	c0e96b49e5	perltidy run on the MSVC build system Forgot this with previuos commit, line it up so it's easier to submit (readable) patches against the MSVC build system.	2011-01-03 10:44:56 +01:00
Peter Eisentraut	39b8843296	Implement remaining fields of information_schema.sequences view Add new function pg_sequence_parameters that returns a sequence's start, minimum, maximum, increment, and cycle values, and use that in the view. (bug #5662; design suggestion by Tom Lane) Also slightly adjust the view's column order and permissions after review of SQL standard.	2011-01-02 15:15:21 +02:00
Robert Haas	e657b55e66	Fix typo. Noted by Magnus Hagander.	2011-01-02 07:26:10 -05:00
Robert Haas	0d692a0dc9	Basic foreign table support. Foreign tables are a core component of SQL/MED. This commit does not provide a working SQL/MED infrastructure, because foreign tables cannot yet be queried. Support for foreign table scans will need to be added in a future patch. However, this patch creates the necessary system catalog structure, syntax support, and support for ancillary operations such as COMMENT and SECURITY LABEL. Shigeru Hanada, heavily revised by Robert Haas	2011-01-01 23:48:11 -05:00
Robert Haas	d7acf6cc4a	Fix pg_dump support for security labels on columns. Along the way, correct an erroneous comment.	2011-01-01 17:44:28 -05:00
Peter Eisentraut	6a208aa404	Allow casting a table's row type to the table's supertype if it's a typed table This is analogous to the existing facility that allows casting a row type to a supertable's row type.	2011-01-01 23:04:14 +02:00
Bruce Momjian	92a73d2190	Add #include <time.h> to pg_ctl.c to fix compiler warning.	2011-01-01 15:55:36 -05:00
Bruce Momjian	5d950e3b0c	Stamp copyrights for year 2011.	2011-01-01 13:18:15 -05:00
Bruce Momjian	30aeda4394	Include the first valid listen address in pg_ctl to improve server start "wait" detection and add postmaster start time to help determine if the postmaster is actually using the specified data directory.	2010-12-31 17:25:02 -05:00
Tom Lane	39c8dd6620	Invert and rename flag variable to improve code readability. No change in functionality. Per discussion with Robert.	2010-12-31 11:59:38 -05:00
Tom Lane	7b46401557	Move symbols for ExecMergeJoin's state machine into nodeMergejoin.c. There's no reason for these values to be known anywhere else. After doing this, executor/execdefs.h is vestigial and can be removed.	2010-12-30 22:12:40 -05:00
Tom Lane	f4e4b32743	Support RIGHT and FULL OUTER JOIN in hash joins. This is advantageous first because it allows us to hash the smaller table regardless of the outer-join type, and second because hash join can be more flexible than merge join in dealing with arbitrary join quals in a FULL join. For merge join all the join quals have to be mergejoinable, but hash join will work so long as there's at least one hashjoinable qual --- the others can be any condition. (This is true essentially because we don't keep per-inner-tuple match flags in merge join, while hash join can do so.) To do this, we need a has-it-been-matched flag for each tuple in the hashtable, not just one for the current outer tuple. The key idea that makes this practical is that we can store the match flag in the tuple's infomask, since there are lots of bits there that are of no interest for a MinimalTuple. So we aren't increasing the size of the hashtable at all for the feature. To write this without turning the hash code into even more of a pile of spaghetti than it already was, I rewrote ExecHashJoin in a state-machine style, similar to ExecMergeJoin. Other than that decision, it was pretty straightforward.	2010-12-30 20:26:08 -05:00
Alvaro Herrera	55573990ca	Avoid unnecessary public struct declaration in slru.h Instead, declare a public wrapper of the sole function using it for external callers, so that they don't have to always pass a NULL argument. Author: Kevin Grittner	2010-12-30 12:09:17 -03:00
Robert Haas	d2bc1c9907	Bump XLOG_PAGE_MAGIC. The unlogged tables patch (commit `53dbc27c62`, 2010-12-29) should have done this, since it changes the format of an XLOG_SMGR_CREATE record.	2010-12-29 07:19:21 -05:00
Robert Haas	53dbc27c62	Support unlogged tables. The contents of an unlogged table are WAL-logged; thus, they are not available on standby servers and are truncated whenever the database system enters recovery. Indexes on unlogged tables are also unlogged. Unlogged GiST indexes are not currently supported.	2010-12-29 06:48:53 -05:00
Magnus Hagander	9b8aff8c19	Add REPLICATION privilege for ROLEs This privilege is required to do Streaming Replication, instead of superuser, making it possible to set up a SR slave that doesn't have write permissions on the master. Superuser privileges do NOT override this check, so in order to use the default superuser account for replication it must be explicitly granted the REPLICATION permissions. This is backwards incompatible change, in the interest of higher default security.	2010-12-29 11:05:03 +01:00
Tom Lane	f2ba1e994c	Avoid unexpected conversion overflow in planner for distant date values. The "date" type supports a wider range of dates than int64 timestamps do. However, there is pre-int64-timestamp code in the planner that assumes that all date values can be converted to timestamp with impunity. Fortunately, what we really need out of the conversion is always a double (float8) value; so even when the date is out of timestamp's range it's possible to produce a sane answer. All we need is a code path that doesn't try to force the result into int64. Per trouble report from David Rericha. Back-patch to all supported versions. Although this is surely a corner case, there's not much point in advertising a date range wider than timestamp's if we will choke on such values in unexpected places.	2010-12-28 22:49:57 -05:00
Tom Lane	81a530a65e	Fix ill-advised placement of PGRES_COPY_BOTH enum value. It must be added at the end of the ExecStatusType enum to avoid ABI breakage compared to previous libpq versions. Noted by Magnus.	2010-12-28 11:02:10 -05:00
Bruce Momjian	b4d3792daa	Another fix for larger postmaster.pid files.	2010-12-28 09:34:46 -05:00
Bruce Momjian	bada44a2a2	Fix code to properly pull out shared memory key now that the postmaster.pid file is larger than in previous major versions. This is a bug introduced when I added lines to the file recently.	2010-12-27 23:11:33 -05:00
Tom Lane	f79136439f	Remove -fno-operator-names switch from cpluspluscheck. No longer needed now that bitand() and bitor() have been renamed.	2010-12-27 15:03:24 -05:00
Tom Lane	84fc571395	Rename the C functions bitand(), bitor() to bit_and(), bit_or(). This is to avoid use of the C++ keywords "bitand" and "bitor" in the header file utils/varbit.h. Note the functions' SQL-level names are not changed, only their C-level names. In passing, make some comments in varbit.c conform to project-standard layout.	2010-12-27 14:57:41 -05:00
Tom Lane	8c61f81b31	Rearrange cpluspluscheck to check just one .h file at a time. This is slower than the original coding but avoids the problem of including files in an unpredictable order. Aside from being more trustworthy, we can get rid of some exclusions that were formerly made for what turn out to be ordering or re-inclusion problems. I also modified it to include libpq's exported files in the check. ecpg should be included as well, but I'm unclear on which ecpg .h files are meant to be included by clients.	2010-12-27 12:51:44 -05:00
Tom Lane	37b61a69f3	Fix failure of executor/hashjoin.h to compile standalone. Noted while experimenting with cpluspluscheck.	2010-12-27 12:20:09 -05:00
Tom Lane	a977db6f1c	Tweak cpluspluscheck to avoid directly #include'ing gram.h. gram.h has ordering dependencies, which are satisfied when it's included from gramparse.h, but might not be if it's pulled in directly.	2010-12-27 11:36:52 -05:00
Tom Lane	275411912d	Fix ill-chosen use of "private" as an argument and struct field name. "private" is a keyword in C++, so this breaks the poorly-enforced policy that header files should be include-able in C++ code. Per report from Craig Ringer and some investigation with cpluspluscheck.	2010-12-27 11:26:19 -05:00
Robert Haas	63676ebff4	Corrections to patch adding SQL/MED error codes. My previous commit, `85cff3ce7f` on 2010-12-25, failed to update errcodes.sgml or plerrcodes.h. This patch corrects that oversight, per a gripe from Tom Lane, and also corrects a typographical error.	2010-12-26 21:35:25 -05:00
Andrew Dunstan	a534728afb	Only build in crashdump support on Windows if there's a working dbghelp.h.	2010-12-26 10:34:47 -05:00
Robert Haas	85cff3ce7f	Add foreign data wrapper error code values for SQL/MED. Extracted from a much larger patch by Shigeru Hanada.	2010-12-25 13:57:39 -05:00
Andrew Dunstan	04ee0db6b2	Allow vpath builds and regression tests to succeed on Mingw. Backpatch to release 8.4 - earlier releases would require more changes and it's not worth the trouble.	2010-12-24 13:31:28 -05:00
Bruce Momjian	5000472112	Remove quotes from boolean recovery.conf.sample parameters, now that the quotes are not required. This now matches postgresql.conf's specification of booleans.	2010-12-24 11:51:51 -05:00
Bruce Momjian	075354ad1b	Improve "pg_ctl -w start" server detection by writing the postmaster port and socket directory into postmaster.pid, and have pg_ctl read from that file, for use by PQping().	2010-12-24 09:45:52 -05:00
Michael Meskes	727a5a1620	Added rule to ecpg lexer to accept "Unicode surrogate pair in extended quoted string". This is not really needed because the string gets copied to the output untranslated anyway, but by adding this rule the lexer stays in sync with the backend lexer.	2010-12-23 20:37:42 +01:00
Heikki Linnakangas	9de3aa65f0	Rewrite the GiST insertion logic so that we don't need the post-recovery cleanup stage to finish incomplete inserts or splits anymore. There was two reasons for the cleanup step: 1. When a new tuple was inserted to a leaf page, the downlink in the parent needed to be updated to contain (ie. to be consistent with) the new key. Updating the parent in turn might require recursively updating the parent of the parent. We now handle that by updating the parent while traversing down the tree, so that when we insert the leaf tuple, all the parents are already consistent with the new key, and the tree is consistent at every step. 2. When a page is split, we need to insert the downlink for the new right page(s), and update the downlink for the original page to not include keys that moved to the right page(s). We now handle that by setting a new flag, F_FOLLOW_RIGHT, on the non-rightmost pages in the split. When that flag is set, scans always follow the rightlink, regardless of the NSN mechanism used to detect concurrent page splits. That way the tree is consistent right after split, even though the downlink is still missing. This is very similar to the way B-tree splits are handled. When the downlink is inserted in the parent, the flag is cleared. To keep the insertion algorithm simple, when an insertion sees an incomplete split, indicated by the F_FOLLOW_RIGHT flag, it finishes the split before doing anything else. These changes allow removing the whole "invalid tuple" mechanism, but I retained the scan code to still follow invalid tuples correctly. While we don't create any such tuples anymore, we want to handle them gracefully in case you pg_upgrade a GiST index that has them. If we encounter any on an insert, though, we just throw an error saying that you need to REINDEX. The issue that got me into doing this is that if you did a checkpoint while an insert or split was in progress, and the checkpoint finishes quickly so that there is no WAL record related to the insert between RedoRecPtr and the checkpoint record, recovery from that checkpoint would not know to finish the incomplete insert. IOW, we have the same issue we solved with the rm_safe_restartpoint mechanism during normal operation too. It's highly unlikely to happen in practice, and this fix is far too large to backpatch, so we're just going to live with in previous versions, but this refactoring fixes it going forward. With this patch, you don't get the annoying 'index "FOO" needs VACUUM or REINDEX to finish crash recovery' notices anymore if you crash at an unfortunate moment.	2010-12-23 16:21:47 +02:00
Magnus Hagander	de9a4c27fe	Add PQlibVersion() function to libpq This function is like the PQserverVersion() function except it returns the version of libpq, making it possible for a client program or driver to determine which version of libpq is in use at runtime, and not just at link time. Suggested by Harald Armin Massa and several others.	2010-12-22 14:23:56 +01:00
Robert Haas	32ba2b5160	Use memcmp() rather than strncmp() when shorter string length is known. It appears that this will be faster for all but the shortest strings; at least one some platforms, memcmp() can use word-at-a-time comparisons. Noah Misch, somewhat pared down.	2010-12-21 22:11:40 -05:00
Robert Haas	c5160b7eec	Fix typos. Andreas Karlsson	2010-12-21 17:58:53 -05:00
Robert Haas	24ecde7742	Work around unfortunate getppid() behavior on BSD-ish systems. On MacOS X, and apparently also on other BSD-derived systems, attaching a debugger causes getppid() to return the pid of the debugging process rather than the actual parent PID. As a result, debugging the autovacuum launcher, startup process, or WAL sender on such systems causes it to exit, because the previous coding of PostmasterIsAlive() detects postmaster death by testing whether getppid() == PostmasterPid. Work around that behavior by checking the return value of getppid() more carefully. If it's PostmasterPid, the postmaster must be alive; if it's 1, assume the postmaster is dead. If it's any other value, assume we've been debugged and fall through to the less-reliable kill() test. Review by Tom Lane.	2010-12-21 06:30:32 -05:00
Robert Haas	f6a0863e3c	Allow transactions that don't write WAL to commit asynchronously. This case can arise if a transaction has written data, but only to temporary tables. Loss of the commit record in case of a crash won't matter, because the temporary tables will be lost anyway. Reviewed by Heikki Linnakangas and Simon Riggs.	2010-12-20 12:59:33 -05:00
Magnus Hagander	d382828f6e	Remove thread dumping constant that requires newer Platform SDK Since we're not multithreaded it only provides marginally useful information, and it does require a newer version of the Platform SDK than we target. We may want to reconsider this in the future along with a fix for MinGW.	2010-12-19 21:32:58 +01:00
Tom Lane	1b19e2c0ba	Fix up handling of simple-form CASE with constant test expression. eval_const_expressions() can replace CaseTestExprs with constants when the surrounding CASE's test expression is a constant. This confuses ruleutils.c's heuristic for deparsing simple-form CASEs, leading to Assert failures or "unexpected CASE WHEN clause" errors. I had put in a hack solution for that years ago (see commit `514ce7a331` of 2006-10-01), but bug #5794 from Peter Speck shows that that solution failed to cover all cases. Fortunately, there's a much better way, which came to me upon reflecting that Peter's "CASE TRUE WHEN" seemed pretty redundant: we can "simplify" the simple-form CASE to the general form of CASE, by simply omitting the constant test expression from the rebuilt CASE construct. This is intuitively valid because there is no need for the executor to evaluate the test expression at runtime; it will never be referenced, because any CaseTestExprs that would have referenced it are now replaced by constants. This won't save a whole lot of cycles, since evaluating a Const is pretty cheap, but a cycle saved is a cycle earned. In any case it beats kluging ruleutils.c still further. So this patch improves const-simplification and reverts the previous change in ruleutils.c. Back-patch to all supported branches. The bug exists in 8.1 too, but it's out of warranty.	2010-12-19 15:30:44 -05:00
Tom Lane	abc1026269	Fix erroneous parsing of tsquery input "... & !(subexpression) \| ..." After parsing a parenthesized subexpression, we must pop all pending ANDs and NOTs off the stack, just like the case for a simple operand. Per bug #5793. Also fix clones of this routine in contrib/intarray and contrib/ltree, where input of types query_int and ltxtquery had the same problem. Back-patch to all supported versions.	2010-12-19 12:48:34 -05:00
Magnus Hagander	dcb09b595f	Support for collecting crash dumps on Windows Add support for collecting "minidump" style crash dumps on Windows, by setting up an exception handling filter. Crash dumps will be generated in PGDATA/crashdumps if the directory is created (the existance of the directory is used as on/off switch for the generation of the dumps). Craig Ringer and Magnus Hagander	2010-12-19 16:45:28 +01:00
Bruce Momjian	7e95337d58	Properly print the IP number and "localhost" for failed localhost connections when the server is down, on Win32.	2010-12-18 11:26:17 -05:00
Magnus Hagander	4754dbf4c3	Make GUC variables for syslog and SSL always visible Make the variables visible (but not used) even when support is not compiled in.	2010-12-18 16:53:59 +01:00
Alvaro Herrera	3026027ec3	set_ps_display when calling functions via fastpath This improves tag output by log_line_prefix	2010-12-17 18:51:22 -03:00
Alvaro Herrera	b68193c0c7	Remove unnecessary definition for autovacuum in SignalSomeChildren.	2010-12-17 15:59:19 -03:00
Robert Haas	8bd4b89e24	Try to save a kernel call in ResolveRecoveryConflictWithVirtualXIDs. If there's no work to be done, just exit quickly, before initialization.	2010-12-17 11:32:02 -05:00
Robert Haas	611fed3712	Reset 'ps' display just once when resolving VXID conflicts. This prevents the word "waiting" from briefly disappearing from the ps status line when ResolveRecoveryConflictWithVirtualXIDs begins a new iteration of the outer loop. Along the way, remove some useless pgstat_report_waiting() calls; the startup process doesn't appear in pg_stat_activity. Fujii Masao	2010-12-17 08:30:57 -05:00
Tom Lane	14ed7735f5	Improve comments around startup_hacks() code. These comments were not updated when we added the EXEC_BACKEND mechanism for Windows, even though it rendered them inaccurate. Also unify two unnecessarily-separate #ifdef __alpha code blocks.	2010-12-16 17:57:57 -05:00
Tom Lane	61b53695fb	Remove optreset from src/port/ implementations of getopt and getopt_long. We don't actually need optreset, because we can easily fix the code to ensure that it's cleanly restartable after having completed a scan over the argv array; which is the only case we need to restart in. Getting rid of it avoids a class of interactions with the system libraries and allows reversion of my change of yesterday in postmaster.c and postgres.c. Back-patch to 8.4. Before that the getopt code was a bit different anyway.	2010-12-16 16:23:05 -05:00
Alvaro Herrera	cd1fefa973	Avoid clobbering errno, per comment from Tom.	2010-12-16 17:15:37 -03:00
Alvaro Herrera	83c759ea0e	Fix inconsequential FILE pointer leakage	2010-12-16 16:45:11 -03:00
Alvaro Herrera	e359b8496d	Add some minor missing error checks	2010-12-16 12:23:07 -03:00
Alvaro Herrera	16ca75baeb	Simplify SignalSomeChildren(BACKEND_TYPE_ALL) to SignalChildren()	2010-12-16 12:23:07 -03:00
Bruce Momjian	48da2b87e3	Fix crash caused by NULL lookup when reporting IP address of failed libpq connection, per report from Magnus. This happens only on GIT master and only on Win32 because that is the platform where "" maps to an IP address (localhost).	2010-12-16 10:13:43 -05:00
Tom Lane	5cdd65f324	Fix up getopt() reset management so it works on recent mingw. The mingw people don't appear to care about compatibility with non-GNU versions of getopt, so force use of our own copy of getopt on Windows. Also, ensure that we make use of optreset when using our own copy. Per report from Andrew Dunstan. Back-patch to all versions supported on Windows.	2010-12-15 23:50:41 -05:00
Robert Haas	290f1603b4	Some copy editing of pg_read_binary_file() patch.	2010-12-15 21:02:31 -05:00
Itagaki Takahiro	03db44eae3	Add pg_read_binary_file() and whole-file-at-once versions of pg_read_file(). One of the usages of the binary version is to read files in a different encoding from the server encoding. Dimitri Fontaine and Itagaki Takahiro.	2010-12-16 06:56:28 +09:00
Robert Haas	34c70c7ac4	Instrument checkpoint sync calls. Greg Smith, reviewed by Jeff Janes	2010-12-14 09:26:19 -05:00
Robert Haas	9878e295dc	Improved tab completion for views with triggers. Allow INSERT INTO, UPDATE, and DELETE FROM to be completed with either the name of a table (as before) or the name of a view with an appropriate INSTEAD OF rule. Along the way, allow CREATE TRIGGER to be completed with INSTEAD OF, as well as BEFORE and AFTER. David Fetter, reviewed by Itagaki Takahiro	2010-12-13 22:46:55 -05:00
Robert Haas	d368e1a2a7	Allow plugins to suppress inlining and hook function entry/exit/abort. This is intended as infrastructure to allow an eventual SE-Linux plugin to support trusted procedures. KaiGai Kohei	2010-12-13 19:15:53 -05:00
Tom Lane	f5e4f743e6	Update time zone data files to tzdata release 2010o: DST law changes in Fiji and Samoa. Historical corrections for Hong Kong.	2010-12-13 12:45:31 -05:00
Robert Haas	5f7b58fad8	Generalize concept of temporary relations to "relation persistence". This commit replaces pg_class.relistemp with pg_class.relpersistence; and also modifies the RangeVar node type to carry relpersistence rather than istemp. It also removes removes rd_istemp from RelationData and instead performs the correct computation based on relpersistence. For clarity, we add three new macros: RelationNeedsWAL(), RelationUsesLocalBuffers(), and RelationUsesTempNamespace(), so that we can clarify the purpose of each check that previous depended on rd_istemp. This is intended as infrastructure for the upcoming unlogged tables patch, as well as for future possible work on global temporary tables.	2010-12-13 12:34:26 -05:00
Tom Lane	0c90442355	Reset all database-level stats in pgstat_recv_resetcounter(). We were failing to zero out some pg_stat_database counters that have been added since the initial pgstats coding. This is a bug, but not back-patching the fix since changing this behavior in a minor release seems a cure worse than the disease. Report and patch by Tomas Vondra.	2010-12-12 15:09:53 -05:00
Tom Lane	5132ad8bdf	Make S_IRGRP etc available in mingw builds as well as MSVC. (Hm, I wonder whether BCC defines them either...) Also label dangling endifs a bit better in this area.	2010-12-12 13:43:44 -05:00
Tom Lane	1319002e2e	Provide a complete set of file-permission-bit macros in win32.h. My previous patch exposed the fact that we didn't have these. Those hard-wired octal constants were actually wrong on Windows, not just inconsistent.	2010-12-11 13:11:18 -05:00
Robert Haas	d3d414696f	Allow bidirectional copy messages in streaming replication mode. Fujii Masao. Review by Alvaro Herrera, Tom Lane, and myself.	2010-12-11 09:27:37 -05:00
Magnus Hagander	20f3964291	Add required new port files to MSVC builds.	2010-12-11 14:19:08 +01:00
Tom Lane	671199929d	Move a couple of initdb's subroutines into src/port/. mkdir_p and check_data_dir will be useful in CREATE TABLESPACE, since we have agreed that that command should handle subdirectory creation just like initdb creates the PGDATA directory. Push them into src/port/ so that they are available to both initdb and the backend. Rename to pg_mkdir_p and pg_check_dir, just to be on the safe side. Add FreeBSD's copyright notice to pgmkdirp.c, since that's where the code came from originally (this really should have been in initdb.c). Very marginal code/comment cleanup.	2010-12-10 19:42:44 -05:00
Tom Lane	04f4e10cfc	Use symbolic names not octal constants for file permission flags. Purely cosmetic patch to make our coding standards more consistent --- we were doing symbolic some places and octal other places. This patch fixes all C-coded uses of mkdir, chmod, and umask. There might be some other calls I missed. Inconsistency noted while researching tablespace directory permissions issue.	2010-12-10 17:35:33 -05:00
Tom Lane	244407a710	Fix efficiency problems in tuplestore_trim(). The original coding in tuplestore_trim() was only meant to work efficiently in cases where each trim call deleted most of the tuples in the store. Which, in fact, was the pattern of the original usage with a Material node supporting mark/restore operations underneath a MergeJoin. However, WindowAgg now uses tuplestores and it has considerably less friendly trimming behavior. In particular it can attempt to trim one tuple at a time off a large tuplestore. tuplestore_trim() had O(N^2) runtime in this situation because of repeatedly shifting its tuple pointer array. Fix by avoiding shifting the array until a reasonably large number of tuples have been deleted. This can waste some pointer space, but we do still reclaim the tuples themselves, so the percentage wastage should be pretty small. Per Jie Li's report of slow percent_rank() evaluation. cume_dist() and ntile() would certainly be affected as well, along with any other window function that has a moving frame start and requires reading substantially ahead of the current row. Back-patch to 8.4, where window functions were introduced. There's no need to tweak it before that.	2010-12-10 11:33:38 -05:00
Tom Lane	663fc32e26	Eliminate O(N^2) behavior in parallel restore with many blobs. With hundreds of thousands of TOC entries, the repeated searches in reduce_dependencies() become the dominant cost. Get rid of that searching by constructing reverse-dependency lists, which we can do in O(N) time during the fix_dependencies() preprocessing. I chose to store the reverse dependencies as DumpId arrays for consistency with the forward-dependency representation, and keep the previously-transient tocsByDumpId[] array around to locate actual TOC entry structs quickly from dump IDs. While this fixes the slow case reported by Vlad Arkhipov, there is still a potential for O(N^2) behavior with sufficiently many tables: fix_dependencies itself, as well as mark_create_done and inhibit_data_for_failed_table, are doing repeated searches to deal with table-to-table-data dependencies. Possibly this work could be extended to deal with that, although the latter two functions are also used in non-parallel restore where we currently don't run fix_dependencies. Another TODO is that we fail to parallelize restore of multiple blobs at all. This appears to require changes in the archive format to fix. Back-patch to 9.0 where the problem was reported. 8.4 has potential issues as well; but since it doesn't create a separate TOC entry for each blob, it's at much less risk of having enough TOC entries to cause real problems.	2010-12-09 13:03:11 -05:00
Simon Riggs	9975c683b1	Self review of previous patch. Fix assumption that xmax >= xmin.	2010-12-09 10:20:49 +00:00
Simon Riggs	b9075a6d2f	Reduce spurious Hot Standby conflicts from never-visible records. Hot Standby conflicts only with tuples that were visible at some point. So ignore tuples from aborted transactions or for tuples updated/deleted during the inserting transaction when generating the conflict transaction ids. Following detailed analysis and test case by Noah Misch. Original report covered btree delete records, correctly observed by Heikki Linnakangas that this applies to other cases also. Fix covers all sources of cleanup records via common code.	2010-12-09 09:41:47 +00:00
Tom Lane	576477e73c	Force default wal_sync_method to be fdatasync on Linux. Recent versions of the Linux system header files cause xlogdefs.h to believe that open_datasync should be the default sync method, whereas formerly fdatasync was the default on Linux. open_datasync is a bad choice, first because it doesn't actually outperform fdatasync (in fact the reverse), and second because we try to use O_DIRECT with it, causing failures on certain filesystems (e.g., ext4 with data=journal option). This part of the patch is largely per a proposal from Marti Raudsepp. More extensive changes are likely to follow in HEAD, but this is as much change as we want to back-patch. Also clean up confusing code and incorrect documentation surrounding the fsync_writethrough option. Those changes shouldn't result in any actual behavioral change, but I chose to back-patch them anyway to keep the branches looking similar in this area. In 9.0 and HEAD, also do some copy-editing on the WAL Reliability documentation section. Back-patch to all supported branches, since any of them might get used on modern Linux versions.	2010-12-08 20:01:09 -05:00
Simon Riggs	e620ee35b2	Optimize commit_siblings in two ways to improve group commit. First, avoid scanning the whole ProcArray once we know there are at least commit_siblings active; second, skip the check altogether if commit_siblings = 0. Greg Smith	2010-12-08 18:48:03 +00:00
Heikki Linnakangas	5a031a5556	Fix bugs in the hot standby known-assigned-xids tracking logic. If there's an old transaction running in the master, and a lot of transactions have started and finished since, and a WAL-record is written in the gap between the creating the running-xacts snapshot and WAL-logging it, recovery will fail with "too many KnownAssignedXids" error. This bug was reported by Joachim Wieland on Nov 19th. In the same scenario, when fewer transactions have started so that all the xids fit in KnownAssignedXids despite the first bug, a more serious bug arises. We incorrectly initialize the clog code with the oldest still running transaction, and when we see the WAL record belonging to a transaction with an XID larger than one that committed already before the checkpoint we're recovering from, we zero the clog page containing the already committed transaction, leading to data loss. In hindsight, trying to track xids in the known-assigned-xids array before seeing the running-xacts record was too complicated. To fix that, hold XidGenLock while the running-xacts snapshot is taken and WAL-logged. That ensures that no transaction can begin or end in that gap, so that in recvoery we know that the snapshot contains all transactions running at that point in WAL.	2010-12-07 09:23:30 +01:00
Tom Lane	8b56928097	Add a stack overflow check to copyObject(). There are some code paths, such as SPI_execute(), where we invoke copyObject() on raw parse trees before doing parse analysis on them. Since the bison grammar is capable of building heavily nested parsetrees while itself using only minimal stack depth, this means that copyObject() can be the front-line function that hits stack overflow before anything else does. Accordingly, it had better have a check_stack_depth() call. I did a bit of performance testing and found that this slows down copyObject() by only a few percent, so the hit ought to be negligible in the context of complete processing of a query. Per off-list report from Toshihide Katayama. Back-patch to all supported branches.	2010-12-06 22:55:43 -05:00
Andrew Dunstan	af1a614ec6	Allow the low level COPY routines to read arbitrary numbers of fields. This doesn't involve any user-visible change in behavior, but will be useful when the COPY routines are exposed to allow their use by Foreign Data Wrapper routines, which will be able to use these routines to read irregular CSV files, for example.	2010-12-06 15:31:55 -05:00
Heikki Linnakangas	95e42a2c29	Fix two typos, by Fujii Masao.	2010-12-06 12:38:05 +01:00
Peter Eisentraut	951d786121	Put only single space after "Sort Method:", for consistency	2010-12-06 13:35:47 +02:00
Tom Lane	d1001a78ce	Reduce memory consumption inside inheritance_planner(). Avoid eating quite so much memory for large inheritance trees, by reclaiming the space used by temporary copies of the original parsetree and range table, as well as the workspace needed during planning. The cost is needing to copy the finished plan trees out of the child memory context. Although this looks like it ought to slow things down, my testing shows it actually is faster, apparently because fewer interactions with malloc() are needed and/or we can do the work within a more readily cacheable amount of memory. That result might be platform-dependent, but I'll take it. Per a gripe from John Papandriopoulos, in which it was pointed out that the memory consumption actually grew as O(N^2) for sufficiently many child tables, since we were creating N copies of the N-element range table.	2010-12-05 15:10:28 -05:00
Tom Lane	d1f5a92e18	Fix two small bugs in new gistget.c logic. 1. Complain, rather than silently doing nothing, if an "invalid" tuple is found on a leaf page. Per off-list discussion with Heikki. 2. Fix oversight in code that removes a GISTSearchItem from the search queue: we have to reset lastHeap if this was the last heap item in the parent GISTSearchTreeItem. Otherwise subsequent additions will do the wrong thing. This was probably masked in early testing because in typical cases the parent item would now be completely empty and would be deleted on next call. You'd need a queued non-leaf page at exactly the same distance as a heap tuple to expose the bug.	2010-12-04 13:47:08 -05:00
Peter Eisentraut	387e468b82	Make output width consistent for all ways of invoking a regression test run_schedule() and run_single_test() were using different output widths, which would show up in bigcheck/bigtest, for example.	2010-12-04 17:34:48 +02:00
Tom Lane	e194a942f9	Update comment to match later code changes.	2010-12-04 03:21:49 -05:00
Tom Lane	b576757d7e	Add external documentation for KNNGIST.	2010-12-03 23:49:06 -05:00
Tom Lane	04910a3ad5	Put back gistgettuple's check for backwards scan request. On reflection it's a bad idea for the KNNGIST patch to have removed that. We don't want it silently returning incorrect answers.	2010-12-03 22:43:01 -05:00
Tom Lane	554506871b	KNNGIST, otherwise known as order-by-operator support for GIST. This commit represents a rather heavily editorialized version of Teodor's builtin_knngist_itself-0.8.2 and builtin_knngist_proc-0.8.1 patches. I redid the opclass API to add a separate Distance method instead of turning the Consistent method into an illogical mess, fixed some bit-rot in the rbtree interfaces, and generally worked over the code style and comments. There's still no non-code documentation to speak of, but I'll work on that separately. Some contrib-module changes are also yet to come (right now, point <-> point is the only KNN-ified operator). Teodor Sigaev and Tom Lane	2010-12-03 20:53:29 -05:00
Robert Haas	5ef6c91383	Remove now-outdated mention of quotes being required in recovery.conf. Noted by Itagaki Takahiro.	2010-12-03 09:00:18 -05:00
Robert Haas	970a18687f	Use GUC lexer for recovery.conf parsing. This eliminates some crufty, special-purpose code and, as a non-trivial side benefit, allows recovery.conf parameters to be unquoted. Dimitri Fontaine, with review and cleanup by Alvaro Herrera, Itagaki Takahiro, and me.	2010-12-03 08:56:44 -05:00
Heikki Linnakangas	9cea52a5a3	Remove misleading comments. Move _Clone and _DeClone functions before the "END OF FORMAT CALLBACKS" comment, because they are format callbacks too.	2010-12-03 14:58:24 +02:00
Itagaki Takahiro	fd223c7407	Remove unnecessary string null-termination in pg_convert. We can directly verify the unterminated input with pg_verify_mbstr_len.	2010-12-03 12:00:27 +09:00
Tom Lane	d583f10b7e	Create core infrastructure for KNNGIST. This is a heavily revised version of builtin_knngist_core-0.9. The ordering operators are no longer mixed in with actual quals, which would have confused not only humans but significant parts of the planner. Instead, ordering operators are carried separately throughout planning and execution. Since the API for ambeginscan and amrescan functions had to be changed anyway, this commit takes the opportunity to rationalize that a bit. RelationGetIndexScan no longer forces a premature index_rescan call; instead, callers of index_beginscan must call index_rescan too. Aside from making the AM-side initialization logic a bit less peculiar, this has the advantage that we do not make a useless extra am_rescan call when there are runtime key values. AMs formerly could not assume that the key values passed to amrescan were actually valid; now they can. Teodor Sigaev and Tom Lane	2010-12-02 20:51:37 -05:00
Alvaro Herrera	d7e5d151da	Move private struct declaration to compress_io.c Keep only the typedef in the header file.	2010-12-02 17:45:13 -03:00
Alvaro Herrera	0025b76f4f	Remove trailing whitespace	2010-12-02 17:45:13 -03:00
Alvaro Herrera	d67a39c326	Remove useless struct declaration	2010-12-02 17:45:12 -03:00
Alvaro Herrera	7f4a7af2fd	Silence compiler	2010-12-02 17:45:12 -03:00
Heikki Linnakangas	bf9aa490db	Refactor the pg_dump zlib code from pg_backup_custom.c to a separate file, to make it easier to reuse that code. There is no user-visible changes. This is in preparation for the patch to add a new archive format, a directory, to perform a custom-like dump but with each table being dumped to a separate file (that in turn is a prerequisite for parallel pg_dump). This also makes it easier to add new compression methods in the future, and makes the pg_backup_custom.c code easier to read, when the compression-related code is factored out. Joachim Wieland, with heavy editorialization by me.	2010-12-02 21:39:03 +02:00
Tom Lane	225f0aa3df	Prevent inlining a SQL function with multiple OUT parameters. There were corner cases in which the planner would attempt to inline such a function, which would result in a failure at runtime due to loss of information about exactly what the result record type is. Fix by disabling inlining when the function's recorded result type is RECORD. There might be some sub-cases where inlining could still be allowed, but this is a simple and backpatchable fix, so leave refinements for another day. Per bug #5777 from Nate Carson. Back-patch to all supported branches. 8.1 happens to avoid a core-dump here, but it still does the wrong thing.	2010-12-01 00:53:18 -05:00
Tom Lane	c0b5fac701	Simplify and speed up mapping of index opfamilies to pathkeys. Formerly we looked up the operators associated with each index (caching them in relcache) and then the planner looked up the btree opfamily containing such operators in order to build the btree-centric pathkey representation that describes the index's sort order. This is quite pointless for btree indexes: we might as well just use the index's opfamily information directly. That saves syscache lookup cycles during planning, and furthermore allows us to eliminate the relcache's caching of operators altogether, which may help in reducing backend startup time. I added code to plancat.c to perform the same type of double lookup on-the-fly if it's ever faced with a non-btree amcanorder index AM. If such a thing actually becomes interesting for production, we should replace that logic with some more-direct method for identifying the corresponding btree opfamily; but it's not worth spending effort on now. There is considerably more to do pursuant to my recent proposal to get rid of sort-operator-based representations of sort orderings, but this patch grabs some of the low-hanging fruit. I'll look at the remainder of that work after the current commitfest.	2010-11-29 12:30:43 -05:00
Simon Riggs	ed78384acd	Move call to GetTopTransactionId() earlier in LockAcquire(), removing an infrequently occurring race condition in Hot Standby. An xid must be assigned before a lock appears in shared memory, rather than immediately after, else GetRunningTransactionLocks() may see InvalidTransactionId, causing assertion failures during lock processing on standby. Bug report and diagnosis by Fujii Masao, fix by me.	2010-11-29 01:08:02 +00:00
Bruce Momjian	1f48290a9d	In libpq/Makefile, use OBJS += as a way to break up long link lines into something that can be documented.	2010-11-27 11:03:23 -05:00
Tom Lane	49cd8a3f81	On further testing, PQping also needs an explicit check for AUTH_REQ. The pg_fe_sendauth code might fail if it can't handle the authentication request message type --- if so, ping should still say the server is up.	2010-11-27 02:11:45 -05:00
Tom Lane	db96e1ccfc	Rewrite PQping to be more like what we agreed to last week. Basically, we want to distinguish all cases where the connection was not made from those where it was. A convenient proxy for this is to see if we got a message with a SQLSTATE code back from the postmaster. This presumes that the postmaster will always send us a SQLSTATE in a failure message, which is true for 7.4 and later postmasters in every case except fork failure. (We could possibly complicate the postmaster code to do something about that, but it seems not worth the trouble, especially since pg_ctl's response for that case should be to keep waiting anyway.) If we did get a SQLSTATE from the postmaster, there are basically only two cases, as per last week's discussion: ERRCODE_CANNOT_CONNECT_NOW and everything else. Any other error code implies that the postmaster is in principle willing to accept connections, it just didn't like or couldn't handle this particular request. We want to make a special case for ERRCODE_CANNOT_CONNECT_NOW so that "pg_ctl start -w" knows it should keep waiting. In passing, pick names for the enum constants that are a tad less likely to present collision hazards in future.	2010-11-27 01:30:34 -05:00
Tom Lane	be3b666eb8	Clean up IPv4 vs IPv6 bogosity in connectFailureMessage(). Newly added code was supposing that "struct sockaddr_in" applies to IPv6.	2010-11-26 19:16:39 -05:00
Tom Lane	3840bc0847	Fix portability issues in new src/port/inet_net_ntop.c file. 1. Don't #include postgres.h in a frontend build. 2. Don't assume that the backend's symbol PGSQL_AF_INET6 has anything to do with the constant that will be used by system library functions (because, in point of fact, it usually doesn't). Fortunately, PGSQL_AF_INET is equal to AF_INET, so we can just cater for both sets of values in one case construct without fear of conflict.	2010-11-26 18:00:26 -05:00
Robert Haas	55109313f9	Add more ALTER <object> .. SET SCHEMA commands. This adds support for changing the schema of a conversion, operator, operator class, operator family, text search configuration, text search dictionary, text search parser, or text search template. Dimitri Fontaine, with assorted corrections and other kibitzing.	2010-11-26 17:31:54 -05:00
Tom Lane	1d9a0abec1	Remove bogus use of PGDLLIMPORT. That macro should be attached to extern declarations, not actual definitions of variables.	2010-11-26 17:05:29 -05:00
Bruce Momjian	e6e38b4ac2	Add inet_net_ntop.c as needed by MSVC, per Magnus.	2010-11-26 14:39:13 -05:00
Bruce Momjian	f2eba413db	Use conn->raddr consistently for non-connect libpq error reporting.	2010-11-26 13:26:13 -05:00
Bruce Momjian	bad8277f13	Update comment that says we only report last libpq connection failure, per Peter.	2010-11-26 11:52:03 -05:00
Bruce Momjian	ed51bd4968	Use only addr_cur when reporting connection failures in libpq.	2010-11-26 11:49:35 -05:00
Bruce Momjian	4f6deef2fb	Abandon use of Makefile variables in libpq/Makefile because MSVC scrapes the OBJS lines from that file. Cleanup where possible.	2010-11-26 11:10:26 -05:00
Bruce Momjian	a9b02ec654	In libpq/Makefile, merge PERM_PGPORT and OPT_PGPORT into a single Makefile variable PGPORT, for clarity.	2010-11-26 10:22:09 -05:00
Bruce Momjian	5f4b3d750b	Improve pg_ctl "cannot connect" spacing, per Tom, and wording.	2010-11-26 10:04:18 -05:00
Bruce Momjian	4646e0cef7	Improve pg_ctl "cannot connect" warning, per suggestion from Magnus.	2010-11-25 14:38:20 -05:00
Bruce Momjian	742ac738c3	For libpq/Makefile OPT_PGPORT, remove .o extension after we test configure's LIBOBJS. Should fix buildfarm failures.	2010-11-25 13:19:31 -05:00
Bruce Momjian	afd7d9adca	Add PQping and PQpingParams to libpq to allow detection of the server's status, including a status where the server is running but refuses a postgres connection. Have pg_ctl use this new function. This fixes the case where pg_ctl reports that the server is not running (cannot connect) but in fact it is running.	2010-11-25 13:09:38 -05:00
Bruce Momjian	212a1c7b0b	Fix getaddrinfo() in pgport to use proper parameters, as detected by Win32 buildfarm members.	2010-11-25 12:56:59 -05:00
Bruce Momjian	c6978ecd6f	Restructure how libpq includes external C files, for clarity.	2010-11-25 12:51:40 -05:00
Robert Haas	cc1ed40d57	Object access hook framework, with post-creation hook. After a SQL object is created, we provide an opportunity for security or logging plugins to get control; for example, a security label provider could use this to assign an initial security label to newly created objects. The basic infrastructure is (hopefully) reusable for other types of events that might require similar treatment. KaiGai Kohei, with minor adjustments.	2010-11-25 11:50:13 -05:00
Robert Haas	2d1e426650	Add inet_net_ntop.c to .gitignore.	2010-11-25 00:12:25 -05:00
Robert Haas	c2281ac87c	Remove belt-and-suspenders guards against buffer pin leaks. Forcibly releasing all leftover buffer pins should be unnecessary now that we have a robust ResourceOwner mechanism, and it significantly increases the cost of process shutdown. Instead, in an assert-enabled build, assert that no pins are held; in a non-assert-enabled build, do nothing.	2010-11-25 00:06:46 -05:00
Bruce Momjian	58dfb07b5d	Properly add new inet_net_ntop file to libpq Makefile.	2010-11-24 21:58:47 -05:00
Bruce Momjian	ba11258ccb	When reporting the server as not responding, if the hostname was supplied, also print the IP address. This allows IPv4 and IPv6 failures to be distinguished. Also useful when a hostname resolves to multiple IP addresses. Also, remove use of inet_ntoa() and use our own inet_net_ntop() in all places, including in libpq, because it is thread-safe.	2010-11-24 17:04:19 -05:00
Tom Lane	725d52d0c2	Create the system catalog infrastructure needed for KNNGIST. This commit adds columns amoppurpose and amopsortfamily to pg_amop, and column amcanorderbyop to pg_am. For the moment all the entries in amcanorderbyop are "false", since the underlying support isn't there yet. Also, extend the CREATE OPERATOR CLASS/ALTER OPERATOR FAMILY commands with [ FOR SEARCH \| FOR ORDER BY sort_operator_family ] clauses to allow the new columns of pg_amop to be populated, and create pg_dump support for dumping that information. I also added some documentation, although it's perhaps a bit premature given that the feature doesn't do anything useful yet. Teodor Sigaev, Robert Haas, Tom Lane	2010-11-24 14:22:17 -05:00
Peter Eisentraut	f2a4278330	Propagate ALTER TYPE operations to typed tables This adds RESTRICT/CASCADE flags to ALTER TYPE ... ADD/DROP/ALTER/ RENAME ATTRIBUTE to control whether to alter typed tables as well.	2010-11-23 22:50:17 +02:00
Peter Eisentraut	fc946c39ae	Remove useless whitespace at end of lines	2010-11-23 22:34:55 +02:00
Robert Haas	44475e782f	Centralize some ALTER <whatever> .. SET SCHEMA checks. Any flavor of ALTER <whatever> .. SET SCHEMA fails if (1) the object is already in the new schema, (2) either the old or new schema is a temp schema, or (3) either the old or new schema is the TOAST schema. Extraced from a patch by Dimitri Fontaine, with additional hacking by me.	2010-11-22 19:53:34 -05:00

... 7 8 9 10 11 ...

21956 Commits