postgresql

Commit Graph

Author	SHA1	Message	Date
Tom Lane	fa98a86f65	Tweak the code in a couple of places to try to deliver more user-friendly error messages when a single COPY line is too long for us to handle. Per example from Johann Spies.	2007-05-28 16:43:24 +00:00
Tom Lane	274dfdb513	Tweak clean_encoding_name() API to avoid need to cast away const. Kris Jurka	2007-04-16 18:50:49 +00:00
Tatsuo Ishii	6041b92238	Make JOHAB client only encoding per discussions in pgsql-hackers "Server-side support of all encodings" around 2007/3/26. initdb required.	2007-04-15 10:56:30 +00:00
Tatsuo Ishii	bf47e3e419	Fix description how to create conversion function.	2007-04-15 10:49:26 +00:00
Bruce Momjian	b8f856512e	Fix typo in Makefile. Marko Kreen	2007-03-27 14:29:51 +00:00
Bruce Momjian	9dd3ec6c3b	Remove advertising clause from Berkeley BSD-licensed files, per instructions from Berkeley.	2007-03-26 21:44:11 +00:00
Tatsuo Ishii	a6fbd2f12a	Fix pg_wchar_table's maxmblen field of EUC_CN, EUC_TW, MULE_INTERNAL and GB18030. patches from ITAGAKI Takahiro.	2007-03-26 11:15:13 +00:00
Tatsuo Ishii	75c6519ff6	Add new encoding EUC_JIS_2004 and SHIFT_JIS_2004, along with new conversions among EUC_JIS_2004, SHIFT_JIS_2004 and UTF-8. catalog version has been bump up.	2007-03-25 11:56:04 +00:00
Tatsuo Ishii	4c35ec53a9	Allow 4 bytes UTF-8 (UCS-4 range 00010000-001FFFFF) This is necessary to support JIS X 0213 <--> UTF-8 conversion.	2007-03-23 13:51:30 +00:00
Tom Lane	234a02b2a8	Replace direct assignments to VARATT_SIZEP(x) with SET_VARSIZE(x, len). Get rid of VARATT_SIZE and VARATT_DATA, which were simply redundant with VARSIZE and VARDATA, and as a consequence almost no code was using the longer names. Rename the length fields of struct varlena and various derived structures to catch anyplace that was accessing them directly; and clean up various places so caught. In itself this patch doesn't change any behavior at all, but it is necessary infrastructure if we hope to play any games with the representation of varlena headers. Greg Stark and Tom Lane	2007-02-27 23:48:10 +00:00
Peter Eisentraut	c138b966d4	Replace useless uses of := by = in makefiles.	2007-02-09 15:56:00 +00:00
Tom Lane	0887fa1117	Get pg_utf_mblen(), pg_utf2wchar_with_len(), and utf2ucs() all on the same page about the maximum UTF8 sequence length we support (4 bytes since 8.1, 3 before that). pg_utf2wchar_with_len never got updated to support 4-byte characters at all, and in any case had a buffer-overrun risk in that it could produce multiple pg_wchars from what mblen claims to be just one UTF8 character. The only reason we don't have a major security hole is that most callers allocate worst-case output buffers; the sole exception in released versions appears to be pre-8.2 iwchareq() (ie, ILIKE), which can be crashed due to zeroing out its return address --- but AFAICS that can't be exploited for anything more than a crash, due to inability to control what gets written there. Per report from James Russell and Michael Fuhr. Pre-8.1 the risk is much less, but I still think pg_utf2wchar_with_len's behavior given an incomplete final character risks buffer overrun, so back-patch that logic change anyway. This patch also makes sure that UTF8 sequences exceeding the supported length (whichever it is) are consistently treated as error cases, rather than being treated like a valid shorter sequence in some places.	2007-01-24 17:12:17 +00:00
Peter Eisentraut	2cc01004c6	Remove remains of old depend target.	2007-01-20 17:16:17 +00:00
Bruce Momjian	29dccf5fe0	Update CVS HEAD for 2007 copyright. Back branches are typically not back-stamped for this.	2007-01-05 22:20:05 +00:00
Tom Lane	e9da20ab4d	Fix machine-dependent crash in sqlchar_to_unicode(). Get rid of bletcherous and unsafe manipulation of global encoding setting. Clean up libxml reporting mechanism a bit (it still looks like a dangling-pointer crash waiting to happen, though, not to mention being far less than sane from a localization standpoint).	2006-12-24 00:57:48 +00:00
Peter Eisentraut	8c1de5fb00	Initial SQL/XML support: xml data type and initial set of functions.	2006-12-21 16:05:16 +00:00
Peter Eisentraut	3cd318a8d1	Fix gratuitous message spelling differences	2006-11-27 15:50:55 +00:00
Peter Eisentraut	b9b4f10b5b	Message style improvements	2006-10-06 17:14:01 +00:00
Bruce Momjian	f99a569a2e	pgindent run for 8.2.	2006-10-04 00:30:14 +00:00
Bruce Momjian	a3132359fd	In new "invalid byte sequence" error hint, call it "error", not "failure".	2006-08-22 12:11:28 +00:00
Bruce Momjian	e11cab650c	Add hint for "invalid byte sequence for encoding" error message, suggesting review of client_encoding.	2006-08-22 03:30:20 +00:00
Bruce Momjian	e0522505bd	Remove 576 references of include files that were not needed.	2006-07-14 14:52:27 +00:00
Bruce Momjian	ac230e7431	Alphabetically order reference to include files, "S"-"Z".	2006-07-11 18:26:11 +00:00
Bruce Momjian	3a534ade39	Alphabetically order reference to include files, "G" - "M".	2006-07-11 17:04:13 +00:00
Bruce Momjian	399a36a75d	Prepare code to be built by MSVC: o remove many WIN32_CLIENT_ONLY defines o add WIN32_ONLY_COMPILER define o add 3rd argument to open() for portability o add include/port/win32_msvc directory for system includes Magnus Hagander	2006-06-07 22:24:46 +00:00
Tom Lane	a0ffab351e	Magic blocks don't do us any good unless we use 'em ... so install one in every shared library.	2006-05-30 22:12:16 +00:00
Tom Lane	c61a2f5841	Change the backend to reject strings containing invalidly-encoded multibyte characters in all cases. Formerly we mostly just threw warnings for invalid input, and failed to detect it at all if no encoding conversion was required. The tighter check is needed to defend against SQL-injection attacks as per CVE-2006-2313 (further details will be published after release). Embedded zero (null) bytes will be rejected as well. The checks are applied during input to the backend (receipt from client or COPY IN), so it no longer seems necessary to check in textin() and related routines; any string arriving at those functions will already have been validated. Conversion failure reporting (for characters with no equivalent in the destination encoding) has been cleaned up and made consistent while at it. Also, fix a few longstanding errors in little-used encoding conversion routines: win1251_to_iso, win866_to_iso, euc_tw_to_big5, euc_tw_to_mic, mic_to_euc_tw were all broken to varying extents. Patches by Tatsuo Ishii and Tom Lane. Thanks to Akio Ishida and Yasuo Ohgaki for identifying the security issues.	2006-05-21 20:05:21 +00:00
Bruce Momjian	f3d99d160d	Add CVS tag lines to files that were lacking them.	2006-03-11 04:38:42 +00:00
Bruce Momjian	f2f5b05655	Update copyright for 2006. Update scripts.	2006-03-05 15:59:11 +00:00
Tatsuo Ishii	b3d0442ab3	Tighten up SJIS byte sequence check. Now we reject invalid SJIS byte sequence such as "0x95 0x27". Patches from Akio Ishida. Also update copyright notice.	2006-03-04 10:57:35 +00:00
Peter Eisentraut	7f4f42fa10	Clean up CREATE FUNCTION syntax usage in contrib and elsewhere, in particular get rid of single quotes around language names and old WITH () construct.	2006-02-27 16:09:50 +00:00
Peter Eisentraut	268c1b6077	The Makefile was invoking perl scripts as ./script.pl. This fails when the script is not executable as UCS_to_most.pl is in CVS. It also won't pick up any custom setting of the perl version/location to use. This patch calls perl scripts like $(PERL) $(srcdir)/script.pl. Kris Jurka	2006-02-24 13:25:44 +00:00
Peter Eisentraut	1b658473ea	Add support for Windows codepages 1253, 1254, 1255, and 1257 and clean up a bunch of the support utilities. In src/backend/utils/mb/Unicode remove nearly duplicate copies of the UCS_to_XXX perl script and replace with one version to handle all generic files. Update the Makefile so that it knows about all the map files. This produces a slight difference in some of the map files, using a uniform naming convention and not mapping the null character. In src/backend/utils/mb/conversion_procs create a master utf8<->win codepage function like the ISO 8859 versions instead of having a separate handler for each conversion. There is an externally visible change in the name of the win1258 to utf8 conversion. According to the documentation notes, it was named incorrectly and this changes it to a standard name. Running the Unicode mapping perl scripts has shown some additional mapping changes in koi8r and iso8859-7.	2006-02-18 16:15:23 +00:00
Tom Lane	226a980bb0	Fix bug that allowed any logged-in user to SET ROLE to any other database user id (CVE-2006-0553). Also fix related bug in SET SESSION AUTHORIZATION that allows unprivileged users to crash the server, if it has been compiled with Asserts enabled. The escalation-of-privilege risk exists only in 8.1.0-8.1.2. However, the Assert-crash risk exists in all releases back to 7.3. Thanks to Akio Ishida for reporting this problem.	2006-02-12 22:32:43 +00:00
Bruce Momjian	2a5180c26e	Throw a warning rather than an error on invalid character from UTF8 to Latin1, like we do for other Latin encodings.	2006-02-12 21:15:19 +00:00
Bruce Momjian	c01999a557	Allow psql multi-line column values to align in the proper columns If the second output column value is 'a\nb', the 'b' should appear in the second display column, rather than the first column as it does now. Change libpq's PQdsplen() to return more useful values. > Note: this changes the PQdsplen function, it can now return zero or > minus one which was not possible before. It doesn't appear anyone is > actually using the functions other than psql but it is a change. The > functions are not actually documentated anywhere so it's not like we're > breaking a defined interface. The new semantics follow the Unicode > standard. BACKWARD COMPATIBLE CHANGE. The only user-visible change I saw in the regression tests is that a SELECT * on a table where all the columns have been dropped doesn't return a blank line like before. This seems like a step forward. Martijn van Oosterhout	2006-02-10 00:39:04 +00:00
Neil Conway	d3a4d63387	mbutils was previously doing some allocations, including invoking fmgr_info(), in the TopMemoryContext. I couldn't see that the code actually leaked, but in general I think it's fragile to assume that pfree'ing an FmgrInfo along with its fn_extra field is enough to reclaim all the resources allocated by fmgr_info(). I changed the code to do its allocations in a new child context of TopMemoryContext, MbProcContext. When we want to release the allocations we can just reset the context, which is cleaner.	2006-01-12 22:04:02 +00:00
Neil Conway	fb627b76cc	Cosmetic code cleanup: fix a bunch of places that used "return (expr);" rather than "return expr;" -- the latter style is used in most of the tree. I kept the parentheses when they were necessary or useful because the return expression was complex.	2006-01-11 08:43:13 +00:00
Neil Conway	762bcbdba2	Remove a confusing pair of parentheses.	2006-01-11 06:59:22 +00:00
Bruce Momjian	a2384d008a	More uses of IS_HIGHBIT_SET() macro.	2005-12-26 19:30:45 +00:00
Bruce Momjian	261114a23f	I have added these macros to c.h: #define HIGHBIT (0x80) #define IS_HIGHBIT_SET(ch) ((unsigned char)(ch) & HIGHBIT) and removed CSIGNBIT and mapped it uses to HIGHBIT. I have also added uses for IS_HIGHBIT_SET where appropriate. This change is purely for code clarity.	2005-12-25 02:14:19 +00:00
Bruce Momjian	d8a8183456	Formatting cleanups.	2005-12-24 17:19:40 +00:00
Bruce Momjian	0658a6a634	Formatting cleanup.	2005-12-24 16:49:48 +00:00
Tatsuo Ishii	804f6b8fc9	Fix long standing Asian multibyte charsets bug. See: Subject: [HACKERS] bugs with certain Asian multibyte charsets From: Tatsuo Ishii <ishii@sraoss.co.jp> To: pgsql-hackers@postgresql.org Date: Sat, 24 Dec 2005 18:25:33 +0900 (JST) for more details/	2005-12-24 09:35:36 +00:00
Tatsuo Ishii	dcc7da8d5e	Fix for rearranging encoding id ISO-8859-5 to ISO-8859-8. Also make the code more robust by searching for target encoding in the internal charset map. Problem reported by Sagi Bashari on 2005/12/21. See "[BUGS] BUG #2120: Crash when doing UTF8<->ISO_8859_8 encoding conversion" on pgsql-bugs list for more details.	2005-12-23 02:11:02 +00:00
Peter Eisentraut	a29c04a541	Allow installation into directories containing spaces in the name.	2005-12-09 21:19:36 +00:00
Bruce Momjian	436a2956d8	Re-run pgindent, fixing a problem where comment lines after a blank comment line where output as too long, and update typedefs for /lib directory. Also fix case where identifiers were used as variable names in the backend, but as typedefs in ecpg (favor the backend for indenting). Backpatch to 8.1.X.	2005-11-22 18:17:34 +00:00
Peter Eisentraut	07bb9f086b	Message corrections	2005-10-29 00:31:52 +00:00
Bruce Momjian	1dc3498251	Standard pgindent run for 8.1.	2005-10-15 02:49:52 +00:00
Tom Lane	8889685555	Suppress signed-vs-unsigned-char warnings.	2005-09-24 17:53:28 +00:00

1 2 3 4 5

228 Commits