postgresql/src/backend/parser
Peter Eisentraut 2991ac5fc9 Add SQL functions for Unicode normalization
This adds SQL expressions NORMALIZE() and IS NORMALIZED to convert and
check Unicode normal forms, per SQL standard.

To support fast IS NORMALIZED tests, we pull in a new data file
DerivedNormalizationProps.txt from Unicode and build a lookup table
from that, using techniques similar to ones already used for other
Unicode data.  make update-unicode will keep it up to date.  We only
build and use these tables for the NFC and NFKC forms, because they
are too big for NFD and NFKD and the improvement is not significant
enough there.

Reviewed-by: Daniel Verite <daniel@manitou-mail.org>
Reviewed-by: Andreas Karlsson <andreas@proxel.se>
Discussion: https://www.postgresql.org/message-id/flat/c1909f27-c269-2ed9-12f8-3ab72c8caf7a@2ndquadrant.com
2020-04-02 08:56:27 +02:00
..
.gitignore Convert cvsignore to gitignore, and add .gitignore for build targets. 2010-09-22 12:57:04 +02:00
Makefile Split all OBJS style lines in makefiles into one-line-per-entry style. 2019-11-05 14:41:07 -08:00
README Move keywords.c/kwlookup.c into src/common/. 2016-03-23 20:22:08 -04:00
analyze.c Fill in extraUpdatedCols in logical replication 2020-02-17 15:20:57 +01:00
check_keywords.pl Update copyrights for 2020 2020-01-01 12:21:45 -05:00
gram.y Add SQL functions for Unicode normalization 2020-04-02 08:56:27 +02:00
parse_agg.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
parse_clause.c Reconsider the representation of join alias Vars. 2020-01-09 11:56:59 -05:00
parse_coerce.c Go back to returning int from ereport auxiliary functions. 2020-03-25 11:57:36 -04:00
parse_collate.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
parse_cte.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
parse_enr.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
parse_expr.c Make parser rely more heavily on the ParseNamespaceItem data structure. 2020-01-02 11:29:01 -05:00
parse_func.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
parse_node.c Go back to returning int from ereport auxiliary functions. 2020-03-25 11:57:36 -04:00
parse_oper.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
parse_param.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00
parse_relation.c Reconsider the representation of join alias Vars. 2020-01-09 11:56:59 -05:00
parse_target.c Reconsider the representation of join alias Vars. 2020-01-09 11:56:59 -05:00
parse_type.c Introduce macros for typalign and typstorage constants. 2020-03-04 10:34:25 -05:00
parse_utilcmd.c Implement operator class parameters 2020-03-30 19:17:23 +03:00
parser.c Allow Unicode escapes in any server encoding, not only UTF-8. 2020-03-06 14:17:43 -05:00
scan.l Go back to returning int from ereport auxiliary functions. 2020-03-25 11:57:36 -04:00
scansup.c Update copyrights for 2020 2020-01-01 12:21:45 -05:00

README

src/backend/parser/README

Parser
======

This directory does more than tokenize and parse SQL queries.  It also
creates Query structures for the various complex queries that are passed
to the optimizer and then executor.

parser.c	things start here
scan.l		break query into tokens
scansup.c	handle escapes in input strings
gram.y		parse the tokens and produce a "raw" parse tree
analyze.c	top level of parse analysis for optimizable queries
parse_agg.c	handle aggregates, like SUM(col1),  AVG(col2), ...
parse_clause.c	handle clauses like WHERE, ORDER BY, GROUP BY, ...
parse_coerce.c	handle coercing expressions to different data types
parse_collate.c	assign collation information in completed expressions
parse_cte.c	handle Common Table Expressions (WITH clauses)
parse_expr.c	handle expressions like col, col + 3, x = 3 or x = 4
parse_func.c	handle functions, table.column and column identifiers
parse_node.c	create nodes for various structures
parse_oper.c	handle operators in expressions
parse_param.c	handle Params (for the cases used in the core backend)
parse_relation.c support routines for tables and column handling
parse_target.c	handle the result list of the query
parse_type.c	support routines for data type handling
parse_utilcmd.c	parse analysis for utility commands (done at execution time)

See also src/common/keywords.c, which contains the table of standard
keywords and the keyword lookup function.  We separated that out because
various frontend code wants to use it too.