From 182de3bf87f46f5c3f321a304ec2e39112a972b9 Mon Sep 17 00:00:00 2001 From: Jeff Davis Date: Mon, 19 Feb 2024 15:21:39 -0800 Subject: [PATCH v18 1/6] Documentation update for Standard Collations. --- doc/src/sgml/charset.sgml | 79 ++++++++++++++++++++++++--------------- 1 file changed, 48 insertions(+), 31 deletions(-) diff --git a/doc/src/sgml/charset.sgml b/doc/src/sgml/charset.sgml index 74783d148f..4422b0cc92 100644 --- a/doc/src/sgml/charset.sgml +++ b/doc/src/sgml/charset.sgml @@ -788,55 +788,72 @@ SELECT * FROM test1 ORDER BY a || b COLLATE "fr_FR"; Standard Collations - On all platforms, the collations named default, - C, and POSIX are available. Additional - collations may be available depending on operating system support. - The default collation selects the LC_COLLATE - and LC_CTYPE values specified at database creation time. - The C and POSIX collations both specify - traditional C behavior, in which only the ASCII letters - A through Z - are treated as letters, and sorting is done strictly by character - code byte values. - - - - - The C and POSIX locales may behave - differently depending on the database encoding. - - - - - Additionally, two SQL standard collation names are available: + On all platforms, the collations following collations are supported: - unicode + UNICODE - This collation sorts using the Unicode Collation Algorithm with the - Default Unicode Collation Element Table. It is available in all - encodings. ICU support is required to use this collation. (This - collation has the same behavior as the ICU root locale; see .) - ucs_basic + UCS_BASIC - This collation sorts by Unicode code point. It is only available for - encoding UTF8. (This collation has the same - behavior as the libc locale specification C in - UTF8 encoding.) + This SQL standard collation sorts by Unicode code point. Behavior is + efficient and stable across all versions. Case mapping and pattern + matching behavior is based on simple ASCII semantics. It is only + available for encoding UTF8. (This collation has + the same behavior as the libc locale specification + C in UTF8 encoding.) + + + + + + C (equivalent to POSIX) + + + The C and POSIX collations both + specify traditional C behavior, in which only the ASCII + letters A through + Z are treated as letters, and + sorting is done strictly by character code byte values. Case mapping + and pattern matching behavior is based on simple ASCII + semantics. Behavior is efficient and stable across all versions for a + given database encoding, but behavior may vary between different + database encodings. + + + + + + default + + + The default collation selects the locale specified + at database creation time. + + + Additional collations may be available depending on operating system + support. The efficiency and stability of these additional collations + depend on the collation provider, the provider version, and the locale. + -- 2.34.1