Commit ca1860c3 authored by Alexander Barkov's avatar Alexander Barkov

Updating charset doc files.

Thanks to Paul for preparing the up-to-date files
reflecting 4.1 changes.
parent 585d205d
This directory holds configuration files which allow MySQL to work with This directory holds configuration files that enable MySQL to work with
different character sets. It contains: different character sets. It contains:
*.conf charset_name.xml
Each conf file contains four tables which describe character types, Each charset_name.xml file contains information for a simple character
set. The information in the file describes character types,
lower- and upper-case equivalencies and sorting orders for the lower- and upper-case equivalencies and sorting orders for the
character values in the set. character values in the set.
Index Index.xml
The Index file lists all of the available charset configurations. The Index.xml file lists all of the available charset configurations,
including collations.
Each charset is paired with a number. The number is stored Each collation must have a unique number. The number is stored
IN THE DATABASE TABLE FILES and must not be changed. Always IN THE DATABASE TABLE FILES and must not be changed.
add new character sets to the end of the list, so that the
numbers of the other character sets will not be changed. The max-id attribute of the <charsets> element must be set to
the largest collation number.
Compiled in or configuration file? Compiled in or configuration file?
When should a character set be compiled in to MySQL's string library When should a character set be compiled in to MySQL's string library
(libmystrings), and when should it be placed in a configuration (libmystrings), and when should it be placed in a charset_name.xml
file? configuration file?
If the character set requires the strcoll functions or is a If the character set requires the strcoll functions or is a
multi-byte character set, it MUST be compiled in to the string multi-byte character set, it MUST be compiled in to the string
library. If it does not require these functions, it should be library. If it does not require these functions, it should be
placed in a configuration file. placed in a charset_name.xml configuration file.
If the character set uses any one of the strcoll functions, it If the character set uses any one of the strcoll functions, it
must define all of them. Likewise, if the set uses one of the must define all of them. Likewise, if the set uses one of the
...@@ -30,11 +33,7 @@ Compiled in or configuration file? ...@@ -30,11 +33,7 @@ Compiled in or configuration file?
more information on how to add a complex character set to MySQL. more information on how to add a complex character set to MySQL.
Syntax of configuration files Syntax of configuration files
The syntax is very simple. Comments start with a '#' character and The syntax is very simple. Words in <map> array elements are
proceed to the end of the line. Words are separated by arbitrary separated by arbitrary amounts of whitespace. Each word must be a
amounts of whitespace. number in hexadecimal format. The ctype array has 257 words; the
other arrays (lower, upper, etc.) take up 256 words each after that.
For the character set configuration files, every word must be a
number in hexadecimal format. The ctype array takes up the first
257 words; the to_lower, to_upper and sort_order arrays take up 256
words each after that.
This diff is collapsed.
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment