[Contents] [Previous Chapter] [Next Chapter] [Index] [Help]


3  Locales

The Tru64 UNIX operating system supports different Chinese locales for different countries and areas. These include Taiwan, People's Republic of China (PRC), and Hong Kong. Table 3-1 shows the valid Chinese locales with different countries, codesets, and collating sequences.

Note

zh_TW is an alias of zh_TW.eucTW, and zh_CN is an alias of zh_CN.dechanzi.

Table 3-1: Chinese Locales

Codeset

Locale

Collation Sequence

DEC Hanyuzh_TW
zh_TW.dechanyu
zh_TW.dechanyu@radical
zh_TW.dechanyu@stroke
zh_TW.dechanyu@chuyin
zh_HK.dechanyu
Internal code
Internal code
Radical
Stroke
Chuyin (Phonetic)
Internal code
Taiwanese EUCzh_TW.eucTW
zh_TW.eucTW@radical
zh_TW.eucTW@stroke
zh_TW.eucTW@chuyin
zh_HK.eucTW
Internal code
Radical
Stroke
Chuyin (Phonetic)
Internal code

Big-5

zh_TW.big5
zh_TW.big5@radical
zh_TW.big5@stroke
zh_TW.big5@chuyin
zh_HK.big5

Internal code
Radical
Stroke
Chuyin (Phonetic)
Internal code

DEC Hanzizh_CN
zh_CN.dechanzi
zh_CN.dechanzi@radical
zh_CN.dechanzi@stroke
zh_CN.dechanzi@pinyin
zh_HK.dechanzi
Internal code
Internal code
Radical
Stroke
Pinyin (Phonetic)
Internal code
GBKzh_CN.GBKInternal code (however, all characters in GB 2312 character set are ordered first, followed by GBK extension characters)
GB18030zh_CN.GB18030Internal code (however, all characters in GB 2312 character set are ordered first, followed by GBK characters not already ordered, followed by GB18030 characters not already ordered)

UTF-8

zh_CN.UTF-8
zh_TW.UTF-8
zh_HK.UTF-8

zh_CN.GBK
zh_TW.dechanyu
zh_HK.dechanyu

The collation sequence for the GBK locale is further refined so that characters in the GB 2312 character set are ordered first followed by GBK extension characters.

The collation sequence for the GB18030 locale is further refined so that characters in the GB 2312 character set are ordered first, followed by GBK characters not already ordered, followed by GB18030 characters not already ordered.

Locales that support the same country and codeset are basically the same. The radical, stroke, pinyin, and chuyin modifiers after the at (@) sign specify different criterion for collation and sorting. Moreover, the characters defined in character set standards have collating precedence over user-defined characters, which, in turn, have precedence over undefined or reserved characters.

The dechanyu, eucTW, and dechanzi locales support locale names with @ucs4 modifiers. The @ucs4 modifier indicates that UCS-4 is used as the internal processing code. However, these locales are the same as the locales without the @ucs4 modifier. The @ucs4 subset is provided for backward compatibility and may be removed in the future. See l10n_intro(5) for more information.

If you are using DECwindows Motif, you can select the locale through the Language Menu of Session Manager. If you are using CDE, you can select the locale using the language menu on the CDE login screen. If you are superuser, you can use the SYSMAN I18NCONFIG utility to select a locale and define it as the system default.

The applicable locales are shown in Table 3-2.

Table 3-2: Chinese Language Names

Locale

Language Name

zh_TW

Traditional Chinese Taiwan

zh_TW.dechanyu

Traditional Chinese Taiwan (DEC Hanyu)

zh_TW.eucTW

Traditional Chinese Taiwan (EUC)

zh_TW.big5

Traditional Chinese Taiwan (Big5)

zh_TW.UTF-8

Traditional Chinese Taiwan (Unicode)

zh_HK.dechanyu

Traditional Chinese Hong Kong (DEC Hanyu)

zh_HK.dechanzi

Simplified Chinese Hong Kong (DEC Hanzi)

zh_HK.eucTW

Traditional Chinese Hong Kong (EUC)

zh_HK.big5

Traditional Chinese Hong Kong (Big5)

zh_HK.UTF-8

Traditional Chinese Hong Kong (Unicode)

zh_CN

Simplified Chinese PRC

zh_CN.dechanzi

Simplified Chinese PRC (DEC Hanzi)

zh_CN.GBK

Simplified Chinese PRC (GBK)

zh_CN.GB18030

Simplified Chinese PRC (GB18030)

zh_CN.UTF-8

Simplified Chinese PRC (Unicode)

Note

For DECwindows Motif and CDE, the locale modifier is ignored.


[Contents] [Previous Chapter] [Next Chapter] [Index] [Help]