Index Click this button to go to the index for this section.


Chinese(5)

NAME

Chinese, chinese - Introduction to Chinese language support

DESCRIPTION

There are two kinds of written Chinese characters, Traditional Chinese and Simplified Chinese. The former is widely used in Hong Kong (HK) and Taiwan, while the latter is more widely used in the People's Republic of China (PRC) and Singapore. Codesets There are several coded character sets (codesets) available to support each of the two written Chinese languages. For each Chinese language, the following list notes the supported codesets and the strings that represent those codesets in the names of locales, converters, and other kinds of system files: · Codesets for Traditional Chinese: DEC Hanyu, dechanyu For more information, see dechanyu(5). Taiwanese EUC (Extended UNIX Code), eucTW For more information, see eucTW(5). Big-5, big5 For more information, see big5(5). Shift Big-5, sbig5 For more information, see sbig5(5). The Shift Big-5 codeset is supported for codeset conversion only; it is not used in locales or for both input and output operations. TeleCode, telecode For more information, see telecode(5). The Telecode codeset is supported for codeset conversion only; it is not used in locales or for both input and output operations. · Codesets for Simplified Chinese: DEC Hanzi, dechanzi For more information, see dechanzi(5). Locales The following list groups supported locales according to language, country or territory, and codeset. Each locale supports one collation sequence, which is specified following the locale name. · For Traditional Chinese, Taiwan: Locales using DEC Hanyu: zh_TW.dechanyu Collation using internal code order zh_TW.dechanyu@radical Collation by radical zh_TW.dechanyu@stroke Collation by stroke zh_TW.dechanyu Chuyin (phonetic) collation Locales using Taiwanese EUC: zh_TW.eucTW Collation using internal code order zh_TW.eucTW@radical Collation by radical zh_TW.eucTW@stroke Collation by stroke zh_TW.eucTW@chuyin Chuyin (phonetic) collation Locales using Big-5: zh_TW.big5 Collation using internal code order zh_TW.big5@radical Collation by radical zh_TW.big5@stroke Collation by stroke zh_TW.big5@chuyin Chuyin (phonetic) collation · For Traditional Chinese, Hong Kong: Locales using DEC Hanyu: zh_HK.dechanyu Collation using internal code order Locales using Taiwanese EUC: zh_HK.eucTW Collation using internal code order Locales using Big-5: zh_HK.big5 Collation using internal code order · For Simplified Chinese, Hong Kong: Locales using DEC Hanzi: zh_HK.dechanzi Collation using internal code order · For Simplified Chinese, PRC: Locales using DEC Hanzi: zh_CN.dechanzi Collation using internal code order zh_CN.dechanzi@radical Collation by radical zh_CN.dechanzi@stroke Collation by stroke zh_CN.dechanzi@pinyin Pinyin (Phonetic) collation The asort(1) reference page contains more information on the collation sequences used for Asian languages. You can use the locale command (see locale(1)) to display the names of locales installed on your system. See i18n_intro(5) for information on setting locale. In a windows environment, you also need to set the session language. The way you do this depends on which windows environment you are using: · In the Common Desktop environment (CDE), use the Language menu accessed from the login window Options button. · In the DECwindows environment, use the Language Option dialog box invoked from the Session Manager's Options menu. Keyboards, Input Servers, and Input Methods The operating system supports the following Chinese keyboards: · For Traditional Chinese: LK201-D LK401-D · For Simplified Chinese: LK201-C LK401-C See the keyboard(5) reference page for information about loading keyboard mapping tables (keymaps) for keyboards. For operation within the DECwindows environment, the operating system provides the following input servers: · For Traditional Chinese: dxhanyuim See dxhanyuim(1X) for more information. · For Simplified Chinese: dxhanziim See dxhanziim(1X) for more information. These input servers can be started in either of the following ways: · In the DECwindows Motif environment, choose Hanyu IM (for Traditional Chinese) or Hanzi IM (for Simplified Chinese) from the Applications menu. · At the system prompt, enter one of the following commands: For Traditional Chinese: /usr/bin/X11/dxhanyuim & For Simplified Chinese: /usr/bin/X11/dxhanziim & The operating system supports the following Chinese input methods, listed by name under the written language with which the methods are used. Next to the name is the function key used to switch to that input method when the window for the input server is active. · For Traditional Chinese: (These input methods are enabled when the dxhanyuim server is running.) Internal code, F8 Phonetic, F10 Phrase, F9 Quick Tsang-Chi, F7 Symbol Tsang-Chi, F6 Note that there are no locales to support the Shift Big-5 and Telecode character sets, so these characters cannot be input directly. · For Simplified Chinese: (These input methods are enabled when the dxhanziim server is running.) 5-stroke, F6 5-Shape, F10 Pin-Yin, or Phonetic, F8 Qu-Wei or Row-Column in GB2312-80, F7 Telex Code, F9 Phrase Input, F5 For the following terminals or keyboard settings, you can toggle between the English input mode and Chinese input mode by using a particular key or key sequence: · For either Traditional Chinese on a VT382-D terminal keyboard or Simplified Chinese on a VT382-C terminal keyboard, press Compose. · In the DECwindows environment when LK201 is the keyboard setting, hold down the Compose key while pressing the Space bar (Compose+Space). · In the DECwindows environment when LK401 is the keyboard setting, use either Compose+Space or Shift+Space. The preceding key sequences are defaults. You can override them by setting other keys to perform the same function. Chinese Terminals The operating system supports the VT382-D and the VT382-C terminals for Traditional Chinese and Simplified Chinese, respectively. Running DECwindows Applications X or Motif applications require non-ASCII fonts to display Chinese characters. Therefore, you must set the font path appropriately before starting an application that displays Chinese characters. An application can find Chinese fonts in either of the following directories: · /usr/i18n/lib/X11/fonts/decwin/75dpi, for low resolution display · /usr/i18n/lib/X11/fonts/decwin/100dpi, for high resolution display Before you start a Chinese application, use the following command to check the font path: % xset q If one of the directories in the preceding list is not in the font path, the following example shows how to add the directory. You can substitute 100dp for 75dpi if you want high resolution display. % xset +fp /usr/i18n/lib/X11/decwin/75dpi/ % xset fp rehash After ensuring that the font path is set correctly and that the appropriate input server is running (see the section on input devices, servers, and methods), you can use the following steps to start a Chinese application: 1. If you have not already made the desired language setting, then: a. Choose Language from the Option menu. b. Select the Chinese language that you want the application to use. The language options are grouped in the following list according to whether they are an implementation of Simplified Chinese or Traditional Chinese. The options are grouped this way to indicate which input server they require. · For Simplified Chinese: Chinese China This option is equivalent to Chinese China (DEC Hanzi). Chinese China (DEC Hanzi) Chinese Hong Kong (DEC Hanzi) All Simplified Chinese language options require the dxhanziim input server. · For Traditional Chinese: Chinese Hong Kong (Big-5) Chinese Hong Kong (EUC Taiwan) Chinese Hong Kong (DEC Hanyu) Chinese Taiwan This option is equivalent to Chinese Taiwan (DEC Hanyu). Chinese Taiwan (Big-5) Chinese Taiwan (DEC Hanyu) Chinese Taiwan (EUC) All Traditional Chinese language options require the dxhanyuim input server. c. Press the OK button. 2. Choose the application you want to start from the Application menu. Printers The operating system supports the following Chinese printers, grouped by language. The associated print filter is noted in parentheses following the printer name. · For Traditional Chinese: CP382-D (cp382dof) The CP382-D is a Chinese dot matrix printer. DEClaser 1152 (dl152wrof) The DEClaser 1152 is a PostScript printer that uses the font- faulting technique to download fonts. DEClaser 5100, model LN09X-HD (dl5100wrof) The DEClaser 5100 is a PostScript printer that uses a built-in font disk. Others The operating system also supports text printers that have built- in Traditional Chinese fonts and text printers to which Traditional Chinese fonts can be downloaded. · For Simplified Chinese: LA88-C (la88cof) The LA88-C is a Chinese dot matrix printer. LA380-CB (la380cbof) The LA380-CB is a Chinese graphic line printer. DEClaser 1152 (dl1152wrof) The DEClaser 1152 is a PostScript printer that uses the font- faulting technique to download fonts. DEClaser 5100, model LN09-HD, (dl5100wrof) The DEClaser 5100 is a PostScript printer that uses a built-in font disk. For more information on setting up and configuring these printers, refer to the i18n_printing(5) and lprsetup(8) reference pages. Codeset Conversion The following codeset converters are available: · big5_UCS-2 (Big-5 to UCS-2) · big5_UCS-4 (Big-5 to UCS-4) · big5_UTF-8 (Big-5 to UTF-8) · big5_dechanyu (Big-5 to DEC Hanyu) · big5_dechanzi (Big-5 to DEC Hanzi) · big5_eucTW (Big-5 to EUC Taiwan) · big5_sbig5 (Big-5 to Shift Big-5) · big5_telecode (Big-5 to Telecode) · dechanyu_big5 (DEC Hanyu to Big-5) · dechanyu_dechanzi (DEC Hanyu to DEC Hanzi) · dechanyu_eucTW (DEC Hanyu to EUC Taiwan) · dechanyu_telecode (DEC Hanyu to Telecode) · dechanyu_UCS-2 (DEC Hanyu to UCS-2) · dechanyu_UCS-4 (DEC Hanyu to UCS-4) · dechanyu_UTF-8 (DEC Hanyu to UTF-8) · dechanzi_big5 (DEC Hanzi to Big-5) · dechanzi_dechanyu (DEC Hanzi to DEC Hanyu) · dechanzi_eucTW (DEC Hanzi to EUC Taiwan) · dechanzi_UCS-2 (DEC Hanzi to UCS-2) · dechanzi_UCS-4 (DEC Hanzi to UCS-4) · dechanzi_UTF-8 (DEC Hanzi to UTF-8) · eucTW_big5 (EUC Taiwan to Big-5) · eucTW_dechanyu (EUC Taiwan to DEC Hanyu) · eucTW_dechanzi (EUC Taiwan to DEC Hanzi) · eucTW_sbig5 (EUC Taiwan to Shift Big-5) · eucTW_telecode (EUC Taiwan to Telecode) · eucTW_UCS-2 (EUC Taiwan to UCS-2) · eucTW_UCS-4 (EUC Taiwan to UCS-4) · eucTW_UTF-8 (EUC Taiwan to UTF-8) · sbig5_big5 (Shift Big-5 to Big-5) · sbig5_eucTW (Shift Big-5 to EUC Taiwan) · telecode_big5 (Telecode to Big-5) · telecode_dechanyu (Telecode to DEC Hanyu) · telecode_eucTW (Telecode to EUC Taiwan) · UCS-2_big5 (UCS-2 to Big-5) · UCS-2_dechanyu (UCS-2 to DEC Hanyu) · UCS-2_dechanzi (UCS-2 to DEC Hanzi) · UCS-2_eucTW (UCS-2 to EUC Taiwan) · UCS-4_big5 (UCS-4 to Big-5) · UCS-4_dechanyu (UCS-4 to DEC Hanyu) · UCS-4_dechanzi (UCS-4 to DEC Hanzi) · UTF-8_big5 (UTF-8 to Big-5) · UTF-8_dechanyu (UTF-8 to DEC Hanyu) · UTF-8_dechanzi (UTF-8 to DEC Hanzi) For information about UCS-2, UCS-4, and UTF-8, see Unicode(5). For information about using the converters that implement codeset conversion, including options and restrictions that apply to conversion of Chinese characters, see iconv_intro(5). Note There are no codeset converters available to support the Microsoft code pages for Chinese because available UNIX codesets provide identical character encoding. For Traditional Chinese, character encoding in the cp950 code page is the same as for the Big-5 (big5) codeset. For Simplified Chinese, character encoding in the cp936 code page is the same as for the DEC Hanzi (dechanzi) codeset. For information about PC code pages, see code_page(5).

EXAMPLES

This section shows the steps in the DECwindows Motif environment to set up Traditional Chinese language support, start the DECterm application, enter words in Traditional Chinese, and display Chinese text. 1. Make sure that the font path includes either the 75dpi or 100dpi directory that contains fonts for Asian languages: % xset q If neither /usr/i18n/lib/decwin/75dpi/ nor /usr/i18n/lib/decwin/75dpi/ is included in the display, add one of them to the font path. For example: % xset +fp /usr/i18n/lib/decwin/75dpi/ % xset fp rehash 2. If it is not already running, start the dxhanyuim input server: % /usr/bin/X11/dxhanyuim & 3. Choose Language from the Option menu, select Chinese Taiwan in the Language Options dialog box, and click on the OK button. 4. Create a new DECterm window by choosing DECterm from the Application menu. 5. Switch from English to the Tsang-Chi input method by entering the Compose/Space key sequence and pressing the F6 key. 6. Use the following steps to enter "Hanyu" in Traditional Chinese characters: a. Type ETLO and press the Space bar. b. Type YRMMR and press the Space bar. 7. Toggle out of the Tsang-Chi input method and select the Quick Tsang- Chi input method by entering the Compose/Space key sequence and pressing the F7 key. 8. Use the following steps to enter the Chinese words for "Quick Tsang- Chi": a. Enter YL6. b. Enter IS1. 9. Toggle out of the Quick Tsang-Chi input method and select the Internal input method by entering the Compose/Space key sequence and pressing the F9 key. 10. Use the following steps to enter the Chinese words for "Internal": a. Enter C4F9. b. Enter EEA3. 11. Toggle back to English input mode by entering the Compose/Space key sequence. 12. Enter the following command to display an example of Traditional Chinese text: % cat /usr/i18n/examples/text_example/dechanyu 13. You can also display an example of Simplified Chinese text. To do this, you must first start the dxhanziim server, change the language setting to one that specifies DEC Hanzi as the codeset, and create another DECterm window. Then, in the new window, enter the following command: % cat /usr/18n/examples/text_example/dechanzi

SEE ALSO

Commands: asort(1), locale(1), lp(1), lpr(1), dxhanyuim(1X), dxhanziim(1X), xset(1X), lpd(8), lprsetup(8) Files: printcap(4) Others: big5(5), code_page(5), dechanyu(5), dechanzi(5), eucTW(5), i18n_intro(5), i18n_printing(5), iconv_intro(5), keyboard(5), l10n_intro(5), sbig5(5), telecode(5), Unicode(5) Writing Software for the International Market