Chinese input method
The factual accuracy of parts of this article (those related to handwriting, OCR, and voice recognition) may be compromised due to out-of-date information. The reason given is: Tech advances have vastly improved these input methods. Please help update this article to reflect recent events or newly available information. (June 2021) |
| Chinese characters |
|---|
|
|
|
Properties and classification |
|
Collation and standards
|
|
Homographs and readings |
|
Derived systems |
Several input methods allow the use of Chinese characters with computers. Most allow selection of characters based either on their pronunciation or their graphical shape. Phonetic input methods are easier to learn but are less efficient, while graphical methods allow faster input, but have a steep learning curve.
Other methods allow users to write characters directly via touchscreens, such as those found on mobile phones and tablet computers.
History
[edit]Chinese input methods predate the computer. One of the early attempts was an electro-mechanical Chinese typewriter Mingkwai (Chinese: Ming Kuai ; pinyin: mingkuai; Wade-Giles: ming-k'uai) which was invented by Lin Yutang, a prominent Chinese writer, in the 1940s. It assigned thirty base shapes or strokes to different keys and adopted a new way of categorizing Chinese characters. But the typewriter was not produced commercially and Lin soon found himself deeply in debt.[2]
Before the 1980s, Chinese publishers hired teams of workers and selected a few thousand type pieces from an enormous Chinese character set. Chinese government agencies entered characters using a long, complicated list of Chinese telegraph codes, which assigned different numbers to each character. During the early computer era, Chinese characters were categorized by their radicals or Pinyin romanization, but results were less than satisfactory.
In the 1970s to 1980s, large keyboards with thousands of keys were used to input Chinese. Each key was mapped to several Chinese characters. To type a character, one pressed the character key and then a selection key.[3] There were also experimental "radical keyboards" with dozens to several hundreds keys. Chinese characters were decomposed into "radicals", each of which was represented by a key.[1][4][5] Unwieldy and difficult to use, these keyboards became obsolete after the introduction of Cangjie input method, the first method to use only the standard QWERTY keyboard and make Chinese touch typing possible.[5]
Chu Bong-Foo invented a common input method in 1976 with his Cangjie input method, which assigns different "roots" to each key on a standard computer keyboard. With this method, for example, the character Ri is assigned to the A key, and Yue is assigned to B. Typing them together will result in the character Ming ("bright").
Despite its steeper learning curve, this method remains popular in Chinese communities that use traditional Chinese characters, such as Hong Kong and Taiwan; the method allows very precise input, thus allowing users to type more efficiently and quickly, provided they are familiar with the fairly complicated rules of the method. It was the first method that allowed users to enter more than a hundred Chinese characters per minute. Its popularity is also helped by its omnipresence on traditional Chinese computer systems, since Chu gave up his patent in 1982, stating that it should be part of the cultural asset. Developers of Chinese systems can adopt it freely, and users do not have the hassle of it being absent on devices with Chinese support.[6][7] Cangjie input programs supporting a large CJK character set have been developed.[8][9][10]
All methods have their strengths and weaknesses. The pinyin method can be learned rapidly but its maximum input rate is limited. The Wubi method takes longer to learn, but expert typists can enter text much more rapidly with it than with phonetic methods. However, Wubi is proprietary, and a version of it has become freely available only after its inventor lost a patent lawsuit in 1997.[11]
Due to these complexities, there is no "standard" method.
By 1989, bopomofo and pinyin were available for the IBM PC.[12] In mainland China, pinyin methods such as Sogou Pinyin and Google Pinyin are the most popular. In Taiwan, use of Cangjie, Dayi, Boshiamy, and bopomofo predominate; and in Hong Kong and Macau, the Cangjie is most often taught in schools, while a few schools teach CKC Chinese Input System.[13]
Other methods include handwriting recognition, OCR and speech recognition. The computer itself must first be "trained" before the first or second of these methods are used; that is, the new user enters the system in a special "learning mode" so that the system can learn to identify their handwriting or speech patterns. The latter two methods are used less frequently than keyboard-based input methods and suffer from relatively high error rates, especially when used without proper "training", though higher error rates are an acceptable trade-off to many users.
Categories
[edit]Phonetic-based
[edit]The user enters pronunciations that are converted into relevant Chinese characters. The user must select the desired character from homophones, which are common in Chinese. Modern systems, such as Sogou Pinyin and Google Pinyin, predict the desired characters based on context and user preferences. For example, if one enters the sounds jicheng, the software will type Ji Cheng (to inherit), but if jichengche is entered, Ji Cheng Che (taxi) will appear.
Various Chinese dialects complicate the system. Phonetic methods are mainly based on standard pinyin, Zhuyin/Bopomofo, and Jyutping in China, Taiwan, and Hong Kong, respectively. Input methods based on other varieties of Chinese, like Hakka or Minnan, also exist.
While the phonetic system is easy to learn, choosing appropriate Chinese characters slows typing speed. Most users report a typing speed of fifty characters per minute, though some reach over one hundred per minute.[14] With some phonetic IMEs (Input Method Editors), in addition to predictive input based on previous conversions, it is possible for users to create custom dictionary entries for frequently used characters and phrases, potentially lowering the number of characters required to evoke it.
Shuangpin
[edit]Shuangpin (Shuang Pin ; Shuang Pin ), literally dual spell, is a stenographical phonetic input method based on hanyu pinyin that reduces the number of keystrokes for one Chinese character to two by distributing every vowel and consonant composed of more than one letter to a specific key. In most Shuangpin layout schemes such as Xiaohe, Microsoft 2003 and Ziranma, the most frequently used vowels are placed on the middle layer, reducing the risk of repetitive strain injury.
Shuangpin is supported by a large number of pinyin input software including QQ, Microsoft Bing Pinyin, Sogou Pinyin and Google Pinyin.
Shape-based
[edit]- Cangjie input method
- Simplified Cangjie
- Dayi method
- Array input method (Xing Lie )
- Four-corner method
- Stroke count method
- Wubi method
- Zhengma method
- Biaoxingma method
- ZYQ method (Zheng Yi Quan )[15]
Others
[edit]- Chinese telegraph code (Zhong Wen Dian Ma )
Examples of keyboard layouts
[edit]-
A typical keyboard layout for zhuyin on computers, which can be used as an input method
-
A keyboard using the Wubi method
-
A typical keyboard layout for the Cangjie method, which is based on the U.S. keyboard layout. Note the non-standard use of Z as the collision key.
-
A typical keyboard layout for the Dayi method
-
Chinese (traditional) keyboard layout, a US keyboard with Zhuyin, Cangjie and Dayi key labels, which can all be used to input Chinese characters into a computer
Software
[edit]- Microsoft IME
- Sogou Pinyin
- Google Pinyin
See also
[edit]References
[edit]- ^ a b "1973Nian Jiao Da Yan Zhi Di Yi Ge Zhong Wen Jian Pan ". The memory of Hsinchu city (in Chinese). Retrieved 2022-08-25.
- ^ Zhong Wen Yu Ji Suan Ji Archived 2003-05-13 at archive.today
- ^ "Yi Zi Zheng Zi Jian Pan Pan Mian Zi Pai Lie ". Standardization Administration of China. 1987. Retrieved 2022-08-26.
- ^ Xie Qing Jun ; Huang Yong Wen ; Lin Shu (1973). "Zhong Wen Zi Gen Zhi Fen Xi ". Science Bulletin National Chiao-Tung University. 6 (1).
- ^ a b Zhu Bang Fu (1995). "San , Dian Nao Cang Jie , Tian Long , Ling Yi , Han Qia ". Zhi Hui Zhi Lu . Di 3Bu , Yan Xia (Yi Jiu Qi San -Yi Jiu Jiu Wu ). Shi Bao Chu Ban .
- ^ Zhu Lin Hua (2012). "Jiao Yu Ke Ji De Zhuan Li Yu Pu Ji ". Guo Jia Jiao Yu Yan Jiu Yuan Dian Zi Bao . No. 33.
- ^ Lan Li Juan (1999). "Zhu Bang Fu De Ren Wen Ke Ji Meng ". Tian Xia Za Zhi . No. 219. Retrieved 2022-08-26.
- ^ "Zhong Zhou Yun Shu Ru Fa Yin Qing ". Retrieved 2022-08-26.
- ^ "Cang Jie Zhi You ". Retrieved 2022-08-26.
- ^ Tian Yi (2012-03-02). "Qian Zhong Shu Xian Sheng Yu [Zhong Guo Gu Dian Shu Zi Gong Cheng ] ". Retrieved 2022-08-26.
- ^ "Wang Yong Min Wang Ma Wu Bi Zi Xing Zhuan Li Jiu Fen An ". Zhong Guo Zhi Shi Chan Quan Lu Shi Wang . 2009-05-17. Retrieved 2022-08-26.
- ^ Pournelle, Jerry (February 1989). "Ready Line Overload". BYTE. pp. 121-137. Retrieved 2024-10-08.
- ^ "Cang Jie Yi Wai De Ling Yi Ge Xuan Ze -"Zong Heng Shu Ru Fa "". Jiao Shi Za Zhi . No. 7. 2004. Retrieved 2022-08-26.
- ^ users' Report on Pinyin Method, Sougou BBS
- ^ Zhang, Xiao-heng (2003). "Zheng Yi Quan :Yi Ge Dong Tai Jie Gou Bi Zu Yi Zi Bian Ma Shu Ru Fa (Towards Correctness, Easiness and Completeness : Building a Chinese Character Coding Input Method Based on Dynamic Structured Stroke Groups)". Journal of Chinese Information Processing. 17 (2003) (3): 60-66.
External links
[edit]- What Does a Chinese Keyboard Look Like?, article by Slate.com
- Overview of Input Methods, by Sebastien Bruggeman.
- Zhong Wen Shu Ru Fa Shi Jie Chinese input method news.
- The engineering daring that led to the first Chinese personal computer. With 1,000s of Chinese characters and limited memory, inventors of the Sinotype III had to push the limits of early machines. by Tom Mullaney, June 29, 2021, techcrunch.com
- How intensive modding ushered in China's computer revolution: Early Chinese engineers needed to constantly push against the boundaries of 'alphabetic order,'by Tom Mullaney, October 24, 2021, techcrunch.com
- The computer pioneer who built modern China, By Leila McNeill, 19 February 2020, bbc website.