Browse > Article
http://dx.doi.org/10.14369/jkmc.2019.32.1.105

A Character Shape Encoding Method to Input Chinese Characters in Old Documents  

Kim, Kiwang (Pusan National University School of Korean Medicine)
Publication Information
Journal of Korean Medical classics / v.32, no.1, 2019 , pp. 105-116 More about this Journal
Abstract
Objectives : There are many secluded Chinese characters - so called Byeokja (僻字) in ancient classic literature, and Chinese characters that are not registered in Unicode and Variant characters (heterogeneous characters) that cannot be found in the current font sets often appear. In order to register all possible Chinese characters including such characters as units of information exchange, this study attempts to propose a method to encode the morphological information of Chinese characters according to certain rules. Methods : This study suggests the methods to encode the connection between the nodules constituting the Chinese character and the coordinates of the nodules. In addition to that, rules for expressing information about curves, expressions of aspect ratios of characters, rules for minimizing coordinate lines, and rules for expressing aggregation status of character components are added. Results : Through the proposed method, it is possible to generate codes of a certain length by extracting only information expressing the morphological configuration of characters. Conclusions : The method of character encoding proposed in this study can be used to distinguish variant characters with small variations in Byeokja, new Chinese characters and character strokes and to store and search them.
Keywords
Chinese characters; old classics; encoding; shape based code;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Septime Auguste Viguier. New book for the telegraph. Shanghai. department of electric machinery. 1872.
2 Wangyunwo. A Chinese Character searching method by codes. Dongfangzazhi. 1925. 22(12). pp.82-98.