대한전자공학회논문지 (Journal of the Korean Institute of Telematics and Electronics)
- 제25권9호
- /
- Pages.1091-1101
- /
- 1988
- /
- 1016-135X(pISSN)
한국어 문서로부터 문자분리 및 도형추출에 관한 연구
A Study on the Korean Character Segmentation and Picture Extraction from a Document
초록
In this paper, a method to segment each character and extract figure from Korean documents is proposed. At first, each character string is extracted by means of iterative horizontal propagation, shrink algorithm and run-length algorithm. Individual character region is extracted by iterative horizontal and vertical manipulation. Next, characters of right pitch are searched. Each character is segmented by the position information. Overlapped character is segmented on the ground of the width of already extracted character. The rest are extracted as special characters of half pitch. Using 9 data input in the form of 840 X 600 from Korean monthly magazine, experiment was simulated. Extraction rate of character is 100%, and that of individual character is 98%. Judging from these results, efficiency on extracting character region and segmenting individual character is proved.
키워드