An Efficient Method to Extract Units of Manchu Characters

Snowberger, Aaron Daniel;Lee, Choong Ho;

Proceedings of the Korean Institute of Information and Commucation Sciences Conference (한국정보통신학회:학술대회논문집)

2021.05a
/
Pages.617-619
/
2021

The Korea Institute of Information and Commucation Engineering (한국정보통신학회)

An Efficient Method to Extract Units of Manchu Characters

만주 글자의 단위를 추출하는 효율적인 방법

Snowberger, Aaron Daniel (Graduate School of Hanbat National University) ;
Lee, Choong Ho (Graduate School of Hanbat National University)

스노우버거 아론 다니엘 (한밭대학교) ;
이충호 (한밭대학교)

Published : 2021.05.03

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Since Manchu characters are written vertically and are connected without spaces within a word, a preprocessing process is required to separate the character area and the units that make up the characters before recognizing the characters. In this paper, we describe a preprocessing method that extracts the character area and cuts off the unit of the character. Unlike existing research that presupposes a method of recognizing each word or character unit, or recognizing the remaining part after removing the stem of a continuous character, this method cuts the character into each recognizable unit. It can be applied to the method of recognizing letters by combining the units. Through an experiment, the effectiveness of this method was verified.

만주 문자는 세로로 씌여지며 한 단어 안에서는 띄어쓰기 없이 이어져 있기 때문에 문자를 인식하기 전에 글자영역 분리와 글자를 이루는 단위를 분리해 내는 전처리과정이 필요하다. 본 논문에서는 글자영역을 추출하고 글자의 단위를 끊어내는 전처리 방법을 기술한다. 기존 연구가 단어별 또는 문자단위로 인식하는 방법을 전제로 하거나, 이어져 있는 글자의 줄기를 없앤 후 남는 부분으로 인식하는 것과 달리, 본 방법은 인식 가능한 단위별로 글자를 끊어낸 다음 그 단위의 합성으로 글자를 인식하는 방법에 적용할 수 있다. 실험을 통하여 본 방법의 유효성을 검증하였다.

Proceedings of the Korean Institute of Information and Commucation Sciences Conference (한국정보통신학회:학술대회논문집)

An Efficient Method to Extract Units of Manchu Characters

만주 글자의 단위를 추출하는 효율적인 방법

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)