Search | Korea Science

Handwritten Korean Character Segmentation using Background thinning (배경 세선화를 이용한 한글 필기체 글자 단위 분할)

서원택;조범준
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.04b
- /
- pp.823-825
- /
- 2004
본 연구에서는 필기체 한글의 글자단위의 분할을 위해 배경 세선화(Background thinning)라는 방법을 제안한다. 배경 세선화 방법은 글자와 글자 사이에 존재하는 배경의 정보를 세선화 처리하여 필기체 한글에서 많이 발생할 수 있는 중첩(Overlap)글자와 연결(Touched)글자를 서로 분할하는데 효과적인 성능을 보였다. 배경 세선화를 이용하여 글자를 분할하는 방법은 인식과정의 판단을 필요하지 않은 외적분할 방법으로 빠른 속도의 분할 성능을 보였다. 이 방법은 특히, 중첩된 글자의 분할에 탁월한 성능을 보였을 뿐만 아니라, 연결된 글자에 대해서도 좋은 성능을 보였다.
PDF

A Study on Character Recognition using Wavelet Transformation and Moment (웨이브릿 변환과 모멘트를 이용한 문자인식에 관한 연구)

Cho, Meen-Hwan
- Journal of the Korea Society of Computer and Information
- /
- v.15 no.10
- /
- pp.49-57
- /
- 2010
In this thesis, We studied on hand-written character recognition, that characters entered into a digital input device and remove noise and separating character elements using preprocessing. And processed character images has done thinning and 3-level wavelet transform for making normalized image and reducing image data. The structural method among the numerical Hangul recognition methods are suitable for recognition of printed or hand-written characters because it is usefull method deal with distortion. so that method are applied to separating elements and analysing texture. The results show that recognition by analysing texture is easily distinguished with respect to consonants. But hand-written characters are tend to decreasing successful recognition rate for the difficulty of extraction process of the starting point, of interconnection of each elements, of mis-recognition from vanishing at the thinning process, and complexity of character combinations. Some characters associated with the separation process is more complicated and sometime impossible to separating elements. However, analysis texture of the proposed character recognition with the exception of the complex handwritten is aware of the character.
https://doi.org/10.9708/jksci.2010.15.10.049 인용 PDF KSCI

A study on the Recognition of Hand-written Characters and Arabic numbers by Neural Networks (신경회로망을 이용한 필기체 한글 자모음 및 숫자인식에 관한 연구)

Oh, Dong-Su;Lee, Eun-Un;Yoo, Jae-Guen;Nam, Moon-Hyun
- Proceedings of the KIEE Conference
- /
- 1991.07a
- /
- pp.900-904
- /
- 1991
In this paper, our study for the recognition of Hand-written Korean characters, Arabic numbers and alphabets by neural netwoks. This System extracts feature of character by using the MESH feature point of handwritten character, Arabic numbers and alphabets. To reduce the input image data, features are extracted from each input images. A MLP(multi-layer perceptron) with one hidden layer was trained with a modified BEP(back error propagation) algorithm. This method extracts feature sets of the characters directly from the scanner and can enhance computation speed without using the special preprocesses such as size normalization, smoothing, and thinning.
PDF

Word Segmentation in Handwritten Korean Text Lines based on GAP Clustering (GAP 군집화에 기반한 필기 한글 단어 분리)

Jeong, Seon-Hwa;Kim, Soo-Hyung
- Journal of KIISE:Software and Applications
- /
- v.27 no.6
- /
- pp.660-667
- /
- 2000
In this paper, a word segmentation method for handwritten Korean text line images is proposed. The method uses gap information to segment words in line images, where the gap is defined as a white run obtained after vertical projection of line images. Each gap is assigned to one of inter-word gap and inter-character gap based on gap distance. We take up three distance measures which have been proposed for the word segmentation of handwritten English text line images. Then we test three clustering techniques to detect the best combination of gap metrics and classification techniques for Korean text line images. The experiment has been done with 305 text line images extracted manually from live mail pieces. The experimental result demonstrates the superiority of BB(Bounding Box) distance measure and sequential clustering approach, in which the cumulative word segmentation accuracy up to the third hypothesis is 88.52%. Given a line image, the processing time is about 0.05 second.
PDF

Online Handwritten Digit Recognition by Smith-Waterman Alignment (Smith-Waterman 정렬 알고리즘을 이용한 온라인 필기체 숫자인식)

Mun, Won-Ho;Choi, Yeon-Seok;Lee, Sang-Geol;Cha, Eui-Young
- Journal of the Korea Society of Computer and Information
- /
- v.16 no.9
- /
- pp.27-33
- /
- 2011
In this paper, we propose an efficient on-line handwritten digit recognition base on Convex-Concave curves feature which is extracted by a chain code sequence using Smith-Waterman alignment algorithm. The time sequential signal from mouse movement on the writing pad is described as a sequence of consecutive points on the x-y plane. So, we can create data-set which are successive and time-sequential pixel position data by preprocessing. Data preprocessed is used for Convex-Concave curves feature extraction. This feature is scale-, translation-, and rotation-invariant. The extracted specific feature is fed to a Smith-Waterman alignment algorithm, which in turn classifies it as one of the nine digits. In comparison with backpropagation neural network, Smith-Waterman alignment has the more outstanding performance.
https://doi.org/10.9708/jksci.2011.16.9.027 인용 PDF KSCI

Partially Connected Multi-Layer Perceptrons and their Combination for Off-line Handwritten Hangul Recognition (오프라인 필기체 전표용 한글 인식을 위한 부분 연결 다층 신경망과 결합)

백영목;임길택;진성일
- Journal of the Korean Institute of Telematics and Electronics C
- /
- v.36C no.4
- /
- pp.87-94
- /
- 1999
This paper presents a study on the off-line handwritten Hangul (Korean) character recognition using the partially connected neural network (PCNN), which is based on partial connections between the input receptive fields and the hidden nodes. The hidden nodes of three PCNNs have ten receptive fields and different input feature sets. And we introduce modular partially connected neural network (MPCNN), The MPCNN combines three PCNNs with a merging network. The learning scheme of the proposed networks is composed of two steps: PCNN learning step and the merging step of combining three PCNN s. In the merging step, another merging PCNN network is introduced and trained by regarding the hidden output of each PCNN as a new input feature vector. The performance of the proposed classifier is verified on the recognition of 18 off-line handwritten Hangul characters widely used in business cards in Korea.
PDF

Extraction of Skeletons from Handwritten Hangul Characters using Shape Decomposition (모양 분해를 이용한 필기 한글 문자의 골격선 추출)

Hong, Ki-Cheon;Oh, Il-Seok
- Journal of KIISE:Software and Applications
- /
- v.27 no.6
- /
- pp.583-594
- /
- 2000
The thinning process which is commonly used in extracting skeletons from handwritten Hangul characters has a problem of distorting the original pattern shapes. This paper proposes a method of skeleton extraction using a shape decomposition algorithm. We decompose the character pattern into a set of near convex parts using a shape decomposition algorithm. From the shape-decomposed pattern, we detect the joint parts and extract the skeletons from the parts incident to the joint parts. Then the skeletons not incident to the joint parts are extracted. Finally, the process of skeleton extension is performed to ensure the connectivity. We setup five criteria for the comparison of quality of skeletons extracted by our method and the thinning based method. The comparison shows the superiority of our method in terms of several criteria.
PDF

An Approach to Segmentation of Address Strings of unconstrained handwritten Hangul using Run-Length Code (Rum-Length code를 이용한 제약없이 쓰여진 한글 필기체 주소열 분할)

Kim, Gyeonghwan;Yoon, Jason-J
- Journal of KIISE:Software and Applications
- /
- v.28 no.11
- /
- pp.813-821
- /
- 2001
While recognition of isolated units of writing, such as a character or a word, has been extensively studied, emphasis on the segmentation itself has been lacking. In this paper we propose an active segmentation method for handwritten Hangul address strings based on the Run-length code. A slant correction algorithm, which is considered as an important preprocessing step for the segmentation, is presented. Three fundamental candidate estimation functions are introduced to detect the clues on touching points, and the classification of touching types is attempted depending on the structural peculiarity of Hangul. Our experiments show segmentation performance of 88.2% on touching characters with minimal over-segmentation.
PDF

A Study on the Design of OMCR(Optical Mark and Character Reader) System based on Image Processing (영상처리방식에 의한 OMCR 시스템 설계에 관한 연구)

이기돈;김우성
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.18 no.9
- /
- pp.1358-1367
- /
- 1993
In this paper, OMR system based on image processing is developed which improve the performance of conventional OMR system based on line-scan method. Based on this OMR system, real-time OCR system which recognizes alphanumerics is also developed. We propose the OMCR system which recognize the mark and numerals at the same time. Besides, we improve the input system using constrained 7-segment type handwritten numeral instead of mark to solve the problem caused by miswriting the mark. In summary, we verified the reai-time recognition performance of developed OMCR system using application form for admission, answer sheet for college entrance examination and receipt sheet.
PDF

Handwritten Korean Amounts Recognition in Bank Slips using Rule Information (규칙 정보를 이용한 은행 전표 상의 필기 한글 금액 인식)

Jee, Tae-Chang;Lee, Hyun-Jin;Kim, Eun-Jin;Lee, Yill-Byung
- The Transactions of the Korea Information Processing Society
- /
- v.7 no.8
- /
- pp.2400-2410
- /
- 2000
Many researches on recognition of Korean characters have been undertaken. But while the majority are done on Korean character recognition, tasks for developing document recognition system have seldom been challenged. In this paper, I designed a recognizer of Korean courtesy amounts to improve error correction in recognized character string. From the very first step of Korean character recognition, we face the enormous scale of data. We have 2350 characters in Korean. Almost the previous researches tried to recognize about 1000 frequently-used characters, but the recognition rates show under 80%. Therefore using these kinds of recognizers is not efficient, so we designed a statistical multiple recognizer which recognize 16 Korean characters used in courtesy amounts. By using multiple recognizer, we can prevent an increase of errors. For the Postprocessor of Korean courtesy amounts, we use the properties of Korean character strings. There are syntactic rules in character strings of Korean courtesy amounts. By using this property, we can correct errors in Korean courtesy amounts. This kind of error correction is restricted only to the Korean characters representing the unit of the amounts. The first candidate of Korean character recognizer show !!i.49% of recognition rate and up to the fourth candidate show 99.72%. For Korean character string which is postprocessed, recognizer of Korean courtesy amounts show 96.42% of reliability. In this paper, we suggest a method to improve the reliability of Korean courtesy amounts recognition by using the Korean character recognizer which recognize limited numbers of characters and the postprocessor which correct the errors in Korean character strings.
PDF

Search Result 103, Processing Time 0.018 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)