Search | Korea Science

An On-line Speech and Character Combined Recognition System for Multimodal Interfaces (멀티모달 인터페이스를 위한 음성 및 문자 공용 인식시스템의 구현)

석수영;김민정;김광수;정호열;정현열
- Journal of Korea Multimedia Society
- /
- v.6 no.2
- /
- pp.216-223
- /
- 2003
In this paper, we present SCCRS(Speech and Character Combined Recognition System) for speaker /writer independent. on-line multimodal interfaces. In general, it has been known that the CHMM(Continuous Hidden Markov Mode] ) is very useful method for speech recognition and on-line character recognition, respectively. In the proposed method, the same CHMM is applied to both speech and character recognition, so as to construct a combined system. For such a purpose, 115 CHMM having 3 states and 9 transitions are constructed using MLE(Maximum Likelihood Estimation) algorithm. Different features are extracted for speech and character recognition: MFCC(Mel Frequency Cepstrum Coefficient) Is used for speech in the preprocessing, while position parameter is utilized for cursive character At recognition step, the proposed SCCRS employs OPDP (One Pass Dynamic Programming), so as to be a practical combined recognition system. Experimental results show that the recognition rates for voice phoneme, voice word, cursive character grapheme, and cursive character word are 51.65%, 88.6%, 85.3%, and 85.6%, respectively, when not using any language models. It demonstrates the efficiency of the proposed system.
PDF

Extraction text-region's pixel on caption of video (동영상에 삽입된 자막 내 문자영역화소추출)

An, Kwon-Jae;Kim, Gye-Young
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2011.01a
- /
- pp.43-45
- /
- 2011
본 논문은 동영상 내 삽입된 자막을 문자인식이 가능하도록 문자영역을 이루는 화소를 추출하는 방법을 제안한다. 최초 자막영상을 통계학적 방법을 이용하여 색상극성을 결정한다. 이 후 색상극성에 따른 잡음제거 방법을 명암값기반과 형태학적기반으로 달리한다. 제안된 방법은 각 색상결정에 따른 적합한 잡음제거를 수행함으로서 추출된 화소들이 이루는 문자영역의 영상을 이용하여 문자인식을 수행하였을 때 기존방법보다 높은 문자인식률을 보였다.
PDF

An implementation of the mixed type character recognition system using combNET (CombNET 신경망을 이용한 혼용 문서 인식 시스템의 구현)

최재혁;손영우;남궁재찬
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.21 no.12
- /
- pp.3265-3276
- /
- 1996
The studies of document recongnition have been focused mainly on Korean documents. But most of documents composed of Korean and other characters. So, in this paper, we propose the document recognition system that can recognize the multi-size, multi font and mixed type characters. We have utilized a large scale network model, "CombNET" which consists of a 4 layered network with combstructure. And we propose recognition method that can recognize characters without discrimination of character type. The first layer constitutes a Kohonen's SOFM network which quantizes an input feature vector space into several sub-spaces and the following 2-4 layers constitutes BP network modules which classify input data in each sub-space into specified catagories. An experimental result demonstrated the usefulness of this approach with the recognition rates of 95.6% for the training data. For the mixed type character documents we obtained the recognition rates of 92.6% and recognition speed of 10.3 characters per second.
PDF

An Implementation Method of the Character Recognizer for the Sorting Rate Improvement of an Automatic Postal Envelope Sorting Machine (우편물 자동구분기의 구분율 향상을 위한 문자인식기의 구현 방법)

Lim, Kil-Taek;Jeong, Seon-Hwa;Jang, Seung-Ick;Kim, Ho-Yon
- Journal of Korea Society of Industrial Information Systems
- /
- v.12 no.4
- /
- pp.15-24
- /
- 2007
The recognition of postal address images is indispensable for the automatic sorting of postal envelopes. The process of the address image recognition is composed of three steps-address image preprocessing, character recognition, address interpretation. The extracted character images from the preprocessing step are forwarded to the character recognition step, in which multiple candidate characters with reliability scores are obtained for each character image extracted. aracters with reliability scores are obtained for each character image extracted. Utilizing those character candidates with scores, we obtain the final valid address for the input envelope image through the address interpretation step. The envelope sorting rate depends on the performance of all three steps, among which character recognition step could be said to be very important. The good character recognizer would be the one which could produce valid candidates with very reliable scores to help the address interpretation step go easy. In this paper, we propose the method of generating character candidates with reliable recognition scores. We utilize the existing MLP(multilayered perceptrons) neural network of the address recognition system in the current automatic postal envelope sorters, as the classifier for the each image from the preprocessing step. The MLP is well known to be one of the best classifiers in terms of processing speed and recognition rate. The false alarm problem, however, might be occurred in recognition results, which made the address interpretation hard. To make address interpretation easy and improve the envelope sorting rate, we propose promising methods to reestimate the recognition score (confidence) of the existing MLP classifier: the generation method of the statistical recognition properties of the classifier and the method of the combination of the MLP and the subspace classifier which roles as a reestimator of the confidence. To confirm the superiority of the proposed method, we have used the character images of the real postal envelopes from the sorters in the post office. The experimental results show that the proposed method produces high reliability in terms of error and rejection for individual characters and non-characters.
PDF

An Overview of Hangul Handwritten Image Database PE92 (한글 필기체 영상 데이터베이스 PE92의 소개)

Kim, D.H.;Bang, S.Y.
- Annual Conference on Human and Language Technology
- /
- 1992.10a
- /
- pp.567-575
- /
- 1992
한글 문자인식 시스템을 개발하기 앞서 생각해야 할 것이 인식실험에 사용될 문자 데이타를 수집하는 것이다. 이 논문에서는 연구 개발자들에게 문자인식 실험에 필요한 충분한 데이타를 제공하며 필기체 문자 데이타를 표준화하여 문자인식 시스템 상호간의 성능을 객관적으로 평가하기 위하여 한글 필기체 문자 데이터베이스 PE92를 개발하였다. 여기서는 PE92 데이타베이스의 소개로서 먼저 PE92를 수집하는데 있어 고려사항들, 즉 필기자, 수집문자의 수, 수집용지의 규격, 데이타베이스의 저장, 데이타의 압축에 대하여 알아본다. 다음 PE92 데이타베이스의 규격을 알아본다.
PDF

Simple Frame Marker: Implementation of In-Marker Image and Character Recognition and Tracking Method (심플 프레임 마커: 마커 내부 이미지 및 문자 패턴의 인식 및 추적 기법 구현)

Kim, Hye-Jin;Woo, Woon-Tack
- 한국HCI학회:학술대회논문집
- /
- 2009.02a
- /
- pp.558-561
- /
- 2009
In this paper, we propose Simple Frame Marker(SFMarker) to support recognition of characters and images included in a marker in augmented reality. If characters are inserted inside of marker and are recognised using Optical Character Recognition(OCR), it doesn't need marker learning process before an execution. It also reduces visual disturbance compared to 2D barcode marker due to familarity of characters. Therefore, proposed SFMarker distinguishes Square SFMarker that embeds images from Rectangle SFMarker with characters according to ratio of marker and applies different recognition algorithms. Also, in order to reduce preprocessing of character recognition, SFMarker inserts direction information in border of marker and extracts it to execute character recognition fast and correctly. Finally, since the character recognition for every frame slows down tracking speed, we increase the speed of recognition process using the result of character recognition in previous frame when frame difference is low.
PDF

Postal Envelope Image Recognition System for Postal Automation (서장 우편물 자동처리를 위한 우편영상 인식 시스템)

Kim, Ho-Yon;Lim, Kil-Taek;Kim, Doo-Sik;Nam, Yun-Seok
- The KIPS Transactions:PartB
- /
- v.10B no.4
- /
- pp.429-442
- /
- 2003
In this paper, we describe an address image recognition system for automatic processing of standard- size letter mail. The inputs to the system are gray-level mail piece images and the outputs are delivery point codes with which a delivery sequence of carrier can be generated. The system includes five main modules; destination address block location, text line separation, character segmentation, character recognition and finally address interpretation. The destination address block is extracted on the basis of experimental knowledge and the line separation and character segmentation is done through the analysis of connected components and vortical runs. For recognizing characters, we developed MLP-based recognizers and dynamical programming technique for interpretation. Since each module has been implemented in an independent way, the system has a benefit that the optimization of each module is relatively easy. We have done the experiment with live mail piece images directly sampled from mail sorting machine in Yuseong post office. The experimental results prove the feasibility of our system.
https://doi.org/10.3745/KIPSTB.2003.10B.4.429 인용 PDF KSCI

A Development of Unicode-based Multi-lingual Namecard Recognizer (Unicode 기반 다국어 명함인식기 개발)

Jang, Dong-Hyeub;Lee, Jae-Hong
- The KIPS Transactions:PartB
- /
- v.16B no.2
- /
- pp.117-122
- /
- 2009
We developed a multi-lingual namecard recognizer for building up a global client management systems. At first, we created the Unicode-based character image database for character recognition and learning of multi languages, and applied many color image processing techniques to get more correct data for namecard images which were acquired by various input devices. And by applying multi-layer perceptron neural network, individual character recognition applied for language types, and post-processing utilizing keyword databases made for individual languages, we increased a recognition rate for multi-lingual namecards.
https://doi.org/10.3745/KIPSTB.2009.16-B.2.117 인용 PDF KSCI

Character Recognition of Vehicle Number Plate Using Feature Based Neural Network (특징 추출에 기반한 신경망 시스템을 이용한 차량 번호판 문자인식)

이현숙;김희승
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.10b
- /
- pp.383-385
- /
- 2000
차량 번호판 문자영상으로부터 여러 가지 특징 추출 방법을 조합하여 입력특징소를 재구성하고, 신경망을 이용하여 문자를 인식한다. 속도 개선을 위해 특별한 전처리 과정없이 이치화와 크기 정규화만을 수행한 후 그물망 방법과 BLT 방법, 정규화된 투영값 특정 방법을 조합하여 입력특징소를 구성한다. 본 연구에서는 숫자 인식에서 그물망 방법과 BLT 방법을 이용하여 잡음으로 인한 유사 문자의 오인식을 해결하였고, 문자 인식에서는 정규화된 투영값 특징을 이용하여 문자의 유형을 분류한 후 자소를 개별적으로 인식하였다. 이로써 모음 인식 경우에 중요한 역할을 하는 작은 획의 영역에 BLT 방법을 사용함으로 기존 연구에서의 모음 오인식 문제를 해결하였다.
PDF

Using PCA Object Analysis Character Region Detection (PCA와 객체 분석을 통한 문자영역 추출)

김강석;강민경;김철기;차의영
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.04b
- /
- pp.568-570
- /
- 2000
이 시스템은 '신발공장 라인'에서 신발 밑창 생산품을 자동적으로 측정하는 것이다. 즉 문자인식 기법으로 인식된 치수와 컴퓨터 비전에 의해 측정된 길이를 비교하여 불량품을 분류한다. 이 논문에서는 이 중 문자영역 추출에 대한 연구를 하였다. 우리가 인식하려고 g는 밑창제품의 양각된 문자의 경우는 배경과 거의 같은 밝기 값을 가지므로 하나의 임계치로 분리 불가능하며 따라서 인쇄된 문자를 인식하는 경우에와 같은 일반적인 방법으로는 양각된 문자영역을 추출하기는 쉽지 않다. 여기에서는 임계값을 달리한 에지 검출 결과에 레이블링 과정을 거친 후 객체로 인식하여 그 각각의 객체의 구성 성분을 PCA 및 기타 방법을 이용하여 해당 객체가 문자인지 아닌지를 판별하는 방법을 썼다. 이 방법의 장점으로는 다양한 환경, 물체의 색깔, 밝기가 달라져도 공통적으로 적용할 수 있는 장점을 지닌다.
PDF

Search Result 1,164, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)