Search | Korea Science

A Study on Korean 4-connected Digit Recognition Using Demi-syllable Context-dependent Models (반음절 문맥종속 모델을 이용한 한국어 4 연숫자음 인식에 관한 연구)

이기영;최성호;이호영;배명진
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.3
- /
- pp.175-181
- /
- 2003
Because a word of Korean digits is a syllable and deeply coarticulatied in connected digits, some recognition models based on demisyllables have been proposed by researchers. However, they could not show an excellent recognition results yet. This paper proposes a recognition model based on extended and context-dependent demisyllables, such as a tri-demisyllable like a tri-phone, for the Korean 4-connected digits recognition. For experiments, we use a toolkit of HTK 3.0 for building this model of continuous HMMs using training Korean connected digits from SiTEC database and for recognizing unknown ones. The results show that the recognition rate is 92% and this model has an ability to improve the recognition performance of Korean connected digits.
PDF KSCI

Car License Plate Extraction Based on Detection of Numeral Regions (숫자 영역 탐색에 기반한 자동차 번호판 추출)

Lee, Duk-Ryong;Oh, Il-Seok
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.7 no.1
- /
- pp.59-67
- /
- 2008
In this paper we propose an algorithm to extract the license plate regions from Korean car images. The idea of this paper is that we first find the four digits in the input car image and then segment the plate region using the digit information. Out method has advantage of segmenting simultaneously the plate regions and four digits regions. The first step finds and groups the connected components with proper sizes as candidate digits. The second step applies an serial alignment condition to find out probable 4-digits. In the third step, we recognize the candidate digits and assign the confidence values to each of them. The final step extracts the license plate region which has the highest confidence value. We used the Perfect Metrics classification algorithm to estimate the confidence. In our experiment, we got 97.23% and 95.45% correct detection rates, 0.09% and 0.11% false detection rates for 4,600 daytime and 264 nighttime images, respectively.
PDF

A Study on 7-Connected Digits Speech Recognition using SCHMM (SCHMM 기반 7연속 숫자음 인식에 관한 연구)

Kim Se Yong;Jung Hui Seok;Kang Chul Ho
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.127-130
- /
- 2002
본 연구에서는 우리말 연속 숫자음 인식에서 본래의 숫자음을 변이 시키는 주된 요인인 연음현상에 대한 인식을 높이기 위해 별도의 연음부분의 레퍼런스를 작성하여 매칭 시키는 방식을 제안한다 또한 단모음으로 이루어진 /2/와 /5/의 연속된 음에 대하여도 레퍼런스를 작성하였다. 제안한 방식에 의하여 전체적으로 $1.4\%$정도 인식률이 상승됨을 볼 수 있다. 특히 발성 목록중 /82/, /62/, /31/, /15/, /75/ 등의 연음과 /226/, /755/등과 같이 모음의 연속된 발성이 포함된 숫자 열에서 제안된 방식이 인식률에 영향을 미치는 것을 볼 수가 있었다. 이는 연음에서 발생하는 오류가 연속 숫자음에 많은 영향을 미치는 것을 알 수 있다. 그 외에 /22/, /55/등과 같이 단모음으로 이루어진 숫자음의 연속 발성 또한 인식률을 저하시키는데 한 요인으로 작용함으로서 이에 대한 레퍼런스도 작성하여 인식률이 상승되는 것을 볼 수 있었다.
PDF

Korean Continuous Speech Recognition Using Discrete Duration Control Continuous HMM (이산 지속시간제어 연속분포 HMM을 이용한 연속 음성 인식)

Lee, Jong-Jin;Kim, Soo-Hoon;Hur, Kang-In
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.1
- /
- pp.81-89
- /
- 1995
In this paper, we report the continuous speech recognition system using the continuous HMM with discrete duration control and the regression coefficients. Also, we do recognition experiment using One Pass DP method(for 25 sentences of robot control commands) with finite state automata context control. In the experiment for 4 connected spoken digits, the recognition rates are $93.8\%$ when the discrete duration control and the regression coefficients are included, and $80.7\%$ when they are not included. In the experiment for 25 sentences of the robot control commands, the recognition rate are $90.9\%$ when FSN is not included and $98.4\%$ when FSN is included.
PDF

A Study on Spoken Digits Analysis and Recognition (숫자음 분석과 인식에 관한 연구)

김득수;황철준
- Journal of Korea Society of Industrial Information Systems
- /
- v.6 no.3
- /
- pp.107-114
- /
- 2001
This paper describes Connected Digit Recognition with Considering Acoustic Feature in Korea. The recognition rate of connected digit is usually lower than word recognition. Therefore, speech feature parameter and acoustic feature are employed to make robust model for digit, and we could confirm the effect of Considering. Acoustic Feature throughout the experience of recognition. We used KLE 4 connected digit as database and 19 continuous distributed HMM as PLUs(Phoneme Like Units) using phonetical rules. For recognition experience, we have tested two cases. The first case, we used usual method like using Mel-Cepstrum and Regressive Coefficient for constructing phoneme model. The second case, we used expanded feature parameter and acoustic feature for constructing phoneme model. In both case, we employed OPDP(One Pass Dynamic Programming) and FSA(Finite State Automata) for recognition tests. When appling FSN for recognition, we applied various acoustic features. As the result, we could get 55.4% recognition rate for Mel-Cepstrum, and 67.4% for Mel-Cepstrum and Regressive Coefficient. Also, we could get 74.3% recognition rate for expanded feature parameter, and 75.4% for applying acoustic feature. Since, the case of applying acoustic feature got better result than former method, we could make certain that suggested method is effective for connected digit recognition in korean.
PDF

Recognition of Passport MRZ Information Using Combined Neural Networks (결합 신경망을 이용한 여권 MRZ 정보 인식)

Kim, Jinho
- Journal of Korea Society of Digital Industry and Information Management
- /
- v.15 no.4
- /
- pp.149-157
- /
- 2019
In case of reading passport using a smart phone in contrast with a dedicated passport reading system, MRZ(Machine Readable Zone) character recognition can be hard when the character strokes were broken, touched or blurred according to the lighting condition, and the position and size of MRZ character lines were varied due to the camera distance and angle. In this paper, the effective recognition algorithm of the passport MRZ information using a combined neural network recognizer of CNN(Convolutional Neural Network) and ANN( Artificial Neural Network), is proposed under the various sized and skewed passport images. The MRZ line detection using connected component analysis algorithm and the skew correction using perspective transform algorithm are also designed in order to achieve effective character segmentation results. Each of the MRZ field recognition results is verified by using five check digits for deciding whether retrying the recognition process of passport MRZ information or not. After we implement the proposed recognition algorithm of passport MRZ information, the excellent recognition performance of the passport MRZ information was obtained in the experimental results for PC off-line mode and smart phone on-line mode.
https://doi.org/10.17662/ksdim.2019.15.4.149 인용 PDF KSCI

Speech Data Collection for korean Speech Recognition (한국어 음성인식을 위한 음성 데이터 수집)

Park, Jong-Ryeal;Kwon, Oh-Wook;Kim, Do-Yeong;Choi, In-Jeong;Jeong, Ho-Young;Un, Chong-Kwan
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.4
- /
- pp.74-81
- /
- 1995
This paper describes the development of speech databases for the Korean language which were constructed at Communications Research Laboratory in KAIST. The procedure and environment to construct the speech database are presented in detail, and the phonetic and linguistic properties of the databases are presented. the databases were intended for use in designing and evaluating speech recognition algorithms. The databases consist of five different sets of speech contents : trade-related continuous speech with 3,000 words, variable-length connected digits, phoneme-balanced 75 isolated words, 500 isolated Korean provincial names, and Korean A-set words.
PDF

Search Result 7, Processing Time 0.019 seconds

A Study on Korean 4-connected Digit Recognition Using Demi-syllable Context-dependent Models (반음절 문맥종속 모델을 이용한 한국어 4 연숫자음 인식에 관한 연구)

Car License Plate Extraction Based on Detection of Numeral Regions (숫자 영역 탐색에 기반한 자동차 번호판 추출)

A Study on 7-Connected Digits Speech Recognition using SCHMM (SCHMM 기반 7연속 숫자음 인식에 관한 연구)

Korean Continuous Speech Recognition Using Discrete Duration Control Continuous HMM (이산 지속시간제어 연속분포 HMM을 이용한 연속 음성 인식)

A Study on Spoken Digits Analysis and Recognition (숫자음 분석과 인식에 관한 연구)

Recognition of Passport MRZ Information Using Combined Neural Networks (결합 신경망을 이용한 여권 MRZ 정보 인식)

Speech Data Collection for korean Speech Recognition (한국어 음성인식을 위한 음성 데이터 수집)

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)