• 제목/요약/키워드: Korean digit recognition

검색결과 138건 처리시간 0.026초

채널보상기법 및 특징파라미터에 따른 한국어 연속숫자음 전화음성의 인식성능 비교 (Comparison of the recognition performance of Korean connected digit telephone speech depending on channel compensation methods and feature parameters)

  • 정성윤;김민성;손종목;배건성;김상훈
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2002년도 11월 학술대회지
    • /
    • pp.201-204
    • /
    • 2002
  • As a preliminary study for improving recognition performance of the connected digit telephone speech, we investigate feature parameters as well as channel compensation methods of telephone speech. The CMN and RTCN are examined for telephone channel compensation, and the MFCC, DWFBA, SSC and their delta-features are examined as feature parameters. Recognition experiments with database we collected show that in feature level DWFBA is better than MFCC and for channel compensation RTCN is better than CMN. The DWFBA+Delta_ Mel-SSC feature shows the highest recognition rate.

  • PDF

음소경계 정보를 이용한 한국어 숫자음 인식에 관한 연구 (A Study on Korean Digit Recognition by Using Phoneme Boundary Information)

  • 최관묵;임동철;이행세
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 2001년도 추계학술발표대회 논문집 제20권 2호
    • /
    • pp.117-120
    • /
    • 2001
  • Recognition rate of Korean digit is lower than that of other words because it is composed of similar phonemes. In this paper, a new method is proposed for the improvement of recognition rate by using the phoneme boundary information. In addition, the proposed method rarely increase cost because phoneme boundary is found by using simple method. We experimented with speech data of one man and then obtained results of enhanced speech recognition rate.

  • PDF

계층적인 가버 특징들과 베이지안 망을 이용한 필기체 숫자인식 (Hierarchical Gabor Feature and Bayesian Network for Handwritten Digit Recognition)

  • 성재모;방승양
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제31권1호
    • /
    • pp.1-7
    • /
    • 2004
  • 본 논문에서는 필기체 숫자인식을 위해서 계층적으로 서로 다른 레벨의 정보를 표현할 수 있는 구조화된 특징들의 추출 방법과 특징들 사이에 의존도를 이용하여 분류하는 베이지안 망을 제안한다. 이러한 계층적 특징들을 추출하기 위해서 레벨 단위로 가버 필터들을 정의하고, FLD(Fisher Linear Discriminant) 척도를 이용하여 최적화된 가버 필터들을 선택한다. 계층적 가버 특징들은 최적화된 가버 특징들을 이용하여 추출되며, 하위 레벨일수록 더욱 국부적인 정보를 표현한다. 추출된 계층적 가버 특징들의 분류성능 향상을 위해서 가버 특징들 사이의 계층적 의존도를 이용하는 베이지안 망을 생성한다. 본 논문에서 제안하는 방법은 naive Bayesian 분류기, k-nearest neighbor 분류기, 그리고 신경망 분류기들과 함께 필기체 숫자인식에 적용되어 계층적 가버 특징들의 효율성과 계층적 의존도를 이용하는 베이지안 망은 분류성능을 향상시킬 수 있다는 것을 보여준다.

2단 회귀신경망의 숫자음 인식에관한 연구 (A study on the spoken digit recognition performance of the Two-Stage recurrent neural network)

  • 안점영
    • 한국통신학회논문지
    • /
    • 제25권3B호
    • /
    • pp.565-569
    • /
    • 2000
  • We compose the two-stage recurrent neural network that returns both signals of a hidden and an output layer to the hidden layer. It is tested on the basis of syllables for Korean spoken digit from /gong/to /gu. For these experiments, we adjust the neuron number of the hidden layer, the predictive order of input data and self-recurrent coefficient of the decision state layer. By the experimental results, the recognition rate of this neural network is between 91% and 97.5% in the speaker-dependent case and between 80.75% and 92% in the speaker-independent case. In the speaker-dependent case, this network shows an equivalent recognition performance to Jordan and Elman network but in the speaker-independent case, it does improved performance.

  • PDF

Chip 구현을 위한 IDMLP 신경 회로망의 개발과 음성인식에 대한 응용 (The Development of IDMLP Neural Network for the Chip Implementation and it's Application to Speech Recognition)

  • 김신진;박정운;정호선
    • 전자공학회논문지B
    • /
    • 제28B권5호
    • /
    • pp.394-403
    • /
    • 1991
  • This paper described the development of input driven multilayer perceptron(IDMLP) neural network and it's application to the Korean spoken digit recognition. The IDMPLP neural network used here and the learning algorithm for this network was proposed newly. In this model, weight value is integer and transfer function in the neuron is hard limit function. According to the result of the network learning for the some kinds of input data, the number of network layers is one or more by the difficulties of classifying the inputs. We tested the recognition of binaried data for the spoken digit 0 to 9 by means of the proposed network. The experimental results are 100% and 96% for the learning data and test data, respectively.

  • PDF

스펙트럼사상학습을 이용한 잡음환경에서의 한국어숫자음인식 (Korean Digit Recognition Under Noise Environment Using Spectral Mapping Training)

  • 이기영
    • 한국음향학회지
    • /
    • 제13권3호
    • /
    • pp.25-32
    • /
    • 1994
  • 본 연구에서는 정적지도적응알고리즘을 기초로 한 스펙트럼사상학습을 이용하여 잡음환경에서의 한국어숫자음인식방법을 제시하였다. 제시한 인식방법에서 잡음이 섞인 음성스펙트럼 공간을 잡음이 없는 음성스펙트럼 공간으로 사상한 결과, 잡음이 섞인 음성스펙트럼의 왜곡이 개선되어 잡음처리를 행하지 않은 기존의 VQ(vector quantizaton)와 DTW(dynamic time warping)를 이용한 방법보다 높은 인식율을 얻을 수 있었으며 , 0 dB의 SNR 레벨에서도 기존방법의 인식율을 10배 정도 향상시키므로써, 스펙트럼사상학습이 잡음환경의 음성에 대한 인식성능을 향상시킬 수 있는 방법임을 확인하였다.

  • PDF

필기체 숫자인식을 위한 병렬 자구성 계층 신경회로망 (Parallel, self-organizing, hierarchical neural networks for handwritten digit recognition)

  • 방극준;조남신;강창언;홍대식
    • 전자공학회논문지B
    • /
    • 제33B권7호
    • /
    • pp.173-182
    • /
    • 1996
  • In this paper, we propose the parallel, self-organizing, hierarchical neural netowrks as a handwritten digit recognition system. This system can absorb the various shape variations of handwritten digits by using the different methods of extracting the features in each stage neural network (SNN) of the PSHNN, and can reduce training time by using the single layer neural network as the SNN, and can obtain high rate of correct recognition by using the certainty area in all the output nodes individually. experiments have been performed with NIST database. In which we use 21, 315 digits (10, 625 digits for training and 10,663 digits for testing). The results show that the correct rate is 97.48% the error rate is 1.72% and the reject rate is 0.78%.

  • PDF

신경망 회로를 이용한 필기체 숫자 인식에 관할 연구 (A Study Of Handwritten Digit Recognition By Neural Network Trained With The Back-Propagation Algorithm Using Generalized Delta Rule)

  • 이규한;정진현
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1999년도 하계학술대회 논문집 G
    • /
    • pp.2932-2934
    • /
    • 1999
  • In this paper, a scheme for recognition of handwritten digits using a multilayer neural network trained with the back-propagation algorithm using generalized delta rule is proposed. The neural network is trained with hand written digit data of different writers and different styles. One of the purpose of the work with neural networks is the minimization of the mean square error(MSE) between actual output and desired one. The back-propagation algorithm is an efficient and very classical method. The back-propagation algorithm for training the weights in a multilayer net uses the steepest descent minimization procedure and the sigmoid threshold function. As an error rate is reduced, recognition rate is improved. Therefore we propose a method that is reduced an error rate.

  • PDF

Lightweight CNN based Meter Digit Recognition

  • Sharma, Akshay Kumar;Kim, Kyung Ki
    • 센서학회지
    • /
    • 제30권1호
    • /
    • pp.15-19
    • /
    • 2021
  • Image processing is one of the major techniques that are used for computer vision. Nowadays, researchers are using machine learning and deep learning for the aforementioned task. In recent years, digit recognition tasks, i.e., automatic meter recognition approach using electric or water meters, have been studied several times. However, two major issues arise when we talk about previous studies: first, the use of the deep learning technique, which includes a large number of parameters that increase the computational cost and consume more power; and second, recent studies are limited to the detection of digits and not storing or providing detected digits to a database or mobile applications. This paper proposes a system that can detect the digital number of meter readings using a lightweight deep neural network (DNN) for low power consumption and send those digits to an Android mobile application in real-time to store them and make life easy. The proposed lightweight DNN is computationally inexpensive and exhibits accuracy similar to those of conventional DNNs.

Zerinke 모멘트와 신경망을 이용한 온라인 필기체 숫자 인식 (Recognition of Online Handwritten Digit using Zernike Moment and Neural Network)

  • 문원호;최연석;차의영
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국해양정보통신학회 2010년도 춘계학술대회
    • /
    • pp.205-208
    • /
    • 2010
  • 본 논문에서는 Zernike 모멘트와 backpropagation신경망을 이용한 온라인 필기체 숫자 인식 방법을 소개한다. 마우스로 통해 입력된 숫자 정보는 전처리를 통해 시간에 순서적이고, 연속적인 좌표 정보로 변환된다. 전처리된 입력 좌표는 Zernike 모멘트(moment)와 각도 특징(angulation feature)을 이용하여 각 숫자가 가지는 고유의 특징을 만들어 낸다. 이러한 특징은 크기, 모양, 틀어진 정도에 상관없이 항상 일정한 성질을 가진다. 제안된 방법으로 추출된 특징은 패턴 구분을 위해 back propagation 신경망의 입력으로 사용된다. 본 논문은 200개의 필기체 숫자 데이터베이스를 이용하여 실험을 한 결과, 제시된 방법은 적은 학습데이터만으로 학습이 가능할 뿐만 아니라 좋은 인식률을 보여준다.

  • PDF