통합 검색 | Korea Science

한국의 지하철역명을 위한 음소 기반의 음성인식에 관한 연구 (A Study on Speech Recognition based on Phoneme for Korean Subway Station Names)

김범승;김순협
- 한국철도학회논문집
- /
- 제14권3호
- /
- pp.228-233
- /
- 2011
본 논문에서는 한국의 지하철역명을 위하여 음운론적 특성을 반영한 음소 기반의 음성인식 구현에 관한 방법을 제시하였다. 한국의 지하철역명의 음소 기반의 음성인식을 위하여 사용되는 최적의 유사음소 단위(PLU: Phoneme-Likely Unit)를 선정하기 위하여 네 가지의 Case 별로 PLU set과 음운 현상을 고려한 발음사전을 구성하여 인식률을 평가하였다. 적용된 유사음소 단위의 경우 초성과 종성 자음의 인식 단위 구분 및 음운 현상을 반영한 경우 트라이폰 모델에서 최적의 인식률(97.74%)을 보임을 알 수 있었다.
https://doi.org/10.7782/JKSR.2011.14.3.228 인용 PDF KSCI

음성 자료에 대한 규칙 기반 Named Entity 인식 (Rule-based Named Entity (NE) Recognition from Speech)

김지환
- 대한음성학회지:말소리
- /
- 제58호
- /
- pp.45-66
- /
- 2006
In this paper, a rule-based (transformation-based) NE recognition system is proposed. This system uses Brill's rule inference approach. The performance of the rule-based system and IdentiFinder, one of most successful stochastic systems, are compared. In the baseline case (no punctuation and no capitalisation), both systems show almost equal performance. They also have similar performance in the case of additional information such as punctuation, capitalisation and name lists. The performances of both systems degrade linearly with the number of speech recognition errors, and their rates of degradation are almost equal. These results show that automatic rule inference is a viable alternative to the HMM-based approach to NE recognition, but it retains the advantages of a rule-based approach.
PDF

HMM 부모델을 이용한 단어 인식에 관한 연구 (A Study on Word Recognition using sub-model based Hidden Markov Model)

신원호
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
- /
- pp.395-398
- /
- 1994
In this paper the word recognition using sub-model based Hidden Markov Model was studied. Phoneme models were composed of 61 phonemes in therms of Korean language pronunciation characteristic. Using this, word model was maded by serial concatenation. But, in case of this phoneme concatenation, the second and the third phoneme of syllable are overlapped in distribution at the same time. So considering this, the method that combines the second and the third phoneme to one model was proposed. And to prevent the increase in number of model, similar phonemes were combined to one, and finially, 57 models were created. In experiment proper model structure of sub-model was searched for, and recognition results were compared. So similar recognition results were maded, and overall recognition rates were increased in case of using parameter tying method.
PDF

거리 척도에 따른 PCA/LDA기반의 얼굴 인식 성능 분석 (A Performance Analysis of the Face Recognition Based on PCA/LDA on Distance Measures)

송영준;김영길;안재형
- 한국산학기술학회논문지
- /
- 제6권3호
- /
- pp.249-254
- /
- 2005
본 논문은 얼굴인식에서 사용되고 있는 PCA/LDA 방식의 유사도 측정 방식에 따른 인식 성능을 비교 분석하였다. 총 14가지의 거리 척도를 ORL 얼굴 데이터베이스에 적용하였으며, PCA와 PCA/LDA로 나누어 성능 비교를 하였다. PCA의 경우에는 맨하튼 거리, Weighted SSE 거리의 인식률이 좋지만, PCA/LDA인 경우에는 Angle-based 거리, Modified SSE거리에 대한 인식률이 좋음이 확인되었다. 또한 PCA보다 PCA/LDA의 경우 유사도 비교 차원의 수를 줄이면서 높은 인식률을 유지할 수 있어, PCA/LDA와 Angle-based 거리 척도를 적용하여 얼굴인식을 할 경우 계산의 경제성과 인식률에서 높은 경쟁력을 갖출 수 있다.
PDF

Speech Recognition using MSHMM based on Fuzzy Concept

Ann, Tae-Ock
- The Journal of the Acoustical Society of Korea
- /
- 제16권2E호
- /
- pp.55-61
- /
- 1997
This paper proposes a MSHMM(Multi-Section Hidden Markov Model) recognition method based on Fuzzy Concept, as a method on the speech recognition of speaker-independent. In this recognition method, training data are divided into several section and multi-observation sequences given proper probabilities by fuzzy rule according to order of short distance from MSVQ codebook per each section are obtained. Thereafter, the HMM per each section using this multi-observation sequences is generated, and in case of recognition, a word that has the most highest probability is selected as a recognized word. In this paper, other experiments to compare with the results of these experiments are implemented by the various conventional recognition methods(DP, MSVQ, DMS, general HMM) under the same data. Through results of all-round experiment, it is proved that the proposed MSHMM based on fuzzy concept is superior to DP method, MSVQ method, DMS model and general HMM model in recognition rate and computational time, and does not decreases recognition rate as 92.91% in spite of increment of speaker number.
PDF

HMM-Based Automatic Speech Recognition using EMG Signal

Lee Ki-Seung
- 대한의용생체공학회:의공학회지
- /
- 제27권3호
- /
- pp.101-109
- /
- 2006
It has been known that there is strong relationship between human voices and the movements of the articulatory facial muscles. In this paper, we utilize this knowledge to implement an automatic speech recognition scheme which uses solely surface electromyogram (EMG) signals. The EMG signals were acquired from three articulatory facial muscles. Preliminary, 10 Korean digits were used as recognition variables. The various feature parameters including filter bank outputs, linear predictive coefficients and cepstrum coefficients were evaluated to find the appropriate parameters for EMG-based speech recognition. The sequence of the EMG signals for each word is modelled by a hidden Markov model (HMM) framework. A continuous word recognition approach was investigated in this work. Hence, the model for each word is obtained by concatenating the subword models and the embedded re-estimation techniques were employed in the training stage. The findings indicate that such a system may have a capacity to recognize speech signals with an accuracy of up to 90%, in case when mel-filter bank output was used as the feature parameters for recognition.
https://doi.org/10.9718/JBER.2006.27.3.101 인용 PDF KSCI

피지에 기초를 둔 HMM을 이용한 음성 인식 (Speech Recognition Using HMM Based on Fuzzy)

안태옥;김순협
- 전자공학회논문지B
- /
- 제28B권12호
- /
- pp.68-74
- /
- 1991
This paper proposes a HMM model based on fuzzy, as a method on the speech recognition of speaker-independent. In this recognition method, multi-observation sequences which give proper probabilities by fuzzy rule according to order of short distance from VQ codebook are obtained. Thereafter, the HMM model using this multi-observation sequences is generated, and in case of recognition, a word that has the most highest probability is selected as a recognized word. The vocabularies for recognition experiment are 146 DDD are names, and the feature parameter is 10S0thT LPC cepstrum coefficients. Besides the speech recognition experiments of proposed model, for comparison with it, we perform the experiments by DP, MSVQ and general HMM under same condition and data. Through the experiment results, it is proved that HMM model using fuzzy proposed in this paper is superior to DP method, MSVQ and general HMM model in recognition rate and computational time.
PDF

The Learning of the Neural Network Using Hadamard Transform

Katayama, Hiromu;Tsuruta, Shinchi;Nakao, Tomohiro;Harada, Hisamochi;Konishi, Ryosuke
- 한국지능시스템학회:학술대회논문집
- /
- 한국퍼지및지능시스템학회 1993년도 Fifth International Fuzzy Systems Association World Congress 93
- /
- pp.1125-1128
- /
- 1993
We propose the new method about the neural-based pattern recognition by using Hadamard transform for the improvement of learning speed, stability and flexibility of network. We can obtain the spatial feature of pattern by Hadamard transformed pattern. We carried out an experiment to estimate the effect of Hadamard transform. We tried the learning of numeric patterns, and tried the pattern recognition with noisy pattern. As a result, the learning times of the network for the 'Hadamard' case is smaller than that of usual case. And the recognition rate of the network for the 'Hadamard' case is higher than that of usual case, too.
PDF

Multi-view Human Recognition based on Face and Gait Features Detection

Nguyen, Anh Viet;Yu, He Xiao;Shin, Jae-Ho;Park, Sang-Yun;Lee, Eung-Joo
- 한국멀티미디어학회논문지
- /
- 제11권12호
- /
- pp.1676-1687
- /
- 2008
In this paper, we proposed a new multi-view human recognition method based on face and gait features detection algorithm. For getting the position of moving object, we used the different of two consecutive frames. And then, base on the extracted object, the first important characteristic, walking direction, will be determined by using the contour of head and shoulder region. If this individual appears in camera with frontal direction, we will use the face features for recognition. The face detection technique is based on the combination of skin color and Haar-like feature whereas eigen-images and PCA are used in the recognition stage. In the other case, if the walking direction is frontal view, gait features will be used. To evaluate the effect of this proposed and compare with another method, we also present some simulation results which are performed in indoor and outdoor environment. Experimental result shows that the proposed algorithm has better recognition efficiency than the conventional sing]e view recognition method.
PDF

후각 센서를 이용한 냄새 인식 및 실감형 멀티미디어 표현 기술 (A technology of realistic multi-media display and odor recognition using olfactory sensors)

이현구;노용완
- 디지털산업정보학회논문지
- /
- 제6권4호
- /
- pp.33-43
- /
- 2010
In this paper, we propose a floral scent recognition using odor sensors and a odor display using odor distribution system. Proposed odor recognition has method of correlation coefficient between sensors that select optimal sensors in floral scent recognition system of selective multi-sensors. Proposed floral scent recognition system consists of four module such as floral scent acquisition module, optimal sensor decision module, entropy-based floral scent detection module, and floral scent recognition module. Odor distribution system consists of generation module of distribution information, control module of distribution, output module of distribution. We applied to floral scent recognition for performance evaluation of proposed sensors decision method. As a result, application of proposed method with floral scent recognition obtained recognition rate of 95.67% case of using 16 sensors while applied floral scent recognition system of proposed sensor decision method confirmed recognition rate of 96% using only 8 sensors. Also, we applied to odor display of proposed method and obtained 3.18 thorough MOS experimentation.
https://doi.org/10.17662/ksdim.2010.6.4.033 인용 PDF KSCI

검색결과 602건 처리시간 0.028초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)