Search | Korea Science

Constructing a Noise-Robust Speech Recognition System using Acoustic and Visual Information (청각 및 시가 정보를 이용한 강인한 음성 인식 시스템의 구현)

Lee, Jong-Seok;Park, Cheol-Hoon
- Journal of Institute of Control, Robotics and Systems
- /
- v.13 no.8
- /
- pp.719-725
- /
- 2007
In this paper, we present an audio-visual speech recognition system for noise-robust human-computer interaction. Unlike usual speech recognition systems, our system utilizes the visual signal containing speakers' lip movements along with the acoustic signal to obtain robust speech recognition performance against environmental noise. The procedures of acoustic speech processing, visual speech processing, and audio-visual integration are described in detail. Experimental results demonstrate the constructed system significantly enhances the recognition performance in noisy circumstances compared to acoustic-only recognition by using the complementary nature of the two signals.
https://doi.org/10.5302/J.ICROS.2007.13.8.719 인용 PDF KSCI

A Development of Robust Underwater Sound Signal Recognition Algorithm for Acoustic Releaser (Acoustic releaser 제어를 위한 강인한 수중음향신호 인식 알고리즘의 개발)

김영진;허경무
- Journal of the Institute of Electronics Engineers of Korea SC
- /
- v.41 no.3
- /
- pp.33-38
- /
- 2004
In this paper we presents a underwater sound recognition algorithm by which we can identify the sound signal without the influence of disturbances due to underwater environmental changes. The proposed method provides a means suitable for acoustic releaser which require low power dissipation and long-time underwater operation. We demonstrate its ability of securing stability and fast sound recognition through both numerical and experimental methods.
PDF KSCI

Detection and Classification of Defect Signals from Rotator by AE Signal Pattern Recognition (AE 신호 형상 인식법에 의한 회전체의 신호 검출 및 분류 연구)

Kim, Ku-Young;Lee, Kang-Yong;Kim, Hee-Soo;Lee, Hyun
- Journal of the Korean Society for Railway
- /
- v.4 no.3
- /
- pp.79-86
- /
- 2001
The signal pattern recognition method by acoustic emission signal is applied to detect and classify the defects of a journal bearing in a power plant. AE signals of main defects such as overheating, wear and corrosion are obtained from a small scale model. To detect and classify the defects, AE signal pattern recognition program is developed. As the classification methods, the wavelet transformation analysis, the frequency domain analysis and time domain analysis are used. Among three analyses, the wavelet transformation analysis is most effective to detect and classify the defects of the journal bearing..
PDF

The Classification of Tool Wear States Using Pattern Recognition Technique (패턴인식기법을 이용한 공구마멸상태의 분류)

Lee, Jong-Hang;Lee, Sang-Jo
- Transactions of the Korean Society of Mechanical Engineers
- /
- v.17 no.7 s.94
- /
- pp.1783-1793
- /
- 1993
Pattern recognition technique using fuzzy c-means algorithm and multilayer perceptron was applied to classify tool wear states in turning. The tool wear states were categorized into the three regions 'Initial', 'Normal', 'Severe' wear. The root mean square(RMS) value of acoustic emission(AE) and current signal was used for the classification of tool wear states. The simulation results showed that a fuzzy c-means algorithm was better than the conventional pattern recognition techniques for classifying ambiguous informations. And normalized RMS signal can provide good results for classifying tool wear. In addition, a fuzzy c-means algorithm(success rate for tool wear classification : 87%) is more efficient than the multilayer perceptron(success rate for tool wear classification : 70%).
https://doi.org/10.22634/KSME.1993.17.7.1783 인용 PDF

A Development of Underwater Sound Signal Recognition Algorithm for Acoustic Releaser in the Seafloor (심해저용 원격 착탈 시스템 제어를 위한 수중음향신호 인식 알고리즘의 개발)

김영진;우종식;조영준;허경무
- Journal of Institute of Control, Robotics and Systems
- /
- v.10 no.5
- /
- pp.421-427
- /
- 2004
In order to exploit underwater resources successfully, the first step would be a marine environmental research and exploration in the seafloor. Generally one sets up a long-term underwater experimental unit in the seafloor and retrieves the unit later after a certain period time. Essential to these applications is the reliable teleoperation and telemetering of the unit. In this paper we presents a robust underwater sound recognition algorithm by which we can identify the sound signal without the influence of disturbances due to underwater environmental changes. The proposed method provides a means suitable for the acoustic releaser which requires low power dissipation and long-time underwater operation. We demonstrate its ability of securing stability and fast sound recognition through simulation methods.
https://doi.org/10.5302/J.ICROS.2004.10.5.421 인용 PDF KSCI

Speech Recognition Performance Improvement using Gamma-tone Feature Extraction Acoustic Model (감마톤 특징 추출 음향 모델을 이용한 음성 인식 성능 향상)

Ahn, Chan-Shik;Choi, Ki-Ho
- Journal of Digital Convergence
- /
- v.11 no.7
- /
- pp.209-214
- /
- 2013
Improve the recognition performance of speech recognition systems as a method for recognizing human listening skills were incorporated into the system. In noisy environments by separating the speech signal and noise, select the desired speech signal. but In terms of practical performance of speech recognition systems are factors. According to recognized environmental changes due to noise speech detection is not accurate and learning model does not match. In this paper, to improve the speech recognition feature extraction using gamma tone and learning model using acoustic model was proposed. The proposed method the feature extraction using auditory scene analysis for human auditory perception was reflected In the process of learning models for recognition. For performance evaluation in noisy environments, -10dB, -5dB noise in the signal was performed to remove 3.12dB, 2.04dB SNR improvement in performance was confirmed.
https://doi.org/10.14400/JDPM.2013.11.7.209 인용 PDF

The Vocabulary Recognition Optimize using Acoustic and Lexical Search (음향학적 및 언어적 탐색을 이용한 어휘 인식 최적화)

Ahn, Chan-Shik;Oh, Sang-Yeob
- Journal of Korea Multimedia Society
- /
- v.13 no.4
- /
- pp.496-503
- /
- 2010
Speech recognition system is developed of standalone, In case of a mobile terminal using that low recognition rate represent because of limitation of memory size and audio compression. This study suggest vocabulary recognition highest performance improvement system for separate acoustic search and lexical search. Acoustic search is carry out in mobile terminal, lexical search is carry out in server processing system. feature vector of speech signal extract using GMM a phoneme execution, recognition a phoneme list transmission server using Lexical Tree Search algorithm lexical search recognition execution. System performance as a result of represent vocabulary dependence recognition rate of 98.01%, vocabulary independence recognition rate of 97.71%, represent recognition speed of 1.58 second.
PDF KSCI

Acoustic Signal Classifier Design using Dictionary Learning (딕셔너리 러닝을 이용한 음파 신호 분류기 설계)

Park, Sung Min;Sah, Sung Jin;Oh, Kwang Myung;Lee, Hui Sung
- Journal of Auto-vehicle Safety Association
- /
- v.8 no.1
- /
- pp.19-25
- /
- 2016
As new car technology is developing, temporal interaction is needed in automotive. Rhythmic pattern is one of the practical examples of temporal interaction in vehicle. To recognize rhythmic pattern and its input medium, dictionary learning is applicable algorithm. In this paper, performance and memory requirement of the learning algorithm is tested and is sufficiently good for use this acoustic sound.
https://doi.org/10.22680/kasa.2016.8.1.019 인용 PDF

Vocabulary Recognition Retrieval Optimized System using MLHF Model (MLHF 모델을 적용한 어휘 인식 탐색 최적화 시스템)

Ahn, Chan-Shik;Oh, Sang-Yeob
- Journal of the Korea Society of Computer and Information
- /
- v.14 no.10
- /
- pp.217-223
- /
- 2009
Vocabulary recognition system of Mobile terminal is executed statistical method for vocabulary recognition and used statistical grammar recognition system using N-gram. If limit arithmetic processing capacity in memory of vocabulary to grow then vocabulary recognition algorithm complicated and need a large scale search space and many processing time on account of impossible to process. This study suggest vocabulary recognition optimize using MLHF System. MLHF separate acoustic search and lexical search system using FLaVoR. Acoustic search feature vector of speech signal extract using HMM, lexical search recognition execution using Levenshtein distance algorithm. System performance as a result of represent vocabulary dependence recognition rate of 98.63%, vocabulary independence recognition rate of 97.91%, represent recognition speed of 1.61 second.
https://doi.org/10.9708/jksci.2009.14.10.217 인용 PDF

Combining multi-task autoencoder with Wasserstein generative adversarial networks for improving speech recognition performance (음성인식 성능 개선을 위한 다중작업 오토인코더와 와설스타인식 생성적 적대 신경망의 결합)

Kao, Chao Yuan;Ko, Hanseok
- The Journal of the Acoustical Society of Korea
- /
- v.38 no.6
- /
- pp.670-677
- /
- 2019
As the presence of background noise in acoustic signal degrades the performance of speech or acoustic event recognition, it is still challenging to extract noise-robust acoustic features from noisy signal. In this paper, we propose a combined structure of Wasserstein Generative Adversarial Network (WGAN) and MultiTask AutoEncoder (MTAE) as deep learning architecture that integrates the strength of MTAE and WGAN respectively such that it estimates not only noise but also speech features from noisy acoustic source. The proposed MTAE-WGAN structure is used to estimate speech signal and the residual noise by employing a gradient penalty and a weight initialization method for Leaky Rectified Linear Unit (LReLU) and Parametric ReLU (PReLU). The proposed MTAE-WGAN structure with the adopted gradient penalty loss function enhances the speech features and subsequently achieve substantial Phoneme Error Rate (PER) improvements over the stand-alone Deep Denoising Autoencoder (DDAE), MTAE, Redundant Convolutional Encoder-Decoder (R-CED) and Recurrent MTAE (RMTAE) models for robust speech recognition.
https://doi.org/10.7776/ASK.2019.38.6.670 인용 PDF KSCI

Search Result 71, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)