Search | Korea Science

On Wavelet Transform Based Feature Extraction for Speech Recognition Application

Kim, Jae-Gil
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.2E
- /
- pp.31-37
- /
- 1998
This paper proposes a feature extraction method using wavelet transform for speech recognition. Speech recognition system generally carries out the recognition task based on speech features which are usually obtained via time-frequency representations such as Short-Time Fourier Transform (STFT) and Linear Predictive Coding(LPC). In some respects these methods may not be suitable for representing highly complex speech characteristics. They map the speech features with same may not frequency resolutions at all frequencies. Wavelet transform overcomes some of these limitations. Wavelet transform captures signal with fine time resolutions at high frequencies and fine frequency resolutions at low frequencies, which may present a significant advantage when analyzing highly localized speech events. Based on this motivation, this paper investigates the effectiveness of wavelet transform for feature extraction of wavelet transform for feature extraction focused on enhancing speech recognition. The proposed method is implemented using Sampled Continuous Wavelet Transform (SCWT) and its performance is tested on a speaker-independent isolated word recognizer that discerns 50 Korean words. In particular, the effect of mother wavelet employed and number of voices per octave on the performance of proposed method is investigated. Also the influence on the size of mother wavelet on the performance of proposed method is discussed. Throughout the experiments, the performance of proposed method is discussed. Throughout the experiments, the performance of proposed method is compared with the most prevalent conventional method, MFCC (Mel0frequency Cepstral Coefficient). The experiments show that the recognition performance of the proposed method is better than that of MFCC. But the improvement is marginal while, due to the dimensionality increase, the computational loads of proposed method is substantially greater than that of MFCC.
PDF

A study on the vowel extraction from the word using the neural network (신경망을 이용한 단어에서 모음추출에 관한 연구)

이택준;김윤중
- Proceedings of the Korea Society for Industrial Systems Conference
- /
- 2003.11a
- /
- pp.721-727
- /
- 2003
This study designed and implemented a system to extract of vowel from a word. The system is comprised of a voice feature extraction module and a neutral network module. The voice feature extraction module use a LPC(Linear Prediction Coefficient) model to extract a voice feature from a word. The neutral network module is comprised of a learning module and voice recognition module. The learning module sets up a learning pattern and builds up a neutral network to learn. Using the information of a learned neutral network, a voice recognition module extracts a vowel from a word. A neutral network was made to learn selected vowels(a, eo, o, e, i) to test the performance of a implemented vowel extraction recognition machine. Through this experiment, could confirm that speech recognition module extract of vowel from 4 words.
PDF

Feature Extraction Based on Speech Attractors in the Reconstructed Phase Space for Automatic Speech Recognition Systems

Shekofteh, Yasser;Almasganj, Farshad
- ETRI Journal
- /
- v.35 no.1
- /
- pp.100-108
- /
- 2013
In this paper, a feature extraction (FE) method is proposed that is comparable to the traditional FE methods used in automatic speech recognition systems. Unlike the conventional spectral-based FE methods, the proposed method evaluates the similarities between an embedded speech signal and a set of predefined speech attractor models in the reconstructed phase space (RPS) domain. In the first step, a set of Gaussian mixture models is trained to represent the speech attractors in the RPS. Next, for a new input speech frame, a posterior-probability-based feature vector is evaluated, which represents the similarity between the embedded frame and the learned speech attractors. We conduct experiments for a speech recognition task utilizing a toolkit based on hidden Markov models, over FARSDAT, a well-known Persian speech corpus. Through the proposed FE method, we gain 3.11% absolute phoneme error rate improvement in comparison to the baseline system, which exploits the mel-frequency cepstral coefficient FE method.
https://doi.org/10.4218/etrij.13.0112.0074 인용 PDF KSCI

Ultrasonic Pattern Recognition of Welding Defects Using the Chaotic Feature Extraction (카오스 특징 추출에 의한 용접 결함의 초음파 형상 인식)

Lee, Won;Yoon, In-Sik;Lee, Byung-Chae
- Journal of the Korean Society for Precision Engineering
- /
- v.15 no.6
- /
- pp.167-174
- /
- 1998
The ultrasonic test is recognized for its significance as a non-destructive testing method to detect volume defects such as porosity and incomplete penetration which reduce strength in the weld zone. This paper illustrates the defect detection in the weld zone of ferritic carbon steel using ultrasonic wave and the evaluation of pattern recognition by chaotic feature extraction using time series signal of detected defects as data. Shown in the time series data were that the time delay was 4 and the embedding dimension was 6 which indicate the geometric dimension of the subject system and the extent of information correlation. Based on fractal dimension and lyapunov exponent in quantitative chaotic feature extraction, feature value of 2.15, 0.47 is presented for porosity and 2.24, 0.51 for incomplete penetration The precision rate of the pattern recognition is enhanced with these values on the total waveform of defect signal in the weld zone. Therefore, we think that the ultrasonic pattern recognition method of weld zone defects of ferritic carbon steel by ultrasonic-chaotic feature extraction proposed in this paper can boost precision rate further than the existing method applying only partial waveform.
PDF

A Novel Recognition Algorithm Based on Holder Coefficient Theory and Interval Gray Relation Classifier

Li, Jingchao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.9 no.11
- /
- pp.4573-4584
- /
- 2015
The traditional feature extraction algorithms for recognition of communication signals can hardly realize the balance between computational complexity and signals' interclass gathered degrees. They can hardly achieve high recognition rate at low SNR conditions. To solve this problem, a novel feature extraction algorithm based on Holder coefficient was proposed, which has the advantages of low computational complexity and good interclass gathered degree even at low SNR conditions. In this research, the selection methods of parameters and distribution properties of the extracted features regarding Holder coefficient theory were firstly explored, and then interval gray relation algorithm with improved adaptive weight was adopted to verify the effectiveness of the extracted features. Compared with traditional algorithms, the proposed algorithm can more accurately recognize signals at low SNR conditions. Simulation results show that Holder coefficient based features are stable and have good interclass gathered degree, and interval gray relation classifier with adaptive weight can achieve the recognition rate up to 87% even at the SNR of -5dB.
https://doi.org/10.3837/tiis.2015.11.018 인용 PDF KSCI KPUBS HTML

Development of Robust-to-Rotation Iris Feature Extraction Algorithms For Embedded System (임베디드 시스템을 위한 회전에 강인한 홍채특징 추출 알고리즘 개발)

Kim, Shik
- The Journal of Information Technology
- /
- v.12 no.4
- /
- pp.25-32
- /
- 2009
Iris recognition is a biometric technology which can identify a person using the iris pattern. It is important for the iris recognition system to extract the feature which is invariant to changes in iris patterns. Those changes can be occurred by the influence of lights, changes in the size of the pupil, and head tilting. This paper is appropriate for the embedded environment using local gradient histogram embedded system using iris feature extraction methods have implement. The proposed method enables high-speed feature extraction and feature comparison because it requires no additional processing to obtain the rotation invariance, and shows comparable performance to the well-known previous methods.
PDF

A study on automatic wear debris recognition by using particle feature extraction (입자 유형별 형상추출에 의한 마모입자 자동인식에 관한 연구)

;;;Grigoriev, A.Y.
- Proceedings of the Korean Society of Tribologists and Lubrication Engineers Conference
- /
- 1998.04a
- /
- pp.314-320
- /
- 1998
Wear debris morphology is closely related to the wear mode and mechanism occured. Image recognition of wear debris is, therefore, a powerful tool in wear monitoring. But it has usually required expert's experience and the results could be too subjective. Development of automatic tools for wear debris recognition is needed to solve this problem. In this work, an algorithm for automatic wear debris recognition was suggested and implemented by PC base software. The presented method defined a characteristic 3-dimensional feature space where typical types of wear debris were separately located by the knowledge-based system and compared the similarity of object wear debris concerned. The 3-dimensional feature space was obtained from multiple feature vectors by using a multi-dimensional scaling technique. The results showed that the presented automatic wear debris recognition was satisfactory in many cases application.
PDF

A Study on Automatic wear Debris Recognition by using Particle Feature Extraction (입자 유형별 형상추출에 의한 마모입자 자동인식에 관한 연구)

;;;A. Y. Grigoriev
- Tribology and Lubricants
- /
- v.15 no.2
- /
- pp.206-211
- /
- 1999
Wear debris morphology is closely related to the wear mode and mechanism occured. Image recognition of wear debris is, therefore, a powerful tool in wear monitoring. But it has usually required expert's experience and the results could be too subjective. Development of automatic tools for wear debris recognition is needed to solve this problem. In this work, an algorithm for automatic wear debris recognition was suggested and implemented by PC base software. The presented method defined a characteristic 3-dimensional feature space where typical types of wear debris were separately located by the knowledge-based system and compared the similarity of object wear debris concerned. The 3-dimensional feature space was obtained from multiple feature vectors by using a multi-dimensional scaling technique. The results showed that the presented automatic wear debris recognition was satisfactory in many cases application.
https://doi.org/10.9725/kstle.1999.15.2.206 인용 PDF

Pattern recognition of time series data based on the chaotic feature extracrtion (카오스 특징 추출에 의한 시계열 신호의 패턴인식)

이호섭;공성곤
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 1996.10a
- /
- pp.294-297
- /
- 1996
This paper proposes the method to recognize of time series data based on the chaotic feature extraction. Features extract from time series data using the chaotic time series data analysis and the pattern recognition process is using a neural network classifier. In experiment, EEG(electroencephalograph) signals are extracted features by correlation dimension and Lyapunov experiments, and these features are classified by multilayer perceptron neural networks. Proposed chaotic feature extraction enhances recognition results from chaotic time series data.
PDF

Parts-Based Feature Extraction of Spectrum of Speech Signal Using Non-Negative Matrix Factorization

Park, Jeong-Won;Kim, Chang-Keun;Lee, Kwang-Seok;Koh, Si-Young;Hur, Kang-In
- Journal of information and communication convergence engineering
- /
- v.1 no.4
- /
- pp.209-212
- /
- 2003
In this paper, we proposed new speech feature parameter through parts-based feature extraction of speech spectrum using Non-Negative Matrix Factorization (NMF). NMF can effectively reduce dimension for multi-dimensional data through matrix factorization under the non-negativity constraints, and dimensionally reduced data should be presented parts-based features of input data. For speech feature extraction, we applied Mel-scaled filter bank outputs to inputs of NMF, than used outputs of NMF for inputs of speech recognizer. From recognition experiment result, we could confirm that proposed feature parameter is superior in recognition performance than mel frequency cepstral coefficient (MFCC) that is used generally.
PDF KSCI

Search Result 820, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)