Browse > Article

Feature Parameter Extraction and Speech Recognition Using Matrix Factorization  

Lee Kwang-Seok (진주산업대학교 전자공학과)
Hur Kang-In (동아대학교 전자공학과)
Abstract
In this paper, we propose new speech feature parameter using the Matrix Factorization for appearance part-based features of speech spectrum. The proposed parameter represents effective dimensional reduced data from multi-dimensional feature data through matrix factorization procedure under all of the matrix elements are the non-negative constraint. Reduced feature data presents p art-based features of input data. We verify about usefulness of NMF(Non-Negative Matrix Factorization) algorithm for speech feature extraction applying feature parameter that is got using NMF in Mel-scaled filter bank output. According to recognition experiment results, we confirm that proposed feature parameter is superior to MFCC(Mel-Frequency Cepstral Coefficient) in recognition performance that is used generally.
Keywords
Speech Feature Extraction; Non-Negative Matrix Factorization; Mel-scaled Filter Bank;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Daniel D. Lee and H. Sebastian Seung, 'Learning the parts of objects by non-negative matrix factorization,' Nature vol. 401, Oct. 21,1999
2 H. Y. Choi, S. J. Choi, 'Learning the Sparse Codes of Speeches via Non-Negative Matrix Factorization,' CVPR, 2002
3 Hoyer. P. O, 'Non-Negative Sparse Coding,' Neural Networks for Signal Processing, 2002, Proceddings of the 2002 12th IEEE Workshop on, pp. 557-565, 2002
4 S. Tsuge, M. Shishibori, S. Kurojwa, K. Kita, 'Dimensionally Reduction Using Non-Negative Matrix Factorization for Information Retreval,' Systems, Man and Cybermetics, 2001 IEEE International Conference on, vol. 2, pp. 960-965, 2001
5 Simon Haykim, 'Neural Networks a Comprehensive Foundation,' Prentice Hall, 1999
6 D. Guillamet, B. Schiele, J. Vitria, 'Analyzing non-negative matrix factorization for image classification,'Pattern Recognition, 2002, Proceedings, 16th International Conference on, vol. 2, pp. 116-119, 11-15 Aug. 2002
7 L. R. Rabiner, R. W. Schafer, 'Digital Processing of Speech Signals,' Prentice Hall, 1993
8 L. R. Rabiner, B. H. Juang, 'Fundamentals of Speech Recognition,' Prentice hall, 1999
9 Sven Behnke, 'Discovering hierarchical speech features using convolutional non-negative matrix factorization,' IJCNN'03, vol. 4, pp.2758-2763, 2003-10-14
10 Daniel D. Lee, H. Sebastian Seung, 'Algorithms for Non-Negative matrix Factorization,' in Advances in Neural Information Procedding System 13, T. K. Leen, T. G. Dietterich, and V. Tresp, Eds., 2001