DOI QR코드

DOI QR Code

Japanese Vowel Sound Classification Using Fuzzy Inference System

  • Phitakwinai, Suwannee (Computer Engineering Department, Faculty of Engineering, Chiang Mai University) ;
  • Sawada, Hideyuki (Department of Intelligent Mechanical Systems Engineering, Faculty of Engineering, Kagawa University) ;
  • Auephanwiriyakul, Sansanee (Computer Engineering Department, Faculty of Engineering, Chiang Mai University) ;
  • Theera-Umpon, Nipon (Biomedical Engineering Center, Chiang Mai University)
  • Received : 2014.01.07
  • Accepted : 2014.02.22
  • Published : 2014.03.31

Abstract

An automatic speech recognition system is one of the popular research problems. There are many research groups working in this field for different language including Japanese. Japanese vowel recognition is one of important parts in the Japanese speech recognition system. The vowel classification system with the Mamdani fuzzy inference system was developed in this research. We tested our system on the blind test data set collected from one male native Japanese speaker and four male non-native Japanese speakers. All subjects in the blind test data set were not the same subjects in the training data set. We found out that the classification rate from the training data set is 95.0 %. In the speaker-independent experiments, the classification rate from the native speaker is around 70.0 %, whereas that from the non-native speakers is around 80.5 %.

Keywords

References

  1. L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition, Prentice Hall, New Jersey, 1993.
  2. D. G. Kimber, M. A. Bush, and G. N. Tajchman, "Speaker-independent vowel classification using Hidden Markov Models and LVQ2," International Conference on Acoustics, Speech, and Signal Processing, pp. 497-500, 1990.
  3. T. Harczos, G. Szepannek, A. Katai, and F. Klefenz, "An auditory model based vowel classification," IEEE Biomedical Circuits and Systems Conference, pp. 69-72, 2006.
  4. A. Sadeghian, H. R. Dajani, and A. D. C. Chan, "Classification of English vowels using speech evoked potentials," 33rd Annual International Conference of the IEEE EMBS, pp. 5000-5003, 2011.
  5. J. Hillenbrand and R. T. Gayvert, "Vowel classification based on fundamental frequency and formant frequencies," Journal of Speech and Hearing Research, Vol. 36, pp. 694-700, 1993. https://doi.org/10.1044/jshr.3604.694
  6. P. M. Paulraj, S. B. Yaacob, A. Nazri, and S. Kumar, "Classification of vowel sounds using MFCC and feed forward neural network," 5th International Colloquium on Signal Processing & Its Application, pp. 59-62, 2009.
  7. K. Daqrouq, K. Y. Al Azzawi, "Arabic vowels recognition based on wavelet average framing linear prediction coding and neural network," Speech Communication, Vol. 55, pp. 641-652, 2013. https://doi.org/10.1016/j.specom.2013.01.002
  8. C.-T. Hsieh, E. Lai, and Y.-C. Wang, "Distributed fuzzy rules for preprocessing of speech segmentation with genetic algorithm," IEEE International Conference on Fuzzy Systems, pp. 427-431, 1997.
  9. Z. Yessenbayev, M. Karabalayeva, and A. Sharipbayev, "Formant analysis and mathematical model of Kazakh vowels," 14th International Conference on Computer Modelling and Simulation, pp. 427-431, 2012.
  10. A. Zourmand, and T. H. Nong, "Vowel classification of children's speech using fundamental formant frequencies," 4th International Conference on Computational Intelligence, Modelling and Simulation, pp. 282-287, 2012.
  11. N. Theera-Umpon, S. Chansareewittaya, and S. Auephanwiriyakul, "Phoneme and Tonal Accent Recognition for Thai Speech," Expert Systems With Applications, Vol. 38, No. 10, pp. 13254-13259, 2011. https://doi.org/10.1016/j.eswa.2011.04.142
  12. S. Karungaru, T. Kamei, M. Fujiwara, and N. Akamatsu, "Vowel recognition using Akamatsu integral and Differential Transforms," International Journal of Advanced Intelligence, Vol. 1, No. 1, pp. 125-140, 2009.
  13. M. Hisagi, T. Saitoh, and R. Konishi, "Analysis of efficient feature for Japanese vowel recognition," International Symposium of Intelligent Signal Processing and Communication Systems, 2006.
  14. T. Murakami, K. Maruyama, N. Minematsu, and K. Hirose, "Japanese vowel recognition using external structure of speech," IEEE Workshop on Automatic Speech Recognition and understanding, pp. 33-36, 2005.
  15. T. Murakami, K. Maruyama, N. Minematsu, and K. Hirose, "Japanese vowel recognition based on structural representation of speech," Annual Conference of International Speech Communication Association, pp. 1261-1264, 2005.
  16. A. V. Oppenheim and R. W. Schafer, Discrete-time Signal Processing, Prentice Hall, New Jersey, 1989.
  17. G. J. Klir and B. Yuan, Fuzzy Sets and Fuzzy Logic: Theory and Applications, Prentice Hall, Upper Saddle River, New Jersey, 1995.