Browse > Article
http://dx.doi.org/10.13067/JKIECS.2013.8.5.665

Application of Preemphasis FIR Filtering To Speech Detection and Phoneme Segmentation  

Lee, Chang-Young (동서대학교 산업경영공학과)
Publication Information
The Journal of the Korea institute of electronic communication sciences / v.8, no.5, 2013 , pp. 665-670 More about this Journal
Abstract
In this paper, we propose a new method of speech detection and phoneme segmentation. We investigate the effect of applying preemphasis FIR filtering on the speech signal before the usual speech detection that utilizes the energy profile for discriminating signals from background noise. By this procedure, only the speech section of low energy and frequency becomes distinct in energy profile. It is verified experimentally that the silence/speech boundary becomes sharper by applying the filtering compared to the conventional method. By applications of this procedure, phoneme segmentation is also found to be much facilitated.
Keywords
Speech Detection; Speech Processing; Preemphasis Filtering; FIR Filtering; Phoneme Segmentation;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 L.R. Rabiner & B. Juang, "Fundamentals of Speech Recognition," Prentice Hall, pp. 112- 117, 1993.
2 J.-C. Wang, J.-F. Wang, & Y. Weng, "Chip design of MFCC extraction for speech recognition," The VLSI Journal, Vol. 32, pp. 111-131, 2002.   DOI   ScienceOn
3 L.R. Rabiner & B. Juang, "Fundamentals of Speech Recognition," Prentice Hall, pp. 30-37, 1993.
4 S. Kajita, K. Takeda, & F. Itakura, "Spectral weighting of SBCOR for noise robust speech recognition," ICASSP '98, Vol. 2, pp. 621-624, 1998.
5 D.C. Costa, G.A.M. Lopes, C.A.B. Mello, & H.O. Viana, "Speech and phoneme segmentation under noisy environment through spectrogram image analysis," IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 1017-1022, 2012.
6 G. Kaplan, "Words Into Action I," IEEE Spectrum, Vol. 17, pp. 22-26, 1980.
7 Beom-joon Kim, "Service Quality Criteria for Voice Services over a WiBro Network," The Journal of the Korea Institute of Electronic Communication Sciences, Vol. 6, No. 6, pp. 823-829, 2011.   과학기술학회마을
8 Myoung-ku Kang, "A Study on the Design of Multimedia Service Platform on Wireless Intelligent Technology," The Journal of the Korea Institute of Electronic Communication Sciences, Vol. 4, No. 1, pp. 24-30, 2009.   과학기술학회마을
9 Jae-duck Yoo, Hong-tae Park, Hyun-sik Shin, & Yun-ho Shin, "A Study of the Communication Infrastructure Construction for u-City in Korea," The Journal of the Korea Institute of Electronic Communication Sciences, Vol. 1, No. 2, pp. 127-135, 2006.   과학기술학회마을
10 Y. Chang, S. Hung, N. Wang, & B. Lin, "CSR: A Cloud-Assisted Speech Recognition Service for Personal Mobile Device," International Conference on Parallel Processing (ICPP), pp. 305-314. 2011.
11 J.E. Flood & D.I. Urquhart-Pullen, "Timeassignment speech interpolation in timecompression- multiplex transmission," Proceedings of the Institution of Electrical Engineers, Vol. 111, No. 4, pp. 675-683, 1964.   DOI
12 J.G. Wilpon, L.R. Rabiner, & T.B. Martin, "An improved word-detection algorithm for telephone- quality speech incorporating both syn tactic and semantic constraints," AT&T Tech. J., Vol. 63, No. 3, pp. 479-498, 1984.
13 L.R. Rabiner & B. Juang, "Fundamentals of Speech Recognition," Prentice Hall, pp. 143- 149, 1993.
14 T. Kristjansson, B. Frey, L. Deng, & A. Acero, "Towards non-stationary model-based noise adaptation for large vocabulary speech recognition," ICASSP '01, Vol. 1, pp. 337-340, 2001.
15 J.R. Deller, J.G. Proakis, & J.H.L. Hansen, "Discrete-Time Processing of Speech Signals," Macmillan, New York, pp. 246-251, 1994.