Browse > Article
http://dx.doi.org/10.4218/etrij.16.0115.0926

Performance Evaluation of Novel AMDF-Based Pitch Detection Scheme  

Kumar, Sandeep (Department of Electronics & Telecommunication Engineering, Rungta College of Engineering and Technology)
Publication Information
ETRI Journal / v.38, no.3, 2016 , pp. 425-434 More about this Journal
Abstract
A novel average magnitude difference function (AMDF)-based pitch detection scheme (PDS) is proposed to achieve better performance in speech quality. A performance evaluation of the proposed PDS is carried out through both a simulation and a real-time implementation of a speech analysis-synthesis system. The parameters used to compare the performance of the proposed PDS with that of PDSs that are based on either a cepstrum, an autocorrelation function (ACF), an AMDF, or circular AMDF (CAMDF) methods are as follows: percentage gross pitch error (%GPE); a subjective listening test; an objective speech quality assessment; a speech intelligibility test; a synthesized speech waveform; computation time; and memory consumption. The proposed PDS results in lower %GPE and better synthesized speech quality and intelligibility for different speech signals as compared to the cepstrum-, ACF-, AMDF-, and CAMDF-based PDSs. The computational time of the proposed PDS is also less than that for the cepstrum-, ACF-, and CAMDF-based PDSs. Moreover, the total memory consumed by the proposed PDS is less than that for the ACF- and cepstrum-based PDSs.
Keywords
DSP; pitch detection; autocorrelation; average magnitude difference function;
Citations & Related Records
연도 인용수 순위
  • Reference
1 X.D. Mei, J. Pan, and S.-H. Sun, "Efficient Algorithm for Speech Pitch Estimation," Proc. Int. Symp. Intell. Multimedia, Video Speech Process., Hong Kong, China, May 2-4, 2001, pp. 421-424.
2 M.J. Ross et al., "Average Magnitude Difference Function Pitch Extractor," IEEE Trans. Acoust., Speech, Signal Process., vol. 22, no. 5, Oct. 1974, pp. 353-362.   DOI
3 S. Kumar, S.K. Singh, and S. Bhattacharya, "Performance Evaluation of a ACF-AMDF Based Pitch Detection Scheme in Real Time," Int. J. Speech Technol., vol. 18, no. 4, Dec. 2015, pp. 521-527.   DOI
4 F. Wang and P. Yip, "Cepstrum Analysis Using Discrete Trigonometric Transforms," IEEE Trans. Acoust., Speech, Signal Process., vol. 39, no. 2, Feb. 1991, pp. 538-541.   DOI
5 H. Huang and J. Pan, "Speech Pitch Determination Based on Huang-Hilbert Transform," Signal Process., vol. 86, no. 4, Apr. 2006, pp. 792-803.   DOI
6 S. Kumar et al., "Performance Evaluation of a Wavelet-Based Pitch Detection Scheme," Int. J. Speech Technol., vol. 16, no. 4, Dec. 2013, pp. 431-417.   DOI
7 S. Kadambe and G.F. Boudreaux-Bartels, "Application of the Wavelet Transform for Pitch Detection of Speech Signals," IEEE Trans. Inf. Theory, vol. 38, no. 2, Mar. 1992, pp. 917-924.   DOI
8 L. Hui, B.-Q. Dai, and L. Wei, "A Pitch Detection Algorithm Based on AMDF and ACF," IEEE Int. Conf. Acoust., Speech Signal Process., Toulouse, France, May 14-19, 2006, pp. 377-380.
9 R. Cai, S. Shi, and Y. Zhu, "A Modified Pitch Detection Meth Based on Wavelet Transform," Int. Conf. Multimedia Inf. Technol., Kaifeng, China, Apr. 2010, pp. 246-249.
10 W. Zhang, G. Xu, and Y. Wang, "Pitch Estimation Based on Circular AMDF," IEEE Int. Conf. Acoust., Speech Signal Process., Orlando, FL, USA, May13-17, 2002, pp. I.341-I.344.
11 S. Kumar, S. Bhattacharya, and P. Patel, "A New Pitch Detection Scheme Based on ACF and AMDF," Int. Conf. Adv. Commun. Contr. Comput. Technol., Ramanathapuram, India, May 18-10, 2014, pp. 1235-1240.
12 S. Bhattacharya, S.K. Singh, and T. Abhinav, "Performance Evaluation of LPC and Cepstral Speech Coder in Simulation and in Real Time," Int. Conf. Recent Adv. Inf. Technol., Dhanbad, India, Mar. 15-17, 2012, pp. 826-831.
13 G. Pirker et al., "A Pitch Tracking Corpus with Evaluation on Multi-pitch Tracking Scenario," Interspeech, Florence, Italy, Aug. 27-31, 2011, pp. 1509-1512.
14 Y. Hu and P. Loizou, "Subjective Evaluation and Comparison of Speech Enhancement Algorithms," Speech Commun., July 2007, vol. 49, pp. 588-601.   DOI
15 F. Plante, G.F. Meyer, and W.A. Ainsworth, "A Pitch Extraction Reference Database," European Conf. Speech Commun. Technol., Madrid, Spain, Sept. 18-21, 1995, pp. 837-840.
16 ITU-T P.862, Perceptual Evaluation of Speech Quality (PESQ), June 2004.
17 J.R. Deller, J.H.L. Hansen, and J.G. Proakis, "Discrete-Time Processing of Speech Signal," Piscataway, NJ, USA: John Wiley & Sons, 2000, pp. 570-579.
18 MATLAB Online Help, Accessed Aug. 12, 2012. http://www.mathworks.in/help/toolbox/simulink/ug/f0-7640.html
19 Breakpoint Help, Accessed July 10, 2012. http://processors.wiki.ti.com/index.php/Breakpoint