DOI QR코드

DOI QR Code

Performance Evaluation of Novel AMDF-Based Pitch Detection Scheme

  • Kumar, Sandeep (Department of Electronics & Telecommunication Engineering, Rungta College of Engineering and Technology)
  • Received : 2015.10.20
  • Accepted : 2016.01.20
  • Published : 2016.06.01

Abstract

A novel average magnitude difference function (AMDF)-based pitch detection scheme (PDS) is proposed to achieve better performance in speech quality. A performance evaluation of the proposed PDS is carried out through both a simulation and a real-time implementation of a speech analysis-synthesis system. The parameters used to compare the performance of the proposed PDS with that of PDSs that are based on either a cepstrum, an autocorrelation function (ACF), an AMDF, or circular AMDF (CAMDF) methods are as follows: percentage gross pitch error (%GPE); a subjective listening test; an objective speech quality assessment; a speech intelligibility test; a synthesized speech waveform; computation time; and memory consumption. The proposed PDS results in lower %GPE and better synthesized speech quality and intelligibility for different speech signals as compared to the cepstrum-, ACF-, AMDF-, and CAMDF-based PDSs. The computational time of the proposed PDS is also less than that for the cepstrum-, ACF-, and CAMDF-based PDSs. Moreover, the total memory consumed by the proposed PDS is less than that for the ACF- and cepstrum-based PDSs.

Keywords

References

  1. X.D. Mei, J. Pan, and S.-H. Sun, "Efficient Algorithm for Speech Pitch Estimation," Proc. Int. Symp. Intell. Multimedia, Video Speech Process., Hong Kong, China, May 2-4, 2001, pp. 421-424.
  2. M.J. Ross et al., "Average Magnitude Difference Function Pitch Extractor," IEEE Trans. Acoust., Speech, Signal Process., vol. 22, no. 5, Oct. 1974, pp. 353-362. https://doi.org/10.1109/TASSP.1974.1162598
  3. S. Kumar, S.K. Singh, and S. Bhattacharya, "Performance Evaluation of a ACF-AMDF Based Pitch Detection Scheme in Real Time," Int. J. Speech Technol., vol. 18, no. 4, Dec. 2015, pp. 521-527. https://doi.org/10.1007/s10772-015-9296-2
  4. F. Wang and P. Yip, "Cepstrum Analysis Using Discrete Trigonometric Transforms," IEEE Trans. Acoust., Speech, Signal Process., vol. 39, no. 2, Feb. 1991, pp. 538-541. https://doi.org/10.1109/78.80852
  5. H. Huang and J. Pan, "Speech Pitch Determination Based on Huang-Hilbert Transform," Signal Process., vol. 86, no. 4, Apr. 2006, pp. 792-803. https://doi.org/10.1016/j.sigpro.2005.06.011
  6. S. Kumar et al., "Performance Evaluation of a Wavelet-Based Pitch Detection Scheme," Int. J. Speech Technol., vol. 16, no. 4, Dec. 2013, pp. 431-417. https://doi.org/10.1007/s10772-013-9194-4
  7. S. Kadambe and G.F. Boudreaux-Bartels, "Application of the Wavelet Transform for Pitch Detection of Speech Signals," IEEE Trans. Inf. Theory, vol. 38, no. 2, Mar. 1992, pp. 917-924. https://doi.org/10.1109/18.119752
  8. L. Hui, B.-Q. Dai, and L. Wei, "A Pitch Detection Algorithm Based on AMDF and ACF," IEEE Int. Conf. Acoust., Speech Signal Process., Toulouse, France, May 14-19, 2006, pp. 377-380.
  9. R. Cai, S. Shi, and Y. Zhu, "A Modified Pitch Detection Meth Based on Wavelet Transform," Int. Conf. Multimedia Inf. Technol., Kaifeng, China, Apr. 2010, pp. 246-249.
  10. W. Zhang, G. Xu, and Y. Wang, "Pitch Estimation Based on Circular AMDF," IEEE Int. Conf. Acoust., Speech Signal Process., Orlando, FL, USA, May13-17, 2002, pp. I.341-I.344.
  11. S. Kumar, S. Bhattacharya, and P. Patel, "A New Pitch Detection Scheme Based on ACF and AMDF," Int. Conf. Adv. Commun. Contr. Comput. Technol., Ramanathapuram, India, May 18-10, 2014, pp. 1235-1240.
  12. S. Bhattacharya, S.K. Singh, and T. Abhinav, "Performance Evaluation of LPC and Cepstral Speech Coder in Simulation and in Real Time," Int. Conf. Recent Adv. Inf. Technol., Dhanbad, India, Mar. 15-17, 2012, pp. 826-831.
  13. G. Pirker et al., "A Pitch Tracking Corpus with Evaluation on Multi-pitch Tracking Scenario," Interspeech, Florence, Italy, Aug. 27-31, 2011, pp. 1509-1512.
  14. Y. Hu and P. Loizou, "Subjective Evaluation and Comparison of Speech Enhancement Algorithms," Speech Commun., July 2007, vol. 49, pp. 588-601. https://doi.org/10.1016/j.specom.2006.12.006
  15. F. Plante, G.F. Meyer, and W.A. Ainsworth, "A Pitch Extraction Reference Database," European Conf. Speech Commun. Technol., Madrid, Spain, Sept. 18-21, 1995, pp. 837-840.
  16. ITU-T P.862, Perceptual Evaluation of Speech Quality (PESQ), June 2004.
  17. J.R. Deller, J.H.L. Hansen, and J.G. Proakis, "Discrete-Time Processing of Speech Signal," Piscataway, NJ, USA: John Wiley & Sons, 2000, pp. 570-579.
  18. MATLAB Online Help, Accessed Aug. 12, 2012. http://www.mathworks.in/help/toolbox/simulink/ug/f0-7640.html
  19. Breakpoint Help, Accessed July 10, 2012. http://processors.wiki.ti.com/index.php/Breakpoint

Cited by

  1. An Approach to Improve Generation of Association Rules in Order to Be Used in Recommenders : vol.13, pp.4, 2017, https://doi.org/10.4018/ijdwm.2017100101
  2. Analysis and Evaluation of a Framework for Sampling Database in Recommenders : vol.26, pp.1, 2018, https://doi.org/10.4018/jgim.2018010103
  3. The Role of the Internet of Things in the Improvement and Expansion of Business : vol.30, pp.3, 2016, https://doi.org/10.4018/joeuc.2018070102
  4. Comparative performance evaluation of MMSE-based speech enhancement techniques through simulation and real-time implementation vol.21, pp.4, 2016, https://doi.org/10.1007/s10772-018-09567-5
  5. A Signal Period Detection Algorithm Based on Morphological Self-Complementary Top-Hat Transform and AMDF vol.10, pp.1, 2016, https://doi.org/10.3390/info10010024
  6. Real-time implementation and performance evaluation of speech classifiers in speech analysis-synthesis vol.43, pp.1, 2016, https://doi.org/10.4218/etrij.2019-0364