DOI QR코드

DOI QR Code

A Study of Peak Finding Algorithms for the Autocorrelation Function of Speech Signal

  • So, Shin-Ae (Dept. Korean Language and Literature, Soongsil University) ;
  • Lee, Kang-Hee (Dept. of Digital Media, Soongsil University) ;
  • You, Kwang-Bock (School of Electronic Engineering, Soongsil University) ;
  • Lim, Ha-Young (School of Electronic Engineering, Soongsil University) ;
  • Park, Ji Su (School of Electronic Engineering, Soongsil University)
  • Received : 2016.11.01
  • Accepted : 2016.12.28
  • Published : 2016.12.31

Abstract

In this paper, the peak finding algorithms corresponding to the Autocorrelation Function (ACF), which are widely exploited for detecting the pitch of voiced signal, are proposed. According to various researchers, it is well known fact that the estimation of fundamental frequency (F0) in speech signal is not only very important task but quite difficult mission. The proposed algorithms, presented in this paper, are implemented by using many characteristics - such as monotonic increasing function - of ACF function. Thus, the proposed algorithms may be able to estimate both reliable and correct the fundamental frequency as long as the autocorrelation function of speech signal is accurate. Since the proposed algorithms may reduce the computational complexity it can be applied to the real-time processing. The speech data, is composed of Korean emotion expressed words, is used for evaluation of their performance. The pitches are measured to compare the performance of proposed algorithms.

Keywords

References

  1. Savitha S Upadhya, "Pitch Detection in Time and Frequency Domain," ICCICT, 2012.
  2. E. Verteletskaya and B. Simak, "Performance Evaluation of Pitch Detection Algorithms," http://access.feld.cvut.cz/service.php
  3. A. Leon-Garcia, Probability, Statistics, and Random Processes for Electrical Engineering, 3rd Edition, Prentice Hall, 2009.
  4. A.M. Kondoz, Digital Speech (Coding for Low Bit Rate Communications Systems), John Wiley & Sons Ltd., 1994.
  5. L.R. Rabiner and R.W. Schafer, Theory and Applications of Digital Speech Processing, 1st edition, Prentice Hall, 2011
  6. J.R. Deller, JR., J.G. Proakis, and J.H.L. Hansen, Discrete-Time Processing of Speech Signals, Macmillan, 1993
  7. Myron J. Ross, Harry L. Shaffer, A. Cohen, R. Freudberg, and H. Manley, "Average Magnitude Difference Function Pitch Extractor," IEEE Trans. On ASSP, Vol., ASSP-22, No. 5, October 1974.
  8. Lee, Ho-Young, The Structure of Korean Prosody, PhD dissertation, University of London, 1990.
  9. Jun, Sun-Ah, The phonetics and phonology of Korean prosody, Ph.D. dissertation, Ohio State University, 1993.
  10. Jun, Sun-Ah, K-ToBI labelling conventions(version3.1), UCLA Working Papers in Phonetics 99, pp.149-173, 2000.
  11. Jin-Koo Ji and Sung-il Yun, Design and Implementation of Speaker Verification System Using Voice, Journal of the Korea Society of Computer & Information, Vol. 5, No. 3, pp. 91-98, 2000.
  12. Celia Shahnaz, Wei-Ping Zhu, and M. Omair Ahmad," Pitch Estimation Based on a Harmonic Sinusoidal Autocorrelation Model and a Time-Domain Matching Scheme," IEEE Transactions on Audio, Speech, and Language Processing, Vol. 20, No. 1, pp. 322-335, 2012. https://doi.org/10.1109/TASL.2011.2161579
  13. Seo Weon Heo, Don Hur, and Hyungsuk Kim, "Optimization of the autocorrelation weighting function for the time-domain calculation of spectral centroids," IEEE Transactions on Ultrasonics, Ferroelectrics, and Frequency Control, Vol. 62, No. 3, pp. 421-427, 2015. https://doi.org/10.1109/TUFFC.2014.006416
  14. Sebastian Kraft and Udo Zolzer, "Polyphonic Pitch Detection by Matching Spectral and Autocorrelation Peaks," 2015 EUSIPCO
  15. Jun Gao and Dan Xu, "Noise-robust Pitch Detection Algorithm Based on AMDF with Clustering Analysis Picking Peaks," 2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference