DOI QR코드

DOI QR Code

Voice Boosting Filter Design in Frequency Domain for Relief of Husky Voice

쉰목소리 완화를 위한 주파수 영역 음성 강조 필터 설계

  • Kim, Hyuntae (Dept. of Multimedia Eng., Dongeui University) ;
  • Lee, Sanghyeop (Dept. of Digital Media Eng., Graduate School, Dongeui University)
  • Received : 2016.10.06
  • Accepted : 2016.11.22
  • Published : 2016.12.30

Abstract

The people who complain of pain due to voice causes such as vocal cord nodules is increasing year by year. If the voice is changed, it is possible to give to colleagues discomfort or inconvenience during conversation. In this paper, we propose a way to reduce discomfort by improving the husky voice during the conversation. A VBF (voice boosting filter) is firstly designed to improve the husky voices. This filter may further emphasize the formant frequency components than the frequency components around the formant frequency, because the value is relatively greater than the other frequency. And a fixed-point type DSP chipset, TMS320F2812 is applied to the system, the operating frequency is 150MHz. The system was implemented as a compact for use as a portable, its size is $2.5cm{\times}10cm$. Through the test using three husky voices with some type of statement, it was satisfactory in processing speed and sound quality improvement.

Keywords

References

  1. 2013 National Health Insurance Statistical Yearbook in Korea, 2014. (Health Insurance Review & Assessment Service, Gangwon-Do 26465, Korea)
  2. S. Han, S. Kim, J. Kim, and C. Kwon, "A Preliminary Study on Correlation between Voice Characteristics and Speech Features," Journal of the Korean Society of Speech Sciences, Vol. 3, No. 4, pp. 85-91, 2011.
  3. D.Y. Choi, S.M. Choi, G.C. Lim, and S.Y. Nam, "Usefulness of Voice Handicap Index in Patients with Hoarseness," Korean Journal of Otorhinolaryngology-Head and Neck Surgery, Vol. 45, No. 7, pp. 706-10, 2002.
  4. W. Lee, S. Wang, K. Chon, S. Kwon, K. Jeon, S. Kim, et al., "Laryngeal Cancer Screening using Cepstral Parameters," The Journal of the Korean Society of Logopedics and Phoniatrics, Vol. 14, No. 2, pp. 110-116, 2003.
  5. H. Kim, Y. Chung, and K. Bae, "A Robust Speech Recognition Method Combining the Model Compensation Method with the Speech Enhancement Algorithm," Speech Sciences, Vol. 14, No. 2, pp. 115-126, 2007.
  6. G. Lee, J.H. Lee, J. Cho, and M.N. Kim, "Adaptive Noise Canceller for Speech Enhancement Using 2-D Binary Mask," Journal of The Korean Multimedia Society, Vol. 19, No. 7, pp. 1127-1136, 2016. https://doi.org/10.9717/kmms.2016.19.7.1127
  7. LoG Filter, http://academic.mu.edu/phys/ matthysd/web226/Lab02.htm, (accessed Jul., 4, 2016).
  8. R.L. Joshi and T.R. Fischer, "Comparison of Generalized Gaussian and Laplacian Modeling in DCT Image Coding," IEEE Signal Processing Letters, Vol. 2, Issue 5, pp. 81-82, 1995. https://doi.org/10.1109/97.386283
  9. J. Choi, "Formant Enhancement Algorithm of Speech Using Auditory Filter," Journal of Korean Institute of Information Technology, Vol. 11, No. 7, pp. 173-178, 2013.
  10. TMS320F2812 Digital Signal Processors Data Manual, Texas Instruments, Literature Number: SPRS174T April 2001-Revised May 2012.