DOI QR코드

DOI QR Code

Development of Voice Activity Detection Algorithm for Elderly Voice based on the Higher Order Differential Energy Operator

고차 미분에너지 기반 노인 음성에서의 음성 구간 검출 알고리즘 연구

  • Lee, JiYeoun (Department of Biomedical Engineering, Jungwon University)
  • 이지연 (중원대학교 의료공학과)
  • Received : 2016.09.29
  • Accepted : 2016.11.20
  • Published : 2016.11.28

Abstract

Since the elderly voices include a lot of noise caused by physiological changes in respiration, phonation, and resonance, the performance of the convergence health-care equipments such as speech recognition, synthesis, analysis program done by elderly voice is deteriorated. Therefore it is necessary to develop researches to operate health-care instruments with elderly voices. In this study, a voice activity detection using a symmetric higher-order differential energy function (SHODEO) was developed and was compared with auto-correlation function(ACF) and the average magnitude difference function(AMDF). It was confirmed to have a better performance than other methods in the voice interval detection. The voice activity detection will be applied to a voice interface for the elderly to improve the accessibility of the smart devices.

노인 음성은 연령에 따른 호흡, 발성, 공명 등의 생리적 변화에 의하여 다량의 잡음이 발생된다. 따라서 노인 음성으로 음성인식 및 합성, 분석 소프트웨어등과 같은 융복합 헬스케어 기기를 동작시키고자 할 때, 성능을 저하시키는 결과를 야기한다. 그러므로 노인 음성을 분석하여 그들의 목소리로 다양한 헬스케어 기기를 잘 운영할 수 있는 위한 연구 개발이 필요하다. 따라서 본 연구는 노인 음성 잡음을 고려하여 기존의 대칭 구조 고차 미분 에너지 함수를 이용하여 노인 음성에서의 음성 구간 검출 알고리즘을 연구하였으며, 자기상관함수와 AMDF 방법과 비교하여 노인 음성에서의 음성 구간 검출에 보다 우수한 성능을 가지는 것을 확인하였다. 본 논문에서 제시하는 음성 구간 검출 알고리즘은 노인을 위한 음성 인터페이스에 적용함으로써 노인들의 스마트 기기에의 접근성을 높이고, 더 나아가 노인들을 위한 융복합 웨어러블 디바이스 성능 개선 및 다양한 개발이 가능할 것으로 전망한다.

Keywords

References

  1. Yong-Wook Nam, Yong-Hyuk Kim, "Speed estimation of sound-emitted objects through convergence of sound information analysis and smart device technology", Journal of the Korea Convergence Society, Vol. 6, No. 5, pp. 233-240, 2015 https://doi.org/10.15207/JKCS.2015.6.5.233
  2. EunJeong Choi, SERI management note, vol. 117, pp. 1-14, SERI, 2011
  3. Seong-Hoon Lee, Dong-Woo Lee, "On Issue and Outlook of wearable Computer based on Technology in Convergence", Journal of the Korea Convergence Society, Vol. 6, No. 3, pp. 73-78, 2015
  4. Yunkyung Song, "Prevalence of Voice Disorders and Characteristics of Korean Voice Handicap Index in the Elderly", Journal of the Korean society of speech science, Vol. 4, No. 3, pp. 151-159, 2012 https://doi.org/10.13064/KSSS.2012.4.3.151
  5. Soon-Kyeom Kim, Jamg-Eui Hong, "Application of Safety Analysis and Management in Software Development Process", Journal of Convergence Society for SMB, Vol. 6, No. 1, pp. 7-15, 2016
  6. Kahane, J. C. "Anatomic and physiologic changes in the aging peripheral speech mechanism. In D. S. Beasley & G. A. Davis (Eds.)," Aging: Communication processes and disorders New York: Grune & Stratton. pp. 21-45, 1981
  7. Lee, S.Y. "The overall speaking rate and articulation rate of normal elderly people," Graduate program in speech and language pathology, Master these, Yonsei University, 2011
  8. Hong Jungpyo; Park Sangjun; Jeong Sangbae; Hahn Minsoo, "Robust Feature Extraction for Voice Activity Detection in Nonstationary Noisy Environments", Journal of the Korean society of speech science, Vol. 5, No. 1, pp. 11-16, 2013
  9. Byeong-Gwan Iem; "Estimation of Fundamental Frequency Using an Instantaneous Frequency Based on the Symmetric Higher Order Differential Energy Operator", The Korean Institute of Electrical Engineers, Vol. 60, No. 2, pp. 2374-2379, 2011 https://doi.org/10.5370/KIEE.2011.60.12.2374
  10. Byeong-Gwan Iem, "An Instantaneous frequency estimators based on the symmetric higher order differential energy operator," IEICE Trans. Fundamentals, vol. E93-A, no. 1, pp. 227-232, 2010 https://doi.org/10.1587/transfun.E93.A.227
  11. P. Maragos, and A. Potamianos, "Higher order differential energy operators," IEEE Signal Processing Letters, Vol. 2, pp. 152-154, 1995 https://doi.org/10.1109/97.404130
  12. In-Kyu Seo, Sang Ho Lee, "An Efficient Hospital Service Model of Hierarchical Property information classified Bioinformatics information of Patient", Journal of Convergence Society for SMB, Vol. 5, No. 4, pp. 17-23, 2015
  13. K. Abdullah-Al-Mamun, "A High Resolution Pitch Detection Algorithm Based on AMDF and ACF", J. Sci. Res. 1, pp. 508-515, 2009
  14. Myungkyu Ham; Sungyoung Choi; Jongcheol Park; Myungjin Bea; "On a Pitch Point Detection by Preserving the Phase Component of the Autocorrelation Function", 2000 Korea Signal Processing Conference, Vol. 13, No. 1, pp. 799-802, 2000
  15. LAWRENCE R. RABINER, "On the Use of Autocorrelation Analysis for Pitch Detection", IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL. ASSP-25, NO. 1, FEBRUARY 1977
  16. Hyun-Soo Seo, "Pitch Period Detection Algorithm Using Modified AMDF", The Korea Institute of Information and Communication Engineering, Vol , 10, No. 1, pp. 23-28, 2006