음성의 안정적 변수 추출 및 변수의 의미 연구

Study for Extraction of Stable Vocal Features and Definition of the Features

  • 투고 : 2011.09.14
  • 심사 : 2011.12.06
  • 발행 : 2011.12.31

초록

Objectives : In this paper, we proposed a method for selecting reliable variables from various vocal features such as frequency derivative features, frequency band ratios, intensities of 5 vowels and an intensity of a sentence, since some features are sensitive to the variation of a subject's utterance. Methods : To obtain the reliable voice variables, the coefficient of variation (CV) was used as the index to evaluate the level of reliability. Since the distributions of a few features are not Gaussian, but are instead skewed to the right or left, we transformed the features by taking the log or square root. Moreover, the definition of the variables that are suitable to represent the vocal property was explained and analyzed. Results : At first, we recorded the vowels and the sentence five times both in the morning and afternoon of the same day, totally ten recordings from each of six subjects (three males and three females). We then analyzed the CVs of each subject's voice to obtain the stable features with a sufficient repeatability. The features having less than 20% CVs for all six subjects were selected. As a result, 92 stable variables from the 222 features were extracted, which included all the transformed variables. Conclusions : Voice can be widely used to classify the four constitution types and to recognize one's health condition from extracting meaningful features as physical quantity in traditional Korean medicine or Western medicine. Therefore, stable voice variables can be useful in the u-Healthcare system of personalized medicine and for improving diagnostic accuracy.

키워드

참고문헌

  1. J.Y. Kim, D.D. Pham, Sasang Constitutional medicine as a holistic tailored medicine, eCAM, 2009;6(sup. 1):11-20.
  2. C. Han, S.H. Park, S.J. Lee, M.-G. Kim, D. Wedding, Y.-K. Kwon, Psychological profile of Sasang typology: a systematic review, eCAM, 2009;6(S. 1):21-30.
  3. 이제마, 동의수세보원초고 (김달래 역), 청담, 1999.
  4. 문승재, 탁지현, 황혜정, 음성학적으로 본 사상체질, 말소리, 2005;55:1-14.
  5. 박성진, 김달래, Harmonics(배음)와 Formant Bandwidth (포먼트 폭)를 이용한 음성특성과 사상체질간의 상관성 연구, 사상체질의학회지, 2004;16(1):61-73.
  6. 김선형, 한동윤, 윤지영, 김달래, 전종원, 四象體質音聲分析機(PSSC-2004)를 통한 한국인 成人女性의 體質別音響特性硏究, 사상체질의학회지, 2005;17(1):84-102.
  7. 김혁, 양상묵, 심규헌, 유준상, 김달래, 四象體質音聲分析機(PSSC-2004)를 이용한 한국인 소아의 體質別 音響特性, 사상체질의학회지, 2006; 18(2): 55-67.
  8. www.goldwave.com
  9. http://www.vias.org/tmdatanaleng/cc_corr_coeff.html
  10. http://htk.eng.cam.ac.uk
  11. http://www.fon.hum.uva.nl/praat/
  12. http://en.wikipedia.org/wiki/Coefficient_of_variation
  13. Johan Sundberg, The Acoustics of the Singing voice, http://www.zainea.com/voices.htm, March 1977.