• Title/Summary/Keyword: voice frequency

Search Result 546, Processing Time 0.027 seconds

On a Study of Detecting First Formant Using Autocorrelation Method (자기상관법을 이용한 제 1 포만트 검출법에 관한 연구)

  • 강은영;민소연;배명진
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.285-288
    • /
    • 2001
  • In the speech analysis, to estimate formant center frequencies exactly is very important. If we know formant frequencies, we can expect which pronunciation is uttered. Generally, the magnitude of first formant frequency in voiced speech is 10dB more than other formant frequency. So, the shape of voice signal in time domain is affected by mainly first formant. Therefore we can get first formant frequency roughly by using ZCR(Zero Cross Rate). In this paper, we proposed the improvement method to get first formant frequency by using ZCR. We did autocorrelation before getting ZCR. This procedure makes voice signal smooth so, first formant in voice signal is emphasized. As a result of this method, we got more exact ZCR and first formant frequency. Conventional method of formant estimate is done in frequency domain but proposed method is done in time domain. So, this is very simple.

  • PDF

Voice range differences in vowels by voice classification among male students of popular music vocals (대중가요 보컬 전공 남학생의 성종에 따른 모음 간 음역 차이)

  • Il-Song Ji;Jaeock Kim
    • Phonetics and Speech Sciences
    • /
    • v.16 no.2
    • /
    • pp.37-47
    • /
    • 2024
  • This study was conducted on 27 male students majoring in or preparing for popular music vocals to determine whether they were aware of their voice classification and vocal range. Additionally, differences in the fundamental frequency and average speaking fundamental frequency were compared among the voice classifications. Moreover, considering that they may differ in their ability to produce high frequencies depending on the vowel, differences in voice ranges among the cardinal vowels, /a/, /i/, and /u/, were examined, and differences in voice ranges between vowels were compared by voice classification. The results showed that more than half of the male students majoring in or preparing for popular music vocals were not accurately aware of their voice types. In addition, statistically significant differences were found in the maximum fundamental frequency and frequency range among vowels, indicating differences in the voice range that can be produced depending on the vowel type. In particular, the voice range decreased in the following order: /a/>/u/>/i/. This suggests that while the vowel /a/ is easier to articulate in the high register compared to other vowels, vowels /u/ and /i/ as high vowels involve narrowing of the oral cavity due to the raised position of the tongue, accompanied by raising of the larynx, resulting in a decrease in voice range and difficulty in vocalizing in the high register.

Prevalence of Voice Disorders and Characteristics of Korean Voice Handicap Index in the Elderly (노인 음성장애 출현율 및 음성장애지수 특성)

  • Song, Yun-Kyung
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.151-159
    • /
    • 2012
  • The purpose of this study is to evaluate the prevalence of voice disorders and the Korean voice handicap index in the elderly. For this study, 169 elderly performed two types of questionnaires and vowel /a/ prolongation. Self-reported voice symptoms and the Korean voice handicap index were analyzed and acoustic voice evaluation was performed by MDVP. The results showed that the prevalence of voice disorders in the elderly are significantly higher than that of adults in self-reports. In acoustic evaluation, 32.2% of the male elderly and 40.9% of the female elderly exceeded the thresholds of Jitter (%), Shimmer (%) and NHR. In addition, Korean voice handicap index scores of the female elderly are significantly higher than those of female adults. These findings indicate the high frequency of voice disorders in the elderly and the need to focus on this group. Additional studies on the voice related quality of life for the elderly are needed.

Acoustic Characteristics of the Voices of Korean Normal Adults by Gender on MDVP (성별에 따른 한국 정상 성인 음성의 음향학적 평가 기준치)

  • Kim, Jae-Ock
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.147-157
    • /
    • 2009
  • The purpose of the study is to develop the normal voice database and to analyze the acoustic characteristics of Korean adults' voices by gender using MDVP. Eight categories in the 34 parameters of MDVP were analyzed in the voices of 170 Korean normal adults taken from /a/ vowel. Among them, Fundamental Frequency Parameters and Frequency Perturbation Parameters were significantly different by gender. In addition, Fundamental Frequency Parameters of our data were remarkably different from the data suggested in the MDVP program which currently used in clinics. Therefore, the data obtained from the current study can be effectively used for the diagnosis of voice disorders of Korean adults as the standard parameter values of MDVP.

  • PDF

The Effect of Auditory Condition on Voice Parameter of Teacher (청각 환경이 교사의 음성 파라미터에 미치는 영향)

  • Lee Ju-Young;Baek Kwang-Hyun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.5
    • /
    • pp.207-212
    • /
    • 2006
  • The purpose of this study was to compare voice parameters in auditory conditions (normal/noise/music) between a teacher group and a control group. Results of statistical analysis showed that the teacher group had higher jitter (%) and shimmer (%) values than the control group. It indicated that the teacher group had larger variations in pitch and dynamic of their voice. In the teacher group, the voice under noisy condition showed a higher value of fundamental frequency than that under normal condition. though its fundamental frequency did not show any significant difference between the noisy condition and the musical condition. In the control group, however, although the voice under noisy condition also showed a higher value of fundamental frequency than that under normal condition, its fundamental frequency was significantly different between the noisy condition and the musical condition.

A Study on the Correlation Between Sasang Constitution and Sound Characteristics Used Harmonics and Formant Bandwidth (Harmonics(배음)와 Formant Bandwidth(포먼트 폭)를 이용한 음성특성(音聲特性)과 사상체질간(四象體質間)의 상관성(相關性) 연구(硏究))

  • Park, Sung-Jin;Kim, Dal-Rae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.16 no.1
    • /
    • pp.61-73
    • /
    • 2004
  • This study was prepared to investigate the correlation between Sasang constitutional groups and voice characteristics using voice analysis system(in this study, CSL). I focused on the voice characteristics in terms of harmonics, Formant frequency and Formant Bandwidth. The subjects were 71 males. I classified them into three groups, that is Soeumin group, Soyangin group and Taeumin group. The classification method of Constitution used two ways, QSCCII(Questionnarie for the Sasang Constitution Classification II) and Interview with a specialist in Sasang Constitution. So 71 people were categorized into 31 Soeumin(people), 18 Soyangin(people) and 22 Taeumin(people). Pitch is approximately similar to the fundamental frequency(F0) in voices. Shimmer in dB gives an evaluation of the period-to-period variability of the peak-to-peak amplitude within the analyzed voice sample. FFT(Fast Fourier Transform) method in CSL can display sampled voices into harmonics. H1 is the first peak and h2 is the second peak in the harmonics. The amplitude difference of h1 and h2(h1-h2) can be explained as the speaker's phonation type, And Formant frequency and bandwidth can be explained as the speaker's vocal tract. So I checked the harmonics and Formant frequency and Bandwidth as the voice parameters. First I have captured /e/ voices from all subjects using microphone. And then I analyzed /e/ voices with CSL. Power Spectrum and Formant History is the menu in the CSL which can display harmonics and Formant frequency and bandwidth. The results about the correlation between Sasang Constitutional Groups and voice parameters are as follows; 1. There is no significant amplitude difference of harmonics(h1-h2) among three groups. 2. There is the significant difference between Soeumin Group and Soyangin Group in Formant Frequency 1 and Formant Bandwidth 1(p<0.05). Any other parameters have no significance. I assume that Soyangin Group has clearer and brighter voice than Soeumin Group according to the Formant Bandwidth difference. And I think its result has coincidence with the context of "Dongyi Suse Bowon" and "Sasangimhejinam".

  • PDF

Design and Implementation of IVR Server Using VoiceXML (VoiceXML을 이용한 IVR 서버 설계 및 구현)

  • Lee, Chang-Ho;Jang, Won-Jo;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.47-59
    • /
    • 2002
  • A new brilliant service using human-voice and DTMF (Dual Tone Multi Frequency) technique is expected nowadays in order to obtain valuable information on the internet more easily. VoiceXML (Voice eXtensible Markup Language) is the right choice that makes the new service possible. In this paper, the design and implementation of IVR (Interactive Voice Response) server using VoiceXML is described, where it connects with internet and IVR server efficiently. IVR server using VoiceXML is composed of two groups: VoiceXML document handling and VoiceXML execution. Scenario part of IVR server corresponds to VoiceXML document, the execution is performed by VoiceXML execution.

  • PDF

Comparative Study on the Acoustic Characteristics of the Korean Vowel /a/ before and after LMS (후두미세수술 전후 /아/의 음향적 특성 비교)

  • Hwang, Yeon-Sin;Seong, Cheol-Jae
    • MALSORI
    • /
    • no.67
    • /
    • pp.33-60
    • /
    • 2008
  • The aim of this study is to show the differences in acoustic parameters between a pathological voice /a/ caused by vocal polyp and a normal voice /a/ produced after LMS (Laryngeal Microscopic Surgery). It was expected that voices of two kinds could be analyzed effectively in terms of HNR in specific frequency bands than in all frequency bands. For this study, 10 patients' voice were recorded before and after LMS and then were manipulated in terms of four acoustic parameter. It was found out that (a) frequency bands of 500Hz in the range of 1,000Hz to 4,000Hz were very useful to obtain HNR values; (b) frequency bands in the range of 1,248Hz to 5,500Hz on a log scale were very useful to obtain HNR values; (c) F0 dropped after LMS but not significantly; (d) the bandwidth of the second formant (B2) decreased significantly after LMS, while that of the first formant (B1) decreased after LMS but not significantly.

  • PDF

컴퓨터 합성음성 경보의 주관적 위급도 정량화

  • 박경수;장필식;이경태
    • Proceedings of the ESK Conference
    • /
    • 1997.10a
    • /
    • pp.339-345
    • /
    • 1997
  • This paper presents an experimental study of te relationship between sound parameters of synthesized voice warning and perceived (psychoacoustic) urgency. Twenty four subjcts participated in two experimental sessions to evaluate and quantify the effects of te voice parameters. Experiments showed that speech rate, fundamental frequency, fundamental frequency contour types and voice types have clear and consistent effect on perceived urgency. The results of these experiments can be applied to the improvement of existing auditory warning systems and the design of new systems.

  • PDF

Detection of Pathological Voice Using Linear Discriminant Analysis

  • Lee, Ji-Yeoun;Jeong, Sang-Bae;Choi, Hong-Shik;Hahn, Min-Soo
    • MALSORI
    • /
    • no.64
    • /
    • pp.77-88
    • /
    • 2007
  • Nowadays, mel-frequency cesptral coefficients (MFCCs) and Gaussian mixture models (GMMs) are used for the pathological voice detection. This paper suggests a method to improve the performance of the pathological/normal voice classification based on the MFCC-based GMM. We analyze the characteristics of the mel frequency-based filterbank energies using the fisher discriminant ratio (FDR). And the feature vectors through the linear discriminant analysis (LDA) transformation of the filterbank energies (FBE) and the MFCCs are implemented. An accuracy is measured by the GMM classifier. This paper shows that the FBE LDA-based GMM is a sufficiently distinct method for the pathological/normal voice classification, with a 96.6% classification performance rate. The proposed method shows better performance than the MFCC-based GMM with noticeable improvement of 54.05% in terms of error reduction.

  • PDF