• Title/Summary/Keyword: voice parameter

Search Result 179, Processing Time 0.023 seconds

Comparisons of voice quality parameter values measured with MDVP, Praat, and TF32 (MDVP, Praat, TF32에 따른 음향학적 측정치에 대한 비교)

  • Ko, Hye-Ju;Woo, Mee-Ryung;Choi, Yaelin
    • Phonetics and Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.73-83
    • /
    • 2020
  • Measured values may differ between Multi-Dimensional Voice Program (MDVP), Praat, and Time-Frequency Analysis software (TF32), all of which are widely used in voice quality analysis, due to differences in the algorithms used in each analyzer. Therefore, this study aimed to compare the values of parameters of normal voice measured with each analyzer. After tokens of the vowel sound /a/ were collected from 35 normal adult subjects (19 male and 16 female), they were analyzed with MDVP, Praat, and TF32. The mean values obtained from Praat for jitter variables (J local, J abs, J rap, and J ppq), shimmer variables (S local, S dB, and S apq), and noise-to-harmonics ratio (NHR) were significantly lower than those from MDVP in both males and females (p<.01). The mean values of J local, J abs, and S local were significantly lower in the order MDVP, Praat, and TF32 in both genders. In conclusion, the measured values differed across voice analyzers due to the differences in the algorithms each analyzer uses. Therefore, it is important for clinicians to analyze pathologic voice after understanding the normal criteria used by each analyzer when they use a voice analyzer in clinical practice.

Correlation Analysis Between Vocal Fold Vibration and Voice Signal Analysis Parameter by Water Temperature (수온에 따른 성대 진동과 음성신호 분석 요소간의 상관성 분석)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.4C
    • /
    • pp.347-353
    • /
    • 2012
  • In this paper, we carried out experiments to analyze influence of vocal cords according to changes of water temperature. We would like to particularly perform a study to design voice measurement system for significant extraction about vibration patterns of vocal cords according to temperature changes of water to drink. To this end, we measured elements value of voice analysis vibration of vocal cords to change, when drank, temperature difference of step 8 from $0^{\circ}C$ to $70^{\circ}C$ to $10^{\circ}C$ intervals. As a result of us experiment, when drank water of $30^{\circ}C{\sim}40^{\circ}C$, vibration of vocal cords stabilized and accuracy of pronunciation improved. We can analyzed that water of $30^{\circ}C{\sim}40^{\circ}C$ had a good effect in vocal cords.

How to Express Emotion: Role of Prosody and Voice Quality Parameters (감정 표현 방법: 운율과 음질의 역할)

  • Lee, Sang-Min;Lee, Ho-Joon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.11
    • /
    • pp.159-166
    • /
    • 2014
  • In this paper, we examine the role of emotional acoustic cues including both prosody and voice quality parameters for the modification of a word sense. For the extraction of prosody parameters and voice quality parameters, we used 60 pieces of speech data spoken by six speakers with five different emotional states. We analyzed eight different emotional acoustic cues, and used a discriminant analysis technique in order to find the dominant sequence of acoustic cues. As a result, we found that anger has a close relation with intensity level and 2nd formant bandwidth range; joy has a relative relation with the position of 2nd and 3rd formant values and intensity level; sadness has a strong relation only with prosody cues such as intensity level and pitch level; and fear has a relation with pitch level and 2nd formant value with its bandwidth range. These findings can be used as the guideline for find-tuning an emotional spoken language generation system, because these distinct sequences of acoustic cues reveal the subtle characteristics of each emotional state.

On a Detection of the ZCR-Parameter for Higher Formants of Speech Signals (음성신호의 상위 포만트에 대한 ZCR-파라미터 검출에 관한 연구)

  • 유건수
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1992.06a
    • /
    • pp.49-53
    • /
    • 1992
  • In many applications such as speech analysis, speech coding, speech recognition, etc., the voiced-unvoiced decision should be performed correctly for efficient processing. One of the parameters which are used for voice-unvoiced decision is zero-crossing. But the information of higher formants have not represented as the zero-crossing rate for higher formants of speech signals.

  • PDF

Design and Implementation of Voice Quality Management System by using MGCP parameter in VoIP Service (MGCP Parameter를 이용한 VoIP서비스 음성품질 관리 시스템 설계 및 구현)

  • 류내원;황부현
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.10c
    • /
    • pp.325-327
    • /
    • 2004
  • VoIP는 음성 및 데이터 통합 뿐만 아니라 차세대 네트웍 등의 기반이 되는 기술이며, 인터넷전화 / IP Telephony, 화상회의, 메신저 서비스 등 여러 서비스에 활용되고 있다. 이러한 VoIP 서비스 제공시에 가장 중요시되는 부분이 음성품질이며 이를 측정 및 관리하는 기술이 필수적으로 필요하다. 지금까지는 품질측정장비를 가지고 직접 측정하는 것이 전부였으나 본 연구는 IETF의 VoIP 표준 프로토콜인 MGCP중 파라미터 값을 이용하여 ITU-T의 음성품질 기준인 R factor(G.107)를 계산해 내고 중앙에서 모든 단말 및 사용자들의 실제 발생한 통화에 대한 음성품질을 관리할 수 있는 시스템을 설계 및 구현한다.

  • PDF

A study on speech training aids for Deafs (청각장애자용 발음훈련기기 개발에 관한 연구)

  • Ahn, Sang-Pil;Lee, Jae-Hyuk;Yoon, Tae-Sung;Park, Sang-Hui
    • Proceedings of the KIEE Conference
    • /
    • 1990.07a
    • /
    • pp.47-50
    • /
    • 1990
  • Deafs cannot speak straight voice as normal people in lack of feedback of their pronunciation, therefore speech training is required. In this study, fundamental frequency, intensity, formant frequencies, vocal tract graphic and vocal tract area function, extracted from speech signal, are used as feature parameter. AR model, whose coefficients are extracted using inverse filtering. is used as speech generation model. In connect ion between vocal tract graphic and speech parameter, articulation distances and articulation distance functions in selected 15-intervals are determined by extracted vocal tract areas and formant frequencies.

  • PDF

Speech Enhancement Algorithm Based on Teager Energy and Speech Absence Probability in Noisy Environments (잡음환경에서 Teager 에너지와 음성부재확률 기반의 음성향상 알고리즘)

  • Park, Yun-Sik;An, Hong-Sub;Lee, Sang-Min
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.49 no.3
    • /
    • pp.81-88
    • /
    • 2012
  • In this paper, we propose a novel speech enhancement algorithm for effective noise suppression in various noisy environments. In the proposed method, to result in improved decision performance for speech and noise segments, local speech absence probability (LSAP, local SAP) based on Teager energy of noisy speech is used as the feature parameter for voice activity detection (VAD) in each frequency subband instead of conventional LSAP. In addition, The presented method utilizes global SAP (GSAP) derived in each frame as the weighting parameter for the modification of the adopted TE operator to improve the performance of TE operator. Performances of the proposed algorithm are evaluated by objective test under various environments and better results compared with the conventional methods are obtained.

A Study on the Change Parameter Analysis of Articulator by Intake the C8H10O2H4 (C8H10O2H4 섭취량에 의한 조음기관의 변화 요소 분석 연구)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.36 no.1B
    • /
    • pp.93-100
    • /
    • 2011
  • The people frequently drink coffee under stress at work, according to increase attractive of leisure activities, favourite food in modern society. The coffee taste which we are catching the taste of modern people is depending on various kinds processing methods such as a mixture of beans. However, the most coffee contains $C_8H_{10}O_2N_4$ which affected various parts of the body. These $C_8H_{10}O_2N_4$, is that the main component of coffee is caffeine. Therefore, in this paper, we are analyzed influence about articulator according to increase in $C_8H_{10}O_2N_4$ 250mg contains a cup of coffee. From this, we gradually increased the amount of $C_8H_{10}O_2N_4$ intake about thirty persons of men in his 20's in experiments. Then we performed a study about Jitter, Formant, Spectrum in voice analysis parameter by applying the results having an affect articulator.

A Study of Acoustic Analysis in the Chinese' Korean Language Learners (중국인 한국어 학습자 음성의 음향학적 특성 연구)

  • Kim, Hyun-Ji;You, Jae-Yeon
    • Phonetics and Speech Sciences
    • /
    • v.2 no.3
    • /
    • pp.75-80
    • /
    • 2010
  • The present research investigated the characteristics of voice between genders and nationalities by measuring the acoustic parameter values of Korean and Chinese students. Sound Forge was used to collect voice samples and Praat was used to measure and analyze jitter, shimmer, NHR, $sF_0$, and pitch range. The results of this research are a follows. First, during prolongation of the vowels, there was no significant difference in $F_0$ between Korean and Chinese males and Korean and Chinese females. Korean males and females had higher $F_0$ values than Chinese males and females. Secondly, during sentence reading, there was no significant difference between Korean and Chinese males in $sF_0$. But between female groups, there was a significant difference in $sF_0$. Thirdly, during sentence reading, the pitch range in Korean males was found to be narrower compared to Korean and Chinese females who had wider pitch range, showing a significant difference. Fourthly, jitter in the five vowels /a, i, u, e, o/ was found to be higher in Chinese than Korean subjects. In the vowels /a, e, u/ females were higher than males showing a significant difference. Fifthly, shimmer in the vowels /a, e, u/ was found to be higher in Chinese than Korean subjects showing a significant difference. Finally, NHR in the vowels /a, u, o/ was found to be higher in Chinese than Korean subjects showing a significant difference.

  • PDF

End-to-end Transmission Performance of VoIP Traffics based on Mobility Pattern over MANET with IDS (IDS가 있는 MANET에서 이동패턴에 기반한 VoIP 트래픽의 종단간 전송성능)

  • Kim, Young-Dong
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.9 no.7
    • /
    • pp.773-778
    • /
    • 2014
  • IDS(Intrusion Detection System) can be used as a countermeasure for blackhole attacks which cause degrade of transmission performance by causing of malicious intrusion to routing function of networks. In this paper, effects of IDS for transmission performance based on mobility patterns is analyzed for MANET(Mobile Ad-hoc Networks), a suggestion for effective countermeasure is considered. Computer simulation based on NS-2 is used in performance analysis, VoIP(Voice over Internet Protocol) as an application service is chosen for performance measure. MOS(Mean Opinion Score), call connection ratio and end-to-end delay is used as performance parameter.