• 제목/요약/키워드: fundamental frequency of speech

검색결과 203건 처리시간 0.022초

발화조건에 따른 기본주파수 및 음성강도 변동의 특징 (Variance characteristics of speaking fundamental frequency and vocal intensity depending on utterance conditions)

  • 이무경
    • 말소리와 음성과학
    • /
    • 제4권1호
    • /
    • pp.111-118
    • /
    • 2012
  • The purpose of this study was to characterize and determine variances of speaking fundamental frequency and vocal intensity depending on gender and three utterance conditions (spontaneous speech, reading, and counting). A total of 65 undergraduate students (32 male students, 33 female students) attending universities in Daegu, South Korea participated in this study. The subjects were all in their 20s. This study used KayPENTAX's Visi-Pitch IV (Model 3950) to measure the variances of speaking fundamental frequency (SFF0) and vocal intensity (VI). As a result, this study came to the following conclusions. First, it was found that both males and females showed no significant difference in SFF0 and vocal intensity among three utterance conditions. Second, this study sought to analyze differences in the variances of SFF0 between males and females. As a result, it was found that females showed significantly higher levels of four measured variances (SFF0 $SD^{**}$, SFF0 $range^{***}$, Min $SFF0^{***}$ and Max $SFF0^{***}$) than males on spontaneous speech. However, it was found that there was no significant difference between males and females in SFF0 range on reading or in SFF0 SD and SFF0 range on counting. It was found that there was no significant difference between males and females in the level of measured variances of vocal intensity depending on utterance conditions. Finally, this study made a comparison and analysis on differences in the variances of SFF0 and vocal intensity among utterance conditions. As a result, it was found that all the measured variances of SFF0 in males were most significantly reduced depending upon spontaneous speech which was followed by reading and counting respectively (SFF0 SD: p<.001, SFF0 range: p<.05, Max SFF0: p<.05). Females however, show no significant difference in the measured variances of SFF0 depending upon three utterance conditions. It was also found that the measured variances of vocal intensity in females were most significantly reduced depending on spontaneous speech that was followed by reading and counting (VI SD: p<.001, VI range: p<.001, Min VI: p<.01 Max VI: p<.05), while males showed no significant difference in the measured variances of vocal intensity depending on three utterance conditions. In sum, these findings suggest that variances of SFF0 in males are affected by three utterance conditions, while variances of vocal intensity in females are affected by three utterance conditions.

F-ratio of Speaker Variability in Emotional Speech

  • Yi, So-Pae
    • 음성과학
    • /
    • 제15권1호
    • /
    • pp.63-72
    • /
    • 2008
  • Various acoustic features were extracted and analyzed to estimate the inter- and intra-speaker variability of emotional speech. Tokens of vowel /a/ from sentences spoken with different modes of emotion (sadness, neutral, happiness, fear and anger) were analyzed. All of the acoustic features (fundamental frequency, spectral slope, HNR, H1-A1 and formant frequency) indicated greater contribution to inter- than intra-speaker variability across all emotions. Each acoustic feature of speech signal showed a different degree of contribution to speaker discrimination in different emotional modes. Sadness and neutral indicated greater speaker discrimination than other emotional modes (happiness, fear, anger in descending order of F-ratio). In other words, the speaker specificity was better represented in sadness and neutral than in happiness, fear and anger with any of the acoustic features.

  • PDF

청각 장애인용 통합형 발음 훈련 기기의 개발 (Development of Integrated Speech Training Aids for Hearing Impaired)

  • 박상희;김동준
    • 대한의용생체공학회:의공학회지
    • /
    • 제13권4호
    • /
    • pp.275-284
    • /
    • 1992
  • Development of Integrated Speech Training Aids for Hearing Impaired In this study, a spepch lralnlng aids that can do real-time display of vocal tract shape and other speech parameters together in a single system is implemenLed and self-training program for this system is developed. To estimate vocal tract shape, speech production process is assumed to be AR model. Through LPC analysis, vocal tract shape, intensity, and log spcclrum are calculated. And, fundamental frequency and nasality are measured using vibration sensors.

  • PDF

후두근전적출술과 Provox 삽입술 후 기관식도발성에 관한 연구 (The Analysis of Tracheoesophageal Voice after Near-Total Laryngectomy and Implantation of Provox Prosthesis)

  • 최인자;노영수;김진환;안회영
    • 대한후두음성언어의학회지
    • /
    • 제15권2호
    • /
    • pp.141-144
    • /
    • 2004
  • Background and Objectives : To compare acoustic, aerodynamic analysis of voice and intelligibility score in patients with near-total laryngectomy and implantation of Provox prothesis. Material and Methods : In order to evaluate the voice characteristics, acoustic, aerodynamic parameter and speech intelligibility were measured in 5 patients after near-total laryngectomy, 5 patients after implantation of Provox prosthesis with total bility were measured in 5 patients after near-total laryngectomy, 5 patients after implantation of Provox prosthesis with total laryngectomy and 10 adults normal speaker. Acoustic analysis was carried out using CSL and aerodynamic analysis was carried out using Aerophon II. Speech sample was recorded and 10 listener was scored for speech intelligibility using a percentage of words correctly identified. Results. Fundamental frequency($F_0$), intensity, jitter, shimmer, maximal phonation time(MPT), subglottic air pressure were used for parameters for voice analysis. There were no significant difference between two group except on fundamental frequency and shimmer. The fundamental frequency was higher in patients with near-total laryngectomy and shimmer was higher in patients after implantation of Provox prosthesis with total laryngectomy. In addition, speech intelligibility was no significant difference between two groups. Conclusion : This results confirm that near-total laryngectomy and implantation of Provox prosthesis provides good voice rehabilitation.

  • PDF

고조파 분석에 의한 음성신호의 피치 검출 (Pitch Extraction of Speech Signals by the Harmonics analysis)

  • 김기희;최정아;배명진;안수길
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1987년도 전기.전자공학 학술대회 논문집(II)
    • /
    • pp.1610-1614
    • /
    • 1987
  • The harmonies of the fundamental frequency in speech signal make a minute line spectrum in frequency domain. In this paper, we propose a new algorithm to detect a pitch interval in voiced sound based on the fact that the number of harmonies can represent the period of the pitch in the time domain.

  • PDF

Shimmer Change According to Fundamental Frequency Variation of Korean Normal Adults

  • Pyo, Hwa-Young;Sim, Hyun-Sub
    • 음성과학
    • /
    • 제10권1호
    • /
    • pp.143-152
    • /
    • 2003
  • The present study was performed to investigate change in shimmer according to $F_{0}$ variation precisely, and to offer suggestions for a clinical application. The analysis for the present study was done by the fundamental frequency ($F_{0}$) and shimmer measurement results of the previous 120 Korean normal adults' voice study of Pyo et al. (2002), used three vowels, /i/, /a/, /and /u/. Through the analysis of 60 female samples from the previous study, we found that $F_{0}$ of the vowels was the highest in /u/, and the lowest in /a/, but, on the contrary, shimmer was highest in /a/and lowest in /u/. Thirty of 60 subjects showed such an inverse relationship between $F_{0}$ and shimmer, as a whole. In the vowel /a/, 47 of 60 subjects showed the increased $F_{0}$ and decreased shimmer, in /i/, 32 subjects, and in /u/, 33 subjects showed the same results. The decrease in shimmer means the improvement of voice quality, so by these results, we expect to answer the question why the patients with spasmodic dysphonia can improve their voice quality with increased pitched voice production.

  • PDF

스펙트로그램을 이용한 근위축성측삭경화증 여성 화자의 모음 포먼트, 음성강도, 기본주파수의 변화 (Characteristics of Vowel Formants, Voice Intensity, and Fundamental Frequency of Female with Amyotrophic Lateral Sclerosis using Spectrograms)

  • 변해원
    • 한국융합학회논문지
    • /
    • 제10권9호
    • /
    • pp.193-198
    • /
    • 2019
  • 본 연구는 근위축성측삭경화증(amyotrophic lateral sclerosis, ALS)으로 진단된 여성을 대상으로 음향음성학적 스펙트로그램 분석을 이용하여 11개월 동안 모음과 이중모음의 포먼트 변화(vowel formant variation)를 분석하였다. 검사어는 단모음 /a, i, u/와 이중모음 /h + ja + da/, /h + wi + da/, /h +ɰi+ da/를 이용하였다. 발화자료는 'Alvin' 프로그램을 이용하여 모니터에 제시된 단어읽기과제를 통해 수집되었고, 녹음환경은 nyquist frequency는 5,500Hz, sampling rate는 11,000Hz으로 설정하였다. 녹음자료는 스펙트로그램을 이용하여 강도, 음도와 이중모음의 포먼트를 분석하였다. 분석결과, ALS의 진행과정에서 기본주파수와 강도가 저하되었고, 단모음에서의 포먼트 변화보다는 이중모음의 포먼트 기울기의 감소가 특징으로 확인되었다. 이 결과는 병의 진행에 따른 ALS의 모음왜곡이 혀와 턱의 협응력 감소에 기인함을 시사한다.

한국어 발화음성에서 중점단어 탐색을 위한 기본주파수에 대한 연구 (A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean)

  • 권순일;박지형;박능수
    • 정보처리학회논문지B
    • /
    • 제15B권6호
    • /
    • pp.595-602
    • /
    • 2008
  • 각 문장 별 중점단어는 발화음성을 인식하고 그 의미를 이해하는데 도움을 준다. 발화된 음성신호로부터 중점단어를 탐색할 수 있는 방법을 찾기 위한 노력의 일환으로 실험을 통하여 문장 내에서 중점단어와 그 외의 단어들의 기본주파수의 평균과 분산, 그리고 평균 에너지를 분석해 보았다. 한국어로 된 100개의 발화문장의 음성데이터를 가지고 실험을 한 결과 중점단어는 그 외의 단어들에 비해 대부분 상대적으로 높은 기본주파수의 평균값을 나타내거나 상대적으로 높은 기본주파수의 분산 값을 나타냈다. 이 연구 결과를 이용하면 한국어의 구어문장에서 운율적 특성을 알 수 있을 뿐만 아니라, 자연어 처리를 이용한 핵심어를 추출하는 데에도 도움이 될 것이다.

정현파 모델을 이용한 2.4kbps 음성부호화 알고리즘 (2.4kbps Speech Coding Algorithm Using the Sinusoidal Model)

  • 백성기;배건성
    • 한국통신학회논문지
    • /
    • 제27권3A호
    • /
    • pp.196-204
    • /
    • 2002
  • STC(Sinusoidal Transform Coding) 방식은 주파수 영역에서 음성신호의 스펙트럼 피크치들을 정현파로 모델링하여 합성하는 음성부호화 방식을 말한다. 저전송률 STC 방식에서는 스펙트럼의 모든 피크를 이용하는 대신, 기본 주파수와 고조파에 해당하는 스펙트럼 포락선에서의 크기와 그때의 위상을 이용하여 음성을 합성한다. 본 논문에서는 정현파 모델에 기반한 2.4kbps 음성부호화 알고리즘을 제안한다. 피치정보는 모든 스펙트럼 피크를 사용한 합성음과 선택된 주파수와 고조파를 이용한 합성음과의 평균자승에러를 이용하여 추정하고, 위상정보는 여기신호 펄스의 시작시기를 나타내는 onset time과 성도 모델 전달함수의 위상을 이용하여 얻는다. 크기정보는 SEEVOC 알고리즘과 선형예측계수를 이용하여 추정한다. 실험결과, 합성음의 스펙트럼 특성은 원음성의 포만트 정보를 대부분 가지고 있으며, 위상정보도 원음성의 위상을 잘 따라감을 확인하였다. 합성음의 음질평가를 위해서 informal한 MOS(Mean Opinion Score) 테스트를 시행하였으며, 2.0kbps의 HVXC와 비교하여 대체적으로 MOS 3.1 이상의 음질을 얻을 수 있었다.

내전형 연축성 발성장애의 연속 발화 특성 (Characteristics of Connected Speech in ADSD)

  • 황연신;김재옥;최홍식
    • 말소리와 음성과학
    • /
    • 제1권1호
    • /
    • pp.93-98
    • /
    • 2009
  • The aim of this study was to investigate voice characteristics of adductive spasmodic dysphonia(ADSD) by measuring electroglottal and acoustic examination at the sentence level. The clinical records of 86 ADSD female patients (age group of $20{\sim}50$ years) and the control records of 86 normal females (age group of $20{\sim}40$ years) were recorded by speech studio(Laryngograph Ltd., UK). An independent t-test was used to compare ADSD and normal group. Results were as follows. (1) Fundamental frequency($F_0$) was significantly decreased in ADSD compared with normal group. (2) Irregularity of frequency and closed quotient(CQ) was significantly increased in ADSD compared with normal group. (3) Voiceless duration increased and voiced duration was significantly decreased in ADSD compared with normal group. (4) Fricative duration was increased in ADSD compared with normal group but it wasn't significant. In conclusion, strained, tight and choked voice shows an increase of CQ, tremor voice shows an increase of irregularity of frequency and less feminine voice shows decrease of $F_0$. Increase of voiceless duration and fricative duration and decrease of voiced duration related with diminution speech intelligibility.

  • PDF