• 제목/요약/키워드: Vocal Tract

검색결과 172건 처리시간 0.024초

청탁의 음성학적 의미 (Phonetic meaning of clarity and turbidity)

  • 박한상
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.77-89
    • /
    • 2017
  • This study investigates the phonetic meaning of clarity and turbidity(淸濁) that has been used in psychoacoustics, musicology, and linguistics in both the East and the West. With a view to clarifying the phonetic meaning of clarity and turbidity, this study conducts three perception tests. First, 34 subjects were asked to take one of Clear and Turbid by forced choice for 5 pure and complex tones, respectively, ranging from A2 to A6 differing by octave. Second, they were asked to select between the two choices for 25 pure and complex tones, respectively, ranging from A2 to A4 differing by semitone. Third, they were asked to opt for one of the two choices for 8 different vowels of different formant and fundamental frequencies. Results showed that there is a certain range of tone which is perceived as clear, that clarity level increases as fundamental frequency increases, and that pure tones have a higher level of clarity than complex ones, fundamental frequency being equal. Results also showed that vocal tract resonance enhances clarity level on the whole, and that lower vowels have a higher level of clarity than higher ones. This study is significant in that it demonstrates that clarity level is proportional to fundamental frequency and the first formant frequency, all else being equal.

입술 트릴의 방법에 따른 음향학적 및 전기성문파형검사 측정치 비교 (A comparison of acoustic & electroglottographic measures according to voiced lip trill methods)

  • 이승진;이광용;임재열;최홍식
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.107-114
    • /
    • 2017
  • The purpose of the current study was to compare selected acoustic and electroglottographic measures (closed quotient, pitch, and loudness) among vowel phonation, traditional voiced lip trill ($VLT_T$), modified voiced lip trill methods ($VLT_M$). A total of 21 participants without voice complaints produced 4-second long samples using each phonation method. Results indicated that mean closed quotient of $VLT_M$ was higher than that of vowel phonation and $VLT_T$, while its range and standard deviation measures were higher than those of vowel phonation. Mean, range, standard deviation, maximum of pitch measures of $VLT_M$ were higher than those of vowel phonation. Lastly, mean and maximum loudness of the $VLT_M$ were higher than $VLT_T$. In conclusion, the current data indicate the possibility to use the $VLT_M$ as a training method for singing or a strategy to facilitate generalization effect of voice therapy. Current results also reflect the necessity for further study pertaining to the long-term effect of the $VLT_M$ training method. Clinical implications are discussed.

켑스트럼 분석에 의한 파형부호화의 피치변경에 관한 연구 (On a Pitch Change of the Waveform Coding by the Cepstrum Analysis of Speech Waveforms)

  • 배명진;이미숙
    • 한국음향학회지
    • /
    • 제11권4호
    • /
    • pp.14-21
    • /
    • 1992
  • 음성신호의 합성기법들 중에서 파형부호화법은 음질이 우수하기 때문에 분석에 의한 합성법으로 많이 사용되고 있다. 그렇지만 음원과 성도의 특성을 분리하지 않고 파형의 잉여분만을 제거한 후에 파형자체를 저장하기 때문에 규칙에 의한 합성기법으로 사용하기에는 어려움이 많다. 본 논문에서는 파형부호화법 중에서 선형 PCM부호화법으로 저장된 음성파형에 대해 피치주기를 조절할 수 있는 켑스트럼 분석법을 제안하여 파형자체의 음원을 분리하지 않고 피치주기를 변경시킬 수 있는 새로운 피치 변경법을 제안하였다. 따라서 음질이 우수한 파형부호화 합성법으로 규칙에 의한 합성을 수행할 수 있다.

  • PDF

목소리 특성의 청취 평가에 기초한 사상체질과 음성 특징의 상관관계 분석 (Analysis of the Relationship Between Sasang Constitutional Groups and Speech Features Based on a Listening Evaluation of Voice Characteristics)

  • 권철홍;김종열;김근호;장준수
    • 말소리와 음성과학
    • /
    • 제4권4호
    • /
    • pp.71-77
    • /
    • 2012
  • Sasang constitution experts utilize voice characteristics as an auxiliary measure for deciding a person's constitutional group. This study aims at establishing a relationship between speech features and the constitutional groups by subjective listening evaluation of voice characteristics. A speech database of 841 speakers whose constitutional groups have been already diagnosed by Sasang constitution experts was constructed. Speech features related to speech source and vocal tract filter were extracted from five vowels and one sentence. Statistically significant speech features for classifying the groups were analyzed using SPSS. The features contributed to constitution classification were speaking rate, Energy, A1, A2, A3, H1, H2, H4, CPP for males in their 20s, F0_mean, CPP, SPI, HNR, Shimmer, Energy, A1, A2, A3, H1, H2, H4 for females in their 20s, Energy, A1, A2, A3, H1, H2, H4, CPP for male in the 60s, and Jitter, HNR, CPP, SPI for females in their 60s. Experimental results show that speech technology is useful in classifying constitutional groups.

후두적출자의 음성재활 - 기관식도천자법 이외의 방법 - (Voice Rehabilitation Other than Tracheo - Esophageal Shunt Method -)

  • 김영호
    • 대한후두음성언어의학회지
    • /
    • 제19권1호
    • /
    • pp.28-30
    • /
    • 2008
  • The problem of voice restoration after total laryngectomy has existed ever since Billroth's first total laryngectomy in 1873. Since then, all the efforts to restore the voice was tried to divert the tracheal air to the pharynx to produce voice, which became the tracheo-esophageal shunt voice currently used. With the intact pharyngoesophagus, however, there are two basic options for speech rehabilitation : the artificial larynx and esophageal voice. The artificial larynx is an electrically driven buzzer or a sound transducer and its most common type is placed against a supple point on patient's neck and introduces a mechanical sound into the tissues and air spaces of the neck. This sound, emanating form the mouth, is articulated by the intact structures of the remaining vocal tract as understandable speech. Esophageal voice is a commonly recommended method for alaryngeal speech rehabilitation, which can be successfully done by regurgitating the air stored in the esophagus. Successful esophageal voice is preferable to the artificial larynx but, most patients usually adapt only one of those methods according to their needs and feasibility to learn.

  • PDF

Palatal Cancer환자의 Obturator 장착전후 모음의 음향학적 특성과 말 명료도에 관한 연구 (The Study on the Acoustical Characteristics and Speech Intelligibility of Vowels Produced by the Maxillectomized Patients before and after Obturator-Wearing)

  • 최성희;정문규;김호중;표화영;심현섭;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제10권2호
    • /
    • pp.140-148
    • /
    • 1999
  • The use of obturator is the prosthetic rehabilitation approach for restoration of the defected maxillary shape and function for the patients with palatal defect. The obturator can change the shape of vocal tract and nasality, but few reports on the effects of the change were presented. So, the authors performed the experimental study to compare the difference between the sizes of vowel triangles produced by maxillectomized patients before and after obturator-wearing and to consider how much improvement in speech intelligibility can be expected by obturator wearing. The 8 patients who were totally maxillectomized due to palatal cancer were participated as subjects. They produced 5 vowels(/a/, /i/, /u/, /e/, /o/) before and after obturator-wearing. The formants of the vowels were analyzed by the spectrogram of CSL, and their speech intelligibility were judged by normal 8 listeners. As results, the frequency of the first and the second formant showed no significant difference between the articulation before and after wearing, but the comparison of the sizes of vowel triangles, related with the speech intelligibility, showed significant difference. The vowel triangle of the articulation after wearing was larger than that of the articulation before wearing. /i/ showed the lowest speech intelligibility score among the vowel articulation before wearing. After wearing obturators, their scores increased on the whole, especially, in /a/, but the intelligibility of /u/ decreased after wearing.

  • PDF

한국어에서의 성인과 유아의 음성 인식 비교 (Comparison of Adult and Child's Speech Recognition of Korean)

  • 유재권;이경미
    • 한국콘텐츠학회논문지
    • /
    • 제11권5호
    • /
    • pp.138-147
    • /
    • 2011
  • 현재 한국의 음성 데이터베이스 구축 현황을 살펴보면 유아에 맞춰진 음성 데이터베이스는 구축이 되지 않은 실정이다. 국외 연구를 분석한 결과, 다양한 언어를 기반으로 유아 대상의 음성 데이터베이스가 구축되어 있다. 이는 성인의 음성과 유아의 음성은 언어학적으로 차이가 있기 때문에 유아는 유아에 맞는 음성 데이터베이스가 필요하다. 한국어에서 성인과 유아의 음성 차이를 알아보기 위해, HMM을 이용하여 유아와 성인의 음성인식을 비교하였다. 유아와 성인의 음성인식 비교는 성별, 나이별, 성도 길이 정규화의 적용 유무에 따라 실험한다. 본 논문에서는 한국어에서 유아의 음성을 유아의 음성인식기로 인식했을 때가 성인의 음성인식기로 인식했을 때 보다 월등히 인식률이 높았으며, 성도 길이 정규화의 적용이 인식률 향상에 도움이 되고 있음을 보여준다.

인두피판술 전.후의 공명파라미터의 비교: 예비연구 (A Comparison of Resonance Parameters before and after Pharyngeal Flap Surgery:A Preliminary Report)

  • 강영애;강낙헌;이태용;성철재
    • 말소리와 음성과학
    • /
    • 제1권3호
    • /
    • pp.133-144
    • /
    • 2009
  • Pharyngeal flap surgery changes the space and shape of the oral cavity and vocal tract, and these changing conditions bring resonance change. The purpose of this study was to determine the most reliable and valuable parameters for evaluating hypernasality to distinguish two patients before and after pharyngeal flap surgery. Each patient was asked to clearly speak the vowels /a/, /i/, /u/, /e/, /o/ for voice recording. There were nine parameters: Formant (F1, F2, F3), Bandwidth (BW1, BW2, BW3), LPC energy slope ($\Delta$ |A2-A1/F2-F1|), and Band Energy (0-500 Hz, 500-1000 Hz) by each vowel. From the results of discrimination analyses on acoustic parameters, the vowels /a/, /e/ appeared to be insignificant but vowels /i/, /u/, /o/ appeared to be efficient in the separation. A 95%, 100%, and 100% recognition score could be reached when vowels /i/, /u/, and /o/ were analyzed. The results showed that F2, BW3, and LPC slope are more important parameters than the others. Finally, there is a relation between perceptual evaluation score and LPC energy slope of acoustic parameters by least square slope.

  • PDF

음향음성학 파라메터를 이용한 이중모음의 분류 (Classification of Diphthongs using Acoustic Phonetic Parameters)

  • 이석명;최정윤
    • 한국음향학회지
    • /
    • 제32권2호
    • /
    • pp.167-173
    • /
    • 2013
  • 본 논문은 이중모음을 분류하기 위한 음향음성학적 파라메터를 연구하였다. 음향음성학적 파라메터는 성도를 통해 음성이 발성될 때 나타나는 특징을 기반으로 하여 분산분석(ANOVA) 방법을 통해 선별한 모음의 길이, 에너지 궤적, 그리고 포먼트의 차이를 이용하였다. TIMIT 데이터 베이스를 사용하였을 때, 단모음과 이중모음만을 구분하는 실험에서는 17.8% 의 밸런스 에러율(BER)을 얻을 수 있었고, /aw/, /ay/, 그리고 /oy/를 단모음과 분류하는 실험에서는 각각 32.9%, 29.9%, 그리고 20.2%의 에러율을 얻을 수 있었다. 추가적으로 진행한 실험에서, 음향음성학적 파라메터와 음성인식에 널리 쓰이고 있는 MFCC를 함께 사용하였을 경우 역시 성능향상이 나타나는 것을 확인하였다.

화자인식을 위한 퍼지상관차원 제안 (A Proposition of the Fuzzy Correlation Dimension for Speaker Recognition)

  • 유병욱;김창석;박현숙
    • 전자공학회논문지S
    • /
    • 제36S권1호
    • /
    • pp.115-122
    • /
    • 1999
  • 본 논문은 음성신호가 카오스 신호임을 확인하고 화자인식 파라미터로 사용하기 위해 상관차원을 분석하였다. 화자식별과 인식 향상을 위하여 개인의 성도특성을 매우 잘 나타내는 음성의 스트레인지 어트렉터를 구성하고 퍼지유사도를 상관차원에 적용하여 퍼지상관차원을 제안하였다. 퍼지상관차원은 어트렉터 구성점들의 상관관계글 퍼지상관적분으로 추정하고 공간차원에 따라 퍼지상관지수가 일정하게 수렴되는 차원값을 구하여 표준패턴 어트렉터와 시험패턴 어트렉터의 변동을 흡수하였다. 퍼지상관차원에 대해 화자와 표준패턴별로 식별오차의 평균값에 따른 거리를 추정함으로써 화자인식파라미터의 타당성을 검토하였다.

  • PDF