• Title/Summary/Keyword: formants

Search Result 148, Processing Time 0.026 seconds

An Experimental Study of Korean Dialectal Speech (한국어 방언 음성의 실험적 연구)

  • Kim, Hyun-Gi;Choi, Young-Sook;Kim, Deok-Su
    • Speech Sciences
    • /
    • v.13 no.3
    • /
    • pp.49-65
    • /
    • 2006
  • Recently, several theories on the digital speech signal processing expanded the communication boundary between human beings and machines drastically. The aim of this study is to collect dialectal speech in Korea on a large scale and to establish a digital speech data base in order to provide the data base for further research on the Korean dialectal and the creation of value-added network. 528 informants across the country participated in this study. Acoustic characteristics of vowels and consonants are analyzed by Power spectrum and Spectrogram of CSL. Test words were made on the picture cards and letter cards which contained each vowel and each consonant in the initial position of words. Plot formants were depicted on a vowel chart and transitions of diphthongs were compared according to dialectal speech. Spectral times, VOT, VD, and TD were measured on a Spectrogram for stop consonants, and fricative frequency, intensity, and lateral formants (LF1, LF2, LF3) for fricative consonants. Nasal formants (NF1, NF2, NF3) were analyzed for different nasalities of nasal consonants. The acoustic characteristics of dialectal speech showed that young generation speakers did not show distinction between close-mid /e/ and open-mid$/\epsilon/$. The diphthongs /we/ and /wj/ showed simple vowels or diphthongs depending to dialect speech. The sibilant sound /s/ showed the aspiration preceded to fricative noise. Lateral /l/ realized variant /r/ in Kyungsang dialectal speech. The duration of nasal consonants in Chungchong dialectal speech were the longest among the dialects.

  • PDF

Voice Color Conversion Based on the Formants and Spectrum Tilt Modification (포먼트 이동과 스펙트럼 기울기의 변환을 이용한 음색 변환)

  • Son Song-Young;Hahn Min-Soo
    • MALSORI
    • /
    • no.45
    • /
    • pp.63-77
    • /
    • 2003
  • The purpose of voice color conversion is to change the speaker identity perceived from the speech signal. In this paper, we propose a new voice color conversion algorithm through the formant shifting and the spectrum-tilt modification in the frequency domain. The basic idea of this technique is to convert the positions of source formants into those of target speaker's formants through interpolation and decimation and to modify the spectrum-tilt by utilizing the information of both speakers' spectrum envelops. The LPC spectrum is adopted to evaluate the position of formant and the information of spectrum-tilt. Our algorithm enables us to convert the speaker identity rather successfully while maintaining good speech quality, since it modifies speech waveforms directly in the frequency domain.

  • PDF

An Analysis of the Vowel Formants of the Young versus Old Speakers in the Buckeye Corpus (벅아이 코퍼스에서의 연령별 모음 포먼트 분석)

  • Km, Ji-Eun;Yoon, Kyuchul
    • Phonetics and Speech Sciences
    • /
    • v.4 no.4
    • /
    • pp.29-35
    • /
    • 2012
  • The purpose of this study was to measure the first two vowel formants of the forty male and female speakers (twenty young vs. old male speakers and twenty young vs. old female speakers) from the Buckeye Corpus of Conversational Speech and to examine the vowel formant changes across two generations (younger vs. older). The results indicated that the vowel space of the younger generation (in their thirties or less) shifted to the lower left position compared to those of the older generation (in their forties or more) in both male and female speakers. When the results were compared to those of Peterson & Barney (1952), it appears that differences can be found in the size of the vowel spaces through time.

A Comparative Study on the Effects of Age on the Vowel Formants of the Korean Corpus of Spontaneous Speech (한국어 자연발화 음성코퍼스의 연령별 모음 포먼트 비교 연구)

  • Kim, Soonok;Yoon, Kyuchul
    • Phonetics and Speech Sciences
    • /
    • v.7 no.3
    • /
    • pp.65-72
    • /
    • 2015
  • The purpose of this study is to extract the first two vowel formant frequencies of the forty speakers from the Seoul corpus[8] and to compare them by the age and sex. The results showed that the vowel formants showed similar patterns between male and female speakers. All the vowels in each age group and all the age groups in each vowel had main effects on either of the formant frequencies. Whereas in English, the vowel space of the older age group moved slightly to the upper right side relative to the younger group, the location of the vowel spaces of the Korean vowels were not as consistent.

A Study on Estimation of Formants and Articulatory Motion Trajectories using RLSL Adaptive Linear Prediction Filter (RLSL 적응선형예측필터를 이용한 형성음 및 조음운동궤적 추정에 관한 연구)

  • 김동준;송영수
    • Journal of Biomedical Engineering Research
    • /
    • v.14 no.1
    • /
    • pp.1-8
    • /
    • 1993
  • In this study, the extractions of formants and articulatory motion trajectories for Korean complex vowels are performed by using the RLSL adaptive linear prediction filter. This enables us to extract accurate spectrum in transition of speech signal. This study shows that the RLSL algorithm is superior to the Levinson algorithm, specially in transition part of speech.

  • PDF

A Study on Acoustical Properties of Soprano′s Singing (소프라노의 성악 발성에 대한 음향학적 특징 연구)

  • 임동철;문소연;이행세
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.5
    • /
    • pp.60-64
    • /
    • 2000
  • This paper studies the relation between the Fundamental Frequency (F0) and the formants of simple vowels in the Korean language sung by sopranos. It is hewn that, in soprano singing, the F0 of a vowel affects its formants. For this reason the formants of simple vowels sung by sopranos must be considered in all over the soprano singing range. We recorded the five simple vowel sounds /a/, /e/, /i/, /o/, and /u/ sung by five professional sopranos from A3 (220.0Hz) to A5 (880.0Hz) in the major scale and compared the formants of the sung vowels with those of spoken vowels. We observed that F1 and F2 of sung vowels were stable in low F0 (lower than B4) but in high F0 (higher than B4), F1 and F2 lost their stabilities. In the case of /a/, /o/, and /u/, the slope of the F1-F2 graph was about 2.6, and those of the F0-F2 and F0-Fl graphs were 2.2-2.5 and 0.7-1.0, respectively. And as the F0 increases, the F1 and F2 of sung vowels /a/, /e/, /i/, /o/, and /u/ were almost the same. At A5, the Fl and F2 of five sung vowels had the same values. This results suggest that the relation between the F0 and the formants be used to synthesize soprano's singing vowels.

  • PDF

On a Performance Improvement of Speaker Recognition by using the Auditory Characteristics of Speech (음성의 청각특성을 이용한 화자식별시스템의 성능향상에 관한 연구)

  • 이윤주;오세영배재옥배명진
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1223-1226
    • /
    • 1998
  • The pre-emephasis filter as the conventional method emphasizes all components of high frequency that reflects the speaker characteristics. However this filter don't show the auditory characteristics of speaker's speech. In order to emphasize the perceptual characteristics, we propose the speaker recognition system that uses the perceptual weighting as the preprocessor because the Auditory characteristic of human is sensitive to the formant peaks. This filter has the characteristcs that both deemphasizes the low-formants and emphasizes the high formants. As a result of the proposed method, we improve the total recognition rate 1.7% better than the conventional method.

  • PDF

The Study of Voice Perception with Formant Analysis of Two Myna Bird's Voice Imitation (구관조 음성모방의 음향학적 분석을 통한 음성인식에 대한 고찰)

  • Lee, Ok-Bun;Jeong, Ok-Ran
    • Speech Sciences
    • /
    • v.12 no.2
    • /
    • pp.121-128
    • /
    • 2005
  • This study was an attempt to determine acoustic characteristics in myna bird's notes. Two myna birds' sounds imitating a normal male voice in his late 20's were sampled and analyzed. The analyses included the mean values of F1, F2, F3 and pitch contours. The results were as follows; First, there was a significan difference in the mean values of F1, F2, and F3 in isolatd vowel /a/ and /i/ between the myna birds' sounds and the human voice. However, there was no apparent difference in pitch contour of their formants. Second, there was a difference in pitch contour of their formants in their sentence ('hn-nyung-ha-se-yo?' meaning 'How are you?') production. Namely, the myna birds' pitch contour was located higher than that of the human's.

  • PDF

A Study on the Influence of Korean Regional Dialects to English Vowel Pronunciation and Correction (영어 모음 발음에 미치는 한국어 지역 방언의 영향과 발음 수정에 대한 연구)

  • Kim, Ji-Eun
    • Phonetics and Speech Sciences
    • /
    • v.5 no.2
    • /
    • pp.81-90
    • /
    • 2013
  • The purposes of this study are to: (1) Compare the vowel production of English front vowels produced by Korean speakers using regional dialects and; (2) Investigate and compare the effectiveness of pronunciation training for each regional dialect group. To test these objectives, the English front vowels produced by five Youngnam dialect male speakers, five Youngnam dialect female speakers, five Kangwon dialect male speakers, and five Kangwon dialect female speakers were scrutinized. These dialect groups' vowel formants and length of English front vowels were evaluated, and the post-pronunciation training values were compared with those of pre-training values. The results indicate that pronunciation training is more effective for Youngnam dialect speakers, whilst both dialect groups have more success mastering the pronunciation of /${\varepsilon}$/ over /${\ae}$/.

The Formant Frequency Differences of English Vowels as a Function of Stress and its Applications on Vowel Pronunciation Training (강세에 따른 영어 모음의 포먼트 변이와 모음 발음 교육에의 응용)

  • Kim, Ji-Eun;Yoon, Kyuchul
    • Phonetics and Speech Sciences
    • /
    • v.5 no.2
    • /
    • pp.53-58
    • /
    • 2013
  • The purpose of this study is to compare the first two vowel formants of the stressed and unstressed English vowels produced by ten young males (in their twenties and thirties) and ten old males (in their forties or fifties) from the Buckeye Corpus of Conversational Speech. The results indicate that the stressed and unstressed vowels, /i/ and $/{\ae}/$ in particular, from the two groups are different in their formant frequencies. In addition, the vowel space of the unstressed vowels is somewhat smaller than that of the stressed vowels. Specifically, the range of the second formant of the unstressed vowels and that of the first formant of the unstressed front vowels were compressed. The findings from this study can be applied to the pronunciation training for the Korean learners of English vowels. We propose that teachers of English pay attention to the stress patterns of English vowels as well as their formant frequencies.