• Title/Summary/Keyword: formant characteristics

Search Result 128, Processing Time 0.031 seconds

A Study on Extraction of Vocal Tract Characteristic After Canceling the Vocal Cord Property Using the Line Spectrum Pairs (선형 스펙트럼쌍을 이용한 성문특성이 제거된 성도특성 추출법에 관한 연구)

  • 민소연;장경아;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.7
    • /
    • pp.665-670
    • /
    • 2002
  • The most common form of pre-emphasis is y(n)=s(n)-As(n-1), where A typically lies between 0.9 and 1.0 in voiced signal. Also, this value reflects the degree of pre-emphasis and equals R(1)/R(0) in conventional method. This paper proposes a new flattening method to compensate the weaked high frequency components that occur by vocal cord characteristic. We used interval information of LSP to estimate formant frequency, After obtaining the value of slope and inverse slope using linear interpolation among formant frequency, flattening process is followed. Experimental results show that the proposed method flattened the weaked high frequency components effectively. That is, we could improve the flattening characteristics by using interval information of LSP as flattening factor at the process that compensates weaked high frequency components.

Improvement of Synthetic Speech Quality using a New Spectral Smoothing Technique (새로운 스펙트럼 완만화에 의한 합성 음질 개선)

  • 장효종;최형일
    • Journal of KIISE:Software and Applications
    • /
    • v.30 no.11
    • /
    • pp.1037-1043
    • /
    • 2003
  • This paper describes a speech synthesis technique using a diphone as an unit phoneme. Speech synthesis is basically accomplished by concatenating unit phonemes, and it's major problem is discontinuity at the connection part between unit phonemes. To solve this problem, this paper proposes a new spectral smoothing technique which reflects not only formant trajectories but also distribution characteristics of spectrum and human's acoustic characteristics. That is, the proposed technique decides the quantity and extent of smoothing by considering human's acoustic characteristics at the connection part of unit phonemes, and then performs spectral smoothing using weights calculated along a time axis at the border of two diphones. The proposed technique reduces the discontinuity and minimizes the distortion which is caused by spectral smoothing. For the purpose of performance evaluation, we tested on five hundred diphones which are extracted from twenty sentences using ETRI Voice DB samples and individually self-recorded samples.

Long Term Average Spectrum Characteristics of Speaking Voice of Western Operatic Singers (Long Term Average Spectrum을 이용한 성악가들의 Speaking Voice 분석)

  • Lee, Kyung-Chul;Hong, Seok-Jin;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.2
    • /
    • pp.122-127
    • /
    • 2004
  • Background and Objectives : Many studies have described and analyzed singer's formant and it has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. A similar phenomenon produced by trained singers in their speech led some authors to examine the speaker's ring. This study was designed to analyze the speaking voice of the singers and speaker's ring. Baterials and Methods : Ten tenors, fifteen baritones, fifteen sopranos and ten mezzo sopranos attending the music college, department of vocal music were chosen for this study. Fifteen male and fifteen female untrained normal speakers were chosen for control group. Each subject was asked to produce a sample of a sustained spoken vowel /ah/ sound for at least five seconds and read sentence 'Kaeul'. The sound data was analyzed using the Fast Fourier Transform(FFT) - based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social Sciences(SPSS). Results : For LTA Power spectrum of/ah/ sound, a significant increase was seen in the 2,500-3,500Hz region(p<0.01) in four trained singer group compared with untrained speaker group, and a significant increase in the 9,000-10,000Hz region(p<0.01) in soparano group. Similarly, in sentence 'Kaeul', there was a significant increase in energy in the tenor, baritone, mezzo soprano group compared with the untrained speaker group in the 2,500-3,500Hz region(p<0.01), and a significant increase in all frequency region(p<0.01) in the soprano group. Conclusions : The LTA power spectrum suggests that trained singers group show more energy concentration in the 'singer's formant' region in the speaking voice, and authors believe this region to be the 'speaker's ring'. Further research is needed on the effect of singing training on the resonance of the speaking voice.

  • PDF

General Acoustical Characteristics of Pansori Singing Voice (판소리 발성의 전반적인 음향학적 특징)

  • Moon Seung-Jae
    • MALSORI
    • /
    • no.42
    • /
    • pp.15-24
    • /
    • 2001
  • 판소리의 특질을 연구하기 위하여 여덟 명창의 소리를 분석하였다. 그 결과 모두에게서 유성음임에도 불구하고 비주기성인 소리를 찾았다. 이러한 현상은 매우 높은 성대밑 공기압에 기인한다고 보았다. 이 비주기성 유성음은 명창들의 일반 대화에서도 나타나서 이러한 현상이 곧 성대의 영구적인 변화에 의한 것임을 추정할 수 있었다. 또한 판소리에서 나타나는 vibrato는 서양의 오페라에 비해 주기가 훨씬 길고 범위는 훨씬 넓음이 확인되었다. 그 외에도 모든 명창의 경우 고주파수 영역에서 매우 높은 에너지를 보여주어서 일반인의 발성과 차이가 남을 알 수 있었고, 특히 일부 명창의 경우는 1000Hz 바로 이하에서 유별나게 강한 harmonics가 나타나서 서양 음악의 소위 singer's formant와 대조를 이루었다.

  • PDF

Experimental Phonetic Study of Yanjin Sino-Korean Dialect (연변 조선족 방언 음성의 실험적 연구)

  • Kim, Hyun-Gi
    • Phonetics and Speech Sciences
    • /
    • v.1 no.1
    • /
    • pp.47-52
    • /
    • 2009
  • The speech of Sino-Korean has been evolved from geopolitical cause since 1945. The aim of this study is to collect Yanji dialectal speech and to compare with South Korean dialectal speech. Twenty Yanbian university students participated as informants. Acoustic speech informations are analyzed using the Multi-Speech Windows Vista version. Dialectal speech characteristics of Yanji sino-Korean showed posterior vowel /${\alpha}$/, neutralization of mid-vowel /o/ between /o/ and /Ɔ/. Lenis stop sound showed the tendency of glottalization based on VOT value. Sibilant sound contains aspiration following constriction and lateral /l/ realized the approximant /r/.

  • PDF

A Study on the Phonetic Parameters Used on the Voice Imitation (모방의 대상이 되는 음성적 특성에 관한 연구)

  • Park Jihye;Shin Jiyoung;Kang Sunmee
    • Proceedings of the KSPS conference
    • /
    • 2003.05a
    • /
    • pp.187-190
    • /
    • 2003
  • The purpose of this paper is to research the phonetic parameters used on the voice imitation. First of all, the fundamental frequency is imitated effectively. Distinctive prosodic patterns are used repeatedly on the voice imitation. Speaking rate is used in special measure in case the target speaker has extraordinary speaking rate. Also formant frequency is imitated variously. In sum, distinctive characteristics perceived by listener are used on voice imitation.

  • PDF

An Study on the Correlation between Sound Characteristics and Sasang Constitution by CSL (CSL을 통한 음향특성과 사상체질간의 상관성 연구)

  • Shin, Mi-ran;Kim, Dal-lae
    • Journal of Sasang Constitutional Medicine
    • /
    • v.11 no.1
    • /
    • pp.137-157
    • /
    • 1999
  • The purpose of this study is to help classifying Sasang Constitution through correlation with sound characteristic. This study was done it under the suppose that Sasang Constitution has correlation with sound spectrogram. The following result were obtained about correlation between sound spectrogram and Sasang Constitution by comparison and analysis 1. Soeumin answered his voice low tone, smooth and quiet in the survey. Soyangin answered his voice high, clear, fast and speaking random. Taeumin answered his voice low, thick and muddy. 2. Taeyangin was significantly slow compared with the others in the time of reading composition. Taeyangin was significantly slow compared with the others in Formant frequency 1. Taeyangin was significantly discriminated from Soeumin in Formant frequency 5. Taeyangin was significantly low compared with the others in Bandwidth 2. Soeumln was significantly low compared with Taeyangin in Pitch Maximum and Pitch Maximum-Pitch Minimum. Taeyangin was significantly high compared with the others in Energy mean. 3. In list of specification, the discrimination rate was higher than that by lists of 13 in the results of Multi-dimensional 4-class minimum-distance. The discrimination rate of three disposition except Soyangin was higher than that of four disposition in the results of One way ANOVA and Analysis of dis crimination in SPSS/PC+. In CART, the estimate rate of Sasang Constitution discrimination was higher than any other method. It is considered that there is a correlation between sound spectrogram and Sasang constitution according to the results. And method of Sasang constitution classification through sound spectrogram analysis can be one method as assistant for the objectification of Sasang constitution classification.

  • PDF

On a Performance Improvement of Speaker Recognition by using the Auditory Characteristics of Speech (음성의 청각특성을 이용한 화자식별시스템의 성능향상에 관한 연구)

  • 이윤주;오세영배재옥배명진
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1223-1226
    • /
    • 1998
  • The pre-emephasis filter as the conventional method emphasizes all components of high frequency that reflects the speaker characteristics. However this filter don't show the auditory characteristics of speaker's speech. In order to emphasize the perceptual characteristics, we propose the speaker recognition system that uses the perceptual weighting as the preprocessor because the Auditory characteristic of human is sensitive to the formant peaks. This filter has the characteristcs that both deemphasizes the low-formants and emphasizes the high formants. As a result of the proposed method, we improve the total recognition rate 1.7% better than the conventional method.

  • PDF

Characteristics of the Korean speakers' voice under easy Korean, difficult Korean and English reading situations (한국인의 쉬운 한국어, 어려운 한국어, 영어 읽기 상황에서의 음성 특성)

  • Kim, Ji-Eun
    • Phonetics and Speech Sciences
    • /
    • v.8 no.1
    • /
    • pp.1-7
    • /
    • 2016
  • The purpose of this study is to know the acoustic characteristics of voice under stressful and relaxed conditions. Ten undergraduate male students participated in this study and produced 아, 에, 이 vowels in English reading, difficult Korean reading under stressful conditions, and easy Korean reading under relaxed conditions. After that, F0, jitter, shimmer, NHR, F1, F2, and F3 values were measured and analyzed. The results of this study demonstrate that speech parameters related to stress are jitter, shimmer, and NHR in that these values are lower under relaxed situations (easy Korean reading) than that of stressful situations (English and difficult Korean reading). This study will be a foundation to verify that the analysis of acoustic characteristics can serve as a quantitative tool for measuring stress levels.

A comparative study of the acoustic characteristics of the vowel /a/ between children with spastic and dyskinetic cerebral palsy (경직형과 불수의운동형 뇌성마비아동의 /아/ 모음 음향학적 비교)

  • Jeong, Pil Yeon;Sim, Hyun Sub
    • Phonetics and Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.65-74
    • /
    • 2020
  • This study aims to compare the acoustic characteristics of vowel phonation in children with spastic and dyskinetic cerebral palsy (CP). Thirty-four children aged 4-12 years with CP participated in the study (spastic 26, dyskinetic 8). Voice samples for the acoustic analysis were extracted from a sustained vowel /a/. All acoustic measures were made using Praat. Group differences were compared by an independent t-test or Welch-Aspin test, if the equivalence assumption was not met. The results of this study are as follow. First, maximum phonation time(MPT) was significantly shorter for the dyskinetic CP than for the spastic CP. Second, shimmer percent was significantly increased in the dyskinetic CP than in the spastic CP. Lastly, there were no significant group differences in both the first formant and the second formant. These findings indicate that the dyskinetic CP has a poorer respiratory capacity and poorer laryngeal function than the spastic CP. On the other hand, both groups have a comparable ability to articulate the vowel /a/. The results of the present study help speech language pathologists identify the speech motor control ability of children with two types of CP (spastic and dyskinetic) and help to make an intervention plan associated with a specific type of CP.