• Title/Summary/Keyword: formant characteristics

Search Result 128, Processing Time 0.027 seconds

Acoustic Characteristics of Nasal Consonants and the Change of Nasalance according to the Sites of Nasal Obstruction (비폐색 부위에 따른 비강자음의 음향학적 특성 및 비음도의 변화)

  • 손영익;정유석;이은경;정원호
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.9 no.1
    • /
    • pp.27-31
    • /
    • 1998
  • Nasal sounds include nasalized vowels and consonants. Nasal cavity is important for the acoustics of nasal sounds. Evaluating the effects of site-specific nasal obstruction on nasal sound will help us to understand the importance of nasal geometry for the nasal sound and to foretell voice change after nasal surgery This study was designed to analyze the change of nasality and formant characteristics of nasal sound by obstructing different sites around the ostiomeatal unit(OMU). Ten adult male and female volunteers participated. The nasal formants and bandwidths of nasal consonant /n/ were checked in various conditions of nasal obstruction. The nasalance of rabbit, baby, and mama passages were compared in each conditions. Nasalance of all passages decreased when anterior portion of OMU was obstructed. Center frequency of first nasal formant(NF1) of /n/ has decreased in the order of anterior, inferior obstruction. The bandwidth of NF1 decreased in female with anterior obstruction. Anterior portion of OMU is most critical to the change of nasality and acoustics of nasal consonant. When anterior portion of OMU is obstructed, the shift of NF1 to a lower frequency and the narrowing of NF1 bandwidth are the major acoustic changes of nasal consonant /n/.

  • PDF

A Study of Acoustic Masking Effect from Formant Enhancement in Digital Hearing Aid (디지털 보청기에서의 포먼트 강조에 의한 마스킹 효과 연구)

  • Jeon, Yu-Yong;Kil, Se-Kee;Yoon, Kwang-Sub;Lee, Sang-Min
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.45 no.5
    • /
    • pp.13-20
    • /
    • 2008
  • Although digital hearing aid algorithms have been developed to compensate hearing loss and to help hearing impaired people to communicate with others, digital hearing aid user still complain about difficulty of hearing the speech. The reason could be the quality of speech through digital hearing aid is insufficient to understand the speech caused by feedback, residual noise and etc. And another thing is masking effect among formants that makes sound quality low. In this study, we measured the masking characteristics of normal listeners and hearing impaired listeners having presbyacusis to confirm masking effect in speech itself. The experiment is composed of 5 tests; pure tone test, speech reception threshold (SRT) test, word recognition score (WRS) test, puretone masking test and speech masking test. In speech masking test, there are 25 speeches in each speech set. And log likelihood ratio (LLR) is introduced to evaluate the distortion of each speech objectively. As a result, the speech perception became lower by increasing the quantity of formant enhancement. And each enhanced speech in a speech set has statistically similar LLR, however speech perception is not. It means that acoustic masking effect rather than distortion influences speech perception. In actuality, according to the result of frequency analysis of the speech that people can not answer correctly, level difference between first formant and second formant is about 35dB, and it is similar to result of pure tone masking test(normal hearing subject:36.36dB, hearing impaired subject:32.86dB). Characteristics of masking effect is not similar between normal listeners and hearing impaired listeners. So it is required to check the characteristics of masking effect before wearing a hearing aid and to apply this characteristics to fitting.

A Study on the Estimation of Glottal Spectrum Slope Using the LSP (Line Spectrum Pairs) (LSP를 이용한 성문 스펙트럼 기울기 추정에 관한 연구)

  • Min, So-Yeon;Jang, Kyung-A
    • Speech Sciences
    • /
    • v.12 no.4
    • /
    • pp.43-52
    • /
    • 2005
  • The common form of pre-emphasis filter is $H(z)\;=\;1\;- az^{-1}$, where a typically lies between 0.9 and 1.0 in voiced signal. Also, this value reflects the degree of filter and equals R(1)/R(0) in Auto-correlation method. This paper proposes a new flattening algorithm to compensate the weaked high frequency components that occur by vocal cord characteristic. We used interval information of LSP to estimate formant frequency. After obtaining the value of slope and inverse slope using linear interpolation among formant frequency, flattening process is followed. Experimental results show that the proposed algorithm flattened the weaked high frequency components effectively. That is, we could improve the flattened characteristics by using interval information of LSP as flattening factor at the process that compensates weaked high frequency components.

  • PDF

A Study on a New Pre-emphasis Method Using the Short-Term Energy Difference of Speech Signal (음성 신호의 다구간 에너지 차를 이용한 새로운 프리엠퍼시스 방법에 관한 연구)

  • Kim, Dong-Jun;Kim, Ju-Lee
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.50 no.12
    • /
    • pp.590-596
    • /
    • 2001
  • The pre-emphasis is an essential process for speech signal processing. Widely used two methods are the typical method using a fixed value near unity and te optimal method using the autocorrelation ratio of the signal. This study proposes a new pre-emphasis method using the short-term energy difference of speech signal, which can effectively compensate the glottal source characteristics and lip radiation characteristics. Using the proposed pre-emphasis, speech analysis, such as spectrum estimation, formant detection, is performed and the results are compared with those of the conventional two pre-emphasis methods. The speech analysis with 5 single vowels showed that the proposed method enhanced the spectral shapes and gave nearly constant formant frequencies and could escape the overlapping of adjacent two formants. comparison with FFT spectra had verified the above results and showed the accuracy of the proposed method. The computational complexity of the proposed method reduced to about 50% of the optimal method.

  • PDF

Characteristics of English Vowels Spoken by Koreans (한국인 영어 모음의 특징)

  • Koo, Hee-San
    • Speech Sciences
    • /
    • v.7 no.3
    • /
    • pp.99-108
    • /
    • 2000
  • The purpose of this experimental study was to investigate characteristics of English vowels as spoken by Korean speakers. Ten English mono-syllabic words were spoken six times by six male college students who were born and raised in Seoul. Formant frequencies were measured from sound spectrograms made by the PC Quirer. Results showed that Korean speakers similarly pronounced /i/ and /I/, /u/ and /$\upsilon$/, and /$\varepsilon$/ and /${\ae}$/ respectively. It seems that Korean speakers can not differentiate tense vowels(/i/, /u/) from lax vowels(/i/, /$\upsilon$/) and pronounce low vowels such as /${\ae}$/, /a/, /c/ clearly. It is necessary that Korean speakers practice the correct movements of the jaw, tongue, and lips when they pronounce English vowels.

  • PDF

SPECTRAL CHARACTERISTICS OF RESONANCE DISORDERS IN SUBMUCOSAL TYPE CLEFT PALATE PATIENTS (점막하 구개열 환자 공명장애의 스펙트럼 특성 연구)

  • Kim, Hyun-Chul;Leem, Dae-Ho;Baek, Jin-A;Shin, Hyo-Keun;Kim, Oh-Hwan;Kim, Hyun-Ki
    • Maxillofacial Plastic and Reconstructive Surgery
    • /
    • v.28 no.4
    • /
    • pp.310-319
    • /
    • 2006
  • Submucosal type cleft palate is subdivision of the cleft palate. It is very difficult to find submucosal cleft, because when we exam submucosal type cleft palate patients, it seems to be normal. But in fact, there are abnormal union of palatal muscles of submucosal type cleft palate patients. Because of late detection, the treatment - for example, the operation or the speech therapy - for the submucosal type cleft palate patient usually becomes late. Some patients visited our hospital due to speech disorder nevertheless normal intraoral appearance. After precise intraoral examination, we found out submucosal cleft palate. We evaluated the speech before and after surgery of these patients. In this study, we want to find the objective characteristics of submucosal type cleft palate patients, comparing with the normal and the complete cleft palate patients. Experimental groups were 10 submucosal type cleft palate patients and 10 complete cleft palate patients who got the operation in our hospital. And, the controls were 10 normal person. The sentence patterns using in this study were simple 5 vowels. Using CSL program we evaluated the Formant, Bandwidth. We analized the spectral characteristics of speech signals of 3 groups, before and after the operation. In most cases, the formant scores were higher in experimental groups (complete cleft palate group and submucosal type cleft palate group) than controls. There were small differences when speeching /a/, /i/, /e/ between experimental groups and control groups, large differences when speeching /o/, /u/. After surgery the formant scores were decreased in experimental groups (complete cleft palate group and submucosal type cleft palate group). In bandwidth scores, there were no significant differences between experimental groups and controls.

Voice Changes after Uvulopalatopharyngoplasty (구개수구개인두성형술 이후의 음성변화)

  • 손영익;김선일;윤영선;추광철;정원호
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.9 no.1
    • /
    • pp.22-26
    • /
    • 1998
  • Uvulopalatopharyngoplasty(UPPP) is one of the most popular surgical procedure for the treatment of obstructive sleep apnea syndrome(OSAS) occurring at the level of oropharynx. However, voice changes after UPPP have been a challenging issue for the professional voice users, because even minor changes in voice quality or articulation may be critical to professional singers, teachers, and so on. Several acoustic changes after UPPP have been proposed. However, based on the authors understanding, there is no report about voice changes after UPPP in Korean. We measured the first, second and third formant frequencies of /a/, /i/, /u/ phonations in 20 adult male patients who had undergone UPPP surgery, and the nasalances of Rabbit, Baby, and Mama passages. These parameters were measured preoperatively, at 1 month and 3 months after the operation. Any subjective voice changes were asked to be reported at the posto-perative visits. The third formant(F3) of /u/ phonation was significantly reduced at postoperative 1 month measurement. The nasalance of Mama passage was singnificantly increased at postoperative 3 months measurement. No one complained of subjective changes in voice quality, timbre, articulation or speech. Even though there are no complaints about postoperative voice changes subjectively, significant changes in the formant characteristics of certain vowel and changes in the nasality after UPPP require the clinicians to be mort cautious and careful in deciding UPPP for the professional voice users.

  • PDF

Long Term Average Spectrum Characteristics of Head and Chest Register Sounds of Western Operatic Singers : Extended Study (성악다들의 목소리에 대한 Long Term Average Spectrum 분석 -$2^{nd}$ Singer's Formant의 존재 가능성에 대하여-)

  • Ban, Jae-Ho;Kwon, Young-Kyung;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.15 no.1
    • /
    • pp.31-36
    • /
    • 2004
  • Background and Objectives : It has been shown that the epilaryngeal tube in the human airway is responsible for vocal ring, or the singer's formant. In previous study, authors showed that in trained tenors, besides the conventional singer's formant in the region of ,5500Hz, another energy peak was observed in the region of 8,000Hz. This peak was interpreted as the second resonance of the epilarynx tube. Singers in other voice categories who produce vocal ring are assumed to have the same peak, but no measurements have as yet been made. Materials and Methods : Fifteen tenors, fourteen baritones, seven sopranos and five mezzo sopranos attending the music college, department of vocal music who could reliably produce the head and chest registers were chosen for this study. Each subject was asked to produce an/ah/sound for at least three seconds for the head register sound(tenors ; G4, barions ; E4 sopranos ; F5 and mezzosopranos ; C5) and for the chest register sound (tenors ; C3, baritones ; D3, sopranos ; D4 and Mezzosoprano ; A3). The sound data was analyzed using the Fast Fourier Transform (FFT)-based power spectrum, Long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab (CSL, Kay elemetrics, Model 4300B, USA). Statistical analysis was performed using the Mann-Whitney test of the Statistical Package for Social sciences(SPSS). Results : For head register sounds, a significant increase was seen in the 2,200-3,400Hz region(p<0.05) and the Similar to the head register sounds, there was a significant increase in energy in the four trained singer group compared with the untrained group in the 2,200-3,100Hz region(p<0.05), the 7,800-8,400Hz region(p<0.05) for the chest register sounds. Conclusions : When good vocal production was made for the head and chest registers, an energy peak was observed near 2,500Hz, a frequency already known as the "singer's formant', in all subjects in the study group. Another region of increased energy was observed around 8,000Hz that had not been noticed previously. The authors believe this region to be the second singer's formant.

  • PDF

The characteristics of soprano students' voice related to the vocal methods (발성방법에 따른 소프라노 성악도의 음성 특성)

  • Kim, Jungtaek;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.75-83
    • /
    • 2017
  • The purpose of this study is to find clues to the risk of voice disorders in soprano students. The subjects of the study were 17 soprano students and 18 general students (women). The phonation of vowels /a/, /i/, and /u/ with C4 and F4 notes in each group were recorded. Then, only soprano students were made to record their classical vocalization containing vibrato. Formant, formant energy, bandwidth, VAI (vowel area index), VSA (vowel space area) and L/H ratio were analyzed. There was significant difference in F3 such that the singers' note was measured around 3 kHz which seems to be 400 Hz higher than one from general students. But, There was no significant difference in L/H ratio between soprano student and the general student. There was a significant difference in F3 in the comparison of the soprano students' two vocalization methods. Classical vocalization was measured at 200Hz higher than sustained phonation in F3. Vocal tract adjustment was made and vowel space changed, but there was no significant difference in F3 energy, which is the index of singers' formant according to the phonation method. The L/H ratio, which can be a direct indicator of vocal effort, has no difference in phonation method and is lowered in all phonation methods as the pitch increases. C4 and F4 pitches are lower than the singing range of the soprano. When the pitch changes, vocal effort increases like a general student which will be an indicator of the risk of vocalization. This will be a clue to the vocalization of the immature soprano student.

$F_2$ Formant Frequency Characteristics of the Aging Male and Female Speakers (한국어 모음에서 연령증가에 따른 제2음형대의 변화양상)

  • 김찬우;차흥억;장일환;김선태;오승철;석윤식;이영숙
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.10 no.2
    • /
    • pp.119-123
    • /
    • 1999
  • Background and Objectives : Conditions such as muscle atrophy, stretching of strap muscles, and continued craniofacial growth factors have been cited as contributing to the changes observed in the vocal tract structure and function in elderly speakers. The purpose of the present study is to compare F$_1$ and F$_2$ frequency levels in elderly and young adult male and female speakers producing a series of vowels ranging from high-front to low-back placement. Material and Methods : The subjects were two groups of young adults(10 males, 10 females, mean age 21 years old range 19-24 years) and two groups of elderly speakers(10 males, 10 females, mean age 67 years : range 60-84 years). Each subject participated in speech pathologist to be a speaker of unimpared standard Korean. The headphone was positioned 2 cm from the speakers lips. Each speaker sustained the five vowels for 5 s. Formant frequency measures were obtained from an analysis of linear predictive coding in CSL model 4300B(Kay co). Results : Repeated measure AVOVA procedures were completed on the $F_1$ and $F_2$ data for the male and female speakers. $F_2$ formant frequency levels were proven to be significantly lower fir elderly speakers. Conclusions : We presume $F_2$ vocal cavity(from the point of tongue constriction to lip) lengthening in elderly speakers. The research designed to observe dynamic speech production more directly will be needed.

  • PDF