• Title/Summary/Keyword: speech quality

Search Result 807, Processing Time 0.034 seconds

The Effect of Vocal Function Exercise on Voice Improvement in Patients with Vocal Nodules (성대 기능 훈련이 성대결절 환자의 음성개선에 미치는 효과)

  • Lim, Hye-Jin;Kim, Jeong-Kyu;Kwon, Do-Ha;Park, Jun-Young
    • Phonetics and Speech Sciences
    • /
    • v.1 no.2
    • /
    • pp.37-42
    • /
    • 2009
  • The purpose of the present study was to determine the effect of the management program known as vocal function exercise (VFE) on voice quality. Typical VFE was modified and applied to patients with vocal nodules by controlling intensity of voice and relieving the vocal fold to solve hyperfunctional problems in VFE. Eight female subjects aged between 28 and 54 who had been diagnosed with vocal nodules took part in the study. The patients performed VFEs once a week for eight weeks. Vocal function exercises consist of voice hygiene, respiratory training, phonation training, and glide training. The subjects' voices were analyzed pre and post therapy on the aspects of acoustics, maximum phonation time (MPT), GRBAS, and voice handicap index (VHI). As a result, it was found that fundamental frequency ($F_o$) was significant increased, shimmer decreased remarkably and that noise to harmonic ratio (NHR) lowered obviously in the acoustic parameter. In addition, MPT was increased significantly. The scale of GRBAS indicated significant improvement in grade, roughness, and strained voice. VHI indicated significant improvement in an emotional part. In conclusion, VFE was effective in improving voice quality for patients with vocal nodules.

  • PDF

The Assessment on the Sound Quality of Reduced Frequency Selectivity of Hearing Impaired People (난청인의 주파수 선택도 둔화현상이 음질에 미치는 영향 평가)

  • An, Hong-Sub;Park, Gyu-Seok;Jeon, Yu-Yong;Song, Young-Rok;Lee, Sang-Min
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.60 no.6
    • /
    • pp.1196-1203
    • /
    • 2011
  • The reduced frequency selectivity is a typical phenomenon of sensorineural hearing loss. In this paper, we compared two modeling methods for reduced frequency selectivity of hearing impaired people. The two models of reduced frequency selectivity were made using LPC(linear prediction coding) algorithm and bandwidth control algorithm based on ERB(equivalent rectangular bandwidth) of auditory filter, respectively. To compare the effectiveness of two models, we compared the result of PESQ (perceptual evaluation of speech quality) and LLR(log likelihood ratio) using 36 Korean words of two syllables. To verify the effect on noise condition, we mixed white and babble noise with 0dB and -3dB SNR to speech words. As the result, it is confirmed that the PESQ score of bandwidth control algorithm is higher than the score of LPC algorithm, on the other hands, and the LLR score of LPC algorithm is lower than the score of bandwidth control algorithm. It means that both non-linearity and widen auditory filter characteristics caused by reduced frequency selectivity could be more reflected in bandwidth control algorithm than in LPC algorithm.

A Study on the Method of Assessing Spatial Speech Transmission Quality as an Indicator of Room Acoustics -Concentrated on the Articulation Test under Variable Ambient Noise- (건축 음향의 실내 청취조건 평가방법에 관한 연구-변동외부소음하의 명료도시험에 관하여-)

  • Han, Myung-Ho;Lee, Tae-Gang;Oh, Yang-Ki;Kim, Sun-Woo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.1
    • /
    • pp.5-11
    • /
    • 1991
  • Articulation test is a good predictor of spatial speech transmission quality. Like many other languages, articulation testing method using Korean language was proposed in 1989, and which was proved as a valid indicator in rooms with static background noise. In this paper, the testing method is examined in variable noise conditions. According to the experiment performed in 26 classrooms with variable background noise, the proposed articulation testing method using Korean Language is still in variable conditions.

  • PDF

Voice quality of normal elderly people after a 3oz water-swallow test: An acoustic analysis (3온스 물 삼킴검사 이후 정상 노년층의 음질 변화: 음향학적 분석)

  • Lee, Sol Hee;Choi, Hong-Shik;Choi, Seong-Hee;Kim, HyangHee
    • Phonetics and Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.69-76
    • /
    • 2018
  • The elderly are at increased risk of developing dysphagia due to aging and illnesses. The aim of the current study was to analyze, via an acoustic study, the change in the voice quality of normal elderly people after a 3oz water-swallow test. Subjects included a group of 60 normal elderly people (age: $mean{\pm}SD=76.9{\pm}6.66$) and 60 healthy young adults (age: $mean{\pm}SD=25.1{\pm}2.36$). Every participant produced a five-second /a/ phonation pre- and post-swallowing, and the fractioned two-second sections were analyzed using the MDVP (multi dimensional voice program) analysis. The elderly group demonstrated a post-swallowing increase in the following related acoustic parameters: fundamental frequency, fundamental frequency variation, amplitude-variation, and noise in both two-second sections. However, the younger group showed an increase only in frequency related acoustic parameters (i.e., STD ) in the first two-second section. The significant changes in values in the post-swallowing parameters might indicate temporary irregularities in pitch and amplitude along with higher amounts of noise in the voice. The results could be attributed to water residues in the vocal fold and vocal tract, as well as a deterioration of the motor and sensory functions caused by anatomical and physiological changes that result from aging.

The Effect on Intervention Program and Auditory-Perceptual Discrimination Feature of Postlingual Cochlear Implant Adults about Pathological Voice (병리적 음성에 대한 언어습득 이후 인공와우이식 성인의 청지각적 변별특성과 중재 프로그램의 효과)

  • Bae, Inho;Kim, Geunhyo;Lee, Yeonwoo;Park, Heejune;Kim, Jindong;Lee, Ilwoo;Kwon, Soonbok
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.9-17
    • /
    • 2015
  • In the present study, we investigated ability of recognition of auditory perception with regards to the quality of voice in postlingual CI adults and proposed a training program to improve within subject reliability. A prospective case-control study was conducted in adults with 7 postlingual deaf who received a CI surgery and 10 normal hearing controls. The pre and post test and training program included parameters of consensus auditory-perceptual evaluation of voice(CAPE-V) with pathological voice sample by using Alvin. In results of pre-post test for monitoring improvements of internal reliability for listeners via the training program, there was statistically significant difference in both test and group. There was statistically significant difference in internal reliability between pre-post test in the normal hearing group, the result was no significant in the CI group. The present study found that CI adults showed less ability in awareness of voice quality compared to normal hearing group. Also the training program improved pitch and loudness in CI adults.

Comparative Studies on the Self Voice Assessment of Voice Disorder Patients and the Hearer Voice Assessment of a Comparative Group of normal subjects (음성장애인의 자가음성평가와 정상음성인의 청자음성평가 특성 비교)

  • Lee, Yu-Jin;Hwang, Young-Jin
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.105-114
    • /
    • 2012
  • This paper will discuss the difference between self assessment of voice disorders and the hearer voice assessment of a comparative group of normal subjects. The study was conducted on 25 voice disorder subjects and 32 hearers of a comparative group of normal subjects. The results are as follows. Firstly, in K-VHI and VHI-H, the hearers of the comparative group of normal subjects perceived more serious voice disorders than the voice disorder group in all sub-domains. Likewise, in K-VQOL and VRQOL-H, the hearers of the comparative group of normal subjects perceived more serious voice disorders than the voice disorder group in all sub-domains. Secondly, the hearer voice assessment of the comparative group of normal subjects showed no difference in gender regarding the perception of the severity of voice disorder issues. Thirdly, the hearer voice assessment of the comparative group of normal subjects states that in the emotional aspects of VHI-H, professional voice users perceive more serious voice disorders than others. Accordingly, in VRQOL-H, there was no difference in use of the voice between professionals and others.

Performance Enhancement of SBC for Voice Signal Using Adaptive Postfiltering at the Medium Bit Rate (중간 전송율에서 적응 포스트 필터링을 이용한 음성용 SBC의 성능 향상)

  • 김원구;이남걸;윤대희;차일환
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.17 no.2
    • /
    • pp.121-131
    • /
    • 1992
  • In this paper, three methods are studied to enhance the performance of SBC ( Sub-Band Coding )schemes for voice signal at the medium bit rate between 12 kbps and If; kbps, and adaptive postfilteritng using human auditory characteristics Is (Bone at the decoder out put. First, GQMF(Generalized Quadrature Mirror Filter ) Is used instead of QME'((Quadrature MirrorFiltcr ) to have better performance. Second, by adaptive bit allocation to each sub-band, speech quality is enhanced and valuable rate ceding If possible. Third, corriparlson study oS thr: coder performance using APCM(Adaptive Pulse Code ModulatioTi) and ADPCM( Adaptive Differentiai Pulse Code Modulatiori) , Indicates that SB AfCM performance better than the other. Adaptive postfiltering at the decoder output enhances the quality of the coded speech. The two proposed postfiltering methods decrease the noise sufficiently at the expense of the low computational load.

  • PDF

Shimmer Change According to Fundamental Frequency Variation of Korean Normal Adults

  • Pyo, Hwa-Young;Sim, Hyun-Sub
    • Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.143-152
    • /
    • 2003
  • The present study was performed to investigate change in shimmer according to $F_{0}$ variation precisely, and to offer suggestions for a clinical application. The analysis for the present study was done by the fundamental frequency ($F_{0}$) and shimmer measurement results of the previous 120 Korean normal adults' voice study of Pyo et al. (2002), used three vowels, /i/, /a/, /and /u/. Through the analysis of 60 female samples from the previous study, we found that $F_{0}$ of the vowels was the highest in /u/, and the lowest in /a/, but, on the contrary, shimmer was highest in /a/and lowest in /u/. Thirty of 60 subjects showed such an inverse relationship between $F_{0}$ and shimmer, as a whole. In the vowel /a/, 47 of 60 subjects showed the increased $F_{0}$ and decreased shimmer, in /i/, 32 subjects, and in /u/, 33 subjects showed the same results. The decrease in shimmer means the improvement of voice quality, so by these results, we expect to answer the question why the patients with spasmodic dysphonia can improve their voice quality with increased pitched voice production.

  • PDF

Effects of age of L2 acquisition and L2 experience on the production of English vowels by Korean speakers

  • Eunhae Oh;Eunyoung Shin
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.9-16
    • /
    • 2023
  • The current study investigated the influence of age of L2 acquisition (AOA) and length of residence (LOR) in the L2 setting country on the production of voicing-conditioned vowel duration and spectral qualities in English by Korean learners. The primary aim was to explore the ways in which the language-specific phonetic features are acquired by the age of onset and L2 experience. Analyses of the archived corpus data produced by 45 native speakers of Korean showed that, regardless of AOA or LOR, absolute vowel duration was used as a salient correlate of voicing contrast in English for Korean learners. The accuracy of relative vowel duration was influenced more by onset age than by L2 experience, suggesting that being exposed to English at an early age may benefit the acquisition of temporal dimension. On the other hand, the spectral characteristics of English vowels were more consistently influenced by L2 experience, indicating that immersive experience in the L2 speaking environment are likely to improve the accurate production of vowel quality. The distinct influence of the onset age and L2 experience on the specific phonetic cues in L2 vowel production provides insight into the intricate relationship between the two factors on the manifestation of L2 phonological knowledge.

Break Predicting Methods Using Phonetic Symbols Combined with Accents Information in a Japanese Speech Synthesizer (일본어 합성기에서 악센트 정보가 결합된 발음기호를 이용한 Break 예측 방법)

  • Na, Deok-Su;Lee, Jong-Seok;Kim, Jong-Kuk;Bae, Myung-Jin
    • MALSORI
    • /
    • no.62
    • /
    • pp.69-84
    • /
    • 2007
  • Japanese is a language having intonations, which are indicated by the relative differences in pitch heights and the accentual phrases (APs) are placed according to the changes of the accents while a break occurs on a boundary of the APs. Although a break can be predicted by using J-ToBI, which is a rule-based or statistical approach, it is very difficult to predict a break exactly due to the flexibility. Therefore, in this paper, a method which can enhance the quality of synthesized speech by reducing the errors in predicting break indices (BI), are proposed. The method is to use a new definition for the phonetic symbols, which combine the phonetic values of Japanese words with the accents information. Since a stream of defined phonetic symbols includes the information on the changes in intonations, the BI can be easily predicted by dividing the intonation phrase (IP) into several APs. As a result of an experiment, the accuracy of break generations was 98 % and the proposed method contributed itself to enhance the naturalness of synthesized speeches.

  • PDF