• Title/Summary/Keyword: voice change

Search Result 360, Processing Time 0.029 seconds

Change in acoustic characteristics of voice quality and speech fluency with aging (노화에 따른 음질과 구어 유창성의 음향학적 특성 변화)

  • Hee-June Park;Jin Park
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.45-51
    • /
    • 2023
  • Voice issues such as voice weakness that arise with age can have social and emotional impacts, potentially leading to feelings of isolation and depression. This study aimed to investigate the changes in acoustic characteristics resulting from aging, focusing on voice quality and spoken fluency. To this end, tasks involving sustained vowel phonation and paragraph reading were recorded for 20 elderly and 20 young participants. Voice-quality-related variables, including F0, jitter, shimmer, and Cepstral Peak Prominence (CPP) values, were analyzed along with speech-fluency-related variables, such as average syllable duration (ASD), articulation rate (AR), and speech rate (SR). The results showed that in voice quality-related measurements, F0 was higher for the elderly and voice quality was diminished, as indicated by increased jitter, shimmer, and lower CPP levels. Speech fluency analysis also demonstrated that the elderly spoke more slowly, as indicated by all ASD, AR, and SR measurements. Correlation analysis between voice quality and speech fluency showed a significant relationship between shimmer and CPP values and between ASD and SR values. This suggests that changes in spoken fluency can be identified early by measuring the variations in voice quality. This study further highlights the reciprocal relationship between voice quality and spoken fluency, emphasizing that deterioration in one can affect the other.

An Economic Analysis of Flat Pricing for Unlimited Voice Calls : Necessary Conditions and MNO's Strategy (음성무제한 요금제경쟁의 경제적 분석 : 무제한요금제 도입 필요조건과 통신사의 선택)

  • Kim, Weonseek
    • Journal of Information Technology Services
    • /
    • v.12 no.3
    • /
    • pp.111-126
    • /
    • 2013
  • As the gaps become narrower in interconnection fee and volume rate, the MNOs began to introduce flat pricing for unlimited voice traffic competitively in Korea wireless telecommunication market : 'unlimited talks within intra-network' by the 1st operator, followed by the 3rd operator's 'unlimited talks over all networks'. As a result, subscribers tip in toward the third ranked operator and could bring a substantial change to steadfast market structure over the last decade in Korea. This paper aims to develop a simple economic model to analyze competition with flat pricing for unlimited voice traffic, and to check whether the pricing can be appropriate for the MNOs. The results show that MNOs already step in the necessary conditions to launch flat pricing for voice traffic. It also predicts that the MNOs compete with unlimited talk over all networks and set a single fee in an equilibrium. At present, the MNOs run virtually identical pricing for unlimited talk over all networks, considering their differentiation with respect to service quality, coverage and brand preference.

Comparison of Voice Characteristics Before and After High-Caffeine Intake (고카페인 섭취 전·후 음성 특성 비교)

  • Lee, Areum;Kim, Eunyun;Yoo, Hyunji;Choi, Yaelin
    • Phonetics and Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.59-65
    • /
    • 2015
  • This study was conducted to identify the differences in voice characteristic variables before and after taking a certain amount of high-caffeine. Linear PCM-M10 Recorder (SONY) was used for the recorder and basic frequency of the voice (Fo), frequency fluctuation rate (jitter), amplitude fluctuation rate (shimmer) and Signal-to-Noise Ratio (SNR) were measured using TF-32(University of Wisconsin-Madison, USA). First, prolonged phonation analysis results of /ah/ by male subjects showed the shimmer values after taking high-caffeine increased statistically significantly(p<.05) compared with before the intake and SNR values significantly decreased. (p<.05). On the other hand, female subjects didn't show any statistically significant differences in all variables. Second, male subjects showed statistically significant increased shimmer values after the intake compared with before the intake at /ah/ of syllable 'na' and /ah/ in 'ra' in 'autumn' paragraph (p<.05), and jitter values significantly increased at /ah/ in 'ah' (p<.05). However, female subjects didn't show any statistically significant differences in all variables. Results of this study showed that high-caffeine intake more affects male subjects than female subjects. In male subjects, shimmer and SNR changed at vowel prolonged phonation, /ah/, and study results showed that shimmer and SNR in 'Autumn' paragraph /na/, /ra/ and jitter in /ah/ could be identified as the variables to show the voice change.

Comparative Study of Pre and Postoperative Voice and Image Analysis in Unilateral Vocal Cord Paralysis and Vocal Polyp (편측 성대마비와 성대폴립 환자의 수술 전후 음성검사와 이미지 화상분석의 상관관계에 대한 객관적 비교연구)

  • 김시찬;정유삼;홍정표;오정석;최홍식
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.11 no.1
    • /
    • pp.20-27
    • /
    • 2000
  • To determine what is the change of pre and postoperative voice and image analysis parameters and correlations between them, videostroboscopy was analyzed in each 18 patients with unilateral vocal cord paralyses or vocal polyps before and after the surgery from November, 1996 to April, 1999. The correlation between acoustic and aerodynamic parameters was investigated. The software-Videolink and $\pi$-View(Mediface Co, Seoul, Korea)-was used in a quantitative analysis. In unilateral vocal cord paralysis, the glottic angle is well correlated with maximum phonation time, jitter and shimmer preoperatively. The postoperative glottic angle is also correlated with preoperative maximum phonation time. In patients with the vocal polyp, the chink is postoperatively decreased, but the size of the chink and the polyp is not correlated with pre and postoperative voice analysis parameters. These findings reveal that glottic an and vocal fold angle are good indicators of e postoperative glottic configuration in unilateral vocal cord paralysis. Vocal fold ratio is also a useful indicator that represents the length of vocal folds. We consider that the computerized analysis through videostroboscopy is one of objective diagnostic methods in many voice disorders if we can measure a distance between the telelaryngoscope and vocal folds.

  • PDF

Comparison of the Surgical Results in Mutational Dysphonia between Unilateral Shortening of Thyroid Cartilage Method and Bilateral Shortening of Thyroid Cartilage Method in Type III Thyroplasty (변성발성장애의 제3형 갑상연골성형술시 갑상연골익의 편측절제술과 양측절제술과의 치료성적 비교)

  • 최홍식;김세헌;김영호;이익호;김광문
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.7 no.1
    • /
    • pp.61-68
    • /
    • 1996
  • Failure to change from the higher pitched voice of preadolescence to the lower pitched voice of adolescence and adulthood is called "mutational dysphonia" The voice is weak, thin, breathy, hoarse, and mono-pitched. If the voice theraphy was failed, surgery to lower vocal pitch which is refered to thyroplasty type III, is indicated. We compared the post-op acoustic parameters with pre-op data in unilateral antero-posterior shortening of the thyroid cartilage method and bilateral antero-posterior shortening of the thyroid cartilage method each other. Bilateral antero-posterior shortening of the thyroid cartilage method shows significant drop of fundamental frequency and speaking fundamental frequency statistically than unilateral shortening method. There was no significant differences in Jitter, Shimmer, SNR, MFR and other psychoacoustic analysiss parameters between two groups. These data shows that unequal tension of the vocal cord in uilateral antero-posterior shortening of the thyroid cartilage method does not control the pitch effectively so bilatreal shortening method in Type III thyroplasty is recommandable procedure in surgery of the mutational dysphonia.

  • PDF

A Study on Pitch Perception of Normal Korean (한국 성인 음성의 음도인식에 관한 연구)

  • Jeong, Ok-Ran;Kim, Hyung-Soon;Kim, Young-Tae;Sub, Jang-Su
    • Speech Sciences
    • /
    • v.1
    • /
    • pp.315-323
    • /
    • 1997
  • This study attempts to determine the fundamental frequency level of male and female voices that Koreans perceive as normal. Seventy-three college students majoring in Speech Pathology participated in the study on a voluntary basis. The subjects listened to a male voice with fundamental frequency of 60 Hz, 80 Hz, 100 Hz, 120 Hz, 140 Hz, 160 Hz, 180 Hz, and 200 Hz, and a female voice with fundamental frequency of 140 Hz, 160 Hz, 180 Hz, 200 Hz, 220 Hz, 240 Hz, 260 Hz, and 280 Hz. The PSOLA (Pitch Synchronous Overlap). method and harmonic modeling method of speech signal were used to change pitch in the 20 Hz interval. The voices were presented in a random order to prevent listener bias. The results were as follows; Firstly, $46.6\%$ judged male voice with 120 Hz as normal, and $19.2\%$ judged 140 Hz as normal, and another $19.2\%$ judged 160 Hz as normal. Secondly, $50.7\%$ perceived female voice with 220 Hz as normal, and $32.9\%\;and\;30.1\%$ responded to 200 Hz and 240 Hz, respectively. The problems and recommendations for a future investigation are discussed.

  • PDF

Correlation Analysis of Between Paranasal Sinuses and Formant Frequency According to External Stimulation (외부 자극에 따른 부비동과 포먼트주파수와의 상관성 분석)

  • Kim, Bong-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.17 no.8
    • /
    • pp.1955-1961
    • /
    • 2013
  • Paranasal sinuses of the empty space is filled with air that exists in the bones in the face. However, the pus becomes inflamed paranasal sinuses sinusitis onset brings the voice of change, and complained of headaches and lethargy. Therefore, in this paper, paranasal sinuses related diseases to predict voice analysis parameter as measured by changes in paranasal sinuses through external stimuli is investigated and carried out a study to analysis the function consisting of the frontal sinus, ethmoid sinus, maxillary sinus, sphenoid sinus. From this, cold pack stimulation in the paranasal sinus area for stimulation before and after voice was performed by measuring formant frequency and external stimuli through correlation analysis of the mutual impact on paranasal sinuses were analyzed.

Design and Implementation of Context-aware Application on Smartphone Using Speech Recognizer

  • Kim, Kyuseok
    • Journal of Advanced Information Technology and Convergence
    • /
    • v.10 no.2
    • /
    • pp.49-59
    • /
    • 2020
  • As technologies have been developing, our lives are getting easier. Today we are surrounded by the new technologies such as AI and IoT. Moreover, the word, "smart" is a very broad one because we are trying to change our daily environment into smart one by using those technologies. For example, the traditional workplaces have changed into smart offices. Since the 3rd industrial revolution, we have used the touch interface to operate the machines. In the 4th industrial revolution, however, we are trying adding the speech recognition module to the machines to operate them by giving voice commands. Today many of the things are communicated with human by voice commands. Many of them are called AI things and they do tasks which users request and do tasks more than what users request. In the 4th industrial revolution, we use smartphones all the time every day from the morning to the night. For this reason, the privacy using phone is not guaranteed sometimes. For example, the caller's voice can be heard through the phone speaker when accepting a call. So, it is needed to protect privacy on smartphone and it should work automatically according to the user context. In this aspect, this paper proposes a method to adjust the voice volume for call to protect privacy on smartphone according to the user context.

Visual Voice Activity Detection and Adaptive Threshold Estimation for Speech Recognition (음성인식기 성능 향상을 위한 영상기반 음성구간 검출 및 적응적 문턱값 추정)

  • Song, Taeyup;Lee, Kyungsun;Kim, Sung Soo;Lee, Jae-Won;Ko, Hanseok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.34 no.4
    • /
    • pp.321-327
    • /
    • 2015
  • In this paper, we propose an algorithm for achieving robust Visual Voice Activity Detection (VVAD) for enhanced speech recognition. In conventional VVAD algorithms, the motion of lip region is found by applying an optical flow or Chaos inspired measures for detecting visual speech frames. The optical flow-based VVAD is difficult to be adopted to driving scenarios due to its computational complexity. While invariant to illumination changes, Chaos theory based VVAD method is sensitive to motion translations caused by driver's head movements. The proposed Local Variance Histogram (LVH) is robust to the pixel intensity changes from both illumination change and translation change. Hence, for improved performance in environmental changes, we adopt the novel threshold estimation using total variance change. In the experimental results, the proposed VVAD algorithm achieves robustness in various driving situations.