• Title/Summary/Keyword: voice characteristics

Search Result 616, Processing Time 0.023 seconds

Vocal acoustic characteristics of speakers with depression (우울증 화자 음성의 음향음성학적 특성)

  • Baek, Yeon-Sook;Kim, Se-Joo;Kim, Eun-Yeon;Choi, Yae-Lin
    • Phonetics and Speech Sciences
    • /
    • v.4 no.1
    • /
    • pp.91-98
    • /
    • 2012
  • The purposes of this paper is to study the characteristics of compared to the speakers voice without depression and speakers with depression, and to propose a objective method for the measurement of the therapeutic effects as well as for diagnostics of depression based on the characteristics. The voice samples obtained from 11 female speakers with depression, aged from 20 to 40, diagnosed as having major depressive disorder by an psychiatrist were compared with those from 12 normal controls with matched sex, age, height, weight, education, smoking, and drinking. The voice samples are taken by a portable digital recorder(TASCAM DR-07, Japan) and analysed using the MDVP(Multi-Dimentional Voice Program) software module from CSL(Computerized Speech Lab, kay elemetrics, co, model 4100). The result of the investigation are as following. First, the average speaking fundamental frequency and loudness range of the speakers with depression group was statistically significantly lower than that of the control group. The pitch range of the control group was rather higher than that of the speakers with depression group, but without statistical significance. Overall speech rates have no statistical difference between two groups. Second, the average speaking fundamental frequency and loudness range have statistically significant negative correlation with Beck Depression Inventory, i. e. more severe depression exhibits lower average speaking fundamental frequency and loudness range. Other vocal parameters such as pitch range and overall speech rate have no statistically meaningful correlations with Beck Depression Inventory.

Acoustic Characteristics of Normal Healthy Koreans with Advancing Age (노령화에 따른 건강한 정상 성인의 음향음성학적 특성 비교)

  • Kim, Sun-Woo;Kim, Hyang-Hee;Park, Eun-Sook;Choi, Hong-Shik
    • Phonetics and Speech Sciences
    • /
    • v.2 no.4
    • /
    • pp.19-28
    • /
    • 2010
  • The purpose of this study was to increase the current understanding of the acoustic characteristics of voices with advancing age. The relationship between age-related changes in body physiology and certain acoustic characteristics of voice was studied in a sample of 80 men representing four chronological age groupings (20-29, 50-59, 60-69, 70-79) who were all of good physical condition. Each subject was asked to phonate the vowel /a/, /i/, and /u/ for as long as possible at comfortable frequency and intensity level and read the sentence. A promising voice analysis program (Multi-Dimensional Voice $Program^{TM}$) was used to measure the fundamental frequency ($f_0$), jitter, shimmer, $f_0$ variation, peak-amplitude variation, smoothed pitch perturbation quotient, smoothed amplitude perturbation quotient, soft phonation index, $f_0$-tremor intensity index, amplitude tremor intensity index, and noise-to-harmonics ratio from the samples.

  • PDF

Clinical Characteristics of Functional Dysphonia (기능성 발성장애의 임상적 특성)

  • Suh, Woo-Jung;Hong, Young-Hye;Choi, Jong-Min;Jung, Eun-Jung;Sung, Myung-Whun;Kim, Kwang-Hyun;Kwon, Tack-Kyun
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.17 no.2
    • /
    • pp.127-132
    • /
    • 2006
  • Background and Objectives : Functional dysphonia is a voice disturbance in the absence of structural or neurologic laryngeal pathology characterized by voluntary misuse of laryngeal muscles. The present report reviews clinical characteristics of 25 patients with functional dysphonia. Materials and Method : We analyzed medical records, perceptual and acoustic analysis of voice samples, aerodynamic studies and laryngoscopy. Results : There was no sex or age predilection. Eighty four percent of patients presented sudden onset of symptoms and 76% had specific events at the onset. Most patients showed breathy or strained voice and various degree of vocal fold insufficiency with supraglottic compensatory contractions. Acoustic analysis revealed non-diagnostic, but mean flow rate was lower than normal in all cases. All patients responded to voice therapy except for 4 patients who were tort to follow up. Mean number of voice therapy sessions required to get responses is 1.9 sessions. Conclusion : We concluded that patients with functional dysphonia responded very well to short-term voice therapy and should be included in differential diagnosis in patients with dysphonia cannot be explained by structural or neurologic etiology.

  • PDF

Continuance Use Intention of Voice Commerce Using the Value-attitude-behavior Model (가치-태도-행동 모델에 기반한 음성 쇼핑 지속이용의도에 관한 연구)

  • Kim, Hyo-Jung
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.5
    • /
    • pp.491-502
    • /
    • 2022
  • Voice technology allows consumers to make purchases through smart devices, and the interest in voice-driven conversational commerce has significantly expanded. In this study, we explored the continuance use intention of voice commerce, and the adoption of a value-attitude-behavior model. An online survey was conducted on 360 individuals who used an artificial intelligence assistant device in a voice commerce environment. We used Amos 23.0 and SPSS 25.0 for descriptive, confirmatory, and structural equation modeling analyses. These results indicated that functional value was the highest influencing variable on satisfaction of voice commerce, while social, emotional, and epistemic values significantly influenced it as well. Additionally, satisfaction of voice commerce significantly influenced the continuance use intention of voice commerce. These findings could help us understand the characteristics of voice commerce users and the diversity value in voice commerce environment.

Detection of Pathological Voice Using Linear Discriminant Analysis

  • Lee, Ji-Yeoun;Jeong, Sang-Bae;Choi, Hong-Shik;Hahn, Min-Soo
    • MALSORI
    • /
    • no.64
    • /
    • pp.77-88
    • /
    • 2007
  • Nowadays, mel-frequency cesptral coefficients (MFCCs) and Gaussian mixture models (GMMs) are used for the pathological voice detection. This paper suggests a method to improve the performance of the pathological/normal voice classification based on the MFCC-based GMM. We analyze the characteristics of the mel frequency-based filterbank energies using the fisher discriminant ratio (FDR). And the feature vectors through the linear discriminant analysis (LDA) transformation of the filterbank energies (FBE) and the MFCCs are implemented. An accuracy is measured by the GMM classifier. This paper shows that the FBE LDA-based GMM is a sufficiently distinct method for the pathological/normal voice classification, with a 96.6% classification performance rate. The proposed method shows better performance than the MFCC-based GMM with noticeable improvement of 54.05% in terms of error reduction.

  • PDF

A Study on the Intergrated Voice/Data transmission Algorithm characteristics on Local Area Network (유선 LAN상의 음성/데이타 혼합전송 알고리즘 특성에 관한 연구)

  • 김동일
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.1 no.2
    • /
    • pp.137-143
    • /
    • 1997
  • From now on, the network is being developed into PSTN(public switched telephone network) and PDN(public data network), that is depend on the form of data. The former one pursues sending voice, and the latter one pursues sending data. But it causes big loss of the economy and efficiency. So, ISDN, processing voice and data at same time, gives a big profit to user. To enlarge the ISDN at the narrow area, it is necessary that study to send the mixture form of voice and data in LAN environment. So, this paper proposes the algorithm about the mixture form of voice and data in ethernet and token-ring. that is widely used in these days.

  • PDF

Study on Correlation between Voice and Health Condition in the Sasang Constitution (사상 체질별 음성과 건강 수준 관련 가능성에 대한 고찰)

  • Ryu, Hyun-Hee;Lee, Si-Woo;Cho, Tai-Hyoung
    • Journal of Physiology & Pathology in Korean Medicine
    • /
    • v.26 no.2
    • /
    • pp.221-227
    • /
    • 2012
  • In this work, we investigate the correlation between health condition and voice to study the validity and value of voice diagnosis. For this purpose, we collected voices, Health index questionnaires (Short form 36, Psychological Well Being Index) and Sasang Constitution informations on 197 males at the age of twenties. Pitch, jitter, shimmer variables were analyzed by ANOVA and Pearson correlation coefficient. There were no significant correlations between pitch, jitter, shimmer and health questionnaire score in total group regardless of Sasang Constitution. However, We found tendency of correlation between shimmer variables and health questionnaire scores in Taeeumin and Soyangin. In Soeumin and Soyangin, zitter and pitch variables were found to be slightly correlated with health questionnaire scores. Our study suggests the possibility that voice might be related with both health condition and Sasang Constitution. Our finding may motivate research activities towards diverse clinical applications of voice diagnosis and studies of voice characteristics in the Sasang constitution.

Discussions on Auditory-Perceptual Evaluation Performed in Patients With Voice Disorders (음성장애 환자에서 시행되는 청지각적 평가에 대한 논의)

  • Lee, Seung Jin
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.32 no.3
    • /
    • pp.109-117
    • /
    • 2021
  • The auditory-perceptual evaluation of speech-language pathologists (SLP) in patients with voice disorders is often regarded as a touchstone in the multi-dimensional voice evaluation procedures and provides important information not available in other assessment modalities. Therefore, it is necessary for the SLPs to conduct a comprehensive and in-depth evaluation of not only voice but also the overall speech production mechanism, and they often encounter various difficulties in the evaluation process. In addition, SLPs should strive to avoid bias during the evaluation process and to maintain a wide and constant spectrum of severity for each parameter of voice quality. Lastly, it is very important for the SLPs to perform a team approach by documenting and delivering important information pertaining to auditory-perceptual characteristics in an appropriate and efficient way through close communication with the laryngologists.

Change in acoustic characteristics of voice quality and speech fluency with aging (노화에 따른 음질과 구어 유창성의 음향학적 특성 변화)

  • Hee-June Park;Jin Park
    • Phonetics and Speech Sciences
    • /
    • v.15 no.4
    • /
    • pp.45-51
    • /
    • 2023
  • Voice issues such as voice weakness that arise with age can have social and emotional impacts, potentially leading to feelings of isolation and depression. This study aimed to investigate the changes in acoustic characteristics resulting from aging, focusing on voice quality and spoken fluency. To this end, tasks involving sustained vowel phonation and paragraph reading were recorded for 20 elderly and 20 young participants. Voice-quality-related variables, including F0, jitter, shimmer, and Cepstral Peak Prominence (CPP) values, were analyzed along with speech-fluency-related variables, such as average syllable duration (ASD), articulation rate (AR), and speech rate (SR). The results showed that in voice quality-related measurements, F0 was higher for the elderly and voice quality was diminished, as indicated by increased jitter, shimmer, and lower CPP levels. Speech fluency analysis also demonstrated that the elderly spoke more slowly, as indicated by all ASD, AR, and SR measurements. Correlation analysis between voice quality and speech fluency showed a significant relationship between shimmer and CPP values and between ASD and SR values. This suggests that changes in spoken fluency can be identified early by measuring the variations in voice quality. This study further highlights the reciprocal relationship between voice quality and spoken fluency, emphasizing that deterioration in one can affect the other.

A study on traffic analysis in voice/data mixed PCS system (음성/데이타 복합서비스 PCS시스템의 트래픽 분석)

  • 김영일;진용욱
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.6
    • /
    • pp.136-148
    • /
    • 1996
  • In this paper, we analyze the traffic characteristics in microcell and macrocell overlaid PCS system which process voice and dta calls separately each others. in this system, data calls are delayed in queue when all of channels are occupied, while voice calls are bolcked in that case. For this, we calculated inter-microcell handoff area dwelling time distribution and handoff area dwelling time distribution between microcell and macrocell. We analyze traffic performance using this results. We used M/M/C/K model, and analyzed traffic performance of macrocell with handoff area variation of microcell.

  • PDF