• Title/Summary/Keyword: Voice Analysis

Search Result 1,163, Processing Time 0.027 seconds

A Study for the Development of Korean Voice Assessment Model for the Patients with Voice Disorders: A Qualitative Study (음성장애 진단 및 평가에 관한 질적 연구: 진단 및 평가 모형 정립을 위한 기초연구)

  • Pyo, Hwa-Young;Sim, Hyun-Sub
    • Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.7-22
    • /
    • 2007
  • The purpose of this study was to develop a Korean assessment model for the patients with voice disorders. Interviews were conducted with 4 voice therapists and the results were analyzed by using a qualitative, constant-comparative design. According to the three themes emerged from the qualitative analysis, 10 subthemes were derived. The three main themes were 1) consideration on the disordered voice, 2) status quo of instrumental and perceptual evaluation, and 3) suggestions for the other voice therapists. The 10 subthemes can be summarized as the following: 1) judgment centering on the patients, 2) increase of the reliability of instrumental and perceptual evaluation, 3) voice therapists' positive participation in the assessment procedure of voice disorder.

  • PDF

A Cepstral Analysis of Breathy Voice with Vocal Fold Paralysis (성대마비로 인한 기식 음성에 대한 Cepstral 분석)

  • Kang, Young-Ae;Seong, Cheol-Jae
    • Phonetics and Speech Sciences
    • /
    • v.4 no.2
    • /
    • pp.89-94
    • /
    • 2012
  • The aim of this study is to investigate the usefulness of the parameter CPP (cepstral peak prominence) and LTAS (long term average spectrum) band energy for an analysis of breathy voice with vocal fold paralysis. Thirty-four female subjects who have vocal paralysis after thyroidectomy participated in this study. According to the perceptual judgements by three speech pathologists and one phonetic scholar, subjects were divided into two groups: breathy voice group (n = 21) and non-breathy voice group (n = 13). Maximum sustained phonation task was measured for acoustic analysis. CPP-related (i.e. mean F0, mean CPP, and mean CPPs) and LTAS-related (i.e. minimum, maximum, and mean) parameters were used. Independent samples t-test was conducted. Regarding CPP, there are significant differences in mean CPP and mean CPPs between groups. The values of mean CPP and CPPs in the non-breathy voice group are higher than those in the breathy voice group. The CPP could be regarded as the useful parameter for breathy voice analysis in the clinic. When it comes to LTAS, energy from 0 to 2 kHz are significantly different between groups. The minimum value of non-breathy group is lower than that of breathy group, whereas the maximum value of non-breathy group is higher. The frequency band below 2 kHz seems to be related to breathy voice.

A Study of depression symptom in patients with voice disorders (음성장애환자에게서의 우울감 연구)

  • Kang, Young Ae;Koo, Bon Seok
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.47-54
    • /
    • 2015
  • The objectives of this study are to research the frequency of depression symptom in patients with voice disorders and to investigate parameters associated with depression from voice evaluation. A hundred ninety six patients(106 males and 90 females) who had been diagnosed with voice disorders first in their lifetime were selected. All the patients were examined by laryngeal stroboscopy. For depression and voice study, personal interview, acoustic and aerodynamic analysis, voice handicap index(VHI), reflux symptom index(RSI), and beck depression index(BDI) were done respectively. Mild to severe BDI were seen in 26.2%(52 patients) of the whole patients. A BDI mean score of female patients was $8.8{\pm}7.5$ which was higher than that of male patients($5.6{\pm}6.6$), the difference observed being statistically significant(p<0.001). In the acoustic analysis, the score of sent_duration parameter was increasing in the patients with depression, which was significantly higher than the score of the patients without depression(p<0.05). In the addition, the scores of VHI and RSI were higher in the patients with depression(p<0.001). Our findings suggest that the prevalence of depression in patients with voice disorders is related to female, speaking velocity, and self-questionnaire. This result can be used for psychologically based approach to therapy.

The Utility of Perturbation, Non-linear dynamic, and Cepstrum measures of dysphonia according to Signal Typing (음성 신호 분류에 따른 장애 음성의 변동률 분석, 비선형 동적 분석, 캡스트럼 분석의 유용성)

  • Choi, Seong Hee;Choi, Chul-Hee
    • Phonetics and Speech Sciences
    • /
    • v.6 no.3
    • /
    • pp.63-72
    • /
    • 2014
  • The current study assessed the utility of acoustic analyses the most commonly used in routine clinical voice assessment including perturbation, nonlinear dynamic analysis, and Spectral/Cepstrum analysis based on signal typing of dysphonic voices and investigated their applicability of clinical acoustic analysis methods. A total of 70 dysphonic voice samples were classified with signal typing using narrowband spectrogram. Traditional parameters of %jitter, %shimmer, and signal-to-noise ratio were calculated for the signals using TF32 and correlation dimension(D2) of nonlinear dynamic parameter and spectral/cepstral measures including mean CPP, CPP_sd, CPPf0, CPPf0_sd, L/H ratio, and L/H ratio_sd were also calculated with ADSV(Analysis of Dysphonia in Speech and VoiceTM). Auditory perceptual analysis was performed by two blinded speech-language pathologists with GRBAS. The results showed that nearly periodic Type 1 signals were all functional dysphonia and Type 4 signals were comprised of neurogenic and organic voice disorders. Only Type 1 voice signals were reliable for perturbation analysis in this study. Significant signal typing-related differences were found in all acoustic and auditory-perceptual measures. SNR, CPP, L/H ratio values for Type 4 were significantly lower than those of other voice signals and significant higher %jitter, %shimmer were observed in Type 4 voice signals(p<.001). Additionally, with increase of signal type, D2 values significantly increased and more complex and nonlinear patterns were represented. Nevertheless, voice signals with highly noise component associated with breathiness were not able to obtain D2. In particular, CPP, was highly sensitive with voice quality 'G', 'R', 'B' than any other acoustic measures. Thus, Spectral and cepstral analyses may be applied for more severe dysphonic voices such as Type 4 signals and CPP can be more accurate and predictive acoustic marker in measuring voice quality and severity in dysphonia.

Voice Analysis before and after Swallowing a Raw Egg in Professional Voice Users (직업적 음성사용자에서 날달걀 먹기 전과 후의 음성 변화)

  • Kim, Kyung-A;Kwon, Soon-Bok;Kim, Sung-Won;Lee, Hyung-Shin;Hong, Jong-Cheol;Kim, Yong-Rok;Lee, Bong-Joo;Han, Yung-Jin;Yu, Tae-Hyun;Lee, Kang-Dae
    • Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.43-53
    • /
    • 2007
  • The purpose of this study was to observe the effect of eating a raw egg by professional or nonprofessional voice users on their voice quality and the duration of the effect. 20 professional voice users and 20 nonprofessional voice users participated in the experiment and they had gone through stroboscopy to have no vocal or laryngeal diseases. The voice exam was performed three times: before eating a raw egg (1st period), right after eating it (2nd period), and 10 minutes later (3rd period). By using Multi-dimensional Voice Program which is a software of Computerized Speech Lab 4500 as a voice analysis instrument, the authors checked the F0, Jitter, Shimmer, Noise to harmonic ratio (NHR), and Voice Range Profile (VRP). Results showed as follows: Firstly, vocal hygiene was good in 57.5% of the total subjects and was poor in 42.5%. 40% of professional voice users and 75% of nonprofessional voice users hand good quality. 77.5% of the total subjects had the vocal fatigue while 22.5% of the subjects did not. 95% of the professional voice users and 60% of nonprofessional voice users complained the vocal fatigue. 60% of the total subjects reported a subjective vocal symptom. 65.0% professional voice users and 70.0% of nonprofessional voice users reported a voice symptom. From the results above, we suggest that eating a raw egg may lead to imporve voice quality of the professional voice users.

  • PDF

Identification of Voice Features for Recently Voice Fishing by Voice Analysis (음성 분석을 통한 최근 보이스피싱의 음성 특징 규명)

  • Lee, Bum Joo;Cho, Dong Uk;Jeong, Yeon Man
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.41 no.10
    • /
    • pp.1276-1283
    • /
    • 2016
  • The scale of financial damages on voice fishing has not been decreased despite of national and social efforts to reduce the amounts of voice fishing damage. One of these reasons is a sophisticated and vernacular speech style that makes it difficult to recognize the offenders. Furthermore, nowadays, young men have intensively been deceived by not only sophisticated and vernacular speech style which is used the employer of real public offices but also obtained personal information. As a result, this lead directly to the financial damages of younger people who has a stronger judgement than older. For this, we investigated the comparison and analysis between the criminals of voice fishing and the same generation younger people for identifying voice features. The experiment was carried out based on the pitch, bandwidth of pitch, energy, speech speed and voice color for searching the difference of voice characteristics between the criminals of voice fishing and the same generation younger people since 2011. The experimental result shows that there is a significant difference in energy and speech speed between the criminals of voice fishing and the same generation younger people.

A survey on the voice related needs of occupational voice users (직업적 음성사용자의 음성관련 요구 조사)

  • Lee, Eun-Jeong;Kim, Wha-Soo
    • Phonetics and Speech Sciences
    • /
    • v.7 no.2
    • /
    • pp.39-45
    • /
    • 2015
  • This research was conducted to investigate the voice related needs of occupational voice users. The data collected from teachers(379), tele-marketers(156), therapists(50) was classified according to its content, by colaizzi's inductive categorical analysis. The voice related needs are classified into 3 big categories, 1) how to use, 2) how to care, 3) how to be healthy. Again the category 'how to use' my voice was into 6 sub-categories: (1) efficiently, (2) as I desired, (3) without pain(discomfort), (4) expressively, (5) phonation (methods) and (6) clear articulation. The result showed that the needs from 3 groups of occupational voice users reflect their own environment which they have to use their voice as well as the voice characteristics wanted from their specific listeners.

Mediating Effect of the Attitude on the Relationship between Subjective Norms and Voice Intention (주관적 규범과 불평행동 의도의 관계에 미치는 태도의 매개 효과)

  • Kang, Jong-Heon;Pyo, Gil-Taek
    • Culinary science and hospitality research
    • /
    • v.13 no.2
    • /
    • pp.12-21
    • /
    • 2007
  • The purpose of this study was to examine the effect of subjective norms on customers' intention to engage in voice of dissatisfaction responses, the effect of subjective norms on attitude, and the mediating effect of attitude on the relationships between subjective norms and customers' intention to engage in voice of dissatisfaction responses. The simple regression analysis is used in order to estimate the effects of subjective norms on customers' intention to engage in voice of dissatisfaction responses and attitude. The mediated regression analysis is used in order to estimate the mediating role of attitude of the effect of subjective norms on customers' intention to engage in voice of dissatisfaction responses. Results of the study demonstrated that the inclusion of perceived behavioral control did significantly improve the predictability of the voice of dissatisfaction response intentions. Furthermore, the mediating analysis indicated that the influence of subjective norms was mediated by mediator. In the contests of voice behavior, the effect of subjective norms on intention was mediated by attitude.

  • PDF

Feasibility of Galaxy Smartphone Recording as Portable Recorder for Acoustic Analysis of Voice (음향분석에 사용할 녹음장비로 갤럭시 스마트폰 녹음기능의 유용성)

  • Yun, Mae-Hwa;Lee, Jae-Hyuk;Lee, Sang-Hyuk;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.26 no.2
    • /
    • pp.104-111
    • /
    • 2015
  • Background and Objectives : Acoustic analysis of voice could be influenced so much by the quality of voice files which were recorded by recording device. In clinical practice, voice files that were recorded by analysis program directly or portable digital recording device were analyzed mostly. This study examined the feasibility of using Galaxy smartphone recordings for acoustic analysis of voice. Materials and Methods : Acoustic measures were compared between voice signals recorded from 30 normal speakers (15 males and 15 females) through Galaxy smartphone, portable digital recording device and CSL. Fo, jitter, shimmer, NHR (Noise-Harmony ratio) and Formant frequencies were analyzed by MDVP. Results : Fo, Jitter, Shimmer, NHR and formant frequencies from 3 devices were no significantly difference. The intraclass correlation coefficient (ICC) was higher between each of the voice perturbation measures. Conclusion : The findings indicated that Galaxy smartphone recording system was useful device for acoustic analysis of voice. Furthermore, Galaxy smartphone can be applied widely in various way for acoustic analysis of voice.

  • PDF

The Influence of Perceived Value on Continuance Use Intention in Voice Commerce Context (비대면 음성 쇼핑의 인지된 가치, 지속이용의도에 미치는 영향 관계에 관한 연구)

  • Kim, Hyo-Jung
    • Journal of Digital Convergence
    • /
    • v.20 no.4
    • /
    • pp.225-234
    • /
    • 2022
  • Voice commerce has emerged as a key channel for consumer searches and purchases. This study examines the continuance use intention of voice commerce, applying value-based adoption model. An online survey was conducted with 470 consumers who has experienced with voice commerce. As participants were who buys and purchases goods; or a user who uses food delivery service in voice commerce context. This study used SPSS 23.0 and Amos 23.0 for descriptive analysis, correlation analysis, confirmatory factor analysis, and structul equation modeling analysis. These reaults are as follows. First, usefuleness and response accuracy were significantly influenced the perceived value of voice commerce. Second, functional risk was significantly influenced the the perceived value of voice commerce. Third, perceived value was significantly influenced the continuance use intention of voice commerce. These results enhance understanding of voice commerce users and provide insight into the service provider of voice commerce.