통합 검색 | Korea Science

Perturbation and Nonlinear Dynamic Analysis of Sustained Vowels in Normal and Pathological Voices

이지연;최성희;;한민수;최홍식
- 말소리와 음성과학
- /
- 제2권1호
- /
- pp.113-120
- /
- 2010
In this paper, we investigate the acoustic characteristics of sustained voices from normal subjects and patients with laryngeal pathologies. Perturbation methods (including jitter and shimmer), signal-to-noise ratio (SNR), and nonlinear dynamic methods (such as correlation dimension) are used to analyze normal and pathological voices. We find that jitter does not statistically discriminate between normal and pathological voices, but a significant difference is found for shimmer, SNR, and correlation dimension. The results suggest that nonlinear dynamic analysis may be valuable for the analysis of normal and pathological voices but perturbation analysis should be applied with caution for pathological voice analysis.
PDF

Diagnosis of Pathological Speech Signals Using Wavelet Transform

Jo, Cheol-Woo;Kim, Dae-Hyun
- 음성과학
- /
- 제4권2호
- /
- pp.17-24
- /
- 1998
In this paper a method to diagnose pathological voices using wavelet transform is sug gested. Pathological voices are collected from hospital and analyzed by the suggested method. Normal voices are collected separately and analyzed. Then the results are compared to find the differences in their characteristics. Three level wavelet transform is used. Normalized energy ratios between the levels and normalized peak-to-peak values are used as parameters. As a result, it was possible to distinguish between normal and pathological voices.
PDF

중국 성인의 음성에 관한 기본 음성 측정치 연구 (The Acoustic Study on the Voices of Chines Normal Adults)

김지채;정옥란
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2007년도 한국음성과학회 공동학술대회 발표논문집
- /
- pp.163-166
- /
- 2007
Our present study was performed to investigate acoustically the Chines normal adults' voices. 60 Chines normal adults (30 males and 30 females) of the age of 20 to 39 years oridyced systained vowel /a/ and, by analyzing them acoustically with Dr. Speech, we could get the fundamental frequency (Fo), jitter, shimmer, NNE. As results, on the average, male voices showed 1I8.1Hz in Fo, 0.186% in jitter, 1.12% in shimmer, and -13.7dB in NNE. And, female voices showed 252.4Hz in Fo, 0.186% in jitter, 0.81% in shimmer, and -1I.3dB in NNE. Every parameter except Fo showed no significant difference between male and female voices.
PDF

양성후두 질환 음성에 대한 여러 기존 피치검출 알고리즘의 성능 평가 (Performance Assessment of Several Established Pitch Detection Algorithms in Voices of Benign Vocal Fold Lesions)

장승진;최성희;김효민;최홍식;윤영로
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2007년도 하계종합학술대회 논문집
- /
- pp.407-408
- /
- 2007
Robust pitch estimation is an important study in many areas of speech processing. In voice pathology, diverse statistics extracted form pitch were commonly used to test voice quality. In this study, we compared several established pitch detection algorithms (PDAs) for verification of adequacy of the PDAs. In the database of total pathological voices of 99 and normal voices of 30, an analysis of errors related with pitch detection was evaluated between pathological and normal voices, or among the types of pathological voices such as benign vocal fold lesions; polyp, nodule, and cysts. Consequently, it is required to survey the severity of tested voice in order to obtain accurate pitch estimates.
PDF

피치 반감 배가를 유발하는 병적인 음성 분석을 위한 강인한 피치 검출 알고리즘 (Robust Pitch Detection Algorithm for Pathological Voice inducing Pitch Halving and Doubling)

장승진;최성희;김효민;최홍식;윤영로
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2007년도 제38회 하계학술대회
- /
- pp.1797-1798
- /
- 2007
In field of voice pathology, diverse statistics extracted form pitch estimation were commonly used to assess voice quality. In this study, we proposed robust pitch detection algorithm which can estimate pitch of pathological voices in benign vocal fold lesions. we also compared our proposed algorithm with three established pitch detection algorithms; autocorrelation, simplified inverse filtering technique, and nonlinear state-space embedding methods. In the database of total pathological voices of 99 and normal voices of 30, an analysis of errors related with pitch detection was evaluated between pathological and normal voices, or among the types of pathological voices. According to the results of pitch errors, gross pitch error showed some increases in cases of pathological voices; especially excessive increase in PDA based on nonlinear time-series. In an analysis of types of pathological voices classified by aperiodicity and the degree of chaos, the more voice has aperiodic and chaotic, the more growth of pitch errors increased. Consequently, it is required to survey the severity of tested voice in order to obtain accurate pitch estimates.
PDF

Acoustic Analysis with Moving Window in Normal and Pathologic Voices

Choi, Seong-Hee;Lee, Ji-Yeoun;Jiang, Jack J.
- 말소리와 음성과학
- /
- 제2권3호
- /
- pp.165-170
- /
- 2010
In this study, the most stable portion was identified using 5% moving window during /a/ sustained phonation in normal and pathologic voice signals and the perturbation values were compared between normal and pathologic voices at the mid-point and at the most stable portion using moving window, respectively. The results revealed that some severe pathologic voice signals can be eligible for perturbation analysis by identifying the most stable portion with Err less than 10. In addition, the perturbation acoustic parameters did not differentiate the pathologic voice signals from the normal voice signals when the mid-point was selected to measure the perturbation analysis(p>0.05). However, significantly higher %shimmer and lower SNR values were observed in pathologic voices (p<0.05) when the most stable portion was selected by moving window. In conclusion, moving window could identify the most stable portion objectively which can allow toget the minimum perturbation values (%jitter, %shimmer) and maximum SNR values. Thus, moving window technique can be applicable for more reliable and accurate perturbation acoustic analysis.
PDF

성별에 따른 한국 정상 성인 음성의 음향학적 평가 기준치 (Acoustic Characteristics of the Voices of Korean Normal Adults by Gender on MDVP)

김재옥
- 말소리와 음성과학
- /
- 제1권4호
- /
- pp.147-157
- /
- 2009
The purpose of the study is to develop the normal voice database and to analyze the acoustic characteristics of Korean adults' voices by gender using MDVP. Eight categories in the 34 parameters of MDVP were analyzed in the voices of 170 Korean normal adults taken from /a/ vowel. Among them, Fundamental Frequency Parameters and Frequency Perturbation Parameters were significantly different by gender. In addition, Fundamental Frequency Parameters of our data were remarkably different from the data suggested in the MDVP program which currently used in clinics. Therefore, the data obtained from the current study can be effectively used for the diagnosis of voice disorders of Korean adults as the standard parameter values of MDVP.
PDF

한국 성인의 정상 음성에 관한 기본 음성 측정치 연구 (The Acoustic Study on the Voices of Korean Normal Adults)

표화영;심현섭;송윤경;윤영선;이은경;임성은;하현령;최홍식
- 음성과학
- /
- 제9권2호
- /
- pp.179-192
- /
- 2002
Our present study was performed to investigate acoustically the Korean normal adults' voices, with enough large number of subjects to be reliable. 120 Korean normal adults (60 males and 60 females) of the age of 20 to 39 years produced sustained three vowels, /a/, /i/, and /u/ and read a part of 'Taking a Walk' paragraph, and by analyzing them acoustically with MDVP of CSL, we could get the fundamental frequency ($F_{0}$), jitter, shimmer and NHR of sustained vowels: speaking fundamental frequency ($SF_{0}$), highest speaking frequency (SFhi), lowest speaking frequency (SFlo) of continuous speech. As results, on the average, male voices showed 118.1$\sim$122.6 Hz in $F_{0}$, 0.467$\sim$0.659% in jitter, 1.538$\sim$2.674% in shimmer, 0.117$\sim$0.114 in NHR, 120.8 Hz in $SF_{0}$, 183.2 Hz in SFhi, 82.6 Hz in SFlo. And, female voices showed 211.6∼220.3 Hz in F0, 0.678∼0.935% in jitter, 1.478∼2.582% in shimmer, 0.098∼0.114 in NHR, 217.1 Hz in $SF_{0}$, 340.9 Hz in SFhi, 136.0 Hz in SFlo. Among the 7 parameters, every parameters except shimmer showed the significant difference between male and female voices. And, when we compared the three vowels, they showed significant differences one another in shimmer and NHR of both genders, but not in $F_{0}$ of males and jitter of females.
PDF

운율이식을 통해 나타난 감정인지 양상 연구 (A Study on the Perceptual Aspects of an Emotional Voice Using Prosody Transplantation)

이서배
- 대한음성학회지:말소리
- /
- 제62호
- /
- pp.19-32
- /
- 2007
This study investigated the perception of emotional voices by transplanting some or all of the prosodic aspects, i.e. pitch, duration, and intensity, of the utterances produced with emotional voices onto those with normal voices and vice versa. Listening evaluation by 24 raters revealed that prosodic effect was greater than segmental & vocal quality effect on the preception of the emotion. The degree of influence of prosody and that of segments & vocal quality varied according to the type of emotion. As for fear, prosodic elements had far greater influence than segmental & vocal quality elements whereas segmental and vocal elements had as much effect as prosody on the perception of happy voices. Different amount of contribution to the perception of emotion was found among prosodic features with the descending order of pitch, duration and intensity. As for the length of the utterances, the perception of emotion was more effective with long utterances than with short utterances.
PDF

모방 발화의 음향음성학적 연구(3) -전문 성대 모사자의 자료를 중심으로- (An Acoustic Study on the Voice Imitation(3) - Based on a professional voice imitator′s speech -)

안병섭;박미영
- 대한음성학회지:말소리
- /
- 제52호
- /
- pp.1-14
- /
- 2004
In this study, we investigated acoustic characteristics of imitated utterances by a professional voice imitator, focusing on prosodic properties such as vowel formants and f0 distribution. To see the patterns of a voice imitation by a professional voice imitator, we compared the imitator's voice data with target speakers' voice data. The professional imitator, Mr. Bae produced utterances imitating the former President Kim's, the comedian Choi's, and the singer Bae's voices. Auditorily, the imitator was judged to imitate all the target speakers' voices successfully. However, acoustic examination showed that the imitator was better at imitating the singer Bae's voice in that the imitator's and the singer Bae's voices are more alike with respect to vowel formants and f0 distribution. We infer this is because the imitator's normal voice is very similar to the singer Bae's voice. On the other hand, the imitator's voice data showed that the patterns of vowel formants and f0 distribution found in the imitator's imitation voices of the other two target speakers were different from those of target speakers' voices.
PDF

검색결과 46건 처리시간 0.023초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)