• 제목/요약/키워드: voice frequency

검색결과 546건 처리시간 0.029초

음성합성시스템을 위한 음색제어규칙 연구 (A Study on Voice Color Control Rules for Speech Synthesis System)

  • 김진영;엄기완
    • 음성과학
    • /
    • 제2권
    • /
    • pp.25-44
    • /
    • 1997
  • When listening the various speech synthesis systems developed and being used in our country, we find that though the quality of these systems has improved, they lack naturalness. Moreover, since the voice color of these systems are limited to only one recorded speech DB, it is necessary to record another speech DB to create different voice colors. 'Voice Color' is an abstract concept that characterizes voice personality. So speech synthesis systems need a voice color control function to create various voices. The aim of this study is to examine several factors of voice color control rules for the text-to-speech system which makes natural and various voice types for the sounding of synthetic speech. In order to find such rules from natural speech, glottal source parameters and frequency characteristics of the vocal tract for several voice colors have been studied. In this paper voice colors were catalogued as: deep, sonorous, thick, soft, harsh, high tone, shrill, and weak. For the voice source model, the LF-model was used and for the frequency characteristics of vocal tract, the formant frequencies, bandwidths, and amplitudes were used. These acoustic parameters were tested through multiple regression analysis to achieve the general relation between these parameters and voice colors.

  • PDF

Dr. Speech Science의 음성합성프로그램을 이용하여 합성한 정상음성과 병적음성(Pathologic Voice)의 음향학적 분석 (Acoustic Analysis of Normal and Pathologic Voice Synthesized with Voice Synthesis Program of Dr. Speech Science)

  • 최홍식;김성수
    • 대한후두음성언어의학회지
    • /
    • 제12권2호
    • /
    • pp.115-120
    • /
    • 2001
  • In this paper, we synthesized vowel /ae/ with voice synthesis program of Dr. Speech Science, and we also synthesized pathologic vowel /ae/ by some parameters such as high frequency gain (HFG), low frequency gain(LFG), pitch flutter(PF) which represents jitter value and flutter of amplitude(FA) which represents shimmer value, and grade ranked as mild, moderate and severe respectively. And then we analysed all pathologic voice by analysis program of Dr. Speech Science. We expect that this synthesized pathologic voices are useful for understanding the parameter such as noise, jitter and shimmer and feedback effect to patient with voice disorder.

  • PDF

파킨슨증의 음성진전 : 감별진단을 위한 예비연구 (Voice Tremor in Parkinsonism : A Preliminary Study for Differential Diagnosis)

  • 최성희;김향희;이원용;최홍식
    • 음성과학
    • /
    • 제12권3호
    • /
    • pp.19-33
    • /
    • 2005
  • Tremor is a main factor of parkinsonism. Voice tremor may be the first, later or the only symptom of a neurological disease and its frequency, amplitude, and regularity may differ among the diseases of different neural subsystems. Differential diagnosis between idiopathic Parkinson's disease (IPD) and multiple system atrophy (MSA) has been difficult. This study included three groups: (1) 6 IPD patients; (2) 6 MSA patients; and (3) 20 ageand sex-matched normal controls. The MDVP (Multidimensional Voice Program) was used to analyze the sustained /a/phonation. The results were as follows: (1) frequency perturbation parameters (jitter, sPPQ, Vf0) and FTRI of tremor parameter of two patient groups were statistically different from those of the controls (p < .01); (2) measures were higher in short-term and long-term f0 and amplitude perturbation in MSA than IPD; (3) however, any acoustic parameters between IPD and MSA were not statistically different; except for the rate of frequency tremor, 4$\sim$5 Hz in IPD, 5$\sim$11 Hz in MSA and (4) the pattern of regularity for voice tremor through histogram indicated that amplitude of IPD was irregular while both f0 and amplitude of MSA were irregular. In conclusion, F0, rate of frequency tremor, and pattern of f0 regularity may be predictors for differential diagnosis. These findings might signify that voice tremor of parkinsonism was resulted from modulation of f0.

  • PDF

모방의 대상이 되는 음성적 특성에 관한 연구 (A Study on the Phonetic Parameters Used on the Voice Imitation)

  • 박지혜;신지영;강선미
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.187-190
    • /
    • 2003
  • The purpose of this paper is to research the phonetic parameters used on the voice imitation. First of all, the fundamental frequency is imitated effectively. Distinctive prosodic patterns are used repeatedly on the voice imitation. Speaking rate is used in special measure in case the target speaker has extraordinary speaking rate. Also formant frequency is imitated variously. In sum, distinctive characteristics perceived by listener are used on voice imitation.

  • PDF

An Enhanced Clarity of Husky Voice by Dissonant Frequency Filtering

  • Kang, Sang-Ki;Baek, Seong-Joon
    • 음성과학
    • /
    • 제12권4호
    • /
    • pp.71-76
    • /
    • 2005
  • There have been numerous studies on the enhancement of noisy speech signal. In this paper, we propose a new speech enhancement method, that is, a filtering of a dissonant frequency combined with noise suppression algorithm. The simulation results indicate that the proposed method provides a significant gain in voice clarity. Therefore if the proposed enhancement scheme is used as a pre-filter, the perceptual clarity of husky voice is greatly enhanced.

  • PDF

A Study on Stable Motion Control of Humanoid Robot with 24 Joints Based on Voice Command

  • Lee, Woo-Song;Kim, Min-Seong;Bae, Ho-Young;Jung, Yang-Keun;Jung, Young-Hwa;Shin, Gi-Soo;Park, In-Man;Han, Sung-Hyun
    • 한국산업융합학회 논문집
    • /
    • 제21권1호
    • /
    • pp.17-27
    • /
    • 2018
  • We propose a new approach to control a biped robot motion based on iterative learning of voice command for the implementation of smart factory. The real-time processing of speech signal is very important for high-speed and precise automatic voice recognition technology. Recently, voice recognition is being used for intelligent robot control, artificial life, wireless communication and IoT application. In order to extract valuable information from the speech signal, make decisions on the process, and obtain results, the data needs to be manipulated and analyzed. Basic method used for extracting the features of the voice signal is to find the Mel frequency cepstral coefficients. Mel-frequency cepstral coefficients are the coefficients that collectively represent the short-term power spectrum of a sound, based on a linear cosine transform of a log power spectrum on a nonlinear mel scale of frequency. The reliability of voice command to control of the biped robot's motion is illustrated by computer simulation and experiment for biped walking robot with 24 joint.

성악가의 성종 구분에 관한 문헌적 고찰 (Voice Classification of Trained Classic Singers)

  • 남도현;백재연;최홍식
    • 대한후두음성언어의학회지
    • /
    • 제18권1호
    • /
    • pp.56-61
    • /
    • 2007
  • Introduction: Actually classification of classic singers' voice depends on habitual judgment by voice teachers or voice trainer referring to vocal timbre, vocal range and vocal quality. Such judgments, however, may turn out to be incorrect because they are based on subjective opinions. Therefore, more objective methodology is required. Method: Foreign dissertations searched through Pub Med, along with foreign and domestic journals, were reviewed regard ing how singers' voice has been categorized. Results: Vocal range, vocal timbre, voice quality, fundamental frequency of habitual speaking, length of vocal tract, the length from cricoid cartilage to thyroid cartilage's thyroid notch and length of vocal fold, tone of passaggio as well as traditional approaches such as perceptual judgment used by professional singers have been used for categorize the voice classification. Conclusion: To optimize categorizing singers' voice, vocal range, vocal timbre, voice quality, fundamental frequency of habitual speaking, length of vocal tract, the length from cricoid cartilage to thyroid cartilage's thyroid notch and length of vocal fold, tone of passaggio may be totally recommended.

  • PDF

음성장애의 병인 집단 간 추정 발화 기본주파수 절대 오차 비교 (A comparison of the absolute error of estimated speaking fundamental frequency (AEF0) among etiological groups of voice disorders)

  • 이승진;임재열;김재옥
    • 말소리와 음성과학
    • /
    • 제15권4호
    • /
    • pp.53-60
    • /
    • 2023
  • 본 연구에서는 음성장애 환자에서 음성 범위 프로파일(voice range profile, VRP)과 말 범위 프로파일(speech range profile, SRP)을 이용한 추정 발화 기본주파수 절대 오차(absolute error of estimated speaking fundamental frequency, AEF0)를 음성장애의 병인 집단 간에 비교하여 차이를 확인하고,각 병인 집단 별로 AEF0와 관련된 변수들 간의 상관관계를 살펴보고자 하였다. 연구대상은 음성장애로 진단된 기능적(functional, FUNC), 기질적(organic, ORGAN), 신경학적(neurogenic, NEUR) 음성장애 환자군과 정상군(normal control, NC) 각 30명(남 15명, 여 15명)으로 총 120명이었다. 각 대상자로 하여금 음성, 말 범위 프로파일 과제를 수행하도록 하고 전기성문파형검사(electroglottography, EGG)를 통해 발화 기본주파수를 측정하였다. 병인 집단 간 AEF0의 비교 결과, Grade와 Severity는 병인 집단 간 차이가 없었던 반면, AEF0VRP와 AEF0SUM에서 병인 집단 간 차이가 있어 AEF0VRP는 ORGAN이 FUNC와 NC보다 높았으며, AEF0SUM은 ORGAN이 NC보다 높았다. 또한 FUNC와 NEUR에서는 AEF0가 Grade와 양의 상관관계를 보인 반면, ORGAN은 CQ(closed quotient)와 양의 상관관계가 있었다. 따라서 병인 집단에 따라 AEF0의 적용과 관련 음성 변수를 살펴보는 데 주의를 기울여야 할 것으로 보이며, 본 연구는 이러한 임상적 판단에 대한 기초 자료를 마련하는 데 일조한 것으로 여겨진다.

성대용종 환자의 음성치료 효과 (The Effect of Voice Therapy in Vocal Polyp Patients)

  • 김성태;정고은;김상윤;최승호;임길채;한주희;남순열
    • 말소리와 음성과학
    • /
    • 제1권2호
    • /
    • pp.43-49
    • /
    • 2009
  • Vocal polyps are benign phonotraumatic lesions which are traditionally treated using phonomicrosurgical techniques. In the case of hyperfunctional voice use, voice therapy is effective and results in voice improvement. However, the utility of voice therapy about vocal polyp is in great demand. The purpose of this study was to evaluate the effects of voice therapy in patients with vocal polyps. The authors reviewed the medical records of 193 patients with vocal nodules or vocal polyps, and 64 patients (31 nodules and 33 polyps) were enrolled. All of the subjects had received explanation of problems, vocal hygiene education, and been treated by the $SKMVTT^{(R)}$ (Seong-Tae Kim's multiple voice therapy technique) ranging from 4 to 16 sessions (mean: 8.6 sessions). All subjects were examined by perceptual assessment, acoustic and aerodynamic measures, and VRP (voice range profile). In perceptual assessment, patients with vocal nodules had more breathy and strained voices than the vocal polyp group. Both groups significantly reduced rough, breathy voice after voice therapy. Patients with vocal polyps had worse voice quality than patients with nodules in acoustic measures. Both groups showed reduced jitter and shimmer after voice therapy. In aerodynamic measures, MPT and Psub were increased, and MFR was reduced (p<.05). Participants' frequency range and intensity range were increased after voice therapy, but only frequency range resulted in a significant difference (p<.05). In conclusion, the therapeutic effect of voice therapy in patients with vocal nodules and polyps was demonstrated perceptually and acoustically. We can suggest that voice therapy, including advice, vocal hygiene, and $SKMVTT^{(R)}$ is a useful as an initial choice of treatment for patients with vocal polyps before considering a surgical approach.

  • PDF

성악 훈련을 받은 성악인에서의 Voice Range Profile (Voice Range Profiles of Trained Classical Singers)

  • 정성민
    • 대한후두음성언어의학회지
    • /
    • 제11권1호
    • /
    • pp.69-75
    • /
    • 2000
  • Background and Objectives : The Voice Range Profile(VRP) is a two-dimensional graphic dysplay of an individual's amplitude range as a function of total fundamental frequency range. It is designed as a maximum performance test which can be used as a general indicator of voice problems in the non-professional voice and as a sensitive indicator of problems with the professional voice. The purpose of the study is to obtain a baseline VRT for the classical professional singers and compare it with the normal nonsinger's profile. We also compared the difference of VRP between the classical professional singers who have normal vocal fold and who have vocal folds lesions without dysphonia. Materials and Methods : The VRPs were elicited. from 42 trained classical singers(Soprano 26, Mesosoprano 5, Tenor 9, Bariton 2) and 20 untrained nonsingers(female 10, male 10) using Voice Range Profile Model 4326(Kay Elemetrics USA). The mean values for phonational range with highest and lowest pitch level and range of voice intensity with maximum and minimum intensity level were compared between classical singers and nonsingers. Results and Conclusions : The frequency range and dynamic range were significantly increased for the classical singers in comparison to the nonsingers. But there was no significant difference were found for the VRP between the parts in the classical singers. The classical singers who have vocal fold lesions showed slightly decreased VRP compared to those with healthy vocal folds.

  • PDF