Search | Korea Science

VOICE SOURCE ESTIMATION USING SEQUENTIAL SVD AND EXTRACTION OF COMPOSITE SOURCE PARAMETERS USING EM ALGORITHM

Hong, Sung-Hoon;Choi, Hong-Sub;Ann, Sou-Guil
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06a
- /
- pp.893-898
- /
- 1994
In this paper, the influence of voice source estimation and modeling on speech synthesis and coding is examined and then their new estimation and modeling techniques are proposed and verified by computer simulation. It is known that the existing speech synthesizer produced the speech which is dull and inanimated. These problems are arised from the fact that existing estimation and modeling techniques can not give more accurate voice parameters. Therefore, in this paper we propose a new voice source estimation algorithm and modeling techniques which can not give more accurate voice parameters. Therefore, in this paper we propose a new voice source estimation algorithm and modeling techniques which can represent a variety of source characteristics. First, we divide speech samples in one pitch region into four parts having different characteristics. Second, the vocal-tract parameters and voice source waveforms are estimated in each regions differently using sequential SVD. Third, we propose composite source model as a new voice source model which is represented by weighted sum of pre-defined basis functions. And finally, the weights and time-shift parameters of the proposed composite source model are estimeted uning EM(estimate maximize) algorithm. Experimental results indicate that the proposed estimation and modeling methods can estimate more accurate voice source waveforms and represent various source characteristics.
PDF

An Interdisciplinary Study of A Leaders' Voice Characteristics: Acoustical Analysis and Members' Cognition

Hahm, SangWoo;Park, Hyungwoo
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.12
- /
- pp.4849-4865
- /
- 2020
The traditional roles of leaders are to influence members and motivate them to achieve shared goals in organizations. However, leaders such as top managers and chief executive officers, in practice, do not always directly meet or influence other company members. In fact, they tend to have the greatest impact on their members through formal speeches, company procedures, and the like. As such, official speech is directly related to the motivation of company employees. In an official speech, not only the contents of the speech, but also the voice characteristics of the speaker have an important influence on listeners, as the different vocal characteristics of a person can have different effects on the listener. Therefore, according to the voice characteristics of a leader, the cognition of the members may change, and, the degree to which the members are influenced and motivated will be different. This study identifies how members may perceive a speech differently according to the different voice characteristics of leaders in formal speeches. Further, different perceptions about voices will influence members' cognition of the leader, for example, in how trustworthy they appear. The study analyzed recorded speeches of leaders, and extracted features of their speaking style through digital speech signal analysis. Then, parameters were extracted and analyzed by the time domain, frequency domain, and spectrogram domain methods. We also analyzed the parameters for use in Natural Language Processing. We investigated which leader's voice characteristics had more influence on members or were more effective on them. A person's voice characteristics can be changed. Therefore, leaders who seek to influence members in formal speeches should have effective voice characteristics to motivate followers.
https://doi.org/10.3837/tiis.2020.12.013 인용 PDF KSCI HTML

Acoustic properties of vowels produced by cerebral palsic adults in conversational and clear speech (뇌성마비 성인의 일상발화와 명료한 발화에서의 모음의 음향적 특성)

Ko Hyun-Ju;Kim Soo-Jin
- Proceedings of the KSPS conference
- /
- 2006.05a
- /
- pp.101-104
- /
- 2006
The present study examined two acoustic characteristics(duration and intensity) of vowels produced by 4 cerebral palsic adults and 4 nondisabled adults in conversational and clear speech. In this study, clear speech means: (1) slow one's speech rate just a little, (2) articulate all phonemes accurately and increase vocal volume. Speech material included 10 bisyllabic real words in the frame sentences. Temporal-acoustic analysis showed that vowels produced by two speaker groups in clear speech(in this case, more accurate and louder speech) were significantly longer than vowels in conversational speech. In addition, intensity of vowels produced by cerebral palsic speakers in clear speech(in this case, more accurate and louder speech) was higher than in conversational speech.
PDF

A survey on the voice related needs of occupational voice users (직업적 음성사용자의 음성관련 요구 조사)

Lee, Eun-Jeong;Kim, Wha-Soo
- Phonetics and Speech Sciences
- /
- v.7 no.2
- /
- pp.39-45
- /
- 2015
This research was conducted to investigate the voice related needs of occupational voice users. The data collected from teachers(379), tele-marketers(156), therapists(50) was classified according to its content, by colaizzi's inductive categorical analysis. The voice related needs are classified into 3 big categories, 1) how to use, 2) how to care, 3) how to be healthy. Again the category 'how to use' my voice was into 6 sub-categories: (1) efficiently, (2) as I desired, (3) without pain(discomfort), (4) expressively, (5) phonation (methods) and (6) clear articulation. The result showed that the needs from 3 groups of occupational voice users reflect their own environment which they have to use their voice as well as the voice characteristics wanted from their specific listeners.
https://doi.org/10.13064/KSSS.2015.7.2.039 인용 PDF KSCI

First Record of a Brown Frog Rana huanrenensis (Family Ranidae) from Korea

Yang, Suh-Yung;Kim, Jong-Bum;Min, Mi-Sook;Suh, Jae-Hwa;Kang, Young-Jin;Matsui, Masafumi;Fei, Liang
- Animal cells and systems
- /
- v.4 no.1
- /
- pp.45-50
- /
- 2000
We found a brown frog species, which is unrecorded from South Korea. Rana huanrenensis Fei, Ye, and Huang (1990), This species was originally described from northeastern China. In having 2n=24 chromosomes, this species is closely related to Rana dybowskii, R. chensinensis, R. ornativentris, R. pirica, and Chinese R. huanrenensis, but it is different from the first four species in the ecological, morphological, and genetic characteristics. By contrast, this species Is identical to Chinese R. huanrenensis In the habitat of montane stream-breeding, absence of the vocal sac, and genetic properties. This record is a significant range extention of R. huanrenensis.
PDF

On the Flattening Techniques of Vocal track characteristics by using position information of the LSP (Line Spectrum Pairs) (LSP parameter의 위치정보를 이용한 성도특성 평탄화기법)

Kim YoungKyou;MIN SoYeon;BAE MyungJin
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.171-174
- /
- 2002
음성신호는 성문특성으로 인해 고주파 특성이 약화되는 경향이 있다. 이를 보상하기 위해 Pre-emphasis filter를 사용한다. 수식으로 표현하면 y(n)=s(n)-As(n-1) 와 같이 차분방정식으로 나타낼 수 있다. 여기서 A값은 보통 0.9에서 1사이의 값을 주로 사용한다. 그러나 Pre-emphasis filter는 고주파 특성을 보상하는 과정에서 극점과 같이 영점도 왜곡된다. 본 논문에서는 음성특성에 따른 LSP(Line Spectrum Pairs) 분포특성을 이용하여 영점을 보존하고 vocoder 및 coding에 필연적인 고주파 특성 혹은 저주파 특성을 강조한다.
PDF

Voice Similarities between Sisters

Ko, Do-Heung
- Speech Sciences
- /
- v.8 no.3
- /
- pp.43-50
- /
- 2001
This paper deals with voice similarities between sisters who are supposed to have common physiological characteristics from a single biological mother. Nine pairs of sisters who are believed to have similar voices participated in this experiment. The speech samples obtained from one pair of sisters were eliminated in the analysis because their perceptual score was relatively low. The words were measured in both isolation and context, and the subjects were asked to read the text five times with about three seconds of interval between readings. Recordings were made at natural speed in a quiet room. The data were analyzed in pitch and formant frequencies using CSL (Computerized Speech Lab) and PCQuirer. It was found that data of the initial vowels are much more similar and homogeneous than those of vowels in other positions. The acoustic data showed that voice similarities are strikingly high in both pitch and formant frequencies. It is assumed that statistical data obtained from this experiment can be used as a guideline for modelling speaker identification and speaker verification.
PDF

A Study on Korean, English and Japanese Speaker Recognitions Using the Peak and Valley Pitch Detection and the Fuzzy Theory (PVPF방법과 퍼지 이론을 이용한 한국어, 영어 및 일본어 화자 인식에 관한 연구)

Kim, Yeon-Suk
- The Transactions of the Korea Information Processing Society
- /
- v.6 no.2
- /
- pp.522-533
- /
- 1999
This paper proposes speaker recognition algorithm which includes both the pitch parameter and the fuzzy inference. This study proposes a pitch detection method PVPF(peak and valley pitch detection fuction) by means of comparing spectra which utilizes the transform characteristics between time and frequency. In this paper, makes reference pattern using membership function and performs vocal tract recognition of common character using fuzzy pattern matching in order to include time variation width for non-linear utterance time.
PDF

Analyzing clinical and genetic aspects of axonal Charcot-Marie-Tooth disease

Kwon, Hye Mi;Choi, Byung-Ok
- Journal of Genetic Medicine
- /
- v.18 no.2
- /
- pp.83-93
- /
- 2021
Charcot-Marie-Tooth disease (CMT) is the most common hereditary motor and sensory peripheral neuropathy. CMT is usually classified into two categories based on pathology: demyelinating CMT type 1 (CMT1) and axonal CMT type 2 (CMT2) neuropathy. CMT1 can be distinguished by assessing the median motor nerve conduction velocity as greater than 38 m/s. The main clinical features of axonal CMT2 neuropathy are distal muscle weakness and loss of sensory and areflexia. In addition, they showed unusual clinical features, including delayed development, hearing loss, pyramidal signs, vocal cord paralysis, optic atrophy, and abnormal pupillary reactions. Recently, customized treatments for genetic diseases have been developed, and pregnancy diagnosis can enable the birth of a normal child when the causative gene mutation is found in CMT2. Therefore, accurate diagnosis based on genotype/phenotypic correlations is becoming more important. In this review, we describe the latest findings on the phenotypic characteristics of axonal CMT2 neuropathy. We hope that this review will be useful for clinicians in regard to the diagnosis and treatment of CMT.
https://doi.org/10.5734/JGM.2021.18.2.83 인용 PDF KSCI

A Case of Voice Therapy for Long Standing Functional Aphonia (장시간 지속된 기능적 실성증에 대한 음성치료 1예)

Kim, Bo Ram;Woo, Joo Hyun
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.33 no.2
- /
- pp.119-122
- /
- 2022
Functional aphonia is a disease in which normal vocal ability is suddenly lost. When voice therapy is started at an early stage, the prognosis is good. However, if the functional aphonia persists for a long time, there is a possibility that the voice disorder may become fixed, though reports of these characteristics are rare. The authors experienced a patient with functional aphonia that occurred in adolescence and lasted for 7 months and reported the result of treatment.
https://doi.org/10.22469/jkslp.2022.33.2.119 인용 PDF KSCI

Search Result 194, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)