Search | Korea Science

Pre-Processing for Performance Enhancement of Speech Recognition in Digital Communication Systems (디지털 통신 시스템에서의 음성 인식 성능 향상을 위한 전처리 기술)

Seo, Jin-Ho;Park, Ho-Chong
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.7
- /
- pp.416-422
- /
- 2005
Speech recognition in digital communication systems has very low performance due to the spectral distortion caused by speech codecs. In this paper, the spectral distortion by speech codecs is analyzed and a pre-processing method which compensates for the spectral distortion is proposed for performance enhancement of speech recognition. Three standard speech codecs. IS-127 EVRC. ITU G.729 CS-ACELP and IS-96 QCELP. are considered for algorithm development and evaluation, and a single method which can be applied commonly to all codecs is developed. The performance of the proposed method is evaluated for three codecs, and by using the speech features extracted from the compensated spectrum. the recognition rate is improved by the maximum of $15.6\%$ compared with that using the degraded speech features.
PDF KSCI

Spectral Shape Invariant Real-time Voice Change System (스펙트럼 형태 불변 실시간 음성 변환 시스템)

Kim Weon-Goo
- Journal of the Korean Institute of Intelligent Systems
- /
- v.15 no.1
- /
- pp.48-52
- /
- 2005
In this paper, the spectral shape invariant real-time voice change method is proposed to change one's voice to mechanical voice. For this purpose, LPC analysis and synthesis is used to maintain the spectraum of voice and the pitch of synthesis speech can be changed freely. In the proposed method, gain matching method is applied to excitation signal generator to make the changed voice natural to hear. In order to evaluate the performance of the proposed method, voice change experiments were conducted. Experimental results showed that original speech signal is changed to the mechanical voice signal in which context of the speaker's voice is conveyed correctly in spite of drastic change of pitch. The system is implemented using TI TMS320C6711DSK board to verify the system runs in real time.
https://doi.org/10.5391/JKIIS.2005.15.1.048 인용 PDF KSCI

GRBAS and Voice Handicap Index (GRBAS 음성평가와 음성장애지수)

Sohn, Jin-Ho
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.19 no.2
- /
- pp.89-95
- /
- 2008
Subjective voice evaluation is necessary and important to assess the voice disorders in addition to objective voice evaluation. Subjective voice evaluation is divided into examiner and examinee subjective voice assessment. The examiner assessment represents perceptual judgment to the patient's voice such as GRBAS scale, Buffalo voice profile, consensus auditory perceptual evaluation of voice (CAPE- V) and so on. The examinee assessment consists of indirect method including voice handicap index (VHI), voice outcome survey (VOS), voice symptom scale (VoiSS), voice related quality of life (V-ROQL) and direct method which is called patient's self-subjective voice rating. This review article describes a general rule, advantages and pitfalls about GRBAS scale, VHI and patient's self-subjective voice rating which are presently most representative voice assessment tools.
PDF

The Effect of Helium Gas Intake on the Characteristics Change of the Acoustic Organs for Voice Signal Analysis Parameter Application (음성신호 분석 요소의 적용으로 헬륨가스 흡입이 음성 기관의 특성 변화에 미치는 영향)

Kim, Bong-Hyun;Cho, Dong-Uk
- The KIPS Transactions:PartB
- /
- v.18B no.6
- /
- pp.397-404
- /
- 2011
In this paper, we were carried out experiments to apply parameter of voice analysis to measure changing characteristic articulator according to inhale the helium gas. The helium gas was used to overcome air embolism nitrogen gas to deal a fatal blow in body nitrogen gas by diver. However, the helium gas has been much trouble interpretation about abnormal voice of diver to cause squeaky voice of low articulation. Therefor, we was carried out experiments about pitch and spectrogram measurement, analysis based on to influence in acoustic organs before and after of inhaled helium gas.
https://doi.org/10.3745/KIPSTB.2011.18B.6.397 인용 PDF KSCI

Post-Processing of Voice Recognition Using Phonologic Rules and Morphologic analysis (음절 복원 규칙과 형태소 분석을 이용한 음성인식 후처리)

Seo, Sang-Hyun;Kim, Jae-Hong;Kim, Hae-Jin;Kim, Mi-Jin;Lee, Sang-Jo
- Annual Conference on Human and Language Technology
- /
- 1997.10a
- /
- pp.495-499
- /
- 1997
컴퓨터의 사용이 보편화됨에 따라 컴퓨터와 사용자 사이의 쉽고 자연스러운 의사 소통을 위한 자연어 인터페이스에 대한 연구가 활발히 진행되고 있다. 이 중에서 특히, 음성인식 분야는 음성명령, 받아쓰기 시스템 등 일반적인 컴퓨터 사용자의 요구를 충족시켜 줄 수 있는 분야로 주목을 받고 있다. 그러나 음성인식은 인식 자체만으로는 인식률에 한계가 있으며, 인식 결과를 향상시키기 위해서는 후처리 단계가 필요하다. 본 논문에서는 음성 인식의 성능을 향상시키기 위해 음성 인식의 결과로 들어온 연속된 한국어 음성을 올바른 음절로 복원시켜 주는 시스템을 구현하였다. 이 시스템에서는 어절단위의 연속된 한국어 음성을 입력으로 받아 한국어 발음 규칙을 역으로 적용하여 원래의 음절로 복원시키고, 형태소 분석기를 이용하여 복원된 음절이 올바른지를 확인하고 수정한다. 초등학교 교과서에 나오는 문장을 대상으로 본 시스템의 성능을 실험한 결과, 90.42%의 복원율을 나타내었다. 현재 정확하게 복원이 되지 않는 것 중에는 동음이의어가 차지하는 비중이 크며, 이 문제는 구문분석이나 의미분석을 이용하여 어느 정도 개선할 수 있을 것으로 보인다.
PDF

Effect Analysis of Kidney Cupping Therapy based on Voice Signal Analysis (음성신호 분석 기반의 신장 부항요법 효과 분석)

Cho, Dong-Uk;Jeong, Yeon-Ho;Ka, Min-Kyoung;Kim, Bong-Hyun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2013.11a
- /
- pp.1474-1475
- /
- 2013
부항은 열 또는 음압(陰壓)장치에 의하여 부항단지 안에 음압을 조성하여 피부에 붙임으로써 피를 뽑거나 울혈(鬱血)을 일으키며 물리적 자극을 주어 병을 치료한다. 부항으로 얻어지는 물리적인 자극은 혈액순환을 촉진하고, 죽은피를 빼냄으로써 혈관을 자극하고 그로인해 다양한 효과를 얻는다. 따라서 본 논문에서는 신장에 해당하는 명문혈을 자극하여 신장과 관련된 음성분석 요소의 변화를 측정하였다. 이를 위해 신장에 이상이 없는 피실험자 10명을 선정하고 신장에 해당하는 명문혈을 자극하기 전과 후의 음성을 수집하였다. 실험은 음성분석 요소 중 신장과 관련된 1 Formant Bandwidth를 적용하여 신장 명문혈 자극 전과 후의 변화를 측정, 분석하였다. 실험 결과, 90%의 피실험자가 값이 감소하는 현상을 보였으며, 이를 통해 명문혈 자극에 따른 신장과 음성신호와의 상관성을 분석할 수 있었다.
https://doi.org/10.3745/PKIPS.y2013m11a.1474 인용 PDF

Classification of Sasang Constitution Taeumin by Comparative of Speech Signals Analysis (음성 분석 정보값 비교를 통한 사상체질 태음인의 분류)

Kim, Bong-Hyun;Lee, Se-Hwan;Cho, Dong-Uk
- The KIPS Transactions:PartB
- /
- v.15B no.1
- /
- pp.17-24
- /
- 2008
This paper proposes Sasang constitution classification through speech signals analysis values and comparison. For this, this paper wishes to propose Taeumin classification method of output values signals that comes out speech signal analysis to connect with process classification of Soeumin through skin diagnosis by first step in the whole system configuration to provide for objective index of Sasang constitution. First of all, these characteristic of voices wish to extract phonetic elements that each Sasang constitution groups' clear features. Also, we wish to classify Taeumin through constitution groups' difference and similarity on the basis of results value. Finally, the effectiveness of this method is verified through the experiments.
https://doi.org/10.3745/KIPSTB.2008.15-B.1.17 인용 PDF KSCI

Continuance Use Intention of Voice Commerce Using the Value-attitude-behavior Model (가치-태도-행동 모델에 기반한 음성 쇼핑 지속이용의도에 관한 연구)

Kim, Hyo-Jung
- The Journal of the Korea Contents Association
- /
- v.22 no.5
- /
- pp.491-502
- /
- 2022
Voice technology allows consumers to make purchases through smart devices, and the interest in voice-driven conversational commerce has significantly expanded. In this study, we explored the continuance use intention of voice commerce, and the adoption of a value-attitude-behavior model. An online survey was conducted on 360 individuals who used an artificial intelligence assistant device in a voice commerce environment. We used Amos 23.0 and SPSS 25.0 for descriptive, confirmatory, and structural equation modeling analyses. These results indicated that functional value was the highest influencing variable on satisfaction of voice commerce, while social, emotional, and epistemic values significantly influenced it as well. Additionally, satisfaction of voice commerce significantly influenced the continuance use intention of voice commerce. These findings could help us understand the characteristics of voice commerce users and the diversity value in voice commerce environment.
https://doi.org/10.5392/JKCA.2022.22.05.491 인용 PDF KSCI

Customized Speech Synthesis for Children with Characteristic Behavioral Patterns (어린이 행동 패턴에 기반한 개별화된 음성 합성)

Lee, Ho-Joon;Park, Jong-C.
- 한국HCI학회:학술대회논문집
- /
- 2006.02a
- /
- pp.571-578
- /
- 2006
음성을 통한 사용자 간의 정보 교환 방법은 추가적인 훈련 과정이나 장비가 필요하지 않고 공간 제약이 거의 없기 때문에 노약자 등 사용자의 연령대에 관계없이 사용될 수 있다. 또한 음성 정보는 시각이나 촉각 등 다른 정보 수단과의 상호 작용으로 상승 효과를 유발할 수 있기 때문에 사람과 기계 사이의 인터페이스로 활용될 경우 정보 전달력을 높이면서 사용자 친화적인 서비스를 제공할 수 있다. 그러나 동일한 상황에서 동일한 유형의 음성 정보가 사용자에게 지속적으로 제공될 경우 표현상의 단조로움으로 인해 정보 전달력이 급감할 수 있는 문제점도 지니고 있다. 따라서 음성을 통한 정보 전달의 경우 동일 상황이라 하더라도 사용자의 행동 패턴, 심리 상태, 주변 환경 등에 따라 차별화된 문장 구조 및 어휘의 선택으로 긴장감을 유지시켜 줄 수 있어야 한다. 본 논문에서는 5 세 전후의 어린이를 대상으로 그들의 행동 패턴 분석에 기반하여 개별화된 음성 합성 결과를 제공하는 시스템을 제안한다. 이를 위해 유치원이라는 물리적 공간에서 어린이들의 주된 행동 패턴을 분석하고, 현직 유치원 교사를 대상으로 동일한 정보를 전달하는 조건을 통하여 어린이의 행동 패턴과 위치 정보, 연령 및 성격에 따른 발화 문장의 문장 구조와 어휘적 특성을 파악한다. 최종적으로, 개별화된 음성 합성 결과를 위해 유치원 공간을 시뮬레이션 하고 RFID 를 이용하여 어린이의 행동 패턴 및 위치 정보를 파악한다. 그리고 각 상황에 따라 분석된 발화문의 문장 구조와 어휘 특성을 반영하여 음성으로 합성될 문장의 문장 구조 및 어휘를 재구성하여 사용자 개별화된 음성 합성 결과를 생성한다. 이러한 결과를 통해 어린이의 행동 패턴이 발화문의 문장 구조 및 어휘에 미치는 영향에 대해서 살펴보고 재구성된 결과 발화문을 평가한다.
PDF

Quasi-periodic waveform analysis for diplophonia (이중음성에 대한 음성파형분석)

홍기환;김미정;정상술
- Proceedings of the KOR-BRONCHOESO Conference
- /
- 1993.05a
- /
- pp.71-71
- /
- 1993
Diplophonia is produced by the voice of two separate tones and produced through quasi-periodic variations in the vocal cord vibration. Diplophonia is generally regarded as a symptom of laryngeal pathology. The difference in the vibratory frequency between the vocal cords can be seen in a tension imbalance and a difference in the level of the vocal folds under the special condition such as incomplete glottal closure. So authors have experienced 19 cases of patient with diplophonia for the unilateral vocal cord paralysis, intracordal cysts and other mass lesions. And we analysed the diplophonic voice with peak variability and noise level for the quasi-periodic waveforms and spectrograms pre-and postoperatively.
PDF

Search Result 3,062, Processing Time 0.059 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)