Search | Korea Science

On a Study of Measurement Method of Utterance Velocity for the Reduction of Transmission Rate in CELP Vocoder. (CELP 보코더 전송률 감소를 위한 발성속도 측정 방법)

장경아;나덕수;배명진
- Proceedings of the IEEK Conference
- /
- 2000.09a
- /
- pp.175-179
- /
- 2000
음성의 발성속도가 빠른 경우에는 발성속도가 느린 경우보다 적은 정보만으로도 부호화가 가능하다 음성의 발성속도가 빠른 경우에는 청취시 낮은 주파수 대역의 정보가 높은 주파수대역의 정보보다 중요하게 된다. 음성 부호화 기술은 전송를과 복잡도를 줄이고 음질을 향상시키는 방향으로 진행되고 있다. 현재 상용화되고 있는 CELP형 보코더는 낮은 전송를에 비해 우수한 음질을 제공하지만, 기존 방식은 음성의 발성속도에 대해서 처리를 달리하지 않고 사용하고 있다. 음성의 발성속도를 측정하여 발성속도가 빠를 경우에, 발성속도가 느린 경우보다 낮은 대역의 정보만 전송한다면 전송율을 감소시킬 수 있다. 본 논문에서는 CELP 부호화기의 전송률 감소를 위해 발성속도를 측정하는 방법을 제안한다. LSP 파라미터가 가지고 있는 정보로 음소의 변화율을 측정하였다. 각각 다른 발성속도를 갖는 음성시료에 대하여 음소 변화율을 구한 결과 발성속도가 다른 경우, 뚜렷하게 다른 음소 변화율을 갖는 것을 알 수 있었고. 빠르게 발성한 경우가 느리게 발성한 경우보다 42.8％가 높게 나왔다.
PDF

A Study on a Design of the Variable Bit-Rate Vocoder by Measuring of the Speaking Rate (발성 속도에 따른 가변전송률 CELP 부호화기 설계에 관한 연구)

나덕수;배명진
- Proceedings of the IEEK Conference
- /
- 2001.06d
- /
- pp.273-276
- /
- 2001
CELP 부호화기는 선형 예측 합성에 의한 분석 부호화의 원칙에 기본을 두고 있다. 그리고 음성 신호의 스펙트럼을 LPC 분석을 통해 부호화하는데 고정 윈도우를 사용하여 부호화한다. 그러나 음성신호는 화자의 발성속도에 따라 파형의 변화가 시간적으로 빠르게 변화하기도 하고, 반대로 유사한 파형이 일정시간 유지되기도 한다. 따라서 윈도우의 크기를 발성속도에 맞추어 분석한다면 보다 효율적인 부호화를 할 수 있다. 본 논문에서는 발성속도에 따라 전송률을 달리 적용하는 방법을 제안한다. 발성속도의 측정은 스펙트럼 변화도를 이용하여 측정하였고, 발성속도가 빠를 때는 프레임 크기를 줄여 시간적으로 빠르게 변화하는 신호에 적응적으로 분석하고 대신 파라미터 표현에 비트를 줄인다. 반대로 발성속도가 느릴 때는 프레임 크기를 키우고 파라미터 표현에 비트를 더 할당한다. 제안한 방법을 실험하기 위해 G.723.1 5.3kbps ACELP 부호화기를 이용하였다 음질의 열하 없이 평균 16.34% 전송률 감소효과를 얻을 수 있었다.
PDF

Study on the Improvement of Speech Recognizer by Using Time Scale Modification (시간축 변환을 이용한 음성 인식기의 성능 향상에 관한 연구)

이기승
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.6
- /
- pp.462-472
- /
- 2004
In this paper a method for compensating for thp performance degradation or automatic speech recognition (ASR) is proposed. which is mainly caused by speaking rate variation. Before the new method is proposed. quantitative analysis of the performance of an HMM-based ASR system according to speaking rate is first performed. From this analysis, significant performance degradation was often observed in the rapidly speaking speech signals. A quantitative measure is then introduced, which is able to represent speaking rate. Time scale modification (TSM) is employed to compensate the speaking rate difference between input speech signals and training speech signals. Finally, a method for compensating the performance degradation caused by speaking rate variation is proposed, in which TSM is selectively employed according to speaking rate. By the results from the ASR experiments devised for the 10-digits mobile phone number, it is confirmed that the error rate was reduced by 15.5% when the proposed method is applied to the high speaking rate speech signals.
PDF KSCI

Effect of politician's voice on electors -Focused on ward head election (정치인의 발성이 유권자에 미치는 영향 -구청장 선거를 중심으로)

Park, Dug-Chun
- Journal of Digital Convergence
- /
- v.11 no.10
- /
- pp.695-700
- /
- 2013
This experimental research explores the effect of politician's voice on electors. For this experimental research, 4 groups of subjects composed of university students were exposed to different types of TV address video clips which were manipulated by tone and speed of voice, This research found that subjects exposed to low tone video clip of politician's address showed higher degree of affect and support. And those exposed to slower video clip of politician's address showed higher degree of affect but it is not connected to higher degree of support.
https://doi.org/10.14400/JDPM.2013.11.10.695 인용 PDF

A Study on the Breathing Training Method at the Pre-phonation Stage in Beginning Acting Class (기초연기 교육과정에서 발성 이전 단계의 호흡훈련 방법 연구)

Choi, Young-Hwan
- The Journal of the Korea Contents Association
- /
- v.15 no.5
- /
- pp.78-87
- /
- 2015
An actor should be well-trained at the natural breathing and phonation in order for a good diction. This thesis is focused on the breathing training method at the pre-phonation stage. The past research on the breathing and phonation training was generally for vocal music, yoga, or those which were approached by the medical view point, whereas there are not so many researches which are approached for an acting training even though it is quite important for actors. In addition, such researches have been given an overview of breathing, phonation and diction. Therefore, this thesis which is focused on the breathing training at the pre-phonation stage suggests the natural breathing training method through the imagination and image in the various situations which could exist in the drama.
https://doi.org/10.5392/JKCA.2015.15.05.078 인용 PDF KSCI

A Study on Speaker Adaptation in Continuous Digits Speech Recognition (연속숫자 음성인식에서 화자 적응에 관한 연구)

최광표
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06e
- /
- pp.319.2-322
- /
- 1998
본 논문에서는 반음절 단위 HMM을 이용한 연속 숫자 음성인식 시스템의 2단계로 이루어지는 화자 적응 알고리즘을 수행하였다. 음성인식 시스템에서 사용되는 훈련데이터의 양이 많더라도 발성속도, 발성크기 등의 화자 발성 습관에 따라 화자독립 음성인식 시스템에서는 많은 문제점들이 발생하게 된다. 불특정 화자를 대상으로 한 음성 인식에 있어서 개인차에 의한 변동을 대처하는 방법으로 유효한 음향적 특성을 추출하기 위해 스펙트럼의 동적인(Dynamic) 특성을 주로 이용하고 있다. 따라서 본 논문에서는 화자 적을 기법의 하나인 frequency warped spectral matching 방법을 연속숫자 음성 인식시스템에 적용하였으며, 이때 인식에 의한 적절한 화자별 스케일링 계수 선정 방법을 수행하여 오인식률이 감소함을 확인하였다.
PDF

Tube phonation in water for patients with hyperfunctional voice disorders: The effect of tube diameter and water immersion depth on bubble height and maximum phonation time (과기능적 음성장애 환자의 물저항발성: 튜브 직경과 물 깊이가 물거품 높이 및 최대발성지속시간에 미치는 영향)

Min Gyeong Kim;Seong Hee Choi;Jong-In Youn
- Phonetics and Speech Sciences
- /
- v.15 no.2
- /
- pp.31-40
- /
- 2023
Tube phonation in water has been widely used for voice training among semi-occluded vocal tract (SOVT) exercises in which the patient bubbles with phonation keeping the tube submerged in water. This study aims to investigate the effect of tube diameter and water depth on bubble height and maximum phonation time (MPT) for patients with hyperfunctional voice disorders. Seventeen patients with hyperfunctional voice disorders were asked to bubble with sustained /u/ at the different inner diameters of tube (5, 7, and 10 mm), water depth (4, 7, and 10 cm). A water resistance phonation biofeedback system using a water height sensor was used for recording bubble height and MPT. The bubble height was significantly changed by the tube diameter while MPT was significantly changed with the tube diameter and water depth. Although the wider tube presented significantly lower bubble height for a given depth, relatively consistent bubble height was maintained. Depending on the water depth, the bubble height did not significantly differ for a given tube diameter. In addtion, MPT significantly decreased with water depth and a wider tube led significantly shorter MPT. A water level-driven water resistance biofeedback system provided useful information on bubble characteristics and vocal fold vibration depending on tube diameter and water depth. It can be useful to monitor the breath support during water resistance phonation for patients with hyperfunctional voice disorders.
https://doi.org/10.13064/KSSS.2023.15.2.031 인용 PDF

Surgery for Primary Pulmonary Liposarcoma (원발성폐지방육종(Primary Pulmonary Liposarcoma)에 관한 수술치험 1예)

김수완;김진국;김관민;최용수;안긍환;심영목
- Journal of Chest Surgery
- /
- v.37 no.11
- /
- pp.942-945
- /
- 2004
Primary pulmonary liposarcoma is extremely rare disease. It has poor prognosis with early multiple metastases and frequent local recurrences. Surgery is the choice of treatment for liposarcoma. Incomplete resection would result in rapid and aggressive growing of the tumor. We report a case of primary pulmonary liposarcoma which was successfully treated with complete resection without local recurrence and distant metastasis for 10 months.
PDF KSCI

Improvements on Speech Recognition for Fast Speech (고속 발화음에 대한 음성 인식 향상)

Lee Ki-Seung
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.2
- /
- pp.88-95
- /
- 2006
In this Paper. a method for improving the performance of automatic speech recognition (ASR) system for conversational speech is proposed. which mainly focuses on increasing the robustness against the rapidly speaking utterances. The proposed method doesn't require an additional speech recognition task to represent speaking rate quantitatively. Energy distribution for special bands is employed to detect the vowel regions, the number of vowels Per unit second is then computed as speaking rate. To improve the Performance for fast speech. in the pervious methods. a sequence of the feature vectors is expanded by a given scaling factor, which is computed by a ratio between the standard phoneme duration and the measured one. However, in the method proposed herein. utterances are classified by their speaking rates. and the scaling factor is determined individually for each class. In this procedure, a maximum likelihood criterion is employed. By the results from the ASR experiments devised for the 10-digits mobile phone number. it is confirmed that the overall error rate was reduced by $17.8\%$ when the proposed method is employed
https://doi.org/10.7776/ASK.2006.25.2.088 인용 PDF KSCI

노무안전판례

Korea Industrial Health Association
- The Safety technology
- /
- no.75
- /
- pp.30-31
- /
- 2004
B형 간염에 감염된 근로자가 과중한 업무에 종사하다가 원발성 간종양 진단을 받고 사망한 경우, B형 간염에 감염된 것은 업무와 관련이 없다 하더라도 계속되는 근무로 인하여 육체적 과로와 정신적 스트레스가 지속되어 자연적인 속도 이상으로 악화되어 사망하였다면 업무상 재해에 해당한다
PDF

Search Result 76, Processing Time 0.03 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)