Search | Korea Science

A Study on Voice quality conversion for Korean vowels using spectrum envelope correction method (스텍트럼포명 수정법에 의한 한국어모음의 성질변환에 관한 연구)

이기영
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06c
- /
- pp.314-317
- /
- 1994
스펙트럼포락의 변경에 의해 음성의 개인성이 변환될 수 있다는데 착안하여 스펙트럼포락 수정법에 의한 성질변환에 관하여 연구하였다. 실험에서는 남성화자와 여성화자가 각각 발성한 한국어 모음을 대상으로 스펙트럼포락 수정법을 적용하여 스펙트로그램과 청취시험을 비교검토하므로써 성질변환의 성능을 확인하였다.
PDF

Experiments on Extraction of Non-Parametric Warping Functions for Speaker Normalization (화자 정규화를 위한 비정형 워핑함수 도출에 관한 실험)

Shin, Ok-Keun
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.5
- /
- pp.255-261
- /
- 2005
In this paper. experiments are conducted to extract a set of non-Parametric warping functions to examine the characteristics of the warping among speakers' utterances. For this Purpose. we made use of MFCC and LP spectra of vowels in choosing reference spectrum of each vowel as well as representative spectra of each speaker. These spectra are compared by DTW to give the warping functions of each speaker. The set of warping functions are then defined by clustering the warping functions of all the speakers. Noting that male and female warping functions have shapes similar to Piecewise linear function and Power function respectively, a new hybrid set of warping functions is defined. The effectiveness of the extracted warping functions are evaluated by conducting phone level recognition experiments, and improvements in accuracy rate are observed in both warping functions.
PDF KSCI

Speaker Adaptation Algorithm Based on a Maximization of the Observation Probability (관찰 확률 최대화에 의한 화자 적응 알고리즘)

양태영;신원호;전원석;김지성;김지성;김원구;이충용;윤대희;차일환
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.6
- /
- pp.37-42
- /
- 1998
본 논문에서는 SCHMM에 적용된 관찰 확률 최대화에 의한 화자 적응 알고리즘을 제안한다. 제안된 알고리즘은 SCHMM의 관찰 확률 밀도들이 새로운 화자의 음성 특징을 잘 표현하지 못하는 경우 인식 성능이 저하되는 것을 막기 위하여, 적응 데이터의 각 특징 벡터들이 최대의 관찰 확률을 가질 수 있도록 관찰 확률 밀도를 결정하는 평균 벡터 μ와 분산 행렬 Σ를 기울기 탐색(gradient search) 알고리즘에 의해 반복적으로 적응시켜 주는 방법이다. SCHMM의 상태 천이 확률 A와 혼합 밀도 계수 C는 관찰 확률 밀도 적응 과정 을 거친 후, 적응 데이터로부터 구한 확률과 기존 확률의 가중 평균을 취하는 과정을 반복 하여 적응시켜 주었다. 제안된 화자 적응 알고리즘을 사용하여 단독음 인식 실험을 수행한 결과, 화자 적응을 수행하지 않았을 때와 비교하여 화자 독립 시스템에서는 평균 9.8%, 남 성 화자 종속 시스템에서는 평균 46.0%, 여성 화자 종속 시스템에서는 평균 52.7%의 인식 률 향상을 보였다.
PDF

Speaker-dependent Speech Recognition Algorithm for Male and Female Classification (남녀성별 분류를 위한 화자종속 음성인식 알고리즘)

Choi, Jae-Seung
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.17 no.4
- /
- pp.775-780
- /
- 2013
This paper proposes a speaker-dependent speech recognition algorithm which can classify the gender for male and female speakers in white noise and car noise, using a neural network. The proposed speech recognition algorithm is trained by the neural network to recognize the gender for male and female speakers, using LPC (Linear Predictive Coding) cepstrum coefficients. In the experiment results, the maximal improvement of total speech recognition rate is 96% for white noise and 88% for car noise, respectively, after trained a total of six neural networks. Finally, the proposed speech recognition algorithm is compared with the results of a conventional speech recognition algorithm in the background noisy environment.
https://doi.org/10.6109/jkiice.2013.17.4.775 인용 PDF KSCI

The Recognition of Korean Continuous Speech using Syntactic Analysis and Level Building (구문 분석과 Level Building을 이용한 한국어 연속음 인식)

안태옥;변용규;김순협
- The Journal of the Acoustical Society of Korea
- /
- v.5 no.4
- /
- pp.27-36
- /
- 1986
본 논문은 특정 화자에 대한 하국어 연속음의 효율적인 인식을 위하여, 구문분석과 OGS기법으 로 변형시킨 Level Building을 이용한 인식시스템에 대해서 제안하고 있다. 본 시스템에서 사용하는 template는 연속음을 분할시킨 단독음이며 소구간 경로 및 본 논문에서 제안한 전체 경로 제약에 의해 거리 계산값이 최소인 super reference를 구함으로써 인식된다. 본 연구에서 사용한 연속음은 단독음 11 자로 구성된 13개의 전철역명으로서 2인의 남성과 1인의 여성화자에 의해 10번씩 발음한 130단어를 테 스트하였는데 97.7%의 단어인식을 보였다.
PDF

A study on speech recognition using pitch detection in a car-noisy environment (자동차 환경에서 피치검출을 이용한 음성인식 연구)

Lee Jeong-gi;Yoo Bong-keun;Kim Hak-jin;Kim Soon-kyob
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.97-100
- /
- 1999
본 논문은 자동차의 편의성 및 안전성의 동시 확보를 위하여, 보조적 스위치의 조작없이 상시 음성의 입$\cdot$출력이 가능하도록 하였고, 남성과 여성을 구별하기 위하여 피치검출법을 사용하여 속도별로 구분하였다. 또한, band pass filter를 이용하여 자동으로 잡음하에서 정확하게 음성추간 검출(End Point Detection)을 하게 하였다. Reference Pattern은 DMS(Dynaminc Multi-Section)[1]모델을 사용하려고, 음성의 특징 파라미터와 인식 알고리즘은 PLP 13차와 One Stage Dynamic Programming(OSDP)를 사용하였다. 시내주행중인 자동차 환경에서 자주 사용되는 차량제어 명령어 30단어를 가지고 실험한 결과 40-80km에서 화자독립 남성 $96\%$, 여성 $94.4\%$ 화자종속일 때 남성 $97\%$, 여성 $95\%$의 인식률을 얻을수 있었고 남성과 여성을 구분하므로 써 인식률을 향상 시켰다.
PDF

종합 - 1

(사)한국여성발명협회
- The Inventors News
- /
- no.35
- /
- pp.5-6
- /
- 2005
제1호 특허담보부 사업화자금 대출 탄생 - 우리쌀 지키기 묘책 특허 출원 - 두산식품 BG, 종가집 `집김치` 새롭게 선보여 - 이공계 공직 진출, 특허청에 길이 있다 - 한국, 지재권 관련 국제기구 의장 재선출 - 산자부, 중국에 `IP 차이나 데스크` 설치
PDF

Speaker-Independent Isolated Word Recognition Using A Modified ISODATA Method (Modified ISODATA 집단화방법을 이용한 불특정화자 단독어 인식)

황우근
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1987.11a
- /
- pp.66-69
- /
- 1987
본 논문은 불특정화자의 한국어 단독음인식에 관한 연구로써 새로운 집단화 방법인 Modified-ISODATA 집단화방법을 제안한다.본 알고리즘의 목적은 종래의 ISODATA 알고리즘에서 외부 고립점 처리 및 분리과정을 단순화 하고, Lumping 과정을 제거하여 정확하고도 자동화된 집단의 중심점을 찾는 것이다. 본 알고리즘을 적용한 결과, 10명의 남성 화자와 4명의 여성 화자가 발음한 11개의 ltnt자음에 대하여, 최근에 발표된 Modified K-means 방법보다 좋은 인식율을 나타내어, 보다 정확한 집단의 중심점을 찾아 내었음을 입증해보였다.
PDF

여성회원이 보는 건축계 개혁방안 - 겸손하고 성실한 건축사가 되자

Kim, Hwa-Ja
- Korean Architects
- /
- no.1 s.297
- /
- pp.60-62
- /
- 1994
PDF

Korean Word Recognition Using Vector Quantization Speaker Adaptation (벡터 양자화 화자적응기법을 사용한 한국어 단어 인식)

Choi, Kap-Seok
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.4
- /
- pp.27-37
- /
- 1991
This paper proposes the ESFVQ(energy subspace fuzzy vector quantization) that employs energy subspaces to reduce the quantizing distortion which is less than that of a fuzzy vector quatization. The ESFVQ is applied to a speaker adaptation method by which Korean words spoken by unknown speakers are recognized. By generating mapped codebooks with fuzzy histogram according to each energy subspace in the training procedure and by decoding a spoken word through the ESFVQ in the recognition proecedure, we attempt to improve the recognition rate. The performance of the ESFVQ is evaluated by measuring the quantizing distortion and the speaker adaptive recognition rate for DDD telephone area names uttered by 2 males and 1 female. The quatizing distortion of the ESFVQ is reduced by 22% than that of a vector quantization and by 5% than that of a fuzzy vector quantization, and the speaker adaptive recognition rate of the ESFVQ is increased by 26% than that without a speaker adaptation and by 11% than that of a vector quantization.
PDF

Search Result 63, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)