Search | Korea Science

Self-Adaptation Algorithm Based on Maximum A Posteriori Eigenvoice for Korean Connected Digit Recognition (한국어 연결 숫자음 인식을 일한 최대 사후 Eigenvoice에 근거한 자기적응 기법)

Kim Dong Kook;Jeon Hyung Bae
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.8
- /
- pp.590-596
- /
- 2004
This paper Presents a new self-adaptation algorithm based on maximum a posteriori (MAP) eigenvoice for Korean connected digit recognition. The proposed MAP eigenvoice is developed by introducing a probability density model for the eigenvoice coefficients. The Proposed approach provides a unified framework that incorporates the Prior model into the conventional eigenvoice estimation. In self-adaptation system we use only one adaptation utterance that will be recognized, we use MAP eigenvoice that is most robust adaptation. In series of self-adaptation experiments on the Korean connected digit recognition task. we demonstrate that the performance of the proposed approach is better than that of the conventional eigenvoice algorithm for a small amount of adaptation data.
PDF KSCI

Performance Evaluation of Variable-Vocabulary Isolated Word Speech Recognizers with Maximum a Posteriori (MAP) Estimation-Based Speaker Adaptation in an Office Environment (최대 사후 추정 화자 적응을 이용한 가변어휘 고립단어 음성인식기의 사무실 환경에서의 성능 평가)

권오욱
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.2
- /
- pp.84-89
- /
- 1998
본 논문에서는 임의의 단어를 인식하기 위하여 음성학적으로 최적화된 (phonetically-optimized word) 음성 데이터베이스를 사용하여 훈련된 가변어휘 고립단위 음 성인식기의 실제 인식기 사용 환경에서의 성능을 평가하였다. 이를 위하여, 훈련 데이터베이 스에서와 상이한 환경에서 수집된 음성학적으로 균형 잡힌(phonetically-balanced word) 고 립 단어 음성을 테스트 데이터로 사용하였다. 테스트 데이터는 일반적인 사무실에서 작동하 는 노트북 PC에서 내장 마이크를 사용하여 녹음되었다. 이렇게 녹음된 음성을 사용하여 고 립단어 인식기의 인식률을 측정하였다. 이 인식기는 최대 사후(maximum a posteriori) 추정 알고리듬을 사용하여 화자의 변화에 적응하였다. 컴퓨터 모의실험 결과에 의하면 화자 적응 을 하지 않은 기본 시스템은 깨끗한 음성에 대하여 81.3%에서 사무실 환경 음성에 대하여 69.8%로 인식률이 저하되었다. 사무실 환경 음성에 대하여, 비교사 점진(unsupervised incremental) 모드에서 최대 사후 추정 화자 적응 알고리듬을 적용하였을 경우에는 화자적 응을 하지 않은 경우에 비하여 9%의 에러를 감소시키며, 50단어의 적응 단어를 사용하여 교사 묶음(supervised batch) 모드에서 최대 사후 추정 화자 적응 알고리듬을 적용하였을 경우에는 16%의 에러를 감소시켰다.
PDF

A Statistical Model-Based Voice Activity Detection Employing the Conditional MAP Criterion with Spectral Deviation (조건 사후 최대 확률과 음성 스펙트럼 변이 조건을 이용한 통계적 모델 기반의 음성 검출기)

Kim, Sang-Kyun;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.30 no.6
- /
- pp.324-329
- /
- 2011
In this paper, we propose a novel approach to improve the performance of a statistical model-based voice activity detection (VAD) which is based on the conditional maximum a posteriori (CMAP) with deviation. In our approach, the VAD decision rule is expressed as the geometric mean of likelihood ratios (LRs) based on adapted threshold according to the speech presence probability conditioned on both the speech activity decisions and spectral deviation in the pervious frame. Experimental results show that the proposed approach yields better results compared to the CMAP-based VAD using the LR test.
https://doi.org/10.7776/ASK.2011.30.6.324 인용 PDF KSCI

Statistical Model-Based Voice Activity Detection Using the Second-Order Conditional Maximum a Posteriori Criterion with Adapted Threshold (적응형 문턱값을 가지는 2차 조건 사후 최대 확률을 이용한 통계적 모델 기반의 음성 검출기)

Kim, Sang-Kyun;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.1
- /
- pp.76-81
- /
- 2010
In this paper, we propose a novel approach to improve the performance of a statistical model-based voice activity detection (VAD) which is based on the second-order conditional maximum a posteriori (CMAP). In our approach, the VAD decision rule is expressed as the geometric mean of likelihood ratios (LRs) based on adapted threshold according to the speech presence probability conditioned on both the current observation and the speech activity decisions in the pervious two frames. Experimental results show that the proposed approach yields better results compared to the statistical model-based and the CMAP-based VAD using the LR test.
https://doi.org/10.7776/ASK.2010.29.1.076 인용 PDF KSCI

Principles of Image Formation and Reconstruction in Emission Computed Tomography (방출전산화단층촬영기의 영상 형성과 재구성 원리)

Lee, Soo-Jin
- Journal of the Korean Society for Precision Engineering
- /
- v.25 no.1
- /
- pp.51-62
- /
- 2008
PDF KSCI

A study on classification accuracy improvements using orthogonal summation of posterior probabilities (사후확률 결합에 의한 분류정확도 향상에 관한 연구)

정재준
- Spatial Information Research
- /
- v.12 no.1
- /
- pp.111-125
- /
- 2004
Improvements of classification accuracy are main issues in satellite image classification. Considering the facts that multiple images in the same area are available, there are needs on researches aiming improvements of classification accuracy using multiple data sets. In this study, orthogonal summation method of Dempster-Shafer theory (theory of evidence) is proposed as a multiple imagery classification method and posterior probabilities and classification uncertainty are used in calculation process. Accuracies of the proposed method are higher than conventional classification methods, maximum likelihood classification(MLC) of each data and MLC of merged data sets, which can be certified through statistical tests of mean difference.
PDF

Comparison of semiparametric methods to estimate VaR and ES (조건부 Value-at-Risk와 Expected Shortfall 추정을 위한 준모수적 방법들의 비교 연구)

Kim, Minjo;Lee, Sangyeol
- The Korean Journal of Applied Statistics
- /
- v.29 no.1
- /
- pp.171-180
- /
- 2016
Basel committee suggests using Value-at-Risk (VaR) and expected shortfall (ES) as a measurement for market risk. Various estimation methods of VaR and ES have been studied in the literature. This paper compares semi-parametric methods, such as conditional autoregressive value at risk (CAViaR) and conditional autoregressive expectile (CARE) methods, and a Gaussian quasi-maximum likelihood estimator (QMLE)-based method through back-testing methods. We use unconditional coverage (UC) and conditional coverage (CC) tests for VaR, and a bootstrap test for ES to check the adequacy. A real data analysis is conducted for S&P 500 index and Hyundai Motor Co. stock price index data sets.
https://doi.org/10.5351/KJAS.2016.29.1.171 인용 PDF KSCI

Postmortem Changes in Muscle of Sea Water Acclimated Tilapia, Oreochromis niloticus (해수순치한 틸라피아 근육의 사후변화)

Yoon Ho-Dong;KIM Tae-Jin;KIM Seong-Jun;LEE Jong-Ho
- Korean Journal of Fisheries and Aquatic Sciences
- /
- v.29 no.3
- /
- pp.279-286
- /
- 1996
Cultivated tilapia (Oreochromis niloticus) in the fresh water were acclimated to the sea water to improve palatability of the fish meat. Physicochemical properties in the rigor mortis of those fish meats were investigated during storage at $0^{\circ}C,\;10^{\circ}C\;and\;20^{\circ}C$. The faster onset of rigor mortis was occurred in acclimated meat than fresh water cultivated meat. Both meats stored at $0^{\circ}C$ showed faster figro mortis than at $10^{\circ}C\;and\;20^{\circ}C$. Significant difference was not observed between the breaking strength and the rigor index. The breaking strength reached maximum over 12hrs after death and then gradually declined, and the rigor index was slowly increased and reached maximum over 18hrs after postmortem. Low temperature and acclimation to the sea water affected the degradation of adenosine triphosphate (ATP), accumulation of inosine monophosphate (IMP) or lactate. These results suggest that the palatability of tilapia muscle cultivated in the fresh water could be improved by acclimation to the sea water which induces the prerigor at the early state of postmortem and the physical changes of fish muscle.
PDF

The Comparison of Speaker Adaptation Methods (화자 적응 방법들의 비교)

황영수
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.1
- /
- pp.61-66
- /
- 1999
In this paper, we proposed various speaker adaptation methods and studied the performance of these methods. Methods which were studied in this paper are MAPE(Maximum A Posteriori Probability Estimation), Linear Spectral Estimating, Multi-Layer Perceptron and ARTMAP. In order to evaluate the performance of these methods, we used Korean isolated digits as the experimental data, the hybrid speaker adaptation method, which unified MAPE, linear spectral estimating and output probability of SCHMM, showed the better recognition result than those which performed other methods. And the method using ARTMAP showed the similar result to above hybrid method.
PDF

Speech Recognition Using the Energy and VQ (에너지와 VQ를 이용한 음성 인식)

Hwang, Young-Soo
- The Journal of The Korea Institute of Intelligent Transport Systems
- /
- v.6 no.3
- /
- pp.87-94
- /
- 2007
In this paper, the performance of the speech recognition and speaker adaptation methods are studied. The speech recognition using energy state and VQ(Vector Quantization) is suggested and the speaker adaptation methods(Maximum a posteriori probability estimation, linear specrum estimation) are considered. The experimental results show that recognition ration using energy state is 2-3 % better than that of general VQ.
PDF

Search Result 177, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)