통합 검색 | Korea Science

이기용;임재열
- 한국음향학회지
- /
- 제20권6호
- /
- pp.26-33
- /
- 2001
기존의 AR HMM(auroreg ressive hidden morkov model)에 의한 화자인식 방법은 그 성능이 우수하나, 잡음에 대한 것이 고려되지 않아 실제 환경에 적용시 성능저하가 문제가 된다. 본 논문에서는 실제 환경에 맞추기 위하여 관측 신호 모델에서 잡음을 고려하고, 화자인식 성능을 개선하고자 지속시간항 (duration-term)을 포함하는 AR HMM을 이용하여 잡음환경에서의 강인한 화자인식 시스템을 제안한다. 100명의 화자 (남자 77명, 여자 23명)가 2주에 걸쳐 6번 발성한 숫자음 데이터베이스을 가지고, 백색잡음 및 자동차 잡음하에서 실험한 결과, 제안된 방법으로 성능이 향상됨을 확인하였다.
PDF

이종민;황요하;김승종;송창섭
- 한국소음진동공학회논문집
- /
- 제13권1호
- /
- pp.48-55
- /
- 2003
Hidden Markov Model(HMM) has a doubly embedded stochastic process with an underlying stochastic process that can be observed through another set of stochastic processes. This structure of HMM is useful for modeling vector sequence that doesn't look like a stochastic process but has a hidden stochastic process. So, HMM approach has become popular in various areas in last decade. The increasing popularity of HMM is based on two facts : rich mathematical structure and proven accuracy on critical application. In this paper, we applied continuous HMM (CHMM) approach with AR coefficient to detect and predict the chatter of lathe bite and to diagnose the wear of oil Journal bearing using rotor shaft displacement. Our examples show that CHMM approach is very efficient method for machine health monitoring and prediction.
https://doi.org/10.5050/KSNVN.2003.13.1.048 인용 PDF KSCI

이기용;서창우;이주헌
- 한국음향학회지
- /
- 제21권1호
- /
- pp.11-18
- /
- 2002
본 논문에서는 부가 잡음에 오염된 음성신호에 이득 적응을 가지는 음성인식을 시간 영역에서 다루었다. 잡음은 유색잡음이라고 가정한다. 전화망에서 마찰음 (fricative), 운음 (glides), 유음 (liquds), 그리고 천이영역(transition region)과 같은 음성 신호의 뚜렷한 비정상상태를 극복하기 위해서 NAR-HMM (nonstationary autoregressive HMM)7을 제안하였다. 비정상상태 AR 처리는 M개의 알고 있는 기저 함수 (basis function)의 선형 결합으로 이루어진 다항 함수 (polynomial function)로 나타낼 수 있다. 오염된 신호만을 이용할 수 있을 때, 잡음의 추정 (estimation)문제는 필연적으로 발생한다. 다중 Kalman 필터를 사용함으로써, 잡음모델의 추정과 음성의 이득곡선 (gain contour)을 수행하였다. 제안한방법의 잡음 추정은 오염된 신호로부터 효과적으로 잡음을 제거하여 깨끗한 음성신호를 얻을 수 있었다. 또한 잡음 추정을 하는 일반적인 ARHMM보다 제안한 NAR-HMM이 약 2-3%의 인식성능을 향상시켰다.
PDF KSCI

Yoo, Soyoung;Kim, Gyeongryeong;Kim, Minji;Kim, Yeonjin;Park, Soeun;Kim, Dongho
- 한국정보기술학회 영문논문지
- /
- 제10권2호
- /
- pp.33-48
- /
- 2020
By Digital Transformation, new technologies like ML (Machine Learning), Big Data, Cloud, VR/AR are being used to video streaming technology. We choose ML to provide optimal QoE (Quality of Experience) in various network conditions. In other words, ML helps DASH in providing non-stopping video streaming. In DASH, the source video is segmented into short duration chunks of 2-10 seconds, each of which is encoded at several different bitrate levels and resolutions. We built and compared the performances of five prototypes after applying five different machine learning algorithms to DASH. The prototype consists of a dash.js, a video processing server, web servers, data sets, and five machine learning models.
https://doi.org/10.14801/JAITC.2020.10.2.33 인용