통합 검색 | Korea Science

잡음음성 음향모델 적응에 기반한 잡음에 강인한 음성인식 (Noise Robust Speech Recognition Based on Noisy Speech Acoustic Model Adaptation)

정용주
- 말소리와 음성과학
- /
- 제6권2호
- /
- pp.29-34
- /
- 2014
In the Vector Taylor Series (VTS)-based noisy speech recognition methods, Hidden Markov Models (HMM) are usually trained with clean speech. However, better performance is expected by training the HMM with noisy speech. In a previous study, we could find that Minimum Mean Square Error (MMSE) estimation of the training noisy speech in the log-spectrum domain produce improved recognition results, but since the proposed algorithm was done in the log-spectrum domain, it could not be used for the HMM adaptation. In this paper, we modify the previous algorithm to derive a novel mathematical relation between test and training noisy speech in the cepstrum domain and the mean and covariance of the Multi-condition TRaining (MTR) trained noisy speech HMM are adapted. In the noisy speech recognition experiments on the Aurora 2 database, the proposed method produced 10.6% of relative improvement in Word Error Rates (WERs) over the MTR method while the previous MMSE estimation of the training noisy speech produced 4.3% of relative improvement, which shows the superiority of the proposed method.
https://doi.org/10.13064/KSSS.2014.6.2.029 인용 PDF KSCI

Robust Video-Based Barcode Recognition via Online Sequential Filtering

Kim, Minyoung
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- 제14권1호
- /
- pp.8-16
- /
- 2014
We consider the visual barcode recognition problem in a noisy video data setup. Unlike most existing single-frame recognizers that require considerable user effort to acquire clean, motionless and blur-free barcode signals, we eliminate such extra human efforts by proposing a robust video-based barcode recognition algorithm. We deal with a sequence of noisy blurred barcode image frames by posing it as an online filtering problem. In the proposed dynamic recognition model, at each frame we infer the blur level of the frame as well as the digit class label. In contrast to a frame-by-frame based approach with heuristic majority voting scheme, the class labels and frame-wise noise levels are propagated along the frame sequences in our model, and hence we exploit all cues from noisy frames that are potentially useful for predicting the barcode label in a probabilistically reasonable sense. We also suggest a visual barcode tracking approach that efficiently localizes barcode areas in video frames. The effectiveness of the proposed approaches is demonstrated empirically on both synthetic and real data setup.
https://doi.org/10.5391/IJFIS.2014.14.1.8 인용 PDF KSCI

음소 및 성조 레이블링 프로그램 개발 (Development of a Phoneme and Tone Labeling Program)

이윤경;곽철;권오욱
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2007년도 심포지엄 논문집 정보 및 제어부문
- /
- pp.435-436
- /
- 2007
Although previous speech analysis programs usually provide speech analysis and phoneme labeling functionalities, they require much time in manual labeling and support only English alphabets. To solve these problems, we develop a new Windows-based program with an improved phoneme and tone labeling method as well as the conventional speech analysis functionalities. The developed program has the unique feature in semi-automatic phoneme and tone labeling based on hidden Markov models.
PDF

무선 전자청진 심음을 이용한 심장질환 분류 (Cardiac Disorder Classification Using Heart Sounds Acquired by a Wireless Electronic Stethoscope)

곽철;이윤경;권오욱
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2007년도 심포지엄 논문집 정보 및 제어부문
- /
- pp.101-102
- /
- 2007
Heart diseases are critical and should be detected as soon as possible. A stethoscope is a simple device to find cardiac disorder but requires keen experiences in heart sounds. We evaluate a cardiac disorder classifier by using heart sounds recorded by a digital wireless stethoscope developed in this work. The classifier uses hidden Markov models with circular state transition to model the heart sounds. We train the classifier using two kinds of data: One recorded by using our stethoscope and the other sampled from a clean heart sound database. In classification experiments using 165 sound clips, the classifier shows the classification accuracy of 82% in classifying 6 cardiac disorder categories.
PDF

A Single Mobile Target Tracking in Voronoi-based Clustered Wireless Sensor Network

Chen, Jiehui;Salim, Mariam B.;Matsumoto, Mitsuji
- Journal of Information Processing Systems
- /
- 제7권1호
- /
- pp.17-28
- /
- 2011
Despite the fact that the deployment of sensor networks and target tracking could both be managed by taking full advantage of Voronoi diagrams, very little few have been made in this regard. In this paper, we designed an optimized barrier coverage and an energy-efficient clustering algorithm for forming Vonoroi-based Wireless Sensor Networks(WSN) in which we proposed a mobile target tracking scheme (CTT&MAV) that takes full advantage of Voronoi-diagram boundary to improve detectability. Simulations verified that CTT&MAV outperforms random walk, random waypoint, random direction and Gauss-Markov in terms of both the average hop distance that the mobile target moved before being detected and lower sensor death rate. Moreover, we demonstrate that our results are robust as realistic sensing models and also validate our observations through extensive simulations.
https://doi.org/10.3745/JIPS.2011.7.1.017 인용 PDF KSCI

보터의 구조에 따른 TMR 시스템의 신뢰도 평가 및 설계에 관한 연구 (A study on the Reliability evaluation and Design of TMR system accroding to the structure of voter)

김현기;신석균;이기서
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 1998년도 하계학술대회 논문집 A
- /
- pp.375-377
- /
- 1998
This paper shows two models of the TMR system- fail passive system - having a fault tolerant system characteristic used in airplane and railway system. We design the single system, single-voter TMR system and triplicated voter TMR system based on MC68000 and calculate the failure rate of components using MIL-SPEC-217F and evaluate the reliability and MTTF(Mean Time To Failure) of the designed systems by Markov model.
PDF

WHMM에 적용가능한 MCE/GPD 학습알고리듬에 관한 연구 (Derivation of MCE/GPD Training Algorithm Applicable to Weighted Hidden Markov Models)

최홍섭
- 한국음향학회지
- /
- 제16권1호
- /
- pp.104-109
- /
- 1997
본 논문에서는 잘 알려진 변별학습 방법인 MCE/GPD방법을 WHMM에 적용시켜 그 학습알고리듬을 유도하고, E-set에 대한 실험결과를 제시한다. 유도된 알고리듬은 여러개의 혼동 가능한 클래스의 HMM에 대해서 학습이 가능하다는 점에서 기존에 제시된 적응학습 알고리듬의 자연스러운 일반화라 할 수 있다. E-set에 대한 인식실험 결과 학습에이터에 대해서 15%, 시험데이터에 대해서 12% 정도의 인식율 개선을 얻을 수 있었다.
PDF

변곡 S-형 소프트웨어 신뢰도성장모형의 베이지안 모수추정 (Bayesian Estimation for Inflection S-shaped Software Reliability Growth Model)

김희수;이종형;박동호
- 품질경영학회지
- /
- 제37권4호
- /
- pp.16-22
- /
- 2009
The inflection S-shaped software reliability growth model (SRGM) proposed by Ohba(1984) is one of the most commonly used models and has been discussed by many authors. The main purpose of this paper is to estimate the parameters of Ohba's SRGM within the Bayesian framework by applying the Markov chain Monte Carlo techniques. While the maximum likelihood estimates for these parameters are well known, the Bayesian method for the inflection S-shaped SRGM have not been discussed in the literature. The proposed methods can be quite flexible depending on the choice of prior distributions for the parameters of interests. We also compare the Bayesian methods with the maximum likelihood method numerically based on the real data.
PDF KSCI

A Semi-supervised Learning of HMM to Build a POS Tagger for a Low Resourced Language

Pattnaik, Sagarika;Nayak, Ajit Kumar;Patnaik, Srikanta
- Journal of information and communication convergence engineering
- /
- 제18권4호
- /
- pp.207-215
- /
- 2020
Part of speech (POS) tagging is an indispensable part of major NLP models. Its progress can be perceived on number of languages around the globe especially with respect to European languages. But considering Indian Languages, it has not got a major breakthrough due lack of supporting tools and resources. Particularly for Odia language it has not marked its dominancy yet. With a motive to make the language Odia fit into different NLP operations, this paper makes an attempt to develop a POS tagger for the said language on a HMM (Hidden Markov Model) platform. The tagger judiciously considers bigram HMM with dynamic Viterbi algorithm to give an output annotated text with maximum accuracy. The model is experimented on a corpus belonging to tourism domain accounting to a size of approximately 0.2 million tokens. With the proportion of training and testing as 3:1, the proposed model exhibits satisfactory result irrespective of limited training size.
https://doi.org/10.6109/jicce.2020.18.4.207 인용 PDF KSCI

Fitting acyclic phase-type distributions by orthogonal distance

Pulungan, Reza;Hermanns, Holger
- Advances in Computational Design
- /
- 제7권1호
- /
- pp.37-56
- /
- 2022
Phase-type distributions are the distributions of the time to absorption in finite and absorbing Markov chains. They generalize, while at the same time, retain the tractability of the exponential distributions and their family. They are widely used as stochastic models from queuing theory, reliability, dependability, and forecasting, to computer networks, security, and computational design. The ability to fit phase-type distributions to intractable or empirical distributions is, therefore, highly desirable for many practical purposes. Many methods and tools currently exist for this fitting problem. In this paper, we present the results of our investigation on using orthogonal-distance fitting as a method for fitting phase-type distributions, together with a comparison to the currently existing fitting methods and tools.
https://doi.org/10.12989/acd.2022.7.1.037 인용 KSCI

검색결과 490건 처리시간 0.023초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)