• Title/Summary/Keyword: 가변음향

Search Result 188, Processing Time 0.02 seconds

Subband Affine Projection Algorithm Using Variable Step Size (가변 스텝사이즈를 이용한 부밴드 인접투사 알고리즘)

  • Choi, Hun;Bae, Hyeon-Deok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.2
    • /
    • pp.69-74
    • /
    • 2007
  • In signal processing applications with highly correlated input signals, subband affine projection algorithm and step size controlling is a good solution for improving the slow convergence rate and large computational complexity of LMS-type algorithms. This paper proposes a subband affine projection algorithm using a variable step size. The proposed method achieves fast convergence rate and small steady-state error with a small computational complexity by combining the SAP and step size controlling in a subband structure. Experimental results on highly correlated input signal show that the proposed method is superior to the conventional methods.

Variable Step Size LMS Algorithm Using the Error Difference (오류 차이를 활용한 가변 스텝 사이즈 LMS 알고리즘)

  • Woo, Hong-Chae
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.3
    • /
    • pp.245-250
    • /
    • 2009
  • In communications and signal processing area, a number of least mean square adaptive algorithms have been used because of simplicity and robustness. However the LMS algorithm is known to have slow and non-uniform convergence. Various variable step size LMS adaptive algorithms have been introduced and researched to speed up the convergence rate. A variable step size LMS algorithm using the error difference for updating the step size is proposed. Compared with other algorithms, simulation results show that the proposed LMS algorithm has a fast convergence. The theoretical performance of the proposed algorithm is also analyzed for the steady state.

Novel Variable Step-Size Gradient Adaptive Lattice Algorithm for Active Noise Control (능동 소음 제어를 위한 새로운 가변 수렴 상수 Gradient Adaptive Lattice Algorithm)

  • Lee, Keunsang;Kim, Seong-Woo;Im, Jaepoong;Seo, Young-Soo;Park, Youngcheol
    • The Journal of the Acoustical Society of Korea
    • /
    • v.33 no.5
    • /
    • pp.309-315
    • /
    • 2014
  • In this paper, a novel variable step-size filtered-x gradient adaptive lattice (NVSS-FxGAL) algorithm for active noise control system is proposed. The gradient adaptive lattice (GAL) algorithm is capable of controlling the narrow band noise effectively. The GAL algorithm can achieve both fast convergence rate and low steady-state level using the variable step-size. However, it suffers from the convergence performance for varying signal characteristic since the global variable step-size is equally applied to all lattice stages. Therefore, the proposed algorithm guarantees the stable and consistency convergence performance by using the local variable step-size for the suitable each lattice stage. Simulation results confirm that the proposed algorithm can obtain the fast convergence rate and low steady-state level compared to the conventional algorithms.

Voice Command Web Browser Using Variable Vocabulary Word Recognizer (가변어휘 단어 인식기를 사용한 음성 명령 웹 브라우저)

  • 이항섭
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.2
    • /
    • pp.48-52
    • /
    • 1999
  • In this paper, we describe a Voice Command Web Browser using a variable vocabulary word recognizer that can do Internet surfing with Korean speech recognition on the Web. The feature of this browser is that it can handle the links and menus of the web browser by speech. Therefore, we can use speech interface together with mouse for web browsing. To recognize the recognition candidates dynamically changing according to Web pages, we use the variable vocabulary word recognizer. The recognizer was trained using POW (Phonetically Optimized Words) 3,848 words. So that it can recognize new words which did not exist in training data. The preliminary test results showed that the performance of speaker-independent and vocabulary-independent recognition is 93.8% for 32 Korean words. The Voice Command Web Browser was developed on windows 95/NT using Netscape Navigator and reflected usability test results in order to offer easy interface to users unfamiliar with speech interface. In on-line experiment of speaker-independent and environment-independent situation, Voice Command Web Browser showed recognition accuracy of 90%.

  • PDF

A Variable Parameter Model based on SSMS for an On-line Speech and Character Combined Recognition System (음성 문자 공용인식기를 위한 SSMS 기반 가변 파라미터 모델)

  • 석수영;정호열;정현열
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.7
    • /
    • pp.528-538
    • /
    • 2003
  • A SCCRS (Speech and Character Combined Recognition System) is developed for working on mobile devices such as PDA (Personal Digital Assistants). In SCCRS, the feature extraction is separately carried out for speech and for hand-written character, but the recognition is performed in a common engine. The recognition engine employs essentially CHMM (Continuous Hidden Markov Model), which consists of variable parameter topology in order to minimize the number of model parameters and to reduce recognition time. For generating contort independent variable parameter model, we propose the SSMS(Successive State and Mixture Splitting), which gives appropriate numbers of mixture and of states through splitting in mixture domain and in time domain. The recognition results show that the proposed SSMS method can reduce the total number of GOPDD (Gaussian Output Probability Density Distribution) up to 40.0% compared to the conventional method with fixed parameter model, at the same recognition performance in speech recognition system.

Enhanced Normalized Subband Adaptive Filter with Variable Step Size (가변 스텝 사이즈를 가지는 개선된 정규 부밴드 적응 필터)

  • Chung, Ik Joo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.6
    • /
    • pp.518-524
    • /
    • 2013
  • In this paper, we propose a variable step size algorithm to enhance the normalized subband adaptive filter which has been proposed to improve the convergence characteristics of the conventional full band adaptive filter. The well-known Kwong's variable step size algorithm is simple, but shows better performance than that of the fixed step size algorithm. However, in case that large additive noise is present, the performance of Kwong's algorithm is getting deteriorated in proportion to the amount of the additive noise. We devised a variable step size algorithm which does not depend on the amount of additive noise by exploiting a normalized adaptation error which is the error subtracted and normalized by the estimated additive noise. We carried out a performance comparison of the proposed algorithm with other algorithms using a system identification model. It is shown that the proposed algorithm presents good convergence characteristics under both stationary and non-stationary environments.

Difference State Number of CHMM Model to Improve the Performance of SCCRS (한국어 음성/문자 공용인식기의 성능향상을 위한 가변 상태수 CHMM모델의 구성)

  • Suk Soo-Young;Kim Min-Jung;Kim Kwang-Soo;Jung Ho-Youl;Chung Hyun-Yeol
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.95-98
    • /
    • 2002
  • 문자인식 또는 음성인식을 위해 사용되어지는 CHMM(Continuous Hidden Markov Model)모델은 일반적으로 모델의 상태수를 일정한 수로 고정하는 고정 상태수 모델 구조를 가지고 있으나, 이는 개별적인 인식 단위의 특성을 고려하지 않은 경우로써 이를 고려한 가변 상태수 모델을 사용할 경우 인식률 향상을 기대할 수 있다. 개별적인 인식 단위에 적합한 모델 상태수를 결정하는 방법으로 파라미터 히스토그램 방법과, BIC(Bayesian Information Criterion)방법을 사용하는 것이 대표적이다. 이들 방법들은 개별적인 인식단위의 우도값만을 향상시키기 위한 방법으로 전체인식률과 직접적으로 비례하지는 않는다. 따라서, 본 논문에서는 고정 상태수를 갖는 모델 적용 방법과 인식단위별 상태수 변화에 따른 인식률을 비교하였으며, 이를 바탕으로 각 모델별 상태수를 달리하는 가변 상태수 CHMM모델 구성 방법을 제안한다. 제안된 가변상태수 모델의 유효성을 확인하기 위해 음성/문자 공용인식기 중 필기체 문자 인식에 적용한 결과 제안한 LM(Local Maximum)으로 구성된 가변 상태수 모델이 MLE와 BIC로 구성된 모델과 인식률 면에서는 거의 동일한 성능을 유지하면서 전체 상태수는 MLE 모델에 비해 $31\%$, BIC로 구성된 모델에 비해 $22\%$ 감소를 나타내어 제안한 모델의 유효성을 확인할 수 있었다.

  • PDF

Rejection Performance Analysis in Vocabulary Independent Speech Recognition Based on Normalized Confidence Measure (정규화신뢰도 기반 가변어휘 고립단어 인식기의 거절기능 성능 분석)

  • Choi, Seung-Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.25 no.2
    • /
    • pp.96-100
    • /
    • 2006
  • Kim et al. Proposed Normalized Confidence Measure (NCM) [1-2] and it was successfully used for rejecting mis-recognized words in isolated word recognition. However their experiments were performed on the fixed word speech recognition. In this Paper we apply NCM to the domain of vocabulary independent speech recognition (VISP) and shows the rejection Performance of NCM in VISP. Specialty we Propose vector quantization (VQ) based method for overcoming the problem of unseen triphones. It is because NCM uses the statistics of triphone confidence in the case of triphone-based normalization. According to speech recognition experiments Phone-based normalization method shows better results than RLJC[3] and also triphone-based normalization approach. This results are different with those of Kim et al [1-2]. Concludingly the Phone-based normalization shows robust Performance in VISP domain.

Categorized VSSLMS Algorithm (Categorized 가변 스텝 사이즈 LMS 알고리즘)

  • Kim, Seon-Ho;Chon, Sang-Bae;Lim, Jun-Seok;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.8
    • /
    • pp.815-821
    • /
    • 2009
  • Information processing in variable and noisy environments is usually accomplished by means of adaptive filters. Among various adaptive algorithms, Least Mean Square (LMS) has become the most popular for its robustness, good tracking capabilities and simplicity, both in terms of computational load and easiness of implementation. In practical application of the LMS algorithm, the most important key parameter is the Step Size. As is well known, if the Step Size is large, the convergence rate of the algorithm will be rapid, but the steady state mean square error (MSE) will increase. On the other hand, if the Step Size is small, the steady state MSE will be small, but the convergence rate will be slow. Many researches have been proposed to alleviate this drawback by using a variable Step Size. In this paper, a new variable Step Size LMS(VSSLMS) called Categorized VSSLMS (CVSSLMS) is proposed. CVSSLMS updates the Step Size by categorizing the current status of the gradient, hence significantly improves the convergence rate. The performance of the proposed algorithm was verified from the view point of convergence rate, Excessive Mean Square Error(EMSE), and complexity through experiments.

Performance Improvement of Variable Vocabulary Speech Recognizer (가변어휘 음성인식기의 성능개선)

  • Kim Seunghi;Kim Hoi-Rin
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.21-24
    • /
    • 1999
  • 본 논문에서는 가변어휘 음성인식기의 성능개선 작업에 관한 내용을 기술하고 있다. 묵음을 포함한 총 40개의 문맥독립 음소모델을 사용한다. LDA 기법을 이용하여 동일차수의 특징벡터내에 보다 유용한 정보를 포함시키고, likelihood 계산시 가우시안 분포와 mixture weight에 대한 가중치를 달리 함으로써 성능향상을 볼 수 있었다. ETRI POW 3848 DB만을 사용하여 실험한 경우, $21.7\%$의 오류율 감소를 확인할 수 있었다. 잡음환경 및 어휘독립환경을 고려하여 POW 3848 DB와 PC 168 DB 및 PBW445 DB를 사용한 실험도 행하였으며, PBW 445 DB를 사용한 어휘독립 인식실험의 경우 $56.8\%$의 오류율 감소를 얻을 수 있었다.

  • PDF