• 제목/요약/키워드: Segmental Method

검색결과 264건 처리시간 0.022초

개선된 델타검색기법을 이용한 피치검색시간의 단축 (AN ALGORITHM TO REDUCE THE PITCH SEARCHING TIME USING MODIFIED DELTA SEARCH IN CELP VOCODER)

  • 이주헌
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.214-217
    • /
    • 1994
  • The major drawback in the Code Excited Linear Prediction type vocoders is their large computational requirements. In this paper, a simple method is proposed to reduce the pitch searching time in the pitch filter almost without degradation of quality. On the basis of the observational regularity of the correlation function of speech, only the limited numbers of pitch lags are considered to be an optimum pitch. This is done by skipping the negative envelope side of the correlation function and limiting the maximum number of lags to be considered preliminarily. By doing so, we can reduce the computational time of pitch searching more than 51% with negligible quality degradation. In addition to that, by combining that method with the conventional delta search technique, we can reduce the computational time requirements more than 60% without serious lowering the speech quality in segmental SNR measure compared to the conventional full search method.

  • PDF

최소 자승오차 방식을 이용한 세그먼트 피치패턴의 정형화 (A New Stylization Method using Least-Square Error Minimization on Segmental Pitch Contour)

  • 이정철
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.107-110
    • /
    • 1994
  • In this paper, we describe the features of the fundamental frequency contour of Korean read speech, and propose a new stylization method to characterize the Fø pattern of segments. Our algorithm consists of three stylization processes : the segment level, the syllable level, and the sord level. For stylization of Fø contour in the segment level , we applied least square error minimization method to determine Fø values at initial, medial, and final position in a segment. In the syllable level, we determine the stylized Fø pattern of a syllable using the mean Fø value of each word and style information for each word, syllable and segment, we reconstruct Fø contour of sentences. The simulation results show that the error is less than 10% of the actual Fø contour for each sentence. In perception test, there is little difference between the synthesized speech with the original difference between the synthesized speech with the original Fø contour and the synthesized speech with the stylized Fø contour.

  • PDF

Single-tooth dento-osseous osteotomy with a computer-aided design/computer-aided manufacturing surgical guide

  • Kang, Sang-Hoon;Kim, Moon-Key;Lee, Ji-Yeon
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • 제42권2호
    • /
    • pp.127-130
    • /
    • 2016
  • This clinical note introduces a method to assist surgeons in performing single-tooth dento-osseous osteotomy. For use in this method, a surgical guide was manufactured using computer-aided design/computer-aided manufacturing technology and was based on preoperative surgical simulation data. This method was highly conducive to successful single-tooth dento-osseous segmental osteotomy.

단순화된 다중 모드 방법을 이용한 음성 부호화기 (A Speech Coder using the Simplified Multi-mode Method)

  • 강홍구
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1995년도 제12회 음성통신 및 신호처리 워크샵 논문집 (SCAS 12권 1호)
    • /
    • pp.146-149
    • /
    • 1995
  • This paper proposes a SM-CELP speech coder which applies different excitation signal according to the characteristic of speech segment at bit-rate below 4 kbps. Speech signal is divided with 2 modes such as stationary voice and etc. using the parameters of average energy of the short-time speech and the residual signal after long term prediction. Structured multi-pulse method is used for the excitation of mode-A and gaussian or pulse-like codebook for mode-B. 4.8kbps DoD-CELP are used to evaluate the performance of the proposed coder. As a result, the propose method shows 1~2 dB higher segmental signal to noise ratio and better subjectional quality without increasing the computational amount.

  • PDF

Landmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition

  • Chao, Hao;Song, Cheng
    • Journal of Information Processing Systems
    • /
    • 제12권3호
    • /
    • pp.410-421
    • /
    • 2016
  • In this paper, we propose a framework that attempts to incorporate landmarks into a segment-based Mandarin speech recognition system. In this method, landmarks provide boundary information and phonetic class information, and the information is used to direct the decoding process. To prove the validity of this method, two kinds of landmarks that can be reliably detected are used to direct the decoding process of a segment model (SM) based Mandarin LVCSR (large vocabulary continuous speech recognition) system. The results of our experiment show that about 30% decoding time can be saved without an obvious decrease in recognition accuracy. Thus, the potential of our method is demonstrated.

Blind speech segmentation과 에너지 가중치를 이용한 문장 종속형 화자인식기의 성능 향상 (Performance improvement of text-dependent speaker verification system using blind speech segmentation and energy weight)

  • 김정곤;김형순
    • 대한음성학회지:말소리
    • /
    • 제47호
    • /
    • pp.131-140
    • /
    • 2003
  • We propose a new method of generating client models for HMM based text-dependent speaker verification system with only a small amount of training data. To make a client model, statistical methods such as segmental K-means algorithm are widely used, but they do not guarantee the quality or reliability of a model when only limited data are avaliable. In this paper, we propose a blind speech segmentation based on level building DTW algorithm as an alternative method to make a client model with limited data. In addition, considering the fact that voiced sounds have much more speaker-specific information than unvoiced sounds and energy of the former is higher than that of the latter, we also propose a new score evaluation method using the observation probability raised to the power of weighting factor estimated from the normalized log energy. Our experiment shows that the proposed methods are superior to conventional HMM based speaker verification system.

  • PDF

보청기에서 적응궤환제거의 성능 향상 (Improving the Performance of Adaptive Feedback Cancellation in Hearing Aids)

  • 김대경;허종;박장식;손경식
    • 한국음향학회지
    • /
    • 제18권4호
    • /
    • pp.38-46
    • /
    • 1999
  • 본 논문에서는 보청기에서의 적응궤환 제거 성능을 개선하기 위한 방법들을 제안하였다. 첫번째 방법은 순시 경사치를 모니터링하여 최적해를 추적해 가는 것으로 직교원리를 이용한 음향학적 궤환제거 방법이고 다른 하나는 본 실험실에서 제안된 적응 알고리즘인 보상기를 가진 적응알고리즘을 이용한 방법이다. 다양한 시뮬레이션 조건하에서 본 논문에서 제안된 적응 궤환제거 방법이 Greenberg가 제안한 합-방식(Sum-method) 최소자승오차 알고리즘 보다 시스템 부정합, 신호대 잡음비(SNR: Signal-to-Noise Ratio) 및 세그멘트 SNR에서 훨씬 좋은 성능을 나타내었다. 또한 적응 궤환제거에 있어서 직교원리를 이용한 방법은 시뮬레이션에서 보상기를 가진 적응알고리즘을 이용한 방법과 유사한 성능을 나타내었다.

  • PDF

시공단계를 고려환 곡선변단면 프리스트레스트 콘크리트 박스거더교량의 해석 (Segmental Analysis of Curved Non-Prismatic Prestressed Concrete Box Girder Bridges)

  • 박찬민;강영진
    • 대한토목학회논문집
    • /
    • 제14권1호
    • /
    • pp.71-81
    • /
    • 1994
  • 시공단계를 고려한 곡선변단면 프리스트레스트 콘크리트 박스거더교량의 해석을 수행하였다. 곡선변단면 박스요소를 사용하며 시공순서에 따른 구조계의 변화, 크리이프, 건조수축과 릴렉세이션 등의 효과를 고려하였다. 사용되는 단면형상은 양쪽에 캔틸레버를 갖는 직사각형 1실 박스단변이며 부재축은 평면상의 곡선으로 단면제원은 부재축을 따라 변할 수 있다. 각 요소는 3절점으로 구성되며 각 절점은 단면 찌그러짐과 ?을 포함하는 8자유도를 가진다. 본 연구에서 여러가지 경우의 예를 해석, 비교하였으며 실제교량에의 적용 가능성을 입증하였다.

  • PDF

심리음향 특성을 이용한 음성 향상 알고리즘 (A Speech Enhancement Algorithm based on Human Psychoacoustic Property)

  • 전유용;이상민
    • 전기학회논문지
    • /
    • 제59권6호
    • /
    • pp.1120-1125
    • /
    • 2010
  • In the speech system, for example hearing aid as well as speech communication, speech quality is degraded by environmental noise. In this study, to enhance the speech quality which is degraded by environmental speech, we proposed an algorithm to reduce the noise and reinforce the speech. The minima controlled recursive averaging (MCRA) algorithm is used to estimate the noise spectrum and spectral weighting factor is used to reduce the noise. And partial masking effect which is one of the human hearing properties is introduced to reinforce the speech. Then we compared the waveform, spectrogram, Perceptual Evaluation of Speech Quality (PESQ) and segmental Signal to Noise Ratio (segSNR) between original speech, noisy speech, noise reduced speech and enhanced speech by proposed method. As a result, enhanced speech by proposed method is reinforced in high frequency which is degraded by noise, and PESQ, segSNR is enhanced. It means that the speech quality is enhanced.

Numerical Study on the Joints between Precast Post-Tensioned Segments

  • Kim, Tae-Hoon;Kim, Young-Jin;Jin, Byeong-Moo;Shin, Hyun-Mock
    • International Journal of Concrete Structures and Materials
    • /
    • 제19권1E호
    • /
    • pp.3-9
    • /
    • 2007
  • This paper presents a numerical procedure for analyzing the joints between precast post-tensioned segments. A computer program for the analysis of reinforced concrete structures was run for this problem. Models of material nonlinearity considered in this study include tensile, compressive and shear models for cracked concrete and a model for reinforcing steel with smeared crack. An unbonded tendon element based on the finite element method, that can describe the interaction between the tendon and concrete of prestressed concrete member, was experimentally investigated. A joint element is newly developed to predict the inelastic behavior of the joints between segmental members. The proposed numerical method for the joints between precast post-tensioned segments was verified by comparison of its results with reliable experimental results.