• Title/Summary/Keyword: Speech Signals

Search Result 499, Processing Time 0.019 seconds

An efficient method of spatial cues and compensation method of spectrums on multichannel spatial audio coding (멀티채널 Spatial Audio Coding에서의 효율적인 Spatial Cues 사용과 그에 따른 Spectrum 보상방법)

  • Lee, Byong-Hwa;Beack, Seung-Kwon;Seo, Jeong-Gil;Han, Min-Soo
    • MALSORI
    • /
    • no.53
    • /
    • pp.157-169
    • /
    • 2005
  • This paper proposes an efficiently representing method of spatial cues on multichannel spatial audio coding. The Binaural Cue Coding (BCC) method introduced recently represents multichannel audio signals by means of Inter Channel Level Difference (ICLD) or Source Index (SI). We tried to express more efficiently ICLD and SI information based on Inter Channel Correlation in this paper. We adopt different spatial cues according to ICC and propose a compensation method of empty spectrums created by using SI. We performed a MOS test and measuring spectral distortion. The results show that the proposed method can reduce the bitrate of side information without large degradation of the audio quality.

  • PDF

A Study on the Affinity Between Pairs of Korean Vowels Using the Dynamic Paremeters of Vocal Tract (성도의 다이내믹 피라미터에 의한 한글 모음간의 근사도에 관한 연구)

  • 김중규;안수길
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.19 no.1
    • /
    • pp.1-8
    • /
    • 1982
  • Many researches on the parametric representation of speech ,signals using the adaptive linear prediction method have been studied for the past few years. In this paper, we used the LPC(Linear Predictive Coding)method to analyae the parameters of Korean vowels and by using those parameters we studied the affinity between every pair of Korean vowels. As a result of our study, it is found that each pair of Korean vowels that has a greater phonetic affinity also has a greater affinity of vocal tract parameters than other pairs.

  • PDF

A Study on the Efficient Speech Recognition System using Database Grouping (어휘 그룹화를 이용한 음성인식시스템의 성능향상에 관한 연구)

  • 우상욱;권승호;한수양;이동규;이두수
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2455-2458
    • /
    • 2003
  • In this paper, the Classification of Energy Labeling has been Proposed. Energy Parameters of input signal which is extracted from each phoneme is labelled. And groups of labelling according to detected energies of input signals are detected. Next, DTW processes in a selected group of labeling. This leads to DTW processing faster than a previous algorithm. In this Method, because an accurate detection of parameters is necessary on the assumption in steps of a detection of speeching duration and a detection of energy parameters, variable windows which are decided by pitch period is used. Extract algorithms don't search for exact frame energy, because 256 frame window-sizes is fixed. For this reason, a new energy extraction method has been proposed. A pitch period is detected firstly; next window scale is decided between 200 frames and 300 frames. The proposed method make it possible to cancel an influence of windows.

  • PDF

Pattern Recognition of Rotor Fault Signal Using Bidden Markov Model (은닉 마르코프 모형을 이용한 회전체 결함신호의 패턴 인식)

  • Lee, Jong-Min;Kim, Seung-Jong;Hwang, Yo-Ha;Song, Chang-Seop
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.27 no.11
    • /
    • pp.1864-1872
    • /
    • 2003
  • Hidden Markov Model(HMM) has been widely used in speech recognition, however, its use in machine condition monitoring has been very limited despite its good potential. In this paper, HMM is used to recognize rotor fault pattern. First, we set up rotor kit under unbalance and oil whirl conditions. Time signals of two failure conditions were sampled and translated to auto power spectrums. Using filter bank, feature vectors were calculated from these auto power spectrums. Next, continuous HMM and discrete HMM were trained with scaled forward/backward variables and diagonal covariance matrix. Finally, each HMM was applied to all sampled data to prove fault recognition ability. It was found that HMM has good recognition ability despite of small number of training data set in rotor fault pattern recognition.

On Altering the Pitch of Speech Signals in Waveform Coding -Alteration Method by the LPC and the Pitch Halving- (음성 파형코딩 음원피치 변경에 관한 연구 -LPC와 주기반분법에 의한 피치변경법-)

  • 배명진;윤희상;안수길
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.5
    • /
    • pp.11-19
    • /
    • 1991
  • 음성 신호의 합성기법들 중에서 파형코딩법은 음질이 우수하기 때문에 분석에 의한 합성법으로 많이 사용하고 있다. 그렇지만 음원과 성도의특성을 분리하지 않고 파형의 잉여분만을 제거한 후에 파 형자체를 저장하기 때문에 규칙에 의한 합성기법으로 사용하기에는 어려움이 많다. 본 논문은 파형코딩 법 중 선형 PCM 코딩법으로 저장된 음성파형에 대해 피치를 양분할 수 있는 주기반분법을 제안하여 파형자체의 음원을 분리하지 않고 피치 주기를 변경시킬 수 있는 새로운 피치 변경법을 제안하였다. 따 라서 음질이 우수한 파형코딩 합성법으로 규칙에 의한 합성을 수행할 수 있다.

  • PDF

A Study on the Digital Filter Structure for ADM Coded Signal (ADM 부호화신호를 위한 디지털필터구조에 관한 연구)

  • 신재호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.14 no.6
    • /
    • pp.642-649
    • /
    • 1989
  • In this paper the theory of digital filters which can directly process the ADM encoded signal, and their structures are studied. In order to investigate the frequency characteristics of DM filters with the strucutures presented in this paper, a sampled speech signal is used for the input data. The result of computer simulation shows that the presented DM filter structures can be used effectively for the direct process of ADM encoded signals, ven though they do not posses sufficiently sharp cutoff characteristics.

  • PDF

An Estimation method for Characteristic Parameters in a Low Frequency Signal Transformed by High Frequency Signals (고주파 신호에 의하여 변형된 저주파신호에서의 특성변수 추정 기법)

  • Yoo, Kyung-Yul
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.51 no.2
    • /
    • pp.86-88
    • /
    • 2002
  • An estimation method for the characteristic parameters in the low frequency signal is proposed in this paper. A low frequency signal is assumed to be modulated or distorted by high frequency terms. The algorithm proposed in this paper is designed to select set of local maximums in a successive manner, hence it is denoted as the iterative peak picking(IPP) algorithm. The IPP algorithm is operating in the time domain and is using only the comparison operation between two neighboring samples. Therefore, its computational complexity is very low and the delay caused by the computation is negligible, which make the real-time operation possible with economic hardware. The proposed algorithm is verified on the pitch estimation of speech signal and blood pulse estimation.

A Source Separation Algorithm for Stereo Panning Sources (스테레오 패닝 음원을 위한 음원 분리 알고리즘)

  • Baek, Yong-Hyun;Park, Young-Cheol
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.4 no.2
    • /
    • pp.77-82
    • /
    • 2011
  • In this paper, we investigate source separation algorithms for stereo audio mixed using amplitude panning method. This source separation algorithms can be used in various applications such as up-mixing, speech enhancement, and high quality sound source separation. The methods in this paper estimate the panning angles of individual signals using the principal component analysis being applied in time-frequency tiles of the input signal and independently extract each signal through directional filtering. Performances of the methods were evaluated through computer simulations.

A Correlation between Emile Sound and Other Waves (에밀레의 맥놀이와 다른 파동과의 상관관계)

  • 안정근;진용옥
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.30-35
    • /
    • 2001
  • The most important characteristic of Emile Bell's sound is a beating. It is modulation phenomenon which appears as a result of interference multiplication in time domain. This modulation phenomenon can be modeled as DSB-SC which suppress carrier and signals distributed both sides. The beatiog wave is observed in Laman distribution signal for polyvinyl speech signal, water vein wave, tide wave. The beating wave is caused by asymmetry Property of the bell.

  • PDF

Subband IPNLMS Adaptive Filter for Sparse Impulse Response Systems (성긴임펄스 응답 시스템을 위한 부밴드 IPNLMS 적응필터)

  • Sohn, Sang-Wook;Choi, Hun;Bae, Hyeon-Deok
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.60 no.2
    • /
    • pp.423-430
    • /
    • 2011
  • In adaptive filtering, the sparseness of impulse response and input signal characteristics are very important factors of it's performance. This paper presents a subband improved proportionate normalized least square (SIPNLMS) algorithm which combines IPNLMS for impulse response sparseness and subband filtering for prewhitening the input signal. As drawing and combining the advantage of conventional approaches, the proposed algorithm, for impulse responses exhibiting high sparseness, achieve improved convergence speed and tracking ability. Simulation results, using colored signal(AR(4)) and speech input signals, show improved performance compared to fullband structure of existing methods.