• Title/Summary/Keyword: Speech separation

Search Result 89, Processing Time 0.023 seconds

On a Split Model for Analysis Techniques of Wideband Speech Signal (광대역 음성신호의 분할모델 분석기법에 관한 연구)

  • Park, Young-Ho;Ham, Myung-Kyu;You, Kwang-Bock;Bae, Myung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.7
    • /
    • pp.80-84
    • /
    • 1999
  • In this paper, the split model analysis algorithm, which can generate the wideband speech signal from the spectral information of narrowband signal, is developed. The split model analysis algorithm deals with the separation of the 10/sup th/ order LPC model into five cascade-connected 2/sup nd/ order model. The use of the less complex 2/sup nd/ order models allows for the exclusion of the complicated nonlinear relationships between model parameters and all the poles of the LPC model. The relationships between the model parameters and its corresponding analog poles is proved and applied to each 2/sup nd/ order model. The wideband speech signal is obtained by changing only the sampling rate.

  • PDF

Implementation of Environmental Noise Remover for Speech Signals (배경 잡음을 제거하는 음성 신호 잡음 제거기의 구현)

  • Kim, Seon-Il;Yang, Seong-Ryong
    • 전자공학회논문지 IE
    • /
    • v.49 no.2
    • /
    • pp.24-29
    • /
    • 2012
  • The sounds of exhaust emissions of automobiles are independent sound sources which are nothing to do with voices. We have no information for the sources of voices and exhaust sounds. Accordingly, Independent Component Analysis which is one of the Blind Source Separaton methods was used to segregate two source signals from each mixed signals. Maximum Likelyhood Estimation was applied to the signals came through the stereo microphone to segregate the two source signals toward the maximization of independence. Since there is no clue to find whether it is speech signal or not, the coefficients of the slope was calculated by the autocovariances of the signals in frequcency domain. Noise remover for speech signals was implemented by coupling the two algorithms.

Performance Enhancement of Speech Communication System using Reverberation Rejection (잔향제거를 이용한 음성통신 시스템 성능 향상)

  • Kim, Se-Young;Kang, Suk-Youb;Kim, Ki-Man
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.13 no.10
    • /
    • pp.2211-2217
    • /
    • 2009
  • In this paper, we propose the speech enhancement algorithm using an one-microphone in a reverberant room environments. Spectral subtraction is the effective method which can reduce the reverberation element and the noise in a spectrum domain. Spectral subtraction needs correct separation of voice section and silent section therefore to improve the performance, voice activity detection(VAD) based on entropy has been applied to the proposed method. We test a performance of the proposed method by comparing with conventional method which used VAD based on energy detection. Reverberation reduction ratio with variable of SNR and a reverberation time is used as a test index. From the simulation result, proposed method shows performance better than conventional method.

Frequency Domain Blind Source Seperation Using Cross-Correlation of Input Signals (입력신호 상호상관을 이용한 주파수 영역 블라인드 음원 분리)

  • Sung Chang Sook;Park Jang Sik;Son Kyung Sik;Park Keun-Soo
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.3
    • /
    • pp.328-335
    • /
    • 2005
  • This paper proposes a frequency domain independent component analysis (ICA) algorithm to separate the mixed speech signals using a multiple microphone array By estimating the delay timings using a input cross-correlation, even in the delayed mixture case, we propose a good initial value setting method which leads to optimal convergence. To reduce the calculation, separation process is performed at frequency domain. The results of simulations confirms the better performances of the proposed algorithm.

  • PDF

An Applicability of Teager Energy Operator and Energy Separation Algorithm for Waveform Distortion Analysis : Harmonics, Inter-harmonics and Frequency Variation

  • Cho, Soo-Hwan;Hur, Jin;Chung, Il-Yop
    • Journal of Electrical Engineering and Technology
    • /
    • v.9 no.4
    • /
    • pp.1210-1216
    • /
    • 2014
  • This paper deals with an application of Teager Energy Operator (TEO) and Energy Separation Algorithm(ESA) to detect and determine various voltage waveform distortions like harmonics, inter-harmonics and frequency variation. Because the TEO and DESA algorithm was initially proposed for speech or communication analysis, its applications are limited to some types of waveform in the power quality analysis area. For example, an undistorted voltage signal is similar with a pure sinusoid. A voltage fluctuation is very similar with an amplitude-modulated signal, from the viewpoint of signal theory. And a continuous frequency variation is similar with a frequency-modulated signal, which is also known as a chirp signal. This paper is written to show that the TEO and DESA algorithm can be used for detecting occurrences of the representative waveform distortions and determining their instantaneous information of amplitude and frequency.

Comparison of Independent Component Analysis and Blind Source Separation Algorithms for Noisy Data (잡음환경에서 독립성분 분석과 암묵신호분리 알고리즘의 성능비교)

  • O, Sang-Hun;Cichocki, Andrzej;Choe, Seung-Jin;Lee, Su-Yeong
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.39 no.2
    • /
    • pp.10-20
    • /
    • 2002
  • Various blind source separation (BSS) and independent component analysis (ICA) algorithms have been developed. However, comparison study for BSS/ICA algorithms has not been extensively carried out yet. The main objective of this paper is to compare various promising BSS/ICA algorithms in terms of several factors such as robustness to sensor noise, computational complexity, the conditioning of the mixing matrix, the number of sensors, and the number of training patterns. We propose several benchmarks which are useful for the evaluation of the algorithm. This comparison study will be useful for real-world applications, especially EEG/MEG analysis and separation of miked speech signals.

Phoneme Separation and Establishment of Time-Frequency Discriminative Pattern on Korean Syllables (음절신호의 음소 분리와 시간-주파수 판별 패턴의 설정)

  • 류광열
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.16 no.12
    • /
    • pp.1324-1335
    • /
    • 1991
  • In this paper, a phoneme separation and an establishment of discriminative pattern of Korean phonemes are studied on experiment. The separation uses parameters such as pitch extraction, glottal peak pulse width of each pitch. speech duration. envelope and amplitude bias. The first pitch is extracted by deviations of glottal peak and width. energy and normalization on a bias on the top of vowel envelope. And then, it traces adjacent pitch to vowel in whole. On vewel, amethod to be reduced gliding pattern and the possible of vowel distinction to be used just second formant are proposed, and shrinking pitch waveform has nothing to do with pitch length is estimated. A pattern of envelope, spectrum, shrinking waveform, and a method of analysis by mutual relation among phonemes and manners of articulation on consonant are detected. As experimental results, 90% on vowel phoneme, 80% and 60% on initial and final consonant are discriminated.

  • PDF

Blind Audio Source Separation Based On High Exploration Particle Swarm Optimization

  • KHALFA, Ali;AMARDJIA, Nourredine;KENANE, Elhadi;CHIKOUCHE, Djamel;ATTIA, Abdelouahab
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.5
    • /
    • pp.2574-2587
    • /
    • 2019
  • Blind Source Separation (BSS) is a technique used to separate supposed independent sources of signals from a given set of observations. In this paper, the High Exploration Particle Swarm Optimization (HEPSO) algorithm, which is an enhancement of the Particle Swarm Optimization (PSO) algorithm, has been used to separate a set of source signals. Compared to PSO algorithm, HEPSO algorithm depends on two additional operators. The first operator is based on the multi-crossover mechanism of the genetic algorithm while the second one relies on the bee colony mechanism. Both operators have been employed to update the velocity and the position of the particles respectively. Thus, they are used to find the optimal separating matrix. The proposed method enhances the overall efficiency of the standard PSO in terms of good exploration and performance. Based on many tests realized on speech and music signals supplied by the BSS demo, experimental results confirm the robustness and the accuracy of the introduced BSS technique.

Double-talk Control using Blind Signal Separation based on Geometric Concept in Acoustic Echo Canceller (음향반향제거기에서 기하학적 개념의 BSS를 이용한 동시통화 제어)

  • Lee, Haeng-Woo
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.12 no.3
    • /
    • pp.419-426
    • /
    • 2017
  • This paper describes an acoustic echo canceller with double-talk using BSS(: Blind Signal Separation) based on the geometric concept. The acoustic echo canceller may be deteriorated or diverged during the double-talk period. So we use the blind signal separation to detect the double talking by separating the near-end speech signal from the mixed microphone signal. In the closed reverberation environment, the blind signal separation extracts the near-end signal from unknown signals with the transformation and rotation based on the geometric concept. By this method, the acoustic echo canceller operates irrespective of double-talking. We verified performances of the proposed acoustic echo canceller by computer simulations. The results show that the acoustic echo canceller with this algorithm detects the double-talk periods thoroughly, and operates stably in the normal state without diverging of coefficients after ending the double-talking.

Vocal separation method using weighted β-order minimum mean square error estimation based on kernel back-fitting (커널 백피팅 알고리즘 기반의 가중 β-지수승 최소평균제곱오차 추정방식을 적용한 보컬음 분리 기법)

  • Cho, Hye-Seung;Kim, Hyoung-Gook
    • The Journal of the Acoustical Society of Korea
    • /
    • v.35 no.1
    • /
    • pp.49-54
    • /
    • 2016
  • In this paper, we propose a vocal separation method using weighted ${\beta}$-order minimum mean wquare error estimation (WbE) based on kernel back-fitting algorithm. In spoken speech enhancement, it is well-known that the WbE outperforms the existing Bayesian estimators such as the minimum mean square error (MMSE) of the short-time spectral amplitude (STSA) and the MMSE of the logarithm of the STSA (LSA), in terms of both objective and subjective measures. In the proposed method, WbE is applied to a basic iterative kernel back-fitting algorithm for improving the vocal separation performance from monaural music signal. The experimental results show that the proposed method achieves better separation performance than other existing methods.