• Title/Summary/Keyword: subband

Search Result 434, Processing Time 0.026 seconds

An investigation of subband decomposition and feature-dimension reduction for musical genre classification (음악 장르 분류를 위한 부밴드 분해와 특징 차수 축소에 관한 연구)

  • Seo, Jin Soo;Kim, Junghyun;Park, Jihyun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.36 no.2
    • /
    • pp.144-150
    • /
    • 2017
  • Musical genre is indispensible in constructing music information retrieval system, such as music search and classification. In general, the spectral characteristics of a music signal are obtained based on a subband decomposition to represent the relative distribution of the harmonic and the non-harmonic components. In this paper, we investigate the subband decomposition parameters in extracting features, which improves musical genre classification accuracy. In addition, the linear projection methods are studied to reduce the resulting feature dimension. Experiments on the widely used music datasets confirmed that the subband decomposition finer than the widely-adopted octave scale is conducive in improving genre-classification accuracy and showed that the feature-dimension reduction is effective reducing a classifier's computational complexity.

Subspace Speech Enhancement Using Subband Whitening Filter (서브밴드 백색화 필터를 이용한 부공간 잡음 제거)

  • 김종욱;유창동
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.3
    • /
    • pp.169-174
    • /
    • 2003
  • A novel subspace speech enhancement using subband whitening filter is proposed. Previous subspace speech enhancement method either assumes additive white noise or uses whitening filter as a pre-processing for colored noise. The proposed method tries to minimize the signal distortion while reducing residual noise by processing the signal using subband whitening filter. By incorporating the notion of subband whitening filter, spectral resolution in Karhunen-Loeve(KL) domain is improved with the negligible additional computational load. The proposed method outperforms both the subspace method suggested by Ephraim and the spectral subtraction suggested by Boll in terms of segmental signal-to-noise ratio (SNRseg) and perceptual evaluation of speech quality (PESQ).

Subband Acoustic Echo Canceller with Double-Talk Detector Using Weighted Overlap-add Method and Dedicated filter (동시 통화검출 전용필터와 가중 Overlap-Add 기법을 적용한 서브밴드 음향 반향 제거기)

  • 고충기;이원철;이충용
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.8
    • /
    • pp.35-46
    • /
    • 2000
  • In this paper, we propose a subband acoustic echo canceller using the weighted Overlap-add adaptive filter bank to prevent the decrease of convergence speed in full-band US processing, and make it possible to realize the adaptive filter in block-parallel processing, this paper introduces the weighted overlap-add technique for subband echo canceller. Moreover, we propose a new double-talk detector which employs dedicated filter in addition to the energy comparison method simultaneously. The computer simulation results show that the performance of the proposed subband adaptive echo canceller double-talk detection

  • PDF

An adaptive motion estimation based on the temporal subband analysis (시간축 서브밴드 해석을 이용한 적응적 움직임 추정에 관한 연구)

  • 임중곤;정재호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.6
    • /
    • pp.1361-1369
    • /
    • 1996
  • Motion estimation is one of the key components for high quality video coding. In this paper, a new motion estimation scheme for MPEG-like video coder is suggested. The proposed temporally adaptive motion estimation scheme consists of five functional blocks: Temporal subband analysis (TSBA), extraction of temporal information, scene change detection (SCD), picture type replacement (PTR), and temporally adapted block matching algorithm (TABMA). Here all the functional components are based on the temporal subband analysis. In this papre, we applied the analysis part of subband decompostion to the temporal axis of moving picture sequence, newly defined the temporal activity distribution (TAD) and average TAD, and proposed the temporally adapted block matching algorithm, the scene change detection algorithm and picture type replacement algorithm which employed the results of the temporal subband analysis. A new block matching algorithm TABMA is capable of controlling the block matching area. According to the temporal activity distribution of objects, it allocates the search areas nonuniformly. The proposed SCD and PTR can prevent unavailable motion prediction for abrupt scene changes. Computer simulation results show that the proposed motion estimation scheme improve the quality of reconstructed sequence and reduces the number of block matching trials to 40% of the numbers of trials in conventional methods. The TSBA based scene change detection algorithm can detect the abruptly changed scenes in the intentionally combined sequence of this experiment without additional computations.

  • PDF

Noise Robust Speaker Verification Using Subband-Based Reliable Feature Selection (신뢰성 높은 서브밴드 특징벡터 선택을 이용한 잡음에 강인한 화자검증)

  • Kim, Sung-Tak;Ji, Mi-Kyong;Kim, Hoi-Rin
    • MALSORI
    • /
    • no.63
    • /
    • pp.125-137
    • /
    • 2007
  • Recently, many techniques have been proposed to improve the noise robustness for speaker verification. In this paper, we consider the feature recombination technique in multi-band approach. In the conventional feature recombination for speaker verification, to compute the likelihoods of speaker models or universal background model, whole feature components are used. This computation method is not effective in a view point of multi-band approach. To deal with non-effectiveness of the conventional feature recombination technique, we introduce a subband likelihood computation, and propose a modified feature recombination using subband likelihoods. In decision step of speaker verification system in noise environments, a few very low likelihood scores of a speaker model or universal background model cause speaker verification system to make wrong decision. To overcome this problem, a reliable feature selection method is proposed. The low likelihood scores of unreliable feature are substituted by likelihood scores of the adaptive noise model. In here, this adaptive noise model is estimated by maximum a posteriori adaptation technique using noise features directly obtained from noisy test speech. The proposed method using subband-based reliable feature selection obtains better performance than conventional feature recombination system. The error reduction rate is more than 31 % compared with the feature recombination-based speaker verification system.

  • PDF

A study on motion prediction and subband coding of moving pictuers using GRNN (GRNN을 이용한 동영상 움직임 예측 및 대역분할 부호화에 관한 연구)

  • Han, Young-Oh
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.5 no.3
    • /
    • pp.256-261
    • /
    • 2010
  • In this paper, a new nonlinear predictor using general regression neural network(GRNN) is proposed for the subband coding of moving pictures. The performance of a proposed nonlinear predictor is compared with BMA(Block Match Algorithm), the most conventional motion estimation technique. As a result, the nonlinear predictor using GRNN can predict well more 2-3dB than BMA. Specially, because of having a clustering process and smoothing noise signals, this predictor well preserves edges in frames after predicting the subband signal. This result is important with respect of human visual system and is excellent performance for the subband coding of moving pictures.

Convergence Analysis of Multiple Constrained Subband Affine Projection Algorithm (다중제한조건을 갖는 부밴드 AP 알고리즘의 수렴해석)

  • Kim, Young-Min;Sohn, Sang-Wook;Choi, Hun;Bae, Hyeon-Deok
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.10a
    • /
    • pp.474-476
    • /
    • 2009
  • In the radio communication, such as echo cancellation and channel equalization, adaptive filtering is very practical. Its convergence behavior that is used for updating the weights depends on the correlation of the input signal and length of adaptive filter. Highly correlated input and long length of adaptive filter deteriorate the convergence behavior. To solve this problem, recently, subband affine projection algorithm which pre-whiten the correlation of the input and update the weights in subband structure has been presented. This paper presents convergence analysis method of multiple constrained subband affine projection algorithm.

  • PDF

Statistical Analysis of the MSE for the MDPSAP Adaptive Filter (MPDSAP 적응필터를 위한 MSE의 통계적 해석)

  • Kim, Young-min;Choi, Hun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.05a
    • /
    • pp.883-887
    • /
    • 2009
  • This paper presents a statistical analysis of the MSE of adaptation for the MPDSAP (Maximally polyphase decomposed Subband Affine Projection) algorithm for the an autoregressive (AR) inputs with P order. In subband structure, the Affine Projection (AP) algorithm is transformed to the Normalized Least Mean Square (NLMS) algorithm by applying the polyphase decomposition and the noble identity to the adaptive filter. And also, AR input can be pre-whitened by subband filtering with the Orthonormal Analysis Filters(OAF). In the subband structure, the pre-whitening of the AR(P) inputs provides simple and valid approximations for a statistical analysis of the MSE behaviors for the SAP adaptive filter.

  • PDF

A half subcarrier guard band spectrum assignment scheme for multi-user FBMC systems

  • Huang, Wei;Xu, Hongbo;Li, Zhongnian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.1
    • /
    • pp.350-364
    • /
    • 2022
  • Traditionally, in multi-user multi-carrier systems, the neighboring subband will be gapped by one subcarrier, which is set as guard band to reduce multiple access interference (MAI) between neighboring subbands. The empty subcarrier for guard band will degrade the spectral efficiency of the whole system. In order to enhance the spectral efficiency of multi-user filter bank multiple carrier (FBMC) systems, a new subband allocation method is introduced, in which the neighboring subband is gapped by half subcarrier instead of one subcarrier. Meanwhile, in order to implement the proposed resource allocation scheme, an optimized FBMC prototype filter is designed to decrease the inter-subband interference to the neighboring subband. The detailed simulations about the comparison between the proposed spectrum assignment and traditional FBMC are given, as well as the performance in the different interference scenarios. The simulation results show that the combination of the proposed spectrum assignment scheme and the optimized filter has better performance compared to the traditional scheme. The proposed scheme can be used in the system which serves massive users to get higher spectrum efficiency.

Image restoration based on wavelet filter bank (웨이블렛 필터 뱅크를 이용한 영상복원)

  • 김주헌;이종수
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1997.10a
    • /
    • pp.1387-1390
    • /
    • 1997
  • In this paper we propose a novel way to restore degraded image using wavelet transform & filterbank. First, we devide a degraded image into 4-suband images using UDWT(Undecimated Wavelet Transform), and then use a proper CLS (Constrained Least Square) filter in each subband. Using a proper CLS filter ineach subband, we can save high grequency components of original image. We reconstruct a restored image from the downsampled subband images using wavelet tansform. Even though there is a trade-off between ISNR and calculation loads, we reduce the calculation loads by using wavelet transform in reconstruction with a negligible degradatiion in ISNR.

  • PDF