• Title/Summary/Keyword: Audio Signal Processing

Search Result 155, Processing Time 0.04 seconds

Minimum Statistics-Based Noise Power Estimation for Parametric Image Restoration

  • Yoo, Yoonjong;Shin, Jeongho;Paik, Joonki
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.3 no.2
    • /
    • pp.41-51
    • /
    • 2014
  • This paper describes a method to estimate the noise power using the minimum statistics approach, which was originally proposed for audio processing. The proposed minimum statistics-based method separates a noisy image into multiple frequency bands using the three-level discrete wavelet transform. By assuming that the output of the high-pass filter contains both signal detail and noise, the proposed algorithm extracts the region of pure noise from the high frequency band using an appropriate threshold. The region of pure noise, which is free from the signal detail part and the DC component, is well suited for minimum statistics condition, where the noise power can be extracted easily. The proposed algorithm reduces the computational load significantly through the use of a simple processing architecture without iteration with an estimation accuracy greater than 90% for strong noise at 0 to 40dB SNR of the input image. Furthermore, the well restored image can be obtained using the estimated noise power information in parametric image restoration algorithms, such as the classical parametric Wiener or ForWaRD image restoration filters. The experimental results show that the proposed algorithm can estimate the noise power accurately, and is particularly suitable for fast, low-cost image restoration or enhancement applications.

Deep Learning based Raw Audio Signal Bandwidth Extension System (딥러닝 기반 음향 신호 대역 확장 시스템)

  • Kim, Yun-Su;Seok, Jong-Won
    • Journal of IKEEE
    • /
    • v.24 no.4
    • /
    • pp.1122-1128
    • /
    • 2020
  • Bandwidth Extension refers to restoring and expanding a narrow band signal(NB) that is damaged or damaged in the encoding and decoding process due to the lack of channel capacity or the characteristics of the codec installed in the mobile communication device. It means converting to a wideband signal(WB). Bandwidth extension research mainly focuses on voice signals and converts high bands into frequency domains, such as SBR (Spectral Band Replication) and IGF (Intelligent Gap Filling), and restores disappeared or damaged high bands based on complex feature extraction processes. In this paper, we propose a model that outputs an bandwidth extended signal based on an autoencoder among deep learning models, using the residual connection of one-dimensional convolutional neural networks (CNN), the bandwidth is extended by inputting a time domain signal of a certain length without complicated pre-processing. In addition, it was confirmed that the damaged high band can be restored even by training on a dataset containing various types of sound sources including music that is not limited to the speech.

Performance Comparison of Guitar Chords Classification Systems Based on Artificial Neural Network (인공신경망 기반의 기타 코드 분류 시스템 성능 비교)

  • Park, Sun Bae;Yoo, Do-Sik
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.3
    • /
    • pp.391-399
    • /
    • 2018
  • In this paper, we construct and compare various guitar chord classification systems using perceptron neural network and convolutional neural network without pre-processing other than Fourier transform to identify the optimal chord classification system. Conventional guitar chord classification schemes use, for better feature extraction, computationally demanding pre-processing techniques such as stochastic analysis employing a hidden markov model or an acoustic data filtering and hence are burdensome for real-time chord classifications. For this reason, we construct various perceptron neural networks and convolutional neural networks that use only Fourier tranform for data pre-processing and compare them with dataset obtained by playing an electric guitar. According to our comparison, convolutional neural networks provide optimal performance considering both chord classification acurracy and fast processing time. In particular, convolutional neural networks exhibit robust performance even when only small fraction of low frequency components of the data are used.

A Study on the Watermarking Methods with Chi-Square Distribution (카이 자승 분포를 이용한 워터마킹기법의 연구)

  • 강환일;김갑일;한승수
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2001.12a
    • /
    • pp.5-9
    • /
    • 2001
  • In this paper, we propose the new audio watermarking method and can be used on line processing. Instead of the wavelet transform, we use the integer wavelet transform for the reduction of the computational load. The watermark associated with the chi-square distribution is inserted into the signal on the integer wavelet domain. When extracting the watermark, the spread spectrum methods are used with the coefficients associated with the covariance sequence. We show that the chi-square distribution is a good tool for the spread spectrum method on the wavelet domain. This watermarking technique may be used for the control of the electrical product which can be controlled with the hidden signals and can be moved according to the audible signals simultaneously.

  • PDF

An Implementation of the Multichannel Digital Audio Signal Processing System (다채널 디지털 오디오 신호처리기 구현)

  • 이규하
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06e
    • /
    • pp.39-42
    • /
    • 1998
  • 본 논문에서는 방송용 오디오 기기가 갖는 다채널의 특성과 각 채널에 대한 다양한 신호처리 기능의 특성을 고려하여 다채널 디지털 오디오 신호 처리기의 구조를 제안하고 범용 DSP를 이용하여 실시간 병렬 처리 시스템을 구현하였다. 구현된 시스템은 32비트 부동수소점 DSP를 이용하였으며 스테레오 채널의 48KHz 표본화 주파수를 지원하고 20비트 해상도를 갖는 시스템이다. 다채널 디지털 오디오 신호 처리 시스템의 구조는 디지털 신호 처리 과정을 수행하는 디지털 오디오 데이터 처리 부분과 시스템을 제어하기 위한 제어 정보 처리 부분으로 제안하였다. 이러한 구조에 적합한 실시간 시스템을 구현하기 위해 전체 시스템은 4부분의 모듈로 구성된다.

  • PDF

A Study on the Multigate Pulsed Doppler System. (다중 채널 펄스 도플러 장치에 관한 연구)

  • 김병철;박세현
    • Journal of Biomedical Engineering Research
    • /
    • v.7 no.1
    • /
    • pp.75-79
    • /
    • 1986
  • In this paper, a multigate pulsed doppler system with parallel signal processing which enables the transcutaneous assesment of the blood-flow velocities of five adjacent sample volumes is developed. The outputs of the system are audio signals, velocity curves and velocity profiles of 5 adjacent sample volumes. We have applied to the major superficial arteries and obtained the detailed information about the time dependent blood-flow behavior.

  • PDF

A Study on Image Retrieval Using Sound Classifier (사운드 분류기를 이용한 영상검색에 관한 연구)

  • Kim, Seung-Han;Lee, Myeong-Sun;Roh, Seung-Yong
    • Proceedings of the KIEE Conference
    • /
    • 2006.10c
    • /
    • pp.419-421
    • /
    • 2006
  • The importance of automatic discrimination image data has evolved as a research topic over recent years. We have used forward neural network as a classifier using sound data features within image data, our initial tests have shown encouraging results that indicate the viability of our approach.

  • PDF

Scheduling Computational Loads in Single Level Tree Network

  • Cui, Run;Sundaram, Suresh;Kim, Hyoung-Joong
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2009.08a
    • /
    • pp.131-135
    • /
    • 2009
  • This paper is the introduction of our work on distributed load scheduling in single-level tree network. In this paper, we derive a new calculation model in single-level tree network and show a closed-form formulation of the time for computation system. There are so many examples of the application of this technology such as distributed database, biology computation on genus, grid computing, numerical computing, video and audio signal processing, etc.

  • PDF

Design methodology of analog circuits for a digital-audio-signal processing 1-bit ???? DAC (디지털 오디오 신호처리용 1-bit Δ$\Sigma$ DAC 아날로그 단의 설계기법)

  • 이지행;김상호;손영철;김선호;김대정;김동명
    • Proceedings of the IEEK Conference
    • /
    • 2002.06b
    • /
    • pp.149-152
    • /
    • 2002
  • The performance of a 1-bit DAC depends on that of the analog circuits. The mixed SC-CT (switched capacitor-continuous time) architecture is an effective design methodology for the analog circuits. This paper Proposes a new buffer scheme for the 1-bit digital-to-analog subconverter and a new SF-DSC(smoothing filter and differential-to-sig le converter) which performs both the smoothing filter and the differential-to-single convertor simultaneously.

  • PDF

Design methodology of digital circuits for an audio-signal-processing DAC (오디오 신호처리용 DAC디지털 단의 설계기법)

  • 김선호;손영철;김상호;이지행;김대정;김동명
    • Proceedings of the IEEK Conference
    • /
    • 2002.06b
    • /
    • pp.157-160
    • /
    • 2002
  • This paper proposed a guideline for selecting the arithmetic circuit architecture. The guideline incorpo-rates the new concept of PDSP (power-delay-size product) and the weighting method. HSPICE simulations havc been performed to several full adders in order to prove the validity of the proposed guideline. We applied this guideline to select an optimized FA (full adder) architecture and successfully implemented the DAC's digital blocks.

  • PDF