• Title/Summary/Keyword: Psychoacoustic model

Search Result 55, Processing Time 0.032 seconds

Digital Audio Watermarking Based on Psychoacoustic Model (심리음향모델 기반의 디지털 오디오 워터마킹)

  • Song, You-Su;Kim, Jong-Hwan;Shin, Kyung-Wook
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.2
    • /
    • pp.772-775
    • /
    • 2005
  • This paper describes a study on the digital watermarking algorithm which is used to confirm the copyright protection of digital audio data. The digital audio watermarking algorithm based on psychoacoustic model is used for the inaudibility of the watermark data. The psychoacoustic model which is a key algorithm in MP3 audio compression is analyzed by MATLAB simulation, and is applied to digital audio watermark insertion.

  • PDF

An Optimization on the Psychoacoustic Model for MPEG-2 AAC Encoder (MPEG-2 AAC Encoder의 심리음향 모델 최적화)

  • Park, Jong-Tae;Moon, Kyu-Sung;Rhee, Kang-Hyeon
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.38 no.2
    • /
    • pp.33-41
    • /
    • 2001
  • Currently, the compression is one of the most important technology in multimedia society. Audio files arc rapidly propagated throughout internet Among them, the most famous one is MP-3(MPEC-1 Laver3) which can obtain CD tone from 128Kbps, but tone quality is abruptly down below 64Kbps. MPEC-II AAC(Advanccd Audio Coding) is not compatible with MPEG 1, but it has high compression of 1.4 times than MP 3, has max. 7.1 and 96KHz sampling rate. In this paper, we propose an algorithm that decreased the capacity of AAC encoding computation but increased the processing speed by optimizing psychoacoustic model which has enormous amount of computation in MPEG 2 AAC encoder. The optimized psychoacoustic model algorithm was implemented by C++ language. The experiment shows that the psychoacoustic model carries out FFT(Fast Fourier Transform) computation of 3048 point with 44.1 KHz sampling rate for SMR(Signal to Masking Ratio), and each entropy value is inputted to the subband filters for the control of encoder block. The proposed psychoacoustic model is operated with high speed because of optimization of unpredictable value. Also, when we transform unpredictable value into a tonality index, the speed of operation process is increased by a tonality index optimized in high frequency range.

  • PDF

A Perceptually Motivated Active Noise Control Design and Its Psychoacoustic Analysis

  • Bao, Hua;Panahi, Issa M.S.
    • ETRI Journal
    • /
    • v.35 no.5
    • /
    • pp.859-868
    • /
    • 2013
  • The active noise control (ANC) technique attenuates acoustic noise in a flexible and effective way. Traditional ANC design aims to minimize the residual noise energy, which is indiscriminative in the frequency domain. However, human hearing perception exhibits selective sensitivity for different frequency ranges. In this paper, we aim to improve the noise attenuation performance in perceptual perspective by incorporating noise weighting into ANC design. We also introduce psychoacoustic analysis to evaluate the sound quality of the residual noise by using a predictive pleasantness model, which combines four psychoacoustic parameters: loudness, sharpness, roughness, and tonality. Simulations on synthetic random noise and realistic noise show that our method improves the sound quality and that ITU-R 468 noise weighting even performs better than A-weighting.

Quality Improvement of Low Bitrate HE-AAC using Linear Prediction Pre-processor (저 전송률 환경에서 선형예측 전처리기를 사용한 HE-AAC의 성능 향상)

  • Lee, Jae-Seong;Lee, Gun-Woo;Park, Young-Chul;Youn, Dae-Hee
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.8C
    • /
    • pp.822-829
    • /
    • 2009
  • This paper proposes a new method of improving the quality of High Efficiency Advanced Audio Coding (HE-AAC). HE-AAC encodes input source by allocating bits for each scalefactor bands appropriately according to human ear's psychoacoustic property. As a result, insufficient bits are assigned to the bands which have relatively low energy. This imbalance between different energy bands can cause decreasing of sound quality like musical noise. In the proposed system, a Linear Prediction (LP) module is combined with HE-AAC as a pre-processor to improve sound quality by even bits distribution. To apply accurate human being's psychoacoustic property, the psychoacoustic model uses Fast Fourier Transform (FFT) spectrum of original input signal to make masking threshold. In its implementation, masking threshold of psychoacoustic model is normalized using the LP spectral envelope in prior to quantization of the LP residual. Experimental result shows that, the proposed algorithm allocates bits appropriately for insufficient bits condition and improves the performance of HE-AAC.

Variable Bitrate MPEG Audio (가변 전송율 MPEG 오디오)

  • Nam, Seung-Hyon
    • The Journal of Engineering Research
    • /
    • v.2 no.1
    • /
    • pp.57-62
    • /
    • 1997
  • Two psychoacoustic models used in MPEG-1 employ different masking patterns, different masking indexes, and different computational procedures. As a result, Model 1 is inferior to Model 2 due to its worst case approach in computing the SMR even though it determines tonality and masking levels accurately. In this study, we investigate the performances of psychoacoustic models when we modify the MPEG-1 audio coder for variable bitrates. Simulation results show that Model 2 has a gain of 30 kbps in the dual channel mode and 20 kbps in the joint stereo mode. It is generally known that the joint stereo mode has a gain in bitrate compare to the dual channel mode. For signals with frequent attacks, this gain becomes larger in Model 1 than in Model 2. This is due to the fact that Model 1 uses the worst case approach in computing the SMR to reduce pre-echo

  • PDF

Design of Audio Watermarks by Noise Shaping (잡음 형상화에 의한 오디오 워터마크 설계)

  • Lee, Jin-Geol
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.11
    • /
    • pp.1432-1438
    • /
    • 2005
  • A psychoacoustic model based noise shaping method is proposed. The method shapes the noise in the frequency domain such that its presence with a host signal will not be perceptually noticeable. The derivation of imperceptible noise levels from the masking thresholds of the signal involves deconvolution associated with the spreading function in the psychoacoustic model. It has been known as an ill-conditioned Problem. In this paper, a constrained optimization is applied such that the noise excitation level conforms to the masking thresholds of the signal. Thus, the noises embedded in the signal will not be perceived by human ear, and its performance is demonstrated experimentally.

  • PDF

Digital Watermarking Using Psychoacoustic Model

  • Poomdaeng, S.;Toomnark, S.;Amornraksa, T.
    • Proceedings of the IEEK Conference
    • /
    • 2002.07b
    • /
    • pp.872-875
    • /
    • 2002
  • A digital watermarking technique applying psychoacoustic model for audio signal is proposed in this paper. In the watermarking scheme, the pseudo-random bit stream used as a watermark signal is embedded into the audio signal in both speech and music. The strength of the embedded signal is subject to the human auditory system in such a way that the disturbances on host audio signal are beyond the sensing of human ears. The experimental results show that the quality of the watermarked audio signal, in term of signal to noise ratio, can be improved up to 3.2 dB.

  • PDF

Noise suppressor Using Psychoacoustic Model and Wavelet Packet Transform (심리음향 모델과 웨이블릿 패킷 변환을 이용한 잡음제거기)

  • Kim, Mi-Seon;Kim, Young-Ju;Lee, In-Sung
    • Proceedings of the IEEK Conference
    • /
    • 2006.06a
    • /
    • pp.345-346
    • /
    • 2006
  • In this paper, we propose the noise suppressor with the psychoacoustic model and wavelet packet transform. The objective of the scheme is to enhance speech corrupted by colored or non-stationary noise. If corrupted noise is colored, subband approach would be more efficient than whole band one. To avoid serious residual noise and speech distortion, we must adjust the Wavelet Coefficient threshold. In this paper, the subband is designed matching with the critical band. And WCT is adapted by noise masking threshold(NMT) and segmental signal to noise ratio(seg_SNR). Consequently this work improve the PESQ-MOS about 0.23 in the case of coded speech.

  • PDF

Wireless Speech Recognition System using Psychoacoustic Model (심리음향 모델을 이용한 무선 음성인식 시스템)

  • Noh, Jin-Soo;Rhee, Kang-Hyeon
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.6 s.312
    • /
    • pp.110-116
    • /
    • 2006
  • In this paper, we implement a speech recognition system to support ubiquitous sensor network application services such as switch control, authentication, etc. using wireless audio sensors. The proposed system is consist of the wireless audio sensor, the speech recognition algorithm using psychoacoustic model and LDPC(low density parity check) for correcting errors. The proposed speech recognition system is inserted in a HOST PC to use the sensor energy effectively mil to improve the accuracy of speech recognition, a FEC(Forward Error Correction) system is used. Also, we optimized the simulation coefficient and test environment to effectively remove the wireless channel noises and correcting wireless channel errors. As a result, when the distance between sensor and the source of voice is less then 1.0m FAR and FRR are 0.126% and 7.5% respectively.

Effect of Fabric Sound of Vapor Permeable Water Repellent Fabrics for Sportswear on Psychoacoustic Properties (스포츠웨어용 투습발수직물 소리가 심리음향학적 특성에 미치는 영향)

  • Lee, Jee-Hyun;Lee, Kyu-Lin;Jin, Eun-Jung;Yang, Yoon-Jung;Cho, Gil-Soo
    • Science of Emotion and Sensibility
    • /
    • v.15 no.2
    • /
    • pp.201-208
    • /
    • 2012
  • The objectives of this study were to investigate the psychoacoustic properties of PTFE(Poly tetra Fluoroethylene) laminated vapor permeable water repellent fabrics which are frequently used for sportswear, to examine the relationship among fabrics' basic characteristics, mechanical properties and the psychoacoustic properties, and finally to propose the predicting model to minimize the psychoacoustic fabric sound. A total of 8 specimens' frictional sound were recorded and Zwicker's psychoacoustic parameters such as loudness(Z), sharpness(Z), roughness(Z), and fluctuation strength(Z) were calculated using the Sound Quality Program. Mechanical properties of specimens were measured by KES-FB system. Loudness(Z) of specimen D-1 was the highest, which means the rustling sound of the specimen D-1 was the most noisy. Statistically significant difference among film type was observed only in loudness(Z) for fabric sound. Based on ANOVA and post-hoc test, specimens were classified into less loud PTFE film group (groupI) and loud PTFE film group (groupII). Loudness(Z) was higher when staple yarn was used compared when filament yarn was used. According to the correlation between the mechanical properties of fabrics and loudness(Z) in groupI, the shear properties, compression properties and weight showed positive correlation with loudness(Z). According to the regression equation predicting loudness(Z) of groupI, the layer variable was chosen. In groupII, variables explaining the loudness(Z) were yarn types and shear hysteresis(2HG5).

  • PDF