Search | Korea Science

Noise Shaping Based on Psychoacoustic Model (심리음향모델에 근거한 잡음 형상화)

Lee Jingeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.335-336
- /
- 2000
A psychoacoustic model based noise shaping method is proposed, where noise's presence with a host signal will not be perceptually noticeable. The derivation of imperceptible noise levels from the masking thresholds of the signal involves a deconvolution associated with the spreading function in the psychoacoustic model, which results in an ill-conditioned problem. In this paper, the problem is formulated as a constrained optimization, and it is demonstrated that the solution provides noise shaping where the noise excitation level conforms to the masking thresholds of the signal.
PDF

Speech Enhancement Based on Psychoacoustic Model (심리음향모델에 근거한 음성개선)

Lee Jingeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.337-338
- /
- 2000
The perceptual filter for speech enhancement was analytically derived where the frequency content of the input noisy signal was made the same as that of the estimated clean signal in auditory domain. However, the analytical derivation should rely on the deconvolution associated with the spreading function in the psychoacoustic model, which results in an ill-conditioned problem. In order to cope with the problem associated with the deconvolution, we propose a novel psychoacoustic model based speech enhancement filter whose principle is the same as the perceptual filter, however the filter is derived by a constrained optimization which provides solutions to the ill-conditioned problem.
PDF

Speech Enhancement Based on Psychoacoustic Model

Lee, Jingeol;Kim, Soowon
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.3E
- /
- pp.12-18
- /
- 2000
Psychoacoustic model based methods have recently been introduced in order to enhance speech signals corrupted by ambient noise. In particular, the perceptual filter is analytically derived where the frequency content of the input noisy signal is made the same as that of the estimated clean signal in auditory domain. However, the analytical derivation should rely on the deconvolution associated with the spreading function in the psychoacoustic model, which results in an ill-conditioned problem. In order to cope with the problem associated with the deconvolution, we propose a novel psychoacoustic model based speech enhancement filter whose principle is the same as the perceptual filter, however the filter is derived by a constrained optimization which provides solutions to the ill-conditioned problem. It is demonstrated with artificially generated signals that the proposed filter operates according to the principle. It is shown that superior performance results from the proposed filter over the perceptual filter provided that a clean speech signal is separable from noise.
PDF

Noise Shaping Based on Psychoacoustic Model

Lee, Jingeol;Nam, Seung Hyon
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.2E
- /
- pp.9-16
- /
- 2001
A psychoacoustic model based noise shaping method which shapes the noise in the frequency domain is proposed, where its presence with a host signal will not be perceptually noticeable. The derivation of imperceptible noise levels from the masking thresholds of the signal involves a deconvolution associated with the spreading function in the psychoacoustic model, which results in an ill-conditioned problem. In this paper, the problem is formulated as a constrained optimization, and it is demonstrated that the solution provides noise shaping where the noise excitation level conforms to the masking thresholds of the signal, and thus the noises embedded in the signal will not be perceived by human ear.
PDF

Fixed-point Processing Optimization of MPEG Psychoacoustic Model-II Algorithm for ASIC Implementation (MPEG 심리음향 모델-ll 알고리듬의 ASIC 구현을 위한 고정 소수점 연산 최적화)

Lee Keun-Sup;Park Young-Cheol;Youn Dae Hee
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.11C
- /
- pp.1491-1497
- /
- 2004
The psychoacoustic model in MPEG audio layer-III (MP3) encoder is optimized for the fixed-point processing. The optimization process consists of determining the data word length of arithmetic unit and the algorithm for transcendental functions that are often used in the psychoacoustic model. In order to determine the data word length, we defined a statistical model expressing the relation between the fixed-point operation errors of the psychoacoustic model and the probability of alteration of the allocated bits doe to these errors. Based on the simulations using this model, we chose a 24-bit data path and constructed a 24-bit fixed-point MP3 encoder. Sound quality tests using the constructed fixed-point encoder showed a mean degradation of -0.2 on ITU-R 5-point audio impairment scale.
PDF KSCI

Speech Enhancement with Decomposition into Deterministic and Stochastic components and Psychoacoustic Model (결정적/확률적 요소로의 음성 분해와 심리음향 모델 기반 잡음 제거 기법)

Jo, Seok-Hwan;Yoo, Chang-D.
- Proceedings of the IEEK Conference
- /
- 2007.07a
- /
- pp.301-302
- /
- 2007
A speech enhancement algorithm based on both a decomposition of speech into deterministic and stochastic components and a psychoacoustic model is proposed. Noisy speech is decomposed into deterministic and stochastic components, and then each component is enhanced preserving its individual characteristics. A psychoacoustic model is taken into account when enhancing the stochastic component. Simulation results show that the proposed algorithm performs better than some of the more popular algorithms.
PDF

The Audio Watermarking method Using the MPEG-2 AAC Psychoacoustic Model (MPEG-2 AAC 심리음향 모델을 이용한 오디오 워터마킹 기법)

성종수;강상구;신재호
- Proceedings of the IEEK Conference
- /
- 1999.06a
- /
- pp.716-719
- /
- 1999
In this Paper, we Present a method for embedding digital watermarks into digital audio signals. The watermarking must be imperceptible and should be robust to attacks, such as filtering and compression etc. In our method, we adaptively embedded the watermarks changing the scale factor using the spread spectrum and MPEG-2 AAC psychoacoustic model.
PDF

Audio Forensic Marking using Psychoacoustic Model II and MDCT (심리음향 모델 II와 MDCT를 이용한 오디오 포렌식 마킹)

Rhee, Kang-Hyeon
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.49 no.4
- /
- pp.16-22
- /
- 2012
In this paper, the forensic marking algorithm is proposed using psychoacoustic model II and MDCT for high-quality audio. The proposed forensic marking method, that inserts the user fingerprinting code of the audio content into the selected sub-band, in which audio signal energy is lower than the spectrum masking level. In the range of the one frame which has 2,048 samples for FFT of original audio signal, the audio forensic marking is processed in 3 sub-bands. According to the average attack of the fingerprinting codes, one frame's SNR is measured on 100% trace ratio of the collusion codes. When the lower strength 0.1 of the inserted fingerprinting code, SNR is 38.44dB. And in case, the added strength 0.5 of white gaussian noise, SNR is 19.09dB. As a result, it confirms that the proposed audio forensic marking algorithm is maintained the marking robustness of the fingerprinting code and the audio high-quality.
PDF KSCI

High Quality Audio Watermarking using Spread Spectrum and Psychoacoustic Model (대역확산과 심리음향 모델을 이용한 고음질 오디오 워터마킹)

Noh Jin-Soo;Rhee Kang-Hyeon
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.43 no.5 s.311
- /
- pp.48-56
- /
- 2006
In this paper, we proposed the high quality audio watermarking algorithm using MDCT/IMDCT (Modified DCT/Inverse Modified DCT) with psychoacoustic model. Generally, a digital audio watermark is embedding the frequency domain after frequency transform of the digital audio data but the digital audio quality is affected by watermarking. In our scheme, the digital audio data is spread with PN((Pseudo Noise) code and then audio watermark is embedded in MDCT processing that refers psychoacoustic model. In MDCT processing, according to the shape of filter bank output, the block switching selects a window sequence that has 256, 1,024 or 2,048 points interval for high quality audio. The author confirm that when watermark weight ${\alpha}$ is 2.5 below, the detection ratio of watermark is a satisfied to SDMI's(Secure Digital Music Initiative) recommendation 50% above and SM is $50{\sim}68dB$ range with mainly 4 kind of attacks(Compression, Cropping, FFT and Echo).
PDF KSCI

An Efficient PN Sequence Embedding and Detection Method for High Quality Digital Audio Watermarking (고음질 디지털 오디오 워터마킹을 위한 효율적인 PN 시퀸스 삽입 및 검출 방법)

김현욱;오현오;김연정;윤대희
- Journal of Broadcast Engineering
- /
- v.6 no.1
- /
- pp.21-31
- /
- 2001
In the PN-sequence based audio watermarking system, the PN sequence is shaped by a filter derived from the psychoacoustic model to increase robustness and inaudibility The psychoacoustic model calculated in each audio segment, however, requires heavy computational loads. In this paper, we propose an efficient watermarking system adopting a fixed-shape perceptual filter that substitutes psychoacoustic model derived filter. The proposed filter can shape the PN-sequence to be inaudible and enable to embed the robust watermark in a simple manner. Moreover, we propose an anchitecture for the PN-sequence compensation fitter In the watermark detecter to increase correlation between the watermark and the PN-sequence. With the proposed architecture, the blind watermark detection performance has been enhanced.
PDF

Search Result 55, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)