• Title/Summary/Keyword: 심리음향모델

Search Result 71, Processing Time 0.026 seconds

Multi-Channel Audio Coding Technologies (멀티채널 오디오 (MPEG-2) 부호화 기술)

  • Hong, J.W.
    • Electronics and Telecommunications Trends
    • /
    • v.10 no.3 s.37
    • /
    • pp.15-27
    • /
    • 1995
  • 멀티미디어에서 비디오의 품질이 향상되고, 디지털 텔레비젼 (ADTV)이나 고선명 텔레비젼(HDTV) 등의 개발에 의해 화면 크기가 증가하면서 이에 어울리는 실감있는 오디오의 전송 및 재생이 요구된다. 따라서 멀티채널 오디오의 도입과 더불어 효율적이고, 경제적인 방법으로 낮은 비트율로 고품질의 멀티채널 오디오를 제공하기 위한 부호화 기술이 필요하게 된다. 최근에 인간의 청각 특성을 고려한 심리음향 모델을 이용한 멀티채널 오디오의 압축 부호화 기술이 MPEG-2 오디오의 국제 표준으로 제정되었다. MPEG-2 오디오 표준은 MPEG-1 오디오 표준을 기초로 하여 현장감을 필요로 하는 오디오를 위해 기본 스테레오 채널외에 중앙채널, 서라운드 채널, 그리고 저주파 효과채널을 부가한 방식으로 다채널, 음성다중 등의 부가서비스를 제공하기에 적합하다. 본고에서는 MPEG-2 오디오 표준의 계층 II를 중심으로 한 표준의 특징, 알고리즘, 데이터 구조, 그리고 응용분야 등에 대해 기술한다.

Study for Audio Watermarking Using Echo Signal (반향 신호를 이용한 오디오 워터마킹에 관한 연구)

  • 오현오;김현욱;윤대희;차일환
    • Proceedings of the IEEK Conference
    • /
    • 2000.09a
    • /
    • pp.767-770
    • /
    • 2000
  • 본 논문에서는 고음질 오디오 신호에 임의로 삽입된 반향(Echo)신호가 음질에 미치는 영향을 조사하고, 이를 이용한 오디오 워터마킹 기법에 대해 다룬다. 일반적으로 오디오 신호에 반향을 첨가하게 되면 음색이 더욱 풍부해지는 효과를 얻을 수 있지만. 이 때 삽입된 반향신호의 시간 지연과 크기가 충분히 작을 경우에는 심리 음향모델의 시간영역 마스킹 효과에 의해 지각되지 않을 수도 있다 한편 오디오 신호의 구간별로 임의 삽입된 반향의 시간지연을 검출할 수 있다면, 이를 이용한 정보 감춤(data hiding)및 워터마킹 기법에 활용할 수 있다. 반향신호를 이용하여 원 신호에 정보를 삽입하게 되면 가우시안 잡음이나 PN 시퀸스를 이용하는 경우처럼 오디오 신호에 이질적인 잡음을 첨가하지 않기 때문에 청감 특성상 유리하며, 오디오 신호 고유의 통계적 특성을 유지 할수 있는 장점이 있다. 그러나 반향의 첨가가 음질의 왜곡은 초래하지 않으면서 정보의 검출이 가능하도록 하기위해서는 원 신호의 특성에 따른 반향 첨가 기술이 요구된다.

  • PDF

Model Development and Analysis of the Car Interior Sound Quality (차량 실내 소음의 음질 분석 및 모델화)

  • Hur, Deog-Jae;Cho, Yeon;Kim, Hee-Seok;Lee, Keun-Soo;Park, Tae-Won
    • Journal of KSNVE
    • /
    • v.10 no.2
    • /
    • pp.254-260
    • /
    • 2000
  • the reduction of the interior nosie level has been the main interest of NVH engineers in the development of vehicles. However, the consumer's perception on the car noise is affected largely by the psychoacoustic characteristics of the noise, as well as the sound pressure level. In this study, the quality of the vehicle interior nosie is analyzed by employing the subjective evaluations and by representing them in temrs of the objective quantities. The subjective evaluatins were performed for the seven vehicles in the range of subcompact to luxury cars. The methods of paired comparisons and semantic differential were used to study the preference, the quality of interior noise and their correlation. The linear regression models were obtained for the subjective evaluation and the sound quality metrics.

  • PDF

Design of Hardware Accelerator for Portable Real-time MP3 Audio Encoder (휴대용 실시간 MP 오디오 부호화기를 위한 하드웨어 가속기 설계)

  • 여창훈;방경호;이근섭;박영철;윤대희
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2132-2135
    • /
    • 2003
  • 본 논문에서는 고정소수점 DSP로 구현한 실시간 MP3 오디오 부호화기에 사용되는 초월함수용 하드웨어 가속기 구조를 제안한다. 구현된 하드웨어 가속기는 MP3 부호화 성능을 저하시키는 초월함수 연산오차에 강인하도록 설계되었다. 제안된 가속기의 연산오차는 Q1.23 고정소수점 출력에서 2비트, 즉 2/sup -21/ 까지의 연산오차를 가진다. LAME 부호화기[5]심리음향 모델의 SMR 오차는 테이블 보간법[4]을 사용할 경우에 비해 4dB이상 향상되었으며, 연산량은 총 4 MIPS 감소하였다. 제안한 하드웨어 가속기는 Verilog HDL로 기술되었으며, SYNOPSYS에서 0.18㎛ CMOS 표준 셀 라이브러리 공정으로 합성되었다. 합성 면적은 7514 게이트이며 초월함수 연산에 대한 동작속도는 3 사이클이다.

  • PDF

Noise suppressor Using Psychoacoustic Model and Wavelet Packet Transform (심리음향 모델과 웨이블릿 패킷 변환을 이용한 잡음제거기)

  • Kim, Mi-Seon;Kim, Young-Ju;Lee, In-Sung
    • Proceedings of the IEEK Conference
    • /
    • 2006.06a
    • /
    • pp.345-346
    • /
    • 2006
  • In this paper, we propose the noise suppressor with the psychoacoustic model and wavelet packet transform. The objective of the scheme is to enhance speech corrupted by colored or non-stationary noise. If corrupted noise is colored, subband approach would be more efficient than whole band one. To avoid serious residual noise and speech distortion, we must adjust the Wavelet Coefficient threshold. In this paper, the subband is designed matching with the critical band. And WCT is adapted by noise masking threshold(NMT) and segmental signal to noise ratio(seg_SNR). Consequently this work improve the PESQ-MOS about 0.23 in the case of coded speech.

  • PDF

Speech Enhancement with Decomposition into Deterministic and Stochastic components and Psychoacoustic Model (결정적/확률적 요소로의 음성 분해와 심리음향 모델 기반 잡음 제거 기법)

  • Jo, Seok-Hwan;Yoo, Chang-D.
    • Proceedings of the IEEK Conference
    • /
    • 2007.07a
    • /
    • pp.301-302
    • /
    • 2007
  • A speech enhancement algorithm based on both a decomposition of speech into deterministic and stochastic components and a psychoacoustic model is proposed. Noisy speech is decomposed into deterministic and stochastic components, and then each component is enhanced preserving its individual characteristics. A psychoacoustic model is taken into account when enhancing the stochastic component. Simulation results show that the proposed algorithm performs better than some of the more popular algorithms.

  • PDF

Analysis and Evaluation Simulation System for Whistle Sound Related Marine Casualty (기적음관련 해양사고 분석.평가 시뮬레이션 시스템 개발)

  • 임정빈;김창경
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2004.04a
    • /
    • pp.61-67
    • /
    • 2004
  • This paper describes Three-Dimensional Listening Simulation System (3D-LSS) which is to analyze whistle sound related marine casualties, and is to evaluate the accident situations using 3D sound by Head Related Transfer Function. At first, the three-dimensional listening model from the analysis of accident situations is proposed, and then the reproduction and evaluation methods of 3D sounds are also discussed. The system is designed to explain the accident situations and to simulate the possible situations with GUI based graphics and 3D sound reproduction. The evaluation experiments using 3D-LSS are carried out with six cases that did not known whether it is true or not the blast and listening of the whistle sound between two vessels. As results of psychological assessments by five subjects, the six cases can be analyzed clearly by visual images and audio sounds, thus the usability of 3D-LSS as one of the judgment assistant system of marine casualty is verified.

  • PDF

Development of Analysis and Evaluation Simulation System for Whistle Sound Related Marine Casualty (기적음관련 해양사고 분석·평가 시뮬레이션 시스템 개발)

  • Yim, Jeong-Bin;Kim, Chang-Kyoung
    • Journal of Navigation and Port Research
    • /
    • v.28 no.8
    • /
    • pp.659-666
    • /
    • 2004
  • This paper describes Three-Dimensional Listening Simulation System (3D-LSS) which is to analyze whistle sound related marine casualties, and is to evaluate the accident situations using 3D sound by Head Related Transfer Function At first, the hree-dimensional listening model from the analysis of accident situations is proposed, and then the reproduction and evaluation methods of 3D sounds are also discussed. The system is designed to explain the accident situations and to simulate the possible situations with GUI based graphics and 3D sound reproduction. The evaluation experiments using 3D-LSS are carried out with six cases that did not known whether it is true or not the blast and listening of the whistle sound between two vessels. As results of psychological assessments by five subjects, the six cases can be analyzed clearly by visual images and audio sounds, thus the usability of 3D-LSS as one of the judgment assistant system of marine casualty is verified.

A method of the cross-talk cancellation for an sound reproduction of 5.1 channel speaker system (5.1 채널 스피커 시스템 음향재생을 위한 크로스토크 제거방법)

  • Lee, Soo-Jeong;Cho, Gab-Ken;Kim, Soon-Hyob
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.4 s.304
    • /
    • pp.159-166
    • /
    • 2005
  • This thesis deals with a method to deliver more realistic sound by cancelling the cross-talk which is inherent to the 5.1 channel speaker system. First, the cross-talk cancellation method that eliminates cross-talks on the path from left speaker to right ear and from right speaker to left ear is explained. Then the application and replaying method using the cross-talk cancellation explained here is introduced. The acoustical model for cross-talk cancellation is the free field model This model minimizes distortion of sound. Many experts also make studies on this model. I used the bark scale sound quality compensation based on psycho-acoustic. For the surround channels, band-limited sound quality compensation is performed in the frequency domain.

A Study on the Implementation of Realistic Sound Through Cross-Talk Cancellation (크로스토크 제거를 통한 입체 음향 구현에 관한 연구)

  • 김학진
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.2
    • /
    • pp.99-108
    • /
    • 2004
  • This thesis deals a method to deliver more realistic sound by cancelling the cross-talk which is inherent to the 5.1 channel speaker system. The acoustical model for cross-talk cancellation is the free field model. This model minimizes distortion of sound. I used the bark scale sound quality compensation which based on psycho-acoustic. For the surround channels, band-limited sound quality compensation is performed in the frequency domain. I also performed the sound quality assessment test on the traditional 2 channel stereo and 5.1 channel system. This test is performed in the test chamber which satisfies the ITU-R specifications. I uses the IACC(Inter-Aural Cross-Correlation) to determine the preferences of the amateur and the golden ear experts to asses the trans-aural filter. According to the result from the proposed method, I got more the 38㏈ separation rates with the Dolby standard speaker array. The results on the diffusion by the subjective test with the experts shows 0.4 point increased then before.