Search | Korea Science

Preprocessing method for enhancing digital audio quality in speech communication system (음성통신망에서 디지털 오디오 신호 음질개선을 위한 전처리방법)

Song Geun-Bae;Ahn Chul-Yong;Kim Jae-Bum;Park Ho-Chong;Kim Austin
- Journal of Broadcast Engineering
- /
- v.11 no.2 s.31
- /
- pp.200-206
- /
- 2006
This paper presents a preprocessing method to modify the input audio signals of a speech coder to obtain the finally enhanced signals at the decoder. For the purpose, we introduce the noise suppression (NS) scheme and the adaptive gain control (AGC) where an audio input and its coding error are considered as a noisy signal and a noise, respectively. The coding error is suppressed from the input and then the suppressed input is level aligned to the original input by the following AGC operation. Consequently, this preprocessing method makes the spectral energy of the music input redistributed all over the spectral domain so that the preprocessed music can be coded more effectively by the following coder. As an artifact, this procedure needs an additional encoding pass to calculate the coding error. However, it provides a generalized formulation applicable to a lot of existing speech coders. By preference listening tests, it was indicated that the proposed approach produces significant enhancements in the perceived music qualities.
PDF KSCI

Adaptive Enhancement Algorithm of Perceptual Filter Using Variable Threshold (가변 임계값을 이용한 지각 필터의 적응적인 음질 개선 알고리즘)

차형태
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.6
- /
- pp.446-453
- /
- 2004
In this paper, a new adaptive perceptual filter using variable threshold to enhance audio signals degraded by additively nonstationary noise is proposed. The adaptive perceptual filter updates variable threshold each time according to the power of signal and the effect of noise variation. So the noisy audio signal is enhanced by the method which controls a residual noise effectively. The proposed algorithm uses the perceptual filter which transforms a time domain signal into frequency domain and calculates an intensity energy and an excitation energy in bark domain. In this method. the stage updated the response of filter is decided by threshold. The proposed algorithm using vairable threshold effectively controls a residual noise using the energy difference of audio signals degraded by the additive nonstationary noise. The proposed method is tested with the noisy audio signals degraded by nonstationary noise at various signal -to-noise ratios (SNR). We carry out NMR and MOS test when the input SNR is 15dB. 20dB. 25dB and 30dB. An approximate improvement of 17.4dB. 15.3dB, 12.8dB. 9.8dB in NMR and enhancement of 2.9, 2.5, 2.3, 1.7 in MOS test is achieved with the input signals. respectively.
PDF KSCI

Performance Evaluation of Six Jitter Control Algorithms for Improving Audio Quality (오디오 품질을 개선하기 위한 6개의 Jitter Control 알고리즘의 성능 분석)

나승구;유홍준;안종석;이태진
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2000.11b
- /
- pp.29-35
- /
- 2000
음성 데이터의 패킷 지터(jitter)가 심할수록 오디오 플레이어가 오디오 데이터를 자연스럽게 재생하지 못하기 때문에 사용자는 원래의 음성을 거의 알아들을 수 없게 된다. 이 문제점을 해결하기 위하여 오디오 수신자는 전송 받은 오디오 데이터를 바로 재생하지 않고 재생시간을 지연시키는 방법을 사용한다. 본 연구자의 조사에 의하면 이러한 재생시간을 지연하는 대표적인 지터 컨트롤 알고리즘으로 6가지 방식이 제안되고 있다. 그 중 세 가지는 NeVot, Vat, Open H.323 프로그램 등에 구현되어 실제로 사용되고 있다 본 논문에서는 이들 6가지의 모델의 지터 컨트롤 알고리즘의 특성을 알아보고 어느 알고리즘이 효율적인지 알아보기 위해 현재 인터넷의 성능을 파악하고 이를 기초로 제안된 6가지 알고리즘 중 어느 것이 가장 효율적인가를 파악하여 오디오의 음질을 개선하기 위한 방법을 제시하고자 한다.
PDF

Audio Quality Enhancement at a Low-bit Rate Perceptual Audio Coding (저비트율로 압축된 오디오의 음질 개선 방법)

서정일;서진수;홍진우;강경옥
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.6
- /
- pp.566-575
- /
- 2002
Low-titrate audio coding enables a number of Internet and mobile multimedia streaming service more efficiently. For the help of next-generation mobile telephone technologies and digital audio/video compression algorithm, we can enjoy the real-time multimedia contents on our mobile devices (cellular phone, PDA notebook, etc). But the limited available bandwidth of mobile communication network prohibits transmitting high-qualify AV contents. In addition, most bandwidth is assigned to transmit video contents. In this paper, we design a novel and simple method for reproducing high frequency components. The spectrum of high frequency components, which are lost by down-sampling, are modeled by the energy rate with low frequency band in Bark scale, and these values are multiplexed with conventional coded bitstream. At the decoder side, the high frequency components are reconstructed by duplicating with low frequency band spectrum at a rate of decoded energy rates. As a result of segmental SNR and MOS test, we convinced that our proposed method enhances the subjective sound quality only 10%∼20% additional bits. In addition, this proposed method can apply all kinds of frequency domain audio compression algorithms, such as MPEG-1/2, AAC, AC-3, and etc.
PDF KSCI

A Study on the Digital Audio Watermarking for a High Quality Audio (고음질을 위한 디지털 오디오 워터마킹에 관한 연구)

Jo, Byeong-Rok;Jeong, Il-Yong;Park, Chang-Gyun;Lee, Gang-Hyeon
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.39 no.3
- /
- pp.53-61
- /
- 2002
In this paper, the authors proposed the digital audio watermarking algorithm for a high quality audio. Today, the digital watermark is used to confirm to the digital copyright protection, not only the digital image but the digital audio study is an activeness in the digital watermarking area. Especially, the watermark insertion in the digital audio area affects deeply not only a robustness but the audio quality of the watermarked audio data. Generally, the audio watermark is inserted in the frequence domain after FFT, the quality of audio data is affected by the watermark insertion. Thus, a high quality audio to be maintained at the same time, the study related a inserting of the robustness watermark happened to a hot issue. In this paper, the authors proposed the digital audio watermarking algorithm using psychoacoustic model and MDCT/IMDCT (Modified Discrete Cosine Transform/Inverse Modified Discrete Cosine Transform). In the proposed scheme, the authors experimented the stereo audio file with 44.1KHz, and 128kbps for the audio watermarking algorithm proposed. When the audio data is processed by MDCT, the watermark is able to insert into the frequence domain with 256, 1024 and 2048 interval. In case of 50㎳ RMS window, it was confirmed that the difference between the original audio data and the watermarked audio data of RMS power is 0.8㏈.
PDF KSCI

Unified Speech and Audio Coding Technology (통합 음성 오디오 부호화 기술)

Lee, Taejin;Beack, Seungkwon;Kang, Kyeongok;Kim, Whan-Woo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.264-267
- /
- 2011
다양한 기능을 가지는 모바일 기기들이 하나로 융합되어 가는 방향으로 기술이 발전함에 따라, 음성 및 오디오 모두에 대해 우수한 음질을 제공하는 부호화 기술에 대한 요구사항이 증대되고 있다. MPEG 에서는 2008 년 10 월부터 MPEG-D USAC 기술에 대해 CfP 를 시작으로 본격적으로 표준화를 진행하고 있으며, 2011 년 3 월 96 차 미팅에서 Study on DIS 까지 승인하였다. 본 논문에서는 LPD 모드의 TCX 윈도우의 변경을 통한 USAC 성능향상 방법은 제안한다. TCX 프레임의 연결에 고정된 크기의 중첩만을 이용하는 현재의 방식과는 달리, 이전 TCX 모드와 다음 TCX 모드, transient 의 존재 유무에 따라 적절하게 TCX 윈도우 중첩크기를 조절하여 음악 특성 신호에 대해 LPD 모드의 음질을 개선할 수 있다.
PDF

Next-generation loudspeaker layout for Ultra High Definition (UHD) Digital TV (초고선명 디지털 TV 를 위한 차세대 라우드스피커 레이아웃)

Lee, Young Woo;Kim, Sunmin
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.57-60
- /
- 2011
본 논문에서는 초고선명 디지털 TV 를 위한 차세대 멀티채널 사운드 시스템의 최적의 라우드스피커 레이아웃을 도출하기 위해 다양한 라우드스피커 배치 환경에서 인지 관점의 오디오 음질 주관평가를 실시하였다. NHK 22.2 채널 시스템, ITU-R BS.775-2 표준의 7.1 채널 시스템과, 실감 음향에 가장 중요한 역할을 하는 Top Layer 라우드스피커에 중점을 두고 다양한 신규 레이아웃 구성들을 비교하였으며, 스튜디오에서 믹싱된 컨텐츠와 B-format 레코딩을 멀티채널로 생성한 컨텐츠를 이용하여 주관 평가를 실시하였다. 주관 평가 결과, Top Layer 에 3 개의 라우드스피커를 가지는 10.2 채널 라우드스피커 레이아웃이 평가에서 사용된 전체적인 오디오 음질의 등급에서 NHK 22.2 채널 시스템과 차이를 인지하기 어렵다는 결과를 도출하였다.
PDF

Research for Multi-channel audio service system on Satellite DMB environment (위성 DMB 환경에서의 멀티채널 오디오 서비스 시스템 연구)

Lee, Yong Ju;Seo, Jeongil;Beack, Seung Kwon;Kang, Kyeongok
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.486-489
- /
- 2011
본 논문에서는 위성 DMB 환경에서 멀티채널 오디오 서비스를 제공할 수 있는 시스템을 제안한다. 위성 DMB 서비스는 2005 년부터 상용서비스를 시작한 이동멀티미디어방송 서비스로서, QVGA 급의 영상과 FM 음질의 오디오 서비스를 제공한다. 본 연구에서는 기존의 위성 DMB 시스템과 호환성을 유지하면서, 적은 비트율의 데이터를 추가하여 멀티채널 오디오 서비스를 제공하는 시스템에 대한 연구를 수행하였다. 이를 위하여 기존의 스테레오 오디오 시스템과 호환성을 가지면서도 적은 비트율의 데이터 추가만으로 멀티채널 오디오 신호의 재현이 가능한 멀티채널 오디오 부호화 기술을 적용하였고, 기존 위성 DMB 단말의 동작에 영향을 주지 않으면서, 멀티채널 오디오 서비스가 제공되는 것을 식별할 수 있도록 하는 시그널링 방법을 개발하여 적용하였다. 연구 결과의 검증을 위하여 위성 DMB 멀티채널 오디오 부호화기 및 단말을 개발하여 방송 실험을 수행하였고, 이를 통하여 제안한 방법으로 위성 DMB 환경에서 멀티채널 오디오 서비스를 효율적으로 제공할 수 있음을 증명하였다.
PDF

An Assessment on the Sound Quality of the Car Audio System Using the Orthogonal Designs (직교배열법을 이용한 차량 음향 시스템의 음질평가)

Doo, Se-Jin;Choi, Kyung-Mee
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.5
- /
- pp.229-238
- /
- 2008
Audio tuning improves not only the sound quality of the car audio but also the quality of the completed car itself. However without the subjective assessment on the users' preferences, it is hard to tune the car audio satisfying them. Even though there are lots of factors to be considered to assess the preferences, only a restricted number of factors should be included in the experiment because the total number of experiments increases rapidly as the number of factors in the experiment increases. A large number of factors make it hard to explore the relationship between the sound quality and the sound characteristics and also makes the panels exhausted. In this paper, 8 sound characteristics, each with 2 levels, are considered for the experiment. An orthogonal design of experiment is suggested to reduce the number of experiments from 256 to 16. The analysis of variance is applied to show that Treble is the most significant characteristic of the reproduced sound of the given pop music. Also Deep Bass, SAD, and the interaction between Treble and SAD are found to be significant. For the given classic music, SAD is the only characteristic which turns out to be significant.
https://doi.org/10.7776/ASK.2008.27.5.229 인용 PDF KSCI

Improved Phase Synthesis for Parametric Stereo Audio Coding (파라메트릭 스테레오 오디오 부호화를 위한 향상된 위상 합성 기법)

Hyun, Dong-Il;Park, Young-Cheol;Youn, Dae Hee
- Journal of the Institute of Electronics and Information Engineers
- /
- v.50 no.12
- /
- pp.184-190
- /
- 2013
Parametric stereo(PS) audio coding is a specific version of spatial audio coding. In this paper, the problem due to the conventional synthesis of phase differences. In the conventional upmix matrix, phase differences are synthesized not only on downmix signal but also ambient signal, which violates the assumption that the ambient signals are anti-phased. Deterioration due to the phase synthesis is analyzed, especially, for low interchannel correlation. To solve this problem, new upmix matrix is proposed, which synthesizes phase differences only on downmix signal. The performance of the proposed upmix matrix is verified by the subjective listening tests.
https://doi.org/10.5573/ieek.2013.50.12.184 인용 PDF KSCI

Search Result 131, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)