• Title/Summary/Keyword: MUSHRA test

Search Result 13, Processing Time 0.026 seconds

Enhancement of Super-wideband Coder by Considering Audio Feature in MDCT Domain (MDCT 도메인에서 오디오 신호 특징을 고려한 초광대역 코덱 개선)

  • Hong, Ki-Bong;Jeong, Gyu-Hyeok;Lee, In-Sung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.48 no.5
    • /
    • pp.129-136
    • /
    • 2011
  • This paper presents the coding method that have multi-mode and efficiency of audio codecs using the feature of audio signal. Recently, the developed extension super-wideband codec based on G.718 wideband divides two mode between Generic and Sinusiodal. So codec efficently encode audio signal exist in super-wideband. But the codec is not as efficent coding for harmonic component of wind instrument and string instrument and individual-Line component of percussion instrument. The proposed method are modeling and encoding multiple pitch and individual-line feature using multi mode coding. For the performance evaluation, we used SNR in MDCT domain for objective test and MUSHRA test for subjective test. As a result, the performance of SNR and MUSHRA test of the proposed method have better performance than the G.718 super-wideband codec.

Performance Evaluation of the MPEG USAC According to the Spectral Band Replication Bandwidth (Spectral Band Replication 대역폭에 따른 MPEG USAC 부호화 성능 평가)

  • An, Kyung-Jun;Jung, Yoo-Sun;Beack, Seung-Kwon;Kang, Kyeong-Ok;Kim, Rin-Chul
    • Journal of Broadcast Engineering
    • /
    • v.16 no.5
    • /
    • pp.705-713
    • /
    • 2011
  • This paper deals with the effect of SBR bandwidth on the overall performance of the MPEG USAC. Here, the SBR bandwidth is termed the frequency region covered by the SBR codec, and is specified by the bs_stop_freq, which is one of the SBR bitstream components. The performance of the USACs with 5 different SBR bandwidths are compared in a subjective manner using the MUSHRA test. In the comparison, the bit rate is confined to 14~24kbps and only the LPD unit is selected for the core codec. From the comparison, it is observed that the SBR bandwidth that stretches up to 18KHz or above gives the better performance than the others.

Watermarking Algorithm for Copyright Protection of Haegeum Sound Contents (해금 사운드 콘텐츠의 저작권 보호를 위한 워터마킹 알고리듬)

  • Hong, Yeon-Woo;Kang, Myeong-Su;Cho, Sang-Jin;Chong, Ui-Pil
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.10 no.4
    • /
    • pp.214-219
    • /
    • 2009
  • This paper proposes a watermarking algorithm considering the frequency characteristics of Haegeum sounds for copyright protection of digital Haegeum sound contents. The harmonics of Haegeum sounds commonly have large magnitude values in 1500Hz~2000Hz and 2800Hz~3500Hz so that those bands are selected to embed a watermark. The proposed method computes the FFT (fast Fourier transform) of the original sound signal and embeds the watermark bits generated by PN (pseudo noise) sequence into the harmonics in the selected bands. Furthermore, the proposed method is robust to lowpass filter, bandpass filter, cropping, noise addition, MP3 compression attacks and the maximum BER (bit error rate) is 1.41% after lowpass filter attack. To measure the quality of the watermarked sound, subjective listening test, MUSHRA (multiple stimuli with hidden reference and anchor), was conducted. The mean value of MUSHRA listening test is bigger than 98 and 96.67 for every Haegeum sounds and Korean classical music with Haeguem, respectively.

  • PDF

A 3D Audio Codec Employing a Revised Noise Filling Method (수정된 잡음 채움 기법을 적용한 3D 오디오 부호기)

  • Kim, Rin Chul
    • Journal of Broadcast Engineering
    • /
    • v.26 no.3
    • /
    • pp.327-330
    • /
    • 2021
  • In this paper, a new noise filling method is proposed for improving the performance of the 3D audio codec. In the new method, the core band is limited up to MAX_SFB, not up to the IGF start frequency. And the noise filling is applied to all frequency range of the IGF source patches. We conduct the MUSHRA test and find that the proposed noise filling method demonstrates better performance than the conventional method.

A Performance Evaluation of the MPEG USAC with Variable Core-Band Down-Sampling Ratio (가변 핵심 대역 하향 표본화 비를 가진 MPEG USAC 성능 평가)

  • Lee, Jae Hwa;Kim, Rin Chul
    • Journal of Broadcast Engineering
    • /
    • v.18 no.1
    • /
    • pp.106-114
    • /
    • 2013
  • This paper deals with the effect of the internal sampling frequency and core band down sampling ratio on the overall performance of the MPEG USAC. Here, the internal sampling frequency is the sampling frequency of a signal actually coded. The core band down sampling ratio is the ratio of the width of the core band over that of the coded band. The performance was measured on 6 different test sound sources by the MUSHRA test with 10 subjects. The experiments showed that 1/3 or 1/4 core band down sampling ratio could yield the better performance than the conventional 1/2 ratio, especially at low rates.

An Audio Coding Technique Employing the Inter-channel Phase Difference Skip (채널 간 위상차 파라미터 생략 기법을 이용한 오디오 부호화)

  • Kim, Hyun-Hwi;Kim, Rin-Chul
    • Journal of Broadcast Engineering
    • /
    • v.21 no.3
    • /
    • pp.369-379
    • /
    • 2016
  • This paper deals with an efficient method for skipping inter-channel phase differences (IPD) in the MPEG surround of the unified speech and audio coding (USAC). Based on the psycho-acoustic sensitivity on the IPD, we estimate a threshold on IPD, below which we can not notice degradation in spatial cue. We propose an IPD skip method, in which any IPDs within the threshold are set to zero and are not transmitted. The proposed IPD skip method gives about 38% savings in terms of bit amount for IPD. Nevertheless, in the MUSHRA test, the proposed method does not show any noticeable degradation in the decoded audio quality.

A 3D Audio Core-Codec Employing an Improved Buffer Control Method (향상된 버퍼 제어 방법을 사용한 3D 오디오 핵심 부호화기)

  • Kim, Rin Chul
    • Journal of Broadcast Engineering
    • /
    • v.25 no.2
    • /
    • pp.233-241
    • /
    • 2020
  • In this paper, a new buffer control method is proposed for improving the performance of the frequency domain part of the 3D audio (3DA) core codec. For the proposed buffer control method, we first combine the 3DA RM9 with the 3GPP AAC buffer control method which includes the psychoacoustic model and rate-distortion control process with the spectral hole avoidance algorithm. Then, we revise the 3GPP buffer control method so as to achieve a faithful bit allocation to the frames with higher activity. With the MUSHRA test, we prove that the proposed buffer control method demonstrates better performance than the 3DA RM9 and 3GPP AAC.

Multi-band Approach to Deep Learning-Based Artificial Stereo Extension

  • Jeon, Kwang Myung;Park, Su Yeon;Chun, Chan Jun;Park, Nam In;Kim, Hong Kook
    • ETRI Journal
    • /
    • v.39 no.3
    • /
    • pp.398-405
    • /
    • 2017
  • In this paper, an artificial stereo extension method that creates stereophonic sound from a mono sound source is proposed. The proposed method first trains deep neural networks (DNNs) that model the nonlinear relationship between the dominant and residual signals of the stereo channel. In the training stage, the band-wise log spectral magnitude and unwrapped phase of both the dominant and residual signals are utilized to model the nonlinearities of each sub-band through deep architecture. From that point, stereo extension is conducted by estimating the residual signal that corresponds to the input mono channel signal with the trained DNN model in a sub-band domain. The performance of the proposed method was evaluated using a log spectral distortion (LSD) measure and multiple stimuli with a hidden reference and anchor (MUSHRA) test. The results showed that the proposed method provided a lower LSD and higher MUSHRA score than conventional methods that use hidden Markov models and DNN with full-band processing.

Evaluation of Spatial Audio Coding Tools for Multichannel Audio (Spatial Audio Coding 기술의 멀티채널 부호화 성능 비교)

  • Jang Inseon;Seo Jeongil;Mun Hangil;Kang Kyeongok
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.153-156
    • /
    • 2004
  • Spatial Audio Coding (SAC)은 낮은 대역폭에서 다채널/다객체 오디오 신호를 전송하기 위해 제안된 기술이다. 본 논문에서는 MPEG 에서 SAC 기술의 평가 방법으로 채택된 Multi-Stimulus test with Hidden Reference and Anchor (MUSHRA) 실험 절차에 대해서 설명한다. 또한 제 69 차 MPEG 회의에서 제안된 4 개 기관의 SAC 기술에 대한 청취실험을 수행하고 그 결과를 분석한다.

  • PDF

Implementation of a Person Tracking Based Multi-channel Audio Panning System for Multi-view Broadcasting Services (다시점 방송 서비스를 위한 사용자 위치추적 기반 다채널 오디오 패닝 시스템 구현)

  • Kim, Yong-Guk;Yang, Jong-Yeol;Lee, Young-Han;Kim, Hong-Kook
    • 한국HCI학회:학술대회논문집
    • /
    • 2009.02a
    • /
    • pp.150-157
    • /
    • 2009
  • In this paper, we propose a person tracking based multi-channel audio panning system for multi-view broadcasting services. Multi-view broadcasting is to render the video sequences that are captured from a set of cameras based on different viewpoints, and multi-channel audio panning techniques are necessary for audio rendering in these services. In order to apply such a realistic audio technique to this multi-view broadcasting service, person tracking techniques which are to estimate the position of users are also necessary. For these reasons, proposed methods are composed of two parts. The first part is a person tracking method by using ultrasonic satellites and receiver. We could obtain user's coordinates of high resolution and short duration about 10 mm and 150 ms. The second part is MPEG Surround parameter-based multi-channel audio panning method. It is a method to obtain panned multi-channel audio by controlling the MPEG Surround spatial parameters. A MUSHRA test is conducted to objectively evaluate the perceptual quality and measure localization performance using a dummy head. From the experiments, it is shown that the proposed method provides better perceptual quality and localization performance than the conventional parameter-based audio panning method. In addition, we implement the prototype of person tracking based multi-view broadcasting system by integrating proposed methods with multi-view display system.

  • PDF