Search | Korea Science

Analysis and Evaluation of PEAQ : Objective Method for Perceived Audio Quality Measurement (객관적 음질 평가를 위한 PEAQ의 성능 평가 및 분석)

Park Se-Hyoung;Ryu Seung-Wan;Park Jeong-Yeol;Shin Jae-Ho
- 한국정보통신설비학회:학술대회논문집
- /
- 2003.08a
- /
- pp.234-239
- /
- 2003
디지털방송, DAB 등과 같은 디지털 오디오 방송 서비스를 위한 디지털 시스템을 설계하기 위해서는 오디오 음질을 평가하기 위한 방법이 필수적이다. 기존의 방식은 인간의 귀를 이용한 주관적 방식을 이용함으로서 많은 시간과 비용을 들이게 되며, 음질평가를 하는 사람의 주관적 의견에 많이 좌우하게 된다. 그러나 최근 ITV-R에서는 오디오 음질의 객관적 평가를 위한 BS.1387(PEAQ)를 제안함으로 많은 시간과 비용을 절감하고 신뢰할 수 있는 결과를 얻게 되었다. PEAQ는 인간의 귀에서의 신호의 처리과정과 인식과정을 심리음향모델과 인식모델로 분리하여 구성함으로써 주관적 평가의 SDG(Subjective Difference Grade)에 대응하는 ODG(Objective Difference Grade)를 구하게 된다. 본 논문에서는 이러한 PEAQ의 심리음향 모델과 인식 모델을 원리와 과정을 평가 분석하였다.
PDF

Performance Evaluation of MCLT-based Audio Watermark in DTV System (DTV 시스템에서의 MCLT 기반 오디오 워터마크 성능 평가)

Jeong, Youngho;Lee, Misuk;Lee, Taejin;Kim, Huiyong
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2017.06a
- /
- pp.219-222
- /
- 2017
본 논문에서는 DTV 시스템을 대상으로 PN 시퀀스를 이용한 MCLT(Modulated Complex Lapped Transform) 기반 오디오 워터마크 알고리즘에 대한 BER 및 PEAQ(Perceptual Evaluation of Audio Quality) 성능 평가를 통해 오디오 신호 압축에 대한 워터마크의 강인성 및 워터마크 삽입에 따른 오디오 품질 열화 정도를 분석하였다. 이를 위해 오디오 신호 특성을 고려한 프로그램 장르별 시험용 방송 콘텐츠를 제작하고, Lab. Test 를 위한 DTV 송수신 시스템을 구축하였다. 오디오 인코딩 비트율 변화에 따른 성능 평가 결과, 광고 콘텐츠를 제외한 평균 BER(%)에서 192kbps 비트율이 128kpbs 비트율에 비해 0.0767 더 우수한 성능을 보였다. 오디오 워터마크 삽입에 따른 객관적 음질 평가에서는 PEAQ 점수가 약 -0.2 로 원래 오디오 신호와의 품질 차이가 매우 작은 것으로 나타났으며, 또한 DTV 시스템상의 신호 압축에 의해 발생하는 오디오 신호의 품질 저하 이외에 워터마크 삽입으로 인한 추가적인 음질 저하는 거의 발생하지 않는 것으로 분석되었다.
PDF

A Scalable Audio Coder for High-quality Speech and Audio Services

Lee, Gil-Ho;Lee, Young-Han;Kim, Hong-Kook;Kim, Do-Young;Lee, Mi-Suk
- MALSORI
- /
- no.61
- /
- pp.75-86
- /
- 2007
In this paper, we propose a scalable audio coder, which has a variable bandwidth from the narrowband speech bandwidth to the audio bandwidth and also has a bit-rate from 8 to 320 kbits/s, in order to cope with the quality of service(QoS) according to the network load. First of all, the proposed scalable coder splits bandwidth of the input audio into narrowband up to around 4 kHz and above. Next, the narrowband signals are compressed by a speech coding method compatible to an existing standard speech coder such as G.729, and the other signals whose bandwidth is above the narrowband are compressed on the basis of a psychoacoustic model. It is shown from the objective quality tests using the signal-to-noise ratio(SNR) and the perceptual evaluation of audio quality(PEAQ) that the proposed scalable audio coder provides a comparable quality to the MPEG-1 Layer III (MP3) audio coder.
PDF

Improvement of the TCX Module in AMR-WB+ Codec Using Pyramid VQ (Pyramid VQ를 이용한 AMR-WB+ 코덱 내 TCX 모듈의 성능 개선)

Park, Sang-Kuk;Park, Jung-Eun;Baik, Seung-Kweon;Seo, Jung-Il;Kang, Sang-Won
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.3
- /
- pp.109-114
- /
- 2007
In this paper, we Propose a pyramid VQ to quantize the transform coefficients of TCX module for the audio improvement of AMR-WB+ codec. The Proposed pyramid VQ is compared to the $RE_8$ Lattice VQ used in the AMR-WB+ standard codec. demonstrating improvement 4% and 5.7%. respectively, in Mean Squared Error (MSE) and 3.3% and 4.7%. respectively, in Perceptual Evaluation of Audio Quality (PEAQ) by 8-dimensional and 16-dimensional Pyramid VQ.
https://doi.org/10.7776/ASK.2007.26.3.109 인용 PDF KSCI

객관적 음질평가 기법 연구

이신열;최낙진;성광고
- Information and Communications Magazine
- /
- v.22 no.10
- /
- pp.24-34
- /
- 2005
시스템을 설계하고 제작한 후에 그 시스템과 구성 요소가 최종적으로 음질에 미치는 영향을 평가하는 일은 필수적이다. 음질평가 기법은 크게 두 가지가 있다. 첫 번째는 사람의 귀로 듣고 평가하는 주관평가 방법이고, 두 번째는 측정 데이터로부터 객관적으로 성능을 평가하는 방법이다. 주관적 음질평가 방법은 사람이 직접 귀로 듣고 평가하는 방법이기 때문에, 여러 가지 불안정한 요소를 안고 있다. 주관 평가자의 신체적$\cdot$심리적 상태에 따라 평가가 달라질 수 있으며, 개개인에 따라 다른 결과를 내기도 한다. 따라서, 주관평가 결과의 신뢰성을 확보하기 위해서는 통계적인 데이터를 얻고 평가자를 올바르게 훈련시켜야 한다. 그러기 위해서는 시간과 비용이 많이 소비된다. 따라서 측정 데이터로부터의 정교한 계산을 통하여 라우드스피커의 음질을 신뢰할만한 수준으로 평가할 수 있다면 신뢰성을 확보할 수 있을 뿐 아니라 시간 및 비용 절감 효과를 볼 수 있다. 본 연구에서는 측정 데이터로부터 시스템의 음질을 신뢰할만한 수준으로 평가할 수 있는 기법을 새롭게 제안한다. 이것은 ITU-R Recommendation BS. 1387인 PEAQ를 사용하여 라우드스피커의 음질을 평가하는 방법이다.
PDF KSCI

Design of the TCX module transform coefficients quantizer in AMR-WB+ codec using PVQ (PVQ 방식을 이용한 AMR-WB+ 코덱의 TCX 모듈 변환계수 양자화기 설계)

Park, Sang-Kuk;Park, Jung-Eun;Kang, Sang-Won
- Proceedings of the IEEK Conference
- /
- 2007.07a
- /
- pp.345-346
- /
- 2007
In this paper, we propose a Pyramid VQ(PVQ) to quantize the transform coefficients of TCX module for the music improvement of AMR-WB+ codec. The proposed PVQ is compared to the $RE_8$ Lattice VQ used in the AHR-WB+ standard codec, demonstrating improvement 4% and 5.7%, respectively, in Mean Squared Error(MSE) and 3.3% and 4.7%, respectively, in Perceptual Evaluation of Audio Quality(PEAQ) by 8-dimensional and 16-dimensional Pyramid VQ.
PDF

Performance Analysis of Audio Data Hiding Method based on Phase Information with Various Window Length (주파수 변환의 길이에 따른 위상 기반 오디오 정보 은닉 기술의 음질 및 성능 분석)

Cho, Kiho;Kim, Nam Soo
- Journal of the Institute of Electronics and Information Engineers
- /
- v.50 no.12
- /
- pp.232-237
- /
- 2013
The role of the window length of time-frequency transformation is important for the audio data hiding methods utilizing phase information. In this paper, the experiments for our audio data hiding method were conducted in order to evaluate the audio quality and robustness against reverberant environment. The experimental results showed the tendency that the worse audio quality but better robustness were obtained when the lengthy window was applied. The important reason for quality degradation was pre-echo which flatters the percussive sound. The results also indicated that the wireless communication theory related to the length of time-frequency transform can be applied in the field of audio data hiding and acoustic data transmission.
https://doi.org/10.5573/ieek.2013.50.12.232 인용 PDF KSCI

Real data-based active sonar signal synthesis method (실데이터 기반 능동 소나 신호 합성 방법론)

Yunsu Kim;Juho Kim;Jongwon Seok;Jungpyo Hong
- The Journal of the Acoustical Society of Korea
- /
- v.43 no.1
- /
- pp.9-18
- /
- 2024
The importance of active sonar systems is emerging due to the quietness of underwater targets and the increase in ambient noise due to the increase in maritime traffic. However, the low signal-to-noise ratio of the echo signal due to multipath propagation of the signal, various clutter, ambient noise and reverberation makes it difficult to identify underwater targets using active sonar. Attempts have been made to apply data-based methods such as machine learning or deep learning to improve the performance of underwater target recognition systems, but it is difficult to collect enough data for training due to the nature of sonar datasets. Methods based on mathematical modeling have been mainly used to compensate for insufficient active sonar data. However, methodologies based on mathematical modeling have limitations in accurately simulating complex underwater phenomena. Therefore, in this paper, we propose a sonar signal synthesis method based on a deep neural network. In order to apply the neural network model to the field of sonar signal synthesis, the proposed method appropriately corrects the attention-based encoder and decoder to the sonar signal, which is the main module of the Tacotron model mainly used in the field of speech synthesis. It is possible to synthesize a signal more similar to the actual signal by training the proposed model using the dataset collected by arranging a simulated target in an actual marine environment. In order to verify the performance of the proposed method, Perceptual evaluation of audio quality test was conducted and within score difference -2.3 was shown compared to actual signal in a total of four different environments. These results prove that the active sonar signal generated by the proposed method approximates the actual signal.
https://doi.org/10.7776/ASK.2024.43.1.009 인용 PDF

Search Result 8, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)