Search | Korea Science

Angle-Based Virtual Source Location Representation for Spatial Audio Coding

Beack, Seung-Kwon;Seo, Jeong-Il;Moon, Han-Gil;Kang, Kyeong-Ok;Hahn, Min-Soo
- ETRI Journal
- /
- v.28 no.2
- /
- pp.219-222
- /
- 2006
Virtual source location information (VSLI) has been newly utilized as a spatial cue for compact representation of multichannel audio. This information is represented as the azimuth of the virtual source vector. The superiority of VSLI is confirmed by comparison of the spectral distances, average bit rates, and subjective assessment with a conventional cue.
PDF

Low-bitrate Multichannel Audio Coding (저비트율 멀티채널 오디오 부호화)

Jang, Inseon;Seo, Jeongil;Beak, Seungkwon;Kang, Kyeongok
- Journal of Broadcast Engineering
- /
- v.10 no.3
- /
- pp.328-338
- /
- 2005
Technology for compressing low-bitrate multichannel audio coding is being standardized owing to the increasing need of consumer for multichannel audio contents. In this paper we propose the sound source location cue coding (SSLCC) for extremely compressing multichannel audio to be suitable at the narrow bandwidth transmission environment. To improve the compression capability of the conventional binaural cue coding(BCC), the SSLCC adopts the virtual source location information (VSLI) as a spatial cue parameter, a symmetric uniform quantizer, and Huffman coder. The objective and subjective assessment results show that the SSLCC provides lower bitrate and better audio quality than conventional BCC method.
PDF KSCI

Multi-channel Audio Service in a Terrestrial-DMB System Using VSLI-Based Spatial Audio Coding

Seo, Jeong-Il;Moon, Han-Gil;Beack, Seung-Kwon;Kang, Kyeong-Ok;Hong, Jae-Keun
- ETRI Journal
- /
- v.27 no.5
- /
- pp.635-638
- /
- 2005
Spatial audio coding (SAC) is an extremely high compact representation of encoded multi-channel audio material. This paper suggests a multi-channel audio service in the terrestrial digital multimedia broadcasting (T-DMB) system using a novel SAC tool, which is called a virtual source location information (VSLI)-based SAC tool. Intensive experiments are presented to evaluate the validity of the proposed VSLI-based SAC tool, and prototypical systems are also presented to demonstrate the reliability of the proposed multi-channel T-DMB system in real applications.
PDF

A Content-based Audio Retrieval System Supporting Efficient Expansion of Audio Database (음원 데이터베이스의 효율적 확장을 지원하는 내용 기반 음원 검색 시스템)

Park, Ji Hun;Kang, Hyunchul
- Journal of Digital Contents Society
- /
- v.18 no.5
- /
- pp.811-820
- /
- 2017
For content-based audio retrieval which is one of main functions in audio service, the techniques for extracting fingerprints from the audio source, storing and indexing them in a database are widely used. However, if the fingerprints of new audio sources are continually inserted into the database, there is a problem that space efficiency as well as audio retrieval performance are gradually deteriorated. Therefore, there is a need for techniques to support efficient expansion of audio database without periodic reorganization of the database that would increase the system operation cost. In this paper, we design a content-based audio retrieval system that solves this problem by using MapReduce and NoSQL database in a cluster computing environment based on the Shazam's fingerprinting algorithm, and evaluate its performance through a detailed set of experiments using real world audio data.
https://doi.org/10.9728/dcs.2017.18.5.811 인용 PDF KSCI

Robust Audio Watermarking Method Under Capturing Attacks (캡쳐링 공격에 강인한 오디오 워터마킹 방법)

Lee, Seung-Jae;Lee, Sang-Kwang;Seo, Jin-S.
- Proceedings of the IEEK Conference
- /
- 2006.06a
- /
- pp.375-376
- /
- 2006
In this paper, we propose a wavelet-based audio watermarking algorithm to be robust against capturing attack. Commercial capturing tools enable us to obtain audio contents without noticeable degradation in audio quality, and it is possible to be a source of illegal distribution. By adjusting mean values of the lowest subband in audio, the proposed method can survive after capturing attack including sampling rate conversion, random cropping and compression. By applying a simple human auditory model, the inaudibility of the watermark is achieved, and detection probability is improved based on the difference information. This is confirmed by experimental results.
PDF

Study of DRM Application for the Portable Digital Audio Device (휴대용 디지털 오디오 기기에서의 DRM 적용에 관한 연구)

Cho, Nam-Kyu;Lee, Dong-Hwi;Lee, Dong-Chun;J. Kim, Kui-Nam;Park, Sang-Min
- Convergence Security Journal
- /
- v.6 no.4
- /
- pp.21-27
- /
- 2006
With the introduction of sound source sharing over the high speed internet and portable digital audio, the digitalization of sound source has been rapidly expanded and the sales and distribution of sound sources of the former offline markets are stagnant. Also, the problem of infringement of copyright is being issued seriously through illegal reproduction and distribution of digitalized sound sources. To solve these problems, the DRM technology for protecting contents and copyrights in portable digital audio device began to be introduced. However, since the existing DRM was designed based on the fast processing CPU and network environment, there were many problems in directly applying to the devices with small screen resolution, low processing speed and network function such as digital portable audio devices which the contents are downloadable through the PC. In this study, the DRM structural model which maintains similar security level as PC environment in the limited hardware conditions such as portable digital audio devices is proposed and analyzed. The proposed model chose portable digital audio exclusive device as a target platform which showed much better result in the aspect of security and usability compared to the DRM structure of exiting portable digital audio device.
PDF

Improved Channel Level Difference Quantization for Spatial Audio Coding

Kim, Kwang-Ki;Beack, Seung-Kwon;Seo, Jeong-Il;Jang, Dae-Young;Hahn, Min-Soo
- ETRI Journal
- /
- v.29 no.1
- /
- pp.99-102
- /
- 2007
The channel level difference (CLD) is a main parameter in the reference model 0 (RM0) for MPEG Surround. Nevertheless, the CLD quantization method in the RM0 has problems such as the lack of theoretical background and inappropriate quantization levels. In this letter, a new CLD quantization method is proposed based on the virtual source location information which has strength in the quantization process. From experimental results, it is confirmed that the proposed scheme greatly reduces the quantization distortions measured in dB and degrees without any additional complexity.
PDF

Optimization of MPEG-4 AAC Codec on PDA (휴대 단말기용 MPEG-4 AAC 코덱의 최적화)

김동현;김도형;정재호
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.3
- /
- pp.237-244
- /
- 2002
In this paper we mention the optimization of MPEG-4 VM (Moving Picture Expert Group-4 Verification Model) GA (General Audio) AAC (Advanced Audio Coding) encoder and the design of the decoder for PDA (Personal Digital Assistant) using MPEG-4 VM source. We profiled the VMC source and several optimization methods have applied to those selected functions from the profiling. Intel Pentium III 600 MHz PC, which uses windows 98 as OS, takes about 20 times of encoding time compared to input sample running time, with additional options, and about 10 times without any option. Decoding time on PDA was over 35 seconds for the 17 seconds input sample. After optimization, the encoding time has reduced to 50% and the real time decoding has achieved on PDA.
PDF KSCI

Design and Development of T-DMB Multichannel Audio Service System Based on Spatial Audio Coding

Lee, Yong-Ju;Seo, Jeong-Il;Beack, Seung-Kwon;Jang, Dae-Young;Kang, Kyeong-Ok;Kim, Jin-Woong;Hong, Jin-Woo
- ETRI Journal
- /
- v.31 no.4
- /
- pp.365-375
- /
- 2009
In this paper, a terrestrial digital multimedia broadcasting (T-DMB) multichannel audio broadcasting system based on spatial audio coding is presented. The proposed system provides realistic multichannel audio service via T-DMB with a small increase of data rate as well as backward compatibility with the conventional stereo-based T-DMB player. To reduce the data rate for additional multichannel audio signals, we compress the multichannel audio signals using the sound source location cue coding algorithm, which is an efficient parametric multichannel audio compression technique. For compatibility, we use the dependent property of an elementary stream descriptor, and this property should be ignored in a conventional T-DMB player. To verify the feasibility of the proposed system, we implement the T-DMB multichannel audio encoder and a prototype player. We perform a compatibility test using the T-DMB multichannel audio encoder and conventional T-DMB players. The test demonstrates that the proposed system is compatible with a conventional T-DMB player and that it can provide a promisingly rich audio service.
https://doi.org/10.4218/etrij.09.0108.0557 인용 PDF

Real-time Audio Watermarking System Considering Audio Source and User (음원 및 사용자를 고려한 실시간 오디오 워터마킹 시스템)

Cho, Jung-Won;Jeong, Seung-Do
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.10 no.11
- /
- pp.3213-3217
- /
- 2009
Distribution, duplication and manipulation of the digital contents are very easy due to the characteristics of the digital contents. Thus, damages of invasion of property right rapidly increase due to infringement of copyright for the digital contents. To prevent illegal use and to settle conflict about ownership of the digital contents, continuous efforts with enormous expense are devoted. In this paper, we design and implement real-time audio watermarking system to protect ownership and copyright for the digital contents. The proposed system also clarifies where the responsibility about the illegal distribution lies. The system has convenient user interface so that general administrator without an expert knowledge of the protection of copyright can use easily. In addition, unlike the traditional watermarking system, our system has merit to offer information about the illegal distribution for clear post-management.
https://doi.org/10.5762/KAIS.2009.10.11.3213 인용 PDF

Search Result 31, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)