• Title/Summary/Keyword: 3-D audio

Search Result 210, Processing Time 0.036 seconds

A Single-Bit 3rd-Order Feedforward Delta Sigma Modulator Using Class-C Inverters for Low Power Audio Applications (저전력 오디오 응용을 위한 Class-C 인버터 사용 단일 비트 3차 피드포워드 델타 시그마 모듈레이터)

  • Hwang, Jun-Sub;Cheon, Jimin
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.15 no.5
    • /
    • pp.335-342
    • /
    • 2022
  • In this paper, a single-bit 3rd-order feedforward delta sigma modulator is proposed for audio applications. The proposed modulator is based on a class-C inverter for low voltage and power applications. For the high-precision requirement, the class-C inverter with regulated cascode structure increases its DC gain and acts as a low-voltage subthreshold amplifier. The proposed Class-C inverter-based modulator is designed and simulated in 180-nm CMOS process. With no performance loss and a low supply voltage compatibility, the proposed class-C inverter-based switched-capacitor modulator achieves high power efficiency. This design achieves an signal-to-noise-and-distortion ratio (SNDR) of 93.9 dB, an signal-to-noise ratio (SNR) of 108 dB, an spurious-free dynamic range (SFDR) of 102 dB, and a dynamic range (DR) of 102 dB at a signal bandwidth of 20 kHz and a sampling frequency of 4 MHz, while only using 280 μW of power consumption from a 0.8-V power supply.

Efficient Individualization Method of HRTFs Using Critical-band Based Spectral Cue Control

  • Hur, Yoo-Mi;Park, Young-Cheol;Lee, Seok-Pil;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.30 no.4
    • /
    • pp.167-180
    • /
    • 2011
  • Recently, 3-D audio technologies are commonly implemented through headphones. A major problem of the headphone-based 3-D audio is in-the-head localization, which occurs due to the inaccurate Head-Related Transfer Function (HRTF). Since the individual measurements of HRTFs are impractical, there have been several researches for HRTF customization. In this paper, an efficient method of customizing HRTFs for the sound externalization is proposed. Firstly, it is determined which part will be customized in HRTF through psychoacoustical experiments. Then, the method controlling spectral notches and envelopes to provide individual localization cues are described. Since the proposed method is based on a critical-band rate, the structure is much simpler than that of previous studies, but still effective. The performance was evaluated through a series of subjective tests, and the results confirmed that the customized HRTF using proposed method could replace the measured individual HRTF successfully.

Introduction of MPEG-H TV Audio System from the Perspective of Decoder Implementation (디코더 구현 관점에서 본 MPEG-H TV Audio System 소개)

  • Kwak, Kyungchul;Yang, Jinyoung;Bae, Sungyong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2018.06a
    • /
    • pp.158-160
    • /
    • 2018
  • 최근 다양한 국제 방송표준에서 차세대 오디오 코덱의 하나로 MPEG-H 3DA(3D Audio)가 채택되었으며, 이를 활용한 몰입형 오디오 서비스들이 개발되고 있다. 이러한 몰입형 서비스를 원활히 제공하기 위해서는 표준에 정의된 기술을 구현한 제품간의 상호호환성 검증이 필수적으로 추진되어야 하며, 이를 위해 개발된 MPEG-H TV Audio System 인증 프로그램에 대해 대상 제품과 시험서비스의 구조에 대해 설명하고 있다.

  • PDF

A Cross-talk Cancelling Technique for Improved 3-Dimensional Audio Reproduction (개선된 3차원 오디오 재생을 위한 크로스토크 제거 기법)

  • 오승수;김기만
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.5 no.1
    • /
    • pp.8-13
    • /
    • 2001
  • It has been well known that cross-talk canceller for 3D audio using loudspeakers depends on a listeners position called the sweet-spot. Therefore, new cross-talk canceller was proposed that increases robustness to perturbations such as head movement, reverberations, and different head shapes. It was made up 3 loudspeakers structure to be combined with symmetric and asymmetric speaker geometry. In this paper, we propose new cross-talk canceller using 2 loudspeakers having the same efficiency as existing cross-talk canceller using 3 loudspeakers. The results of the study is verified through the listening tests and also presented a cross-talk cancelling methods for improved 3-D audio production in details.

  • PDF

Design of a New Audio Watermarking System Based on Human Auditory System (청각시스템을 기반으로 한 새로운 오디오 워터마킹 시스템 설계)

  • Shin, Dong-Hwan;Shin Seung-Won;Kim, Jong-Weon;Choi, Jong-Uk;Kim, Duck-Young;Kim, Sung-Hwan
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.51 no.7
    • /
    • pp.308-316
    • /
    • 2002
  • In this paper, we propose a robust digital copyright-protection technique based on the concept of human auditory system. First, we propose a watermarking technique that accepts the various attacks such as, time scaling, pitch shift, add noise and a lot of lossy compression such as MP3, AAC WMA. Second, we implement audio PD(portable device) for copyright protection using proposed method. The proposed watermarking technique is developed using digital filtering technique. Being designed according to critical band of HAS(human auditory system), the digital filers embed watermark without nearly affecting audio quality. Before processing of digital filtering, wavelet transform decomposes the input audio signal into several signals that are composed of specific frequencies. Then, we embed watermark in the decomposed signal (0kHz~11kHz) by designed band-stop digital filer. Watermarking detection algorithm is implemented on audio PD(portable device). Proposed watermarking technology embeds 2bits information per 15 seconds. If PD detects watermark '11', which means illegal song. PD displays "Illegal Song" message on LCD, skips the song and plays the next song, The implemented detection algorithm in PD requires 19 MHz computational power, 7.9kBytes ROM and 10kBytes RAM. The suggested technique satisfies SDMI(secure digital music initiative) requirements of platform3 based on ARM9E core.

Classification of Infant Crying Audio based on 3D Feature-Vector through Audio Data Augmentation

  • JeongHyeon Park;JunHyeok Go;SiUng Kim;Nammee Moon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.9
    • /
    • pp.47-54
    • /
    • 2023
  • Infants utilize crying as a non-verbal means of communication [1]. However, deciphering infant cries presents challenges. Extensive research has been conducted to interpret infant cry audios [2,3]. This paper proposes the classification of infant cries using 3D feature vectors augmented with various audio data techniques. A total of 5 classes (belly pain, burping, discomfort, hungry, tired) are employed in the study dataset. The data is augmented using 5 techniques (Pitch, Tempo, Shift, Mixup-noise, CutMix). Tempo, Shift, and CutMix augmentation techniques demonstrated improved performance. Ultimately, applying effective data augmentation techniques simultaneously resulted in a 17.75% performance enhancement compared to models using single feature vectors and original data.

Spatial Audio Signal Processing Technology Using Multi-Channel 3D Microphone (멀티채널 3차원 마이크를 이용한 입체음향 처리 기술)

  • Kang Kyeongok;Lee Taejin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.2
    • /
    • pp.68-77
    • /
    • 2005
  • The purpose of a spatial audio system is to give a listener an impression as if he were present in a recorded environment when its sound is reproduced. For this purpose a dummy head microphone is generally used. Because of its human-like shape, dummy head microphone can reproduce spatial images through headphone reproduction. However, its shape and size are restriction to public use and it is difficult to convert the output signal of dummy head microphone into a multi-channel signal for multi-channel environment. So, in this paper, we propose a multi-channel 3D microphone technology. The multi-channel 3D microphone acquire a spatial audio using five microphones around a horizontal plane of a rigid sphere and through post processing, it can reproduce various reproduction signals for headphone, stereo, stereo dipole, 4ch and 5ch reproduction environments. Because of complex computation, we implemented H/W based post processing system. To verily the Performance of the multi-channel 3D microphone, localization experiments were Performed. The result shows that a front/back confusion, which is the one of common limitations of conventional dummy head technology, can be reduced dramatically.

Robust Audio Watermarking in Frequency Domain for Copyright Protection (저작권 보호를 위한 주파수 영역에서의 강인한 오디오 워터마킹)

  • Dhar, Pranab Kumar;Kim, Jong-Myon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.2
    • /
    • pp.109-117
    • /
    • 2010
  • Digital watermarking has drawn extensive attention for protecting digital contents from unauthorized copying. This paper proposes a new watermarking scheme in frequency domain for copyright protection of digital audio. In our proposed watermarking system, the original audio is segmented into non-overlapping frames. Watermarks are then embedded into the selected prominent peaks in the magnitude spectrum of each frame. Watermarks are extracted by performing the inverse operation of watermark embedding process. Simulation results indicate that the proposed scheme is robust against various kinds of attacks such as noise addition, cropping, resampling, re-quantization, MP3 compression, and low pass filtering. Our proposed watermarking system outperforms Cox's method in terms of imperceptibility, while keeping comparable robustness with the Cox's method. Our proposed system achieves SNR (signal-to-noise ratio) values ranging from 20 dB to 28 dB. This is in contrast to Cox's method which achieves SNR values ranging from only 14 dB to 23 dB.

A Spatial Audio System Using Multiple Microphones on a Rigid Sphere

  • Lee, Tae-Jin;Jang, Dae-Young;Kang, Kyeong-Ok;Kim, Jin-Woong;Jeong, Dae-Gwon;Hamada, Hareo
    • ETRI Journal
    • /
    • v.27 no.2
    • /
    • pp.153-165
    • /
    • 2005
  • The main purpose of a spatial audio system is to give a listener the same impression as if he/she were present in a recorded environment. A dummy head microphone is generally used for such purposes. Because of its human-like shape, we can obtain good spatial sound images. However, its shape is a restriction on its public use and it is difficult to convert a 2-channel recording into multi-channel signals for an efficient rendering over a multi-speaker arrangement. In order to solve the problems mentioned above, a spatial audio system is proposed that uses multiple microphones on a rigid sphere. The system has five microphones placed on special points of the rigid sphere, and it generates audio signals for headphone, stereo, stereo dipole, 4-channel, and 5-channel reproduction environments. Subjective localization experiments show that front/back confusion, which is a common limitation of spatial audio systems using the dummy head microphone, can be reduced dramatically in 4-channel and 5-channel reproduction environments and can be reduced slightly in a headphone reproduction.

  • PDF