• Title/Summary/Keyword: Audio System

Search Result 1,041, Processing Time 0.029 seconds

Implementation of a High-Quality Audio Collaboration System Over IP Networks (IP 네트워크 기반 고품질 오디오 협업 시스템)

  • Kang, Jin-Ah;Kim, Hong-Kook
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.218-223
    • /
    • 2008
  • In this paper, we implement several methods to improve an audio collaboration system over IP networks, and then evaluate the performance of the implemented methods. In general, speech and audio quality degrades depending on the characteristics of IP networks such as jitter and packet loss. In order to reduce this quality degradation, we propose a lower bit rate audio delivery scheme using the MPEG-2 AAC (Advanced Audio Coding) audio codec in a viewpoint that a packet loss rate could be reduced by a smaller packet size. In addition, iLBC (Internet Low-Bitrate Codec) and the G.711 packet loss concealment algorithm defined by IEFT and ITU-T, respectively, are applied to a audio collaboration system. RAT (Robust-Audio Tool)[7] is used as a baseline platform for the implementation of the proposed methods. It is shown from the implementation that the implemented MPEG-2 AAC audio codec with a bitrate of 256 kbit/s performs as similar as the uncompressed audio quality with a bitrate of 512 kbit/s, and that iLBC and the G.711 packet loss concealment algorithm can improve speech quality when a packet loss rate is 2~10%.

  • PDF

Digital Audio Effect System-on-a-Chip Based on Embedded DSP Core

  • Byun, Kyung-Jin;Kwon, Young-Su;Park, Seong-Mo;Eum, Nak-Woong
    • ETRI Journal
    • /
    • v.31 no.6
    • /
    • pp.732-740
    • /
    • 2009
  • This paper describes the implementation of a digital audio effect system-on-a-chip (SoC), which integrates an embedded digital signal processor (DSP) core, audio codec intellectual property, a number of peripheral blocks, and various audio effect algorithms. The audio effect SoC is developed using a software and hardware co-design method. In the design of the SoC, the embedded DSP and some dedicated hardware blocks are developed as a hardware design, while the audio effect algorithms are realized using a software centric method. Most of the audio effect algorithms are implemented using a C code with primitive functions that run on the embedded DSP, while the equalization effect, which requires a large amount of computation, is implemented using a dedicated hardware block with high flexibility. For the optimized implementation of audio effects, we exploit the primitive functions of the embedded DSP compiler, which is a very efficient way to reduce the code size and computation. The audio effect SoC was fabricated using a 0.18 ${\mu}m$ CMOS process and evaluated successfully on a real-time test board.

FIR ROOM RESPONSE CORRECTION SYSTEM (FIR 필터를 사용한 청취 환경 보정 시스템)

  • Arora Manish;Sung Ho-Young;Lee Hyuck-Jae;Lee Joon-Hyon
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.283-286
    • /
    • 2004
  • Due to advances in electronics very high quality audio reproduction is today possible. But the listening environment causes deviation of the audio system from the expected behavior. Firstly the listening Room significantly changes the audio signal frequencies and their phase. Secondly the position of the user in the room affects the perceived sound. With existing DSP technology it is possible to adequately correct these effects. In our work we developed a room correction system, correcting up to 7.1 channels using dual Motorola 56367 fixed point DSP's, implementing position dependent room effects measurement, real time compensation filter design and equalization filtering procedures.

  • PDF

A Robust Audio Fingerprinting System with Predominant Pitch Extraction in Real-Noise Environment

  • Son, Woo-Ram;Yoon, Kyoung-Ro
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.390-395
    • /
    • 2009
  • The robustness of audio fingerprinting system in a noisy environment is a principal challenge in the area of content-based audio retrieval. The selected feature for the audio fingerprints must be robust in a noisy environment and the computational complexity of the searching algorithm must be low enough to be executed in real-time. The audio fingerprint proposed by Philips uses expanded hash table lookup to compensate errors introduced by noise. The expanded hash table lookup increases the searching complexity by a factor of 33 times the degree of expansion defined by the hamming distance. We propose a new method to improve noise robustness of audio fingerprinting in noise environment using predominant pitch which reduces the bit error of created hash values. The sub-fingerprint of our approach method is computed in each time frames of audio. The time frame is transformed into the frequency domain using FFT. The obtained audio spectrum is divided into 33 critical bands. Finally, the 32-bit hash value is computed by difference of each bands of energy. And only store bits near predominant pitch. Predominant pitches are extracted in each time frames of audio. The extraction process consists of harmonic enhancement, harmonic summation and selecting a band among critical bands.

  • PDF

An Efficient Audio Indexing Scheme based on User Query Patterns (사용자 질의 패턴을 이용한 효율적인 오디오 색인기법)

  • 노승민;박동문;황인준
    • Journal of KIISE:Databases
    • /
    • v.31 no.4
    • /
    • pp.341-351
    • /
    • 2004
  • With the popularity of digital audio contents, querying and retrieving audio contents efficiently from database has become essential. In this paper, we propose a new index scheme for retrieving audio contents efficiently using audio portions that have been queried frequently. This scheme is based on the observation that users have a tendency to memorize and query a small number of audio portions. Detecting and indexing such portions enables fast retrieval and shows better performance than sequential search-based audio retrieval. Moreover, this scheme is independent of underlying retrieval system, which means this scheme can work together with any other audio retrieval system. We have implemented a prototype system and showed its performance gain through experiments.

A Content-based Audio Retrieval System Supporting Efficient Expansion of Audio Database (음원 데이터베이스의 효율적 확장을 지원하는 내용 기반 음원 검색 시스템)

  • Park, Ji Hun;Kang, Hyunchul
    • Journal of Digital Contents Society
    • /
    • v.18 no.5
    • /
    • pp.811-820
    • /
    • 2017
  • For content-based audio retrieval which is one of main functions in audio service, the techniques for extracting fingerprints from the audio source, storing and indexing them in a database are widely used. However, if the fingerprints of new audio sources are continually inserted into the database, there is a problem that space efficiency as well as audio retrieval performance are gradually deteriorated. Therefore, there is a need for techniques to support efficient expansion of audio database without periodic reorganization of the database that would increase the system operation cost. In this paper, we design a content-based audio retrieval system that solves this problem by using MapReduce and NoSQL database in a cluster computing environment based on the Shazam's fingerprinting algorithm, and evaluate its performance through a detailed set of experiments using real world audio data.

A 3D Audio-Visual Animated Agent for Expressive Conversational Question Answering

  • Martin, J.C.;Jacquemin, C.;Pointal, L.;Katz, B.
    • 한국정보컨버전스학회:학술대회논문집
    • /
    • 2008.06a
    • /
    • pp.53-56
    • /
    • 2008
  • This paper reports on the ACQA(Animated agent for Conversational Question Answering) project conducted at LIMSI. The aim is to design an expressive animated conversational agent(ACA) for conducting research along two main lines: 1/ perceptual experiments(eg perception of expressivity and 3D movements in both audio and visual channels): 2/ design of human-computer interfaces requiring head models at different resolutions and the integration of the talking head in virtual scenes. The target application of this expressive ACA is a real-time question and answer speech based system developed at LIMSI(RITEL). The architecture of the system is based on distributed modules exchanging messages through a network protocol. The main components of the system are: RITEL a question and answer system searching raw text, which is able to produce a text(the answer) and attitudinal information; this attitudinal information is then processed for delivering expressive tags; the text is converted into phoneme, viseme, and prosodic descriptions. Audio speech is generated by the LIMSI selection-concatenation text-to-speech engine. Visual speech is using MPEG4 keypoint-based animation, and is rendered in real-time by Virtual Choreographer (VirChor), a GPU-based 3D engine. Finally, visual and audio speech is played in a 3D audio and visual scene. The project also puts a lot of effort for realistic visual and audio 3D rendering. A new model of phoneme-dependant human radiation patterns is included in the speech synthesis system, so that the ACA can move in the virtual scene with realistic 3D visual and audio rendering.

  • PDF

Implementation of the TMS320C6701 DSP Board for Multichannel Audio Coding (멀티채널 오디오 부호화를 위한 TMS320C6701 DSP 보드 구현)

  • 장대영;홍진우;곽진석
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 1999.11a
    • /
    • pp.199-203
    • /
    • 1999
  • This paper is on the DSP system design and implementation for real time MPEG-2 AAC multichannel audio, and MPEG-4 object oriented audio coding. This DSP system employs two DSPs of the state of the art TMS320C6701, developed by TI semiconductor. DSP board has PCI interface for downloading application program and control the system. DSP board was designed to use for both encoder and decoder, by setting several switches. The system contains external input and output box also, for A/D and D/A conversion for eight channel audio. The input box converts multi channel digital audio to ADI format, that provides serial interface for eight channel digital audio. And the output box converts ADI format signal to multi channel audio. Through this ADI interface, DSP boards can be connected to input, output box. Implemented DSP system was tested for integration with MPEG-2 AAC encoder and decoder S/W. Currently the DSP system performs realtime AAC 4-channel audio encoding with two DSPs, and 8-channel decoding with one DSP.

  • PDF

Stability of Digital Audio Amplifier and Analysis on the Effect of Hysteresis (디지털 오디오 앰프의 안정성과 히스테리시스에 의한 영향 해석)

  • Doh, Tae-Yong;Jang, Byung-Tak;Ryoo, Tae-Ha;Ryoo, Ji-Yeol;Park, Hwan-Wook
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.605-607
    • /
    • 2004
  • A class D digital audio amplifier with small size, low cost, and high quality is positively necessary in the multimedia era made of home theater system and the digital audio broadcasting (DAB). It is impossible to analyze the stability of the digital audio amplifier, which is based on the PWM signal processing. To solve this problem, the digital audio amplifier is analyzed using variable structure control theory which is one of nonlinear system theories. Moreover, the magnitude and the frequency of ripple signal, which generated by hysteresis in the comparator, is obtained using describing function which is useful to represent the input-output relation of nonlinear system.

  • PDF

A System-on-a-Chip Design for Digital TV

  • Rhee, Seung-Hyeon;Lee, Hun-Cheol;Kim, Sang-Hoon;Choi, Byung-Tae;Lee, Seok-Soo;Choi, Seung-Jong
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.5 no.4
    • /
    • pp.249-254
    • /
    • 2005
  • This paper presents a system-on-a-chip (SOC) design for digital TV. The single LSI incorporates almost all essential parts such as CPU, ISO/IEC 11172/13818 system/audio/video decoders, a video post-processor, a graphics/OSD processor and a display processor. It has analog IP's inside such as video DACs, an audio PLL, and a system PLL to reduce the system-level implementation cost. Descramblers and Smart Card interface are included to support widely used conditional access systems. The video decoder can decode two video streams simultaneously. The DSP-based audio decoder can process various audio coding specifications. The functional blocks for video quality enhancement also form outstanding features of this SoC. The SoC supports world-wide major DTV services including ATSC, ARIB, DVB, and DIRECTV.