• Title/Summary/Keyword: Digital audio

Search Result 626, Processing Time 0.027 seconds

An Efficient Audio Indexing Scheme based on User Query Patterns (사용자 질의 패턴을 이용한 효율적인 오디오 색인기법)

  • 노승민;박동문;황인준
    • Journal of KIISE:Databases
    • /
    • v.31 no.4
    • /
    • pp.341-351
    • /
    • 2004
  • With the popularity of digital audio contents, querying and retrieving audio contents efficiently from database has become essential. In this paper, we propose a new index scheme for retrieving audio contents efficiently using audio portions that have been queried frequently. This scheme is based on the observation that users have a tendency to memorize and query a small number of audio portions. Detecting and indexing such portions enables fast retrieval and shows better performance than sequential search-based audio retrieval. Moreover, this scheme is independent of underlying retrieval system, which means this scheme can work together with any other audio retrieval system. We have implemented a prototype system and showed its performance gain through experiments.

Audio Event Detection Using Deep Neural Networks (깊은 신경망을 이용한 오디오 이벤트 검출)

  • Lim, Minkyu;Lee, Donghyun;Park, Hosung;Kim, Ji-Hwan
    • Journal of Digital Contents Society
    • /
    • v.18 no.1
    • /
    • pp.183-190
    • /
    • 2017
  • This paper proposes an audio event detection method using Deep Neural Networks (DNN). The proposed method applies Feed Forward Neural Network (FFNN) to generate output probabilities of twenty audio events for each frame. Mel scale filter bank (FBANK) features are extracted from each frame, and its five consecutive frames are combined as one vector which is the input feature of the FFNN. The output layer of FFNN produces audio event probabilities for each input feature vector. More than five consecutive frames of which event probability exceeds threshold are detected as an audio event. An audio event continues until the event is detected within one second. The proposed method achieves as 71.8% accuracy for 20 classes of the UrbanSound8K and the BBC Sound FX dataset.

A study on the hearing characteristic based equalizer design for the elderly (고령층의 가청주파수 특성을 고려한 이퀄라이저 연구)

  • Lee, Chul-Hee;Hong, Sung-Kyoo
    • Journal of Digital Contents Society
    • /
    • v.19 no.4
    • /
    • pp.779-787
    • /
    • 2018
  • This study delves into how the equalizer can compensate for a sound pressure of lost frequencies. The targeted audiences are senior citizens who have difficulties hearing high-frequency because of a decline of audio frequency. Through investigations, this study confirms that the reason why reduction of high-frequency hearing increases depending on senescence. By considering the features of audio frequency of senior citizens, it also clarifies the necessity of equalizer reflecting features of audio frequency for the senior citizens, which have dramatically increased in Korea. There are application programs having functions, which provide several options of equalizer setup that people can adjust it depending on their own audio frequency. Some of them provide different equalizer setup depending on age. This study, however, reveals that they are not fully enough to compensate for the range of hearing loss of the senior citizens. Therefore, by pointing out limitations of existing functions and suggesting improvements, this study explores the way of improvements that enhance the sound transmissions of digital media contents for senior citizens.

The development of Intuitive User Interface and Control Software for Audio Mixer in Digital PA System (디지털전관방송시스템을 위한 오디오믹서의 직관적인 사용자 인터페이스 및 제어 소프트웨어 개발)

  • Kim, Kwan Woong;Cho, Juphil
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.11 no.3
    • /
    • pp.307-312
    • /
    • 2018
  • In this paper, we can confirm the result of intuitive interface software implementation for operating a digital PA(Public Address) controller and the performance of audio mixer control part. Developed user interface software provides the maintaining management and control function of digital hybrid mixer. This SW loaded in the integrated control server controls an sound status of the audio mixer TAD-168M and checks the device status for Public Address integrated system. Also, this SW enables the integrated control and the continuous upgrade. Developed SW is connected to TAD-168M with Ethernet and linked to PC Lan port and the 4-port switch, located in the backside of TAD-168M, by LAN cable for communicating with operating PC. Integrated control including system management, audio control and uplink broadcasting control for broadcasting system will be made available with this novel developed system.

The Modern Reader and The Past Literature (현대(現代)의 독자(讀者)와 과거(過去)의 문학(文學))

  • Kim, Kyun-tae
    • Journal of Korean Classical Literature and Education
    • /
    • no.16
    • /
    • pp.5-27
    • /
    • 2008
  • It is not a simple topic how let the modern readers read the past literature in the these days of digital. But even though the changes of the times, we must not let 'the paper-books(the thing written with letters)' disappear because of 'the audio-visual texts(the thing made with digital media as drama-opera, animated cartoon, animated image)'. The Electronic medias should be used so as helping for us to understand contents of the paper-books. Because of them, the paper-books must not be expelled. It is no need certainly for the reading materials to be made with Paper-books. For example, the electronic-books in order to read also would not become problems. Moreover, the electronic-books to be made with various electronic media can also provide the audio-visual materials for readers well to understand contents of the books. For that reason, the electronic-books would be helped to read effectively. Besides after reading the original texts, the readers to try the 'rewriting', with using the meanings for oneself to get from the texts would be able to make a synopsis or story-telling for other art performances. These works are things positively to be stimulated, because of giving the achievement motivations to the readers. To conclude, the audio-texts reading and the visual-texts reading should be developed so that the paper-books to be revitalize. And though the modern readers dislike to read the paper-books, We should try to make the audio-visual texts base on the paper-books. Therefore the paper-books and audio-visual texts are inter-complementary relationships, not competitive relationships.

Effective BER Measurement System for Terrestrial DMB (지상파 DMB를 위한 효율적인 비트오류율 측정시스템)

  • 김상훈;임중곤;김만식;이종화
    • Journal of Broadcast Engineering
    • /
    • v.8 no.3
    • /
    • pp.250-258
    • /
    • 2003
  • Recently, the transition from conventional analog broadcasting to digital broadcasting has been proceeding as a result of the advance in digital multimedia broadcasting technique. In radio broadcasting, Eureka-147 DAB(Digital Audio Broadcasting) was decided as the standard system of digital radio broadcasting in Korea. In addition to CD quality audio, a variety of data services and excellent performance in mobile reception can be served by DAB, and DAB was evolved into DMB(Digital Multimedia Broadcasting) in Korea for the purpose of emphasizing moving picture multimedia service by DAB. In case of digital broadcasting, it is absolutely essential to measure the BER(Bit Error Rate) in the received signal in order to evaluate the coverage obtained by a transmitter and the quality of the received signal. In this paper, we propose efficient subchannel data structure and BER measurement algorithm. and then verify it by laboratory experiments. With a proposed method, the synchronization for BER measurement is easily obtained and especially the exact results can be obtained by classifying the lost bits which are included in the reception-failed CIFs(Common Interleaved Frame) into errors. This makes the proposed BER measurement system especially appropriate to DMB in which the frequent changes in channel status caused by mobile reception environment exist.

Digital Audio Watermarking Based on Spread Spectrum Techniques (스프레드 스펙트럼 기반 디지털 오디오 워터마킹 기법 연구)

  • 진창윤;최창렬;정제창
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.257-260
    • /
    • 2001
  • In this paper, we propose a robust audio watermarking method. The proposed watermarking algorithm is composed of a psychoacoustic model to achieve perceptual transparency and spread spectrum technique to embed watermark. The watermark is embedded in each audio frame by adding a perceptually-shaped pseudo-random sequence. We demonstrate the robustness of the watermarking algorithm.

  • PDF

A 2.5 V 109 dB DR ΔΣ ADC for Audio Application

  • Noh, Gwang-Yol;Ahn, Gil-Cho
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.10 no.4
    • /
    • pp.276-281
    • /
    • 2010
  • A 2.5 V feed-forward second-order deltasigma modulator for audio application is presented. A 9-level quantizer with a tree-structured dynamic element matching (DEM) was employed to improve the linearity by shaping the distortion resulted from the capacitor mismatch of the feedback digital-toanalog converter (DAC). A chopper stabilization technique (CHS) is used to reduce the flicker noise in the first integrator. The prototype delta-sigma analogto-digital converter (ADC) implemented in a 65 nm 1P8M CMOS process occupies 0.747 $mm^2$ and achieves 109.1 dB dynamic range (DR), 85.4 dB signal-to-noise ratio (SNR) in a 24 kHz audio signal bandwidth, while consuming 14.75 mW from a 2.5 V supply.