• Title/Summary/Keyword: Audio Data

Search Result 883, Processing Time 0.026 seconds

Elevating Utilization Efficiency through the Multimedia Database Construction of Accompanying Materials (딸림자료의 멀티미디어 데이터베이스 구축을 통한 이용 효율 제고에 관한 연구)

  • Lee, Ju-Hyun;Lee, Eung-Bong
    • Journal of Information Management
    • /
    • v.35 no.2
    • /
    • pp.41-55
    • /
    • 2004
  • This study is expected to discuss some methods regarding the uplift of user's use convenience and the efficiency of material management by constructing the multimedia database through the digitalization of the especial audiotape-typed materials among accompanying materials. In order to do that, this paper dealt with the present management conditions of accompanying materials, the sorts of audio data formats, data format transformation, the methods of administration and utilization, etc. And this paper also presented the expected effect and problems caused by multimedia databases construction of accompanying materials.

Authoring Tool of Musical Slide Show MAF Contents

  • Sabirin Muhammad Syah Houari;Kim Mun-Churl
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2006.11a
    • /
    • pp.289-295
    • /
    • 2006
  • The Musical Slide Show MAF, which currently being standardized by MPEG, conveys the concept of combining several established standard technologies in a single file format. It defines the format of packing up MP3 audio data, along with MPEG-7 Simple Metadata Profile and MPEG-21 Digital Item Declaration metadata; with JPEC images and optional text, and synchronizes them all together to create a slideshow of JPEC image data associated to MP3 audio data during the audio playback. The implementation of Musical Slide Show MAF can be a music karaoke file where users can sing along while listening to the music, view the JPEG slideshow and reading the lyrics; or a story-telling file where users can listen to the narrated story by looking at the related illustration slideshow of the story In this paper we present the tool to producing the Musical Slide Show MAF contents. Regardless the knowledge of user on the MAF file format, the authoring tool simplify the manner of packaging several multimedia contents into single file.

  • PDF

An Experimental Delay Analysis Based on M/G/1-Vacation Queues for Local Audio/Video Streams

  • Kim, Doo-Hyun;Lee, Kyung-Hee;Kung, Sang-Hwan;Kim, Jin-Hyung
    • ETRI Journal
    • /
    • v.19 no.4
    • /
    • pp.344-362
    • /
    • 1997
  • The delay which is one of the quality of service parameters is considered to be a crucial factor for the effective usage of real-time audio and video streams in interactive multimedia collaborations. Among the various causes of the delay, we focus in this paper on the local delay concerned with the schemes which handle continuous inflow of encoded data from constant or variable bit-rate audio and video encoders. We introduce two kinds of implementation approaches, pull model and push model. While the pull model periodically pumps out the incoming data from the system buffer, the push model receives events from the device drivers. From our experiments based on Windows NT 3.51, it is shown that the push model outperforms the other for both constant and variable bit-rate streams in terms of the local delay, when the system suffers reasonable loads. We interpret this experimental data with M/G/1 multiple vacation queuing theories, and show that it is consistent with the queuing theoretic interpretations.

  • PDF

A PSIP Information Generating System for Produce Digital Access Program (디지털 방송 콘텐츠 제작을 위한 PSIP 정보 생성 시스템)

  • Hwang, Kyung-Min;Kim, Jong-Moon;Bang, Jin-Suk;Cho, Tae-Beom;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2007.10a
    • /
    • pp.615-618
    • /
    • 2007
  • It has able to express digital video/audio data from analog and to broadcast it via improvement of video/audio compression technology and publishing standard of MPEG-2 System. Nowaday many System Operators are provide regular digital broadcasting program to customer with their own access program. To provide access program, two process needs that are creating broadcasting information and remultiplexing it with video/audio data, and this solution is providing with high-cost system only. For this reason, digital access program bas week point to product. In this paper, we designed and implemented Generating PSIP Information System to product digital access program which generate PSIP information via receiving broadcasting information from user, and map PSIP information directly to video/audio data.

  • PDF

Audio Transformation Filter for Multimedia I/O Server (멀티미디어 입출력 서버를 위한 오디오 변환 필터)

  • Cho, Byoung-Ho;Jang, Yu-Tak;Kim, Woo-Jin;Kim, Ki-Jong;Yoo, Ki-Young
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.6 no.6
    • /
    • pp.580-587
    • /
    • 2000
  • In this paper, we present a design method of a digital filter converting humming voice melody into MIDI data and a method of adapting it to a distributed multimedia I/O server. MuX uses device-independent DLMs(Dynamic Linking Module) for the inteliace with various I/O devices, and has wave-form audio DLM and MIDI DLM for audio interfaces. In order to expand the audio device interfacing ability of MuX system, we have designed and implemented a filter transforming human voice into MIDI messages. As the methods to input MIDI data are expanded to human voice in addition to MIDI files and MIDI instrument, someone who is not good at playing instruments can also generate the MIDI data, which enables our media interfaces to be used in various applications.

  • PDF

New Interactive TV Service Model based on the MPEG-4 System

  • Kim, Jongho;Jechang Jeong
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.125-128
    • /
    • 2002
  • In this paper, a new interactive TV service model is proposed. The MPEG-4 system is specified for composing and managing various object streams including user interactions. The data broadcasting model supporting user interactions is designed using MPEG-4 system in our proposal. We evaluate possibility of proposed service model using simulation player. This player supports MPEG-2 TS which contains MPEG-2 video and AC-3 audio streams as a main service and MPEC-4 system data as interactive services as well as user specific EPG information, and XML data, etc as supplemetary services. The player also supports a multi-channel environment. The synchronization between audio and visual data is achieved by DTS and PTS in TS.

  • PDF

A Study on Implementing of AC-3 Decoding Algorithm Software (AC-3 Decoding Algorithm Software 구현에 관한 연구)

  • 이건욱;박인규
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1215-1218
    • /
    • 1998
  • 본 논문은 Digital Audio Compression(AC-3) Standard 인 A-52를 기반으로 하였으며 Borland C++3.1 Compiler를 사용하여 AC-3 Decoding Algorithm 구현하였다. Input Stream은 DVD VOB File에서 AC-3 Stream만을 분리하여 사용하며 최종 출력은 16 Bit PCM File이다. AC-3의 Frame구조는 Synchronization Information, Bit Stream Information, Audio Block, Auxiliary Data, Error Check로 구성된다. Aduio Block 은 모두 6개의 Block으로 나뉘어져 있다. BSI와 Side Information을 참조하여 Exponent를 추출하여 Exponent Strategy에 따라 Exponent를 복원한다. 복원된 Exponent 정보를 이용하여 Bit Allocation을 수행하여 각각의 Mantissa에 할당된 Bit수를 계산하고 Stream으로부터 Mantissa를 추출한다. Coupling Parameter를 참조하ㅕ Coupling Channel을 Original Channel로 복원시킨다. Stereo Mode에 대해서는 Rematrixing을 수행한다. Dynamic Range는 Mantissa와 Exponent의 Magnitude를 바꾸는 것으로 선택적으로 사용할 수 있다. Mantissa와 Exponent를 결합하여 Floating Point coefficient로 만든 후 Inverse Transform을 수행하면 PCM Data를 얻을 수 있다. PC에서 듣기 위해서는 Multi Channel을 Stereo나 Mono로 Downmix를 수행한다. 이렇게 만들어진 PCM data는 PCM Data를 재생하는 프로그램으로 재생할 수 있다.

  • PDF

Speech Watermark Based on Patchwork for Digital Broadcasting (디지털 방송을 위한 패치워크 기반 음성 워터마크)

  • 여인권;김형중;최용희;김기섭
    • Journal of Broadcast Engineering
    • /
    • v.5 no.2
    • /
    • pp.220-226
    • /
    • 2000
  • A novel audio watermark algorithm, the Modified Patchwork Algorithm, is applied to the speech to show that it is effective for digital broadcasting systems. Digital broadcasting system does not separate speech from audio data. However. speech data is very important especially for educational broadcasting. Speech can carry more information than video data. Thus, intellectual property management and protection for speech data is urgent. This paper addresses the technical issues, speech watermark algorithm, and its robustness against malicious attacks.

  • PDF

An Evaluation on the Audio-visual Investment Fund's Contribution to Korean Film Production Capital (한국영화 제작자본에 대한 영상전문투자조합 정책의 기여도 평가)

  • Kim, Mee-Hyun
    • The Journal of the Korea Contents Association
    • /
    • v.19 no.9
    • /
    • pp.212-220
    • /
    • 2019
  • This study evaluates the extent to which the government's financial support policy, the Audio-visual investment fund, contributed to raising capital for Korean films. Audio-visual investment fund in the Korean film industry, which has been formed through the public sector support since 1999. The Audio-visual investment fund is a leading financial support policy for the Korean film industry, and began with the investment of the Small and Medium Business Administration and the Korean Film Council. It has become an important source of Korean film production costs and has spread to other cultural industry sectors, as a way of capital procurement for a start-up companies and cultural projects. This study reconstruct the data of the organizations such as the size of a new investment fund by public sector, the ratio of public capital contribution, the amount and number of investment in Korean films, investment multiplier compared to equity investment, and the internal return rate(IRR) of liquidation funds in the Korean film capital market from 1999 to 2017. The purpose of this project was to provide the basis for assessing the achievements of the Audio-visual investment fund policy in contributing to the growth of the film industry.

Towards Low Complexity Model for Audio Event Detection

  • Saleem, Muhammad;Shah, Syed Muhammad Shehram;Saba, Erum;Pirzada, Nasrullah;Ahmed, Masood
    • International Journal of Computer Science & Network Security
    • /
    • v.22 no.9
    • /
    • pp.175-182
    • /
    • 2022
  • In our daily life, we come across different types of information, for example in the format of multimedia and text. We all need different types of information for our common routines as watching/reading the news, listening to the radio, and watching different types of videos. However, sometimes we could run into problems when a certain type of information is required. For example, someone is listening to the radio and wants to listen to jazz, and unfortunately, all the radio channels play pop music mixed with advertisements. The listener gets stuck with pop music and gives up searching for jazz. So, the above example can be solved with an automatic audio classification system. Deep Learning (DL) models could make human life easy by using audio classifications, but it is expensive and difficult to deploy such models at edge devices like nano BLE sense raspberry pi, because these models require huge computational power like graphics processing unit (G.P.U), to solve the problem, we proposed DL model. In our proposed work, we had gone for a low complexity model for Audio Event Detection (AED), we extracted Mel-spectrograms of dimension 128×431×1 from audio signals and applied normalization. A total of 3 data augmentation methods were applied as follows: frequency masking, time masking, and mixup. In addition, we designed Convolutional Neural Network (CNN) with spatial dropout, batch normalization, and separable 2D inspired by VGGnet [1]. In addition, we reduced the model size by using model quantization of float16 to the trained model. Experiments were conducted on the updated dataset provided by the Detection and Classification of Acoustic Events and Scenes (DCASE) 2020 challenge. We confirm that our model achieved a val_loss of 0.33 and an accuracy of 90.34% within the 132.50KB model size.