• Title/Summary/Keyword: Audio file

Search Result 95, Processing Time 0.051 seconds

A Fully Synthesizable Bluetooth Baseband Module for a System-on-a-Chip

  • Chun, Ik-Jae;Kim, Bo-Gwan;Park, In-Cheol
    • ETRI Journal
    • /
    • v.25 no.5
    • /
    • pp.328-336
    • /
    • 2003
  • Bluetooth is a specification for short-range wireless communication using the 2.4 GHz ISM band. It emphasizes low complexity, low power, and low cost. This paper describes an area-efficient digital baseband module for wireless technology. For area-efficiency, we carefully consider hardware and software partitioning. We implement complex control tasks of the Bluetooth baseband layer protocols in software running on an embedded microcontroller. Hardware-efficient functions, such as low-level bitstream link control; host controller interfaces (HCIs), such as universal asynchronous receiver transmitter (UART) and universal serial bus (USB)interfaces; and audio Codec are performed by dedicated hardware blocks. Furthermore, we eliminate FIFOs for data buffering between hardware functional units. The design is done using fully synthesizable Verilog HDL to enhance the portability between process technologies so that our module can be easily integrated as an intellectual property core no system-on-a-chip (SoC) ASICs. A field programmable gate array (FPGA) prototype of this module was tested for functional verification and realtime operation of file and bitstream transfers between PCs. The module was fabricated in a $0.25-{\mu}m$ CMOS technology, the core size of which was only 2.79 $mm{\times}2.80mm$.

  • PDF

Design and Implementation of Voice EPG Platform within Voice EPG Generator for Terrestrial DMB (음성 EPG 생성기를 내장한 지상파 DMB용 음성 EPG 플랫폼 설계 및 구현)

  • Kim, Kyung-Nam;Lim, Choong-Soo;Cheon, Kyeong-Jae;Kim, Hwan-Chul;Choi, Jung-Hoon
    • Proceedings of the KIEE Conference
    • /
    • 2007.04a
    • /
    • pp.275-277
    • /
    • 2007
  • Recent activation of DMB has enabled various high quality video, audio and data services. And there are various user facilities functions using digital data transmission. One of the various user facilities functions is EPG(Electronic Program Guide). EPG supports schedule of programs on screen for. audiences. EPG is composed to time, title, channel, genre etc. Users can select a program what they want to browsing. Currently EPG services are displaying program schedule on screen visually and make users to input ke:ywords with keypads, remote control devices or touch screen etc. However, this approach could cause a serious restriction to some users like to drivers or visually handicapped persons. A standard for a voice EPG to T-DMB is proposed. This method must be transferred VoiceXML based EPG files from the transmitter to receivers. This approach has a problem to process a standardization because the transmitter and receivers should be modified. We proposed and implemented a voice EPG platform that generates the voice EPG files from T-DMB SI without transferring voice EPG file from the transmitter.

  • PDF

MPEG-A PART 9 DIGITAL MULTIMEDIA BROADCASTING APPLICATION FORMAT

  • Sabirin, Muhammad Syah Houari;Lee, Jung-Soo;Kim, Hui-Yong;Kim, Mun-Churl;Kim, Yong-Han
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.346-350
    • /
    • 2009
  • Digital Multimedia Broadcasting (DMB) is the mobile TV service based on a digital radio transmission system that provides high quality audio/video and other auxiliary data services. As users want to store the DMB content in their device to be consumed later or to be shared among users, a standardized format needs to be specified to guarantee the interoperability for the DMB contents for various devices. DMB AF (Application Format) specification defines a file format for DMB contents and services. It specifies how to combine the variety of DMB contents with associated information for a presentation in a well-defined format that facilitates storage, interchange, management, editing, and presentation of the DMB contents in protected, governed, and interoperable ways. In this paper we present our implementation of DMB AF as part of the development of DMB AF reference software. Our implementation of DMB AF is developed as the reference software for the standard specification that consists of a three applications: packager, media player, metadata browser and collection of supporting libraries used by the applications.

  • PDF

Development of software demultiplexer for DTV Signal Reception (DTV 수신을 위한 소프트웨어 Demultiplexer 개발)

  • Jin, Hyun-Joon;Park, Nho-Kyung;Kim, Moo-Han
    • Journal of IKEEE
    • /
    • v.7 no.2 s.13
    • /
    • pp.188-196
    • /
    • 2003
  • In this paper, a demultiplexer for MPEG-2 Transport Stream which can be applied to terrestrial digital broadcast is developed. The demultiplexer separates video, audio, and data from MPEG-2 multimedia stream and transports them to each decoders respectively. While most existing demultiplexers of MPEG-2 transport stream have been developed as hardware systems, but the fast increment of computer's performance enables a software demultiplexer to be worked in realtime. The developed demultiplexer is implemented as a software module called a filter using DirectShow of Microsoft which is based on COM(Component Object Model)and works on the Windows system. The operation of the demultiplexer is verified by using the GraphEdit tool and rendering a test file formatted as MPEG-2 transport stream.

  • PDF

Design and Analysis of a New Video Conference System Supporting the NAT of Firewall (방화벽 NAT를 지원하는 새로운 다자간 화상회의 시스템의 설계 및 분석)

  • Jung, Yong-Deug;Kim, Gil-Choon;Jeon, Moon-Seog
    • The Journal of Society for e-Business Studies
    • /
    • v.9 no.4
    • /
    • pp.137-155
    • /
    • 2004
  • A video-conference system is being utilized in web based application services in various fields due to the widespread use of Internet and the progress of computer technologies. This system should use the public IP address for sharing file and white board and it is difficult to manage the internal network users of the firewall and non-public IP address users. In this paper, we propose an Application Level Gateway which transforms non-public IP address into public IP address. This mechanism is for the internal network users of the firewall or non-public IP address users over the Internet. We also propose a Control Daemon which manages video and audio media dynamically according to network bandwidth. This mechanism can start and terminate a video conference and manage the process of the video conference.

  • PDF

Implementation of A Multimedia Streaming System using MPEG-4 (MPEG-4 표준을 이용한 멀티미디어 스트리밍 시스템 구현)

  • 임동근;이정우;김선태;마평수;호요성
    • Journal of Broadcast Engineering
    • /
    • v.6 no.3
    • /
    • pp.215-224
    • /
    • 2001
  • In recent days, research activities on multimedia services mainly focus on the multiplexing system with timing synchromization for media components, such as video, audio and text. The MPEG-4 standard emphasizes object-based coding which includes analysis and understanding of the Image content. Since in MPEG-4 we can define objects and encode them independently, we can manipulate and display each object for different applications. This feature of MPEG-4 is also vero useful for multimedia services, such as video streaming cia different network channels, digital versatile disc, internet TV, video E-mail, and so on. In this Paper, we implement a multimedia streaming system which is compliant with the MPEG-4 system and the MP4 file format.

  • PDF

Design and Implementation of Multimedia Data Retrieval System using Image Caption Information (영상 캡션 정보를 이용한 멀티미디어 데이터 검색 시스템의 설계 및 구현)

  • 이현창;배상현
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.3
    • /
    • pp.630-636
    • /
    • 2004
  • According to the increase of audio and video data utilization, the presentation of multimedia data contents and the work of retrieving, storing and manipulating a multimedia data have been the focus of recent work. The display for multimedia data should retrieve and access the contents easily that users want to present. This study is about the design and implementation of a system to retrieve multimedia data based on the contents of documentation or the caption information of a multimedia data for retrieving documentation including multimedia data. It intends to develop an filtering step to retrieve all of keyword within the caption information of multimedia data and text of a documentation. Also, the system is designed to retrieve a large amount of data quickly using an inverted file structure available for B+ tree.

Design and Implementation of a Bluetooth Baseband Module based on IP (IP에 기반한 블루투스 기저대역 모듈의 설계 및 구현)

  • Lim, Ji-Suk;Chun, Ik-Jae;Kim, Bo-Gwan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.04b
    • /
    • pp.1285-1288
    • /
    • 2002
  • Bluetooth wireless technology is a publicly available specification proposed for Radio Frequency (RF) communication for short-range and point-to- multipoint voice and data transfer. It operates in the 2.4GHz ISM(Industrial, Scientific and Medical) band and offers the potential for low-cost, broadband wireless access for various mobile and portable devices at range of about 10 meters. In this paper, we describe the structure and the test results of the bluetooth baseband module we have developed. This module was developed based on IP reuse. So Interface of each module such as link controller UART, and audio CODEC is designed based on ARM7 comfortable processor. We also considered various interfaces of related external chips. The fully synthesizable baseband module was fabricated in a $0.25{\mu}m$ CMOS technology occupying $2.79{\times}2.8mm^2$ area including the ARM TDMI processor. And a FPGA implementation of this module is tested for file and bit-stream transfers between PCs.

  • PDF

CutPaste-Based Anomaly Detection Model using Multi Scale Feature Extraction in Time Series Streaming Data

  • Jeon, Byeong-Uk;Chung, Kyungyong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.8
    • /
    • pp.2787-2800
    • /
    • 2022
  • The aging society increases emergency situations of the elderly living alone and a variety of social crimes. In order to prevent them, techniques to detect emergency situations through voice are actively researched. This study proposes CutPaste-based anomaly detection model using multi-scale feature extraction in time series streaming data. In the proposed method, an audio file is converted into a spectrogram. In this way, it is possible to use an algorithm for image data, such as CNN. After that, mutli-scale feature extraction is applied. Three images drawn from Adaptive Pooling layer that has different-sized kernels are merged. In consideration of various types of anomaly, including point anomaly, contextual anomaly, and collective anomaly, the limitations of a conventional anomaly model are improved. Finally, CutPaste-based anomaly detection is conducted. Since the model is trained through self-supervised learning, it is possible to detect a diversity of emergency situations as anomaly without labeling. Therefore, the proposed model overcomes the limitations of a conventional model that classifies only labelled emergency situations. Also, the proposed model is evaluated to have better performance than a conventional anomaly detection model.

Music Transcription Using Non-Negative Matrix Factorization (비음수 행렬 분해 (NMF)를 이용한 악보 전사)

  • Park, Sang-Ha;Lee, Seok-Jin;Sung, Koeng-Mo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.2
    • /
    • pp.102-110
    • /
    • 2010
  • Music transcription is extracting pitch (the height of a musical note) and rhythm (the length of a musical note) information from audio file and making a music score. In this paper, we decomposed a waveform into frequency and rhythm components using Non-Negative Matrix Factorization (NMF) and Non-Negative Sparse coding (NNSC) which are often used for source separation and data clustering. And using the subharmonic summation method, fundamental frequency is calculated from the decomposed frequency components. Therefore, the accurate pitch of each score can be estimated. The proposed method successfully performed music transcription with its results superior to those of the conventional methods which used either NMF or NNSC.