• Title/Summary/Keyword: Audio Data

Search Result 887, Processing Time 0.029 seconds

Digital Watermarking by Rearranging and Modifying DCT Coefficients

  • Lee, Hee sup;Oh, Sang-Heun;Lee, Keun-Young
    • Proceedings of the IEEK Conference
    • /
    • 2000.07b
    • /
    • pp.902-905
    • /
    • 2000
  • Because of the rapid growth of Internet and multimedia applications, how to protect IPR (intellectual property rights) has become a critical issue. Is one of the ways to overcome the problem of the protection of IPR. Digital watermarking call be applied to multimedia data, such as digital images, digital video, and digital audio. In this paper, we propose a digital watermarking technique for digital images to authenticate an owner or an image by embedding visually recognizable patterns, such as logos, signatures, or stamps into images In BDCT (block discrete cosine transform) frequency domain. The proposed method sorts the components of an original image twice. At the same time, the method, also, rearranges the components of a watermark twice in order to be more robust, and finally embeds the watermark into the image. From the experimental results, the conjunction of three similarity measurements shows that our proposed method is robust to image cropping, image filtering, and JPEG (the Joint Photographic Experts Group) both subjectively and objectively.

  • PDF

Screaming data analysis for security system with audio capability (오디오 취득 기반의 방범용 시스템을 위한 비명 분석)

  • Lee, So-Min;Byun, Sung-Woo;Li, Shi-Cong;Kim, Kwang-Yong;Chung, Il-Gu;Lee, Seok-Pil
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2013.11a
    • /
    • pp.85-87
    • /
    • 2013
  • 본 논문에서는 환경 잡음에서 사람들의 비명소리를 검출하여, 위험상황을 식별하는 방법용 시스템을 구축하기 위해 비명소리의 특징을 분석한다. 비명 소리는 놀랬을 때, 다급할 때, 아플 때, 세 가지 상황으로 나누어 녹음을 진행하였고, 이를 주파수 신호로 바꾸어 분석을 하였다. 비명소리 데이터에서 amplitude 가 가장 크게 나타나는 주파수 대역을 분석하고, 상황 별로 비명소리에 대한 주파수 분포의 차이, 남성과 여성의 주파수 대역과 분포의 차이를 분석한다.

  • PDF

An efficient video multiplexer for the transmission of the DMB multimedia data (DMB 멀티미디어 데이터의 전송을 위한 효율적인 비디오 다중화기)

  • Na Nam-Woong;Baek Sun-Hye;Hong Sung-Hoon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2003.11a
    • /
    • pp.183-186
    • /
    • 2003
  • DMB(Digital Multimedia Broadcasting)는 유럽의 디지털 오디오 방송규격인 Eureka-147 DAB(Digital Audio Broadcasting) 전송시스템을 기반으로 하여 동영상 및 음성, 문자데이터 등을 포함한 멀티미디어 서비스를 제공하기 위한 새로운 방송표준이다 따라서 DMB 시스템은 Eureka-147 DAB 전송부 이외에 영상 및 음성을 압축하는 미디어압축 (복)부호화부, 압축된 미디어 스트림을 다중화 하는 비디오 (역)다중화부가 추가된 구조를 갖는다. 본 논문은 DMB 표준의 비디오 다중화부의 분석을 통하여 확장된 전송기능 및 높은 전송효율을 제공할 수 있는 새로운 비디오 다중화 구조를 제시한다. 또한 표준 비디오 다중화기와 제안된 비디오 다중화기의 성능평가를 위해 기능적으로 분석하고 시뮬레이션을 통해 전송효율을 측정하였다.

  • PDF

An Implementation of TPEG Transmission and Reception Systems Based on Digital Radio Mondiale (DRM 기반 TPEG 송.수신 시스템 구현)

  • Kwon, Ki-Won;Park, Kyung-Won;Jeon, Won-Gi;Choi, Young-Keon;Chang, Tae-Wook
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2008.08a
    • /
    • pp.125-128
    • /
    • 2008
  • In this paper, the Transport Protocol Experts Group(TPEG) transmission and reception system based on Digital Radio Mondiale(DRM) is implemented for traffic information services. DRM stands for the European radio broadcasting standard to bring AM radio into digital radio, designed to work at frequencies below 30MHz. DRM can offer various data services such as text messages and a slide show service as well as audio services. In spite of low datarate, the TPEG system on DRM has an advantage that the traffic information service system for whole country can be established because DRM can offer wider service ranges than terrestrial-digital multimedia broadcasting(T-DMB) and simply archive a single frequency network.

  • PDF

Design Method of Variable Point Prime Factor FFT For DRM Receiver (DRM 수신기의 효율적인 수신을 위한 가변 프라임펙터 FFT 설계)

  • Kim, Hyun-Sik;Lee, Youn-Sung;Seo, Jeong-Wook;Baik, Jong-Ho
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2008.08a
    • /
    • pp.257-261
    • /
    • 2008
  • The Digital Radio Mondiale (DRM) system is a digital broadcasting standard designed for use in the LF, MF and HF bands of the broadcasting bands below 30 MHz. The system provides both superior audio quality and improved user services / operability compared with existing AM transmissions. In this paper, we propose a variable point Prime Factor FFT design method for Digital Radio Mondiale (DRM) system. Proposed method processes a various size IFFT/FFT of Robustness Mode on DRM standard efficiently by composing Radix-Prime Factor FFT Processing Unit of form similar to Radix-4 by insertion of a variable Prime Factor Twiddle Factor and Garbage data. So, we improved limitation that cannot process 112/176/256/288 FFT of each mode of DRM system with a existent Radix Processor and increase memory size and memory access time for IFFT/FFT processing by software processing in case of implementation with a existent high speed DSP.

  • PDF

Collaborative Filtering and Genre Classification for Music Recommendation

  • Byun, Jeong-Yong;Nasridinov, Aziz
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.11a
    • /
    • pp.693-694
    • /
    • 2014
  • This short paper briefly describes the proposed music recommendation method that provides suitable music pieces to a listener depending on both listeners' ratings and content of music pieces. The proposed method consists of two methods. First, listeners' ratings prediction method is a combination the traditional user-based and item-based collaborative filtering methods. Second, genre classification method is a combination of feature extraction and classification procedures. The feature extraction step obtains audio signal information and stores it in data structure, while the second one classifies the music pieces into various genres using decision tree algorithm.

Multimodal Cough Detection Model Using Audio and Acceleration Data (소리와 가속도 데이터를 이용한 멀티모달 기침 감지 모델)

  • Kang, Jae-Sik;Back, Moon-Ki;Choi, Hyung-Tak;Won, Yoon-Seung;Lee, Kyu-Chul
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.10a
    • /
    • pp.746-748
    • /
    • 2018
  • 전 세계적으로 인플루엔자에 의해 매년 29~64만의 사망자가 발생하며 사회, 경제적 피해를 일으키고 있다. 기침에 의해 생성된 비말은 인플루엔자의 주요 전파 방법으로, 기침 감지 기술을 통해 확산 방지가 가능하다. 이전의 기침 감지에 대한 연구는 기침 소리와 전통적인 기계학습기법을 사용하였다. 본 논문은 기침 소리와 더불어 기침 시 발생하는 신체의 움직임 정보를 동시에 학습하는 멀티모달 딥러닝 기반의 기침 감지 모델을 제안한다. 도출된 모델과 기존의 모델과의 성능 비교를 통해 제안한 모델이 이전의 기침 감지 모델보다 정확한 기침 인식이 가능함을 보였다. 본 논문이 제안하는 모델은 스마트 워치와 같은 웨어러블 기기에 적용되면 인플루엔자의 확산 방지에 크게 기여할 수 있을 것이다.

Scheduling Algorithms for Downlink Rate Allocation in Heterogeneous CDMA Networks

  • Varsou, Aikaterini C.;Poor, H. Vincent
    • Journal of Communications and Networks
    • /
    • v.4 no.3
    • /
    • pp.199-208
    • /
    • 2002
  • The downlink rate scheduling problem is considered for CDMA networks with multiple users carrying packets of heterogeneous traffic (voice/audio only, bursty data only or mixed traffic), with each type having its own distinct quality of service requirements. Several rate scheduling algorithms are developed, the common factor of which is that part of the decision on which users to serve is based on a function of the deadline of their head-ofline packets. An approach of Andrews et al., in which the basic Earliest-Deadline-First algorithm is studied for similar systems, is extended to result in better performance by considering a more efficient power usage and by allowing service of more than one user per timeslot if the power resources permit it. Finally, the performance of the proposed schemes is compared through simulations.

An Operational and Data Model for ARS Control (ARS 제어를 위한 동작 및 데이터 모델)

  • Min, Kyoung-Seok;Kim, Suk-il;Jeon, Joong-Nam
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.04a
    • /
    • pp.373-378
    • /
    • 2000
  • ARS(Audio Response System)를 구현하기 위하여, 응용 분야를 분석하여 필요한 자료구조를 설계 및 처리 과정을 설계한 후, ARS 처리용 하드웨어 생산 업체에서 제공하는 원시 라이브러리와 C언어를 사용하여 구현하는 것이 일반적이다. 본 논문에서는 ARS의 처리 과정을 분석하여 동작을 제어하는 부분과 동작을 표현하는 부분으로 분리한 ARS 구현 모델을 제시하였다. 응용분야와 무관한 동작제어 부분은 대기상태, 처리상태, 종료상태로 구성되는 유한상태 기계 모델을 제시하였고, 응용분야에 따라 결정되는 동작표현에 필요한 정보를 체계적으로 구성한 자료구조를 제시하였다. 본 논문에서 제시하는 모델에 의하여 동작표현만 제공함으로써 ARS를 구현할 수 있다.

  • PDF

A Computer-Assisted Pronunciation Training System for Correcting Pronunciation of Adjacent Phonemes

  • Lee, Jaesung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.2
    • /
    • pp.9-16
    • /
    • 2019
  • Computer-Assisted Pronunciation Training system is considered to be a useful tool for pronunciation learning for students who received elementary level English pronunciation education, especially for students who have difficulty in correcting their pronunciation in front of others or who are not able to receive face-to-face training. The conventional Computer-Assisted Pronunciation Training system shows the word to the user, the user pronounces the word, and then the system provides phoneme or audio feedback according to the pronunciation of the user. In this paper, we propose a Computer-Assisted Pronunciation Training system that can practice on the varying pronunciation according to positions of adjacent phonemes. To achieve this, the proposed system is implemented by recommending a series of words by focusing on adjacent phonemes for simplicity and clarity. Experimental results showed that word recommendation considering adjacent phonemes leads to improvement of pronunciation accuracy.