• Title/Summary/Keyword: Digital audio

Search Result 626, Processing Time 0.027 seconds

A Study on Contents-based Retrieval using Wavelet (Wavelet을 이용한 내용기반 검색에 관한 연구)

  • 강진석;박재필;나인호;최연성;김장형
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.4 no.5
    • /
    • pp.1051-1066
    • /
    • 2000
  • According to the recent advances of digital encoding technologies and computing power, large amounts of multimedia informations such as image, graphic, audio and video are fully used in multimedia systems through Internet. By this, diverse retrieval mechanisms are required for users to search dedicated informations stored in multimedia systems, and especially it is preferred to use contents-based retrieval method rather than text-type keyword retrieval method. In this paper, we propose a new contents-based indexing and searching algorithm which aims to get both high efficiency and high retrieval performance. To achieve these objectives, firstly the proposed algorithm classifies images by a pre-processing process of edge extraction, range division, and multiple filtering, and secondly it searches the target images using spatial and textural characteristics of colors, which are extracted from the previous process, in a image. In addition, we describe the simulation results of search requests and retrieval outputs for several images of company's trade-mark using the proposed contents-based retrieval algorithm based on wavelet.

  • PDF

Watermarking Algorithm using Power of Subbands Decomposed by Wavelet Packet and QIM (웨이블릿 패킷 변환한 후의 대역별 에너지와 QIM을 이용한 워터마킹 알고리즘)

  • Seo, Ye-Jin;Cho, Sang-Jin;Chong, Ui-Pil
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.11
    • /
    • pp.1431-1437
    • /
    • 2011
  • This paper proposes a novel watermarking algorithm that protects digital copyrights and is robust to attacks. Watermarks are embedded in the subband including the significant part of the signal such as a pitch. Generally, the subband containing the pitch has the biggest energy. In order to find this subband, wavelet packet transform is used to decompose the subbands and their energy are calculated. The signal of the selected subbands is transformed in frequency domain using FFT. The watermarks are embedded using QIM for samples higher than a certain threshold. The blind detection uses the Euclidean distance. The proposed method shows less than 5% BER in the audio watermark benchmarking.

The Development of Terrestrial DMB System for Video Associated Data Services (비디오 부가데이터 서비스를 위한 지상파 DMB 시스템 개발)

  • Kim, Hyun-Soon;Kyung, Il-Soo;Kim, Sang-Hun;Kim, Man-Sik
    • Journal of Broadcast Engineering
    • /
    • v.11 no.4 s.33
    • /
    • pp.541-553
    • /
    • 2006
  • Since DMB on-air was started, not high-qualified audio, video services but various service models have been required. This paper is about systems for one of these services, video associated data service. A terrestrial DMB system to make contents of video associated data services and transmit them on DMB channel is proposed in this paper. This system satisfies standard of the video associated data services for terrestrial DMB; MPEG-4 BIFS (BInary Format for Scene) Core2D scene description profile and graphics profile. This system is designed to support two major features of broadcasting, real-time authoring non automatic transmission and non real-time authoring automatic transmission, and focuses on the abilities to make high-qualified contents efficiently and transmit them to video encoder reliably. This system proved its performance through conformance tests with various receivers, so can be used in future on-air.

Implementation of 24bit Sigma-delta D/A Converter for an Audio (오디오용 24bit 시그마-델타 D/A 컨버터 구현)

  • Heo, Jeong-Hwa;Park, Sang-Bong
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.8 no.4
    • /
    • pp.53-58
    • /
    • 2008
  • This paper designs sigma-delta D/A Converter with a high resolution and low power consumption. It reorganizes the input data along LJ, RJ, I2S mode and bit mode to the output data of A/D converter. The D/A converter decodes the original analog signal through HBF, Hold and 5th CIFB(Cascaded Integrators with distributed Feedback as well as distributed input coupling) sigma-delta modulation blocks. It uses repeatedly the addition operation in instead of the multiply operation for the chip area and the performance. Also, the half band filters of similar architecture composed the one block and it used the sample-hold block instead of the sinc filter. We supposed simple D/A Converter decreased in area. The filters of the block analyzed using the matlab tool. The top block designed using the top-down method by verilog language. The designed block is fabricated using Samsung 0.35um CMOS standard cell library. The chip area is 1500*1500um.

  • PDF

A Study on Factors Affecting Users' Satisfaction Level in Using PMP for Learning Purpose (학습목적의 PMP사용자에 대한 만족도 영향요인 분석)

  • Um, Myoungyong;Kim, Mi-Ryang
    • The Journal of Korean Association of Computer Education
    • /
    • v.10 no.1
    • /
    • pp.77-88
    • /
    • 2007
  • More flexible learning models are needed, and learning environments that operate through mobile technologies such as portable multimedia players(PMP) provide useful tools in implementing these learning models. The main attractant of PMP is often their versatility: being able to load and play different formats of video, audio, digital images, and interactive media. In this paper, we investigate the factors influencing the usage and acceptance of the PMP for study, based on the extended version of the Technology Acceptance Model (TAM). Based on data collected from online survey, we show that perceived usefulness, perceived ease of use, flow and perceived enjoyment are the major determinants for users to play PMP for study purpose. Factors, including ease of use, contents-credibility are shown to determine the level of perceived usefulness; additionally, perceived usefulness, ease of use and perceived enjoyment are shown to directly affect the level of flow. Based upon the statistical results, some useful guidelines for developing learning contents are also provided.

  • PDF

A Novel Third-Order Cascaded Sigma-Delta Modulator using Switched-Capacitor (스위치형 커패시터를 이용한 새로운 형태의 3차 직렬 접속형 시그마-델타 변조기)

  • Ryu, Jee-Youl;Noh, Seok-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.1
    • /
    • pp.197-204
    • /
    • 2010
  • This paper proposes a new body-effect compensated switch configuration for low voltage and low distortion switched-capacitor (SC) applications. The proposed circuit allows rail-to-rail switching operation for low voltage SC circuits and has better total harmonic distortion than the conventional bootstrapped circuit by 19 dB. A 2-1 cascaded sigma-delta modulator is provided for performing the high-resolution analog-to-digital conversion on audio codec in a communication transceiver. An experimental prototype for a single-stage folded-cascode operational amplifier (opamp) and a 2-1 cascaded sigma-delta modulator has been implemented m a 0.25 micron double-poly, triple-metal standard CMOS process with 2.7 V of supply voltage. The 1% settling time of the opamp is measured to be 560 ns with load capacitance of 16 pF. The experimental testing of the sigma-delta modulator with bit-stream inspection and analog spectrum analyzing plot is performed. The die size is $1.9{\times}1.5\;mm$.

The First Formant Characteristics in Vocalize of One Soprano (소프라노 1인의 모음곡 발성 시 제 1 포먼트의 변화양상)

  • Song, Yun-Kyung;Jin, Sung-Min
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.16 no.1
    • /
    • pp.10-14
    • /
    • 2005
  • Background and Objectives : Vowels are characterized on the basis of formant patterns. The first formant(F1) is determined by high-low placement of the tongue, and the second formant (F2) by front-back placement of the tongue. The fundamental frequency(F0) of a soprano often exceed the normal frequency of the first formant. And the vocal intensity is boosted when F0 is high and a harmonic coincides with a formant. This is called a formant tuning. Experienced singers thus learned how to tune their formants over a resonable range by lowering the tongue to maximize their vocal intensity. So, the current study aimed to identify the formant tuning in one experienced soprano by comparing the first formants of vowel [i] in three different voice production : speech, ascending scale, and vocalize. Materials and Method : All voices recordings of vowel [i] in speech, ascending scale (from F4 note to A4 note), and vocalize(:Ridente la calam") were made with digital audio tape-corder in a sound treated room. And the captured data were analyzed by the long term average(LTA) power spectrum using the FFT algorithm of the Computerized Speech Lab(CSL, Kay elementrics, Model, 4300B). Results : Although the first formant of vowel [i] in speech was 238Hz, those of ascending scale [i] were 377Hz, 405Hz, 453Hz respectively in F4(349z), G4(392Hz), A4(440Hz) note, and 722Hz, 820Hz, 918Hz respectively in F5 (698Hz), G5(784Hz), A5(880Hz) note. In vocalize, first formants of [i] were 380Hz, 398Hz, 453Hz respectively in F4, G4, A4 note, and 720Hz, 821Hz, 890Hz respectively in F5, G5, A5 note. Conclusion : These results showed that the first formant of ascending scale and vocalize sustained higher frequency than fundamental frequency in high pitch. This finding implicates that the formant tuning of vowel [i] in ascending scale was also noted in vocalize.

  • PDF

A Study on TCP-friendly Congestion Control Scheme using Hybrid Approach for Multimedia Streaming in the Internet (인터넷에서 멀티미디어 스트리밍을 위한 하이브리드형 TCP-friendly 혼잡제어기법에 관한 연구)

  • 조정현;나인호
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2003.10a
    • /
    • pp.837-840
    • /
    • 2003
  • Recently the multimedia streaming traffic such as digital audio and video in the Internet has increased tremendously. Unlike TCP, the UDP protocol, which has been used to transmit streaming traffic through the Internet, does not apply any congestion control mechanism to regulate the data flow through the shared network. If this trend is let go unchecked, these traffic will effect the performance of TCP, which is used to transport data traffic, and may lead to congestion collapse of the Internet. To avoid any adverse effort on the current Internet functionality, A study on a new protocol of modification or addition of some functionality to existing transport protocol for transmitting streaming traffic in the Internet is needed. TCP-frienly congestion control mechanism is classified with window-based congestion control scheme and rate-based congestion control scheme. In this paper, we propose an algorithm for improving the transmitting rate on a hybrid TCP-friendly congestion control scheme combined with widow-based and rate-based congestion control for multimedia streaming in the internet.

  • PDF

Test Case Generation for Conformance Test of DSM-CC U-U (DSM-CC U-U 적합성 시험을 위한 시험열 생성)

  • Kim, Young-Gyu;Lee, Ok-Bin;Kim, Hak-Suh;Kwon, Young-Duk;Lee, Sang-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.8
    • /
    • pp.2171-2178
    • /
    • 1999
  • In these days, as rapid growth of multimedia industries and development of techniques, and effort to develop DAVIC(Digital Audio-Visual Council) systems which support multimedia services such as VOD(Video onn Demand) and teleshopping is being done. Therefore it will be indispendable to establish a new conformance testing method related DAVIC system with respect to their standard specification. DSM-CC is a core part of DAVIC and adopts DSM-CC U-N for S3 information stream which plays a part in connection establishment and release of session and transmission layer, and DSM-CC U-U for S2 which operates user application of the system. In this paper, we propose a new conformance testing architecture and methodology based on scenario in order to test DSM-CC U-U which doesn't have any message sequences.

  • PDF

Smart device research for the prevention of missing child (미아 방지용 스마트 디바이스 구현에 관한 연구)

  • Ahn, Jong-Chan;Kim, Young-Kil
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2007.10a
    • /
    • pp.437-440
    • /
    • 2007
  • Recently embedded system developed a lot. Physically, embedded systems range from portable devices such as digital watches and MP3 players, to large stationary installations like traffic lights, factory controllers, or the systems controlling nuclear power plants. This paper focuses on implementation of portable device which is applicable to the child-kidnap or missing child prevention system in residential area or public area. To be specific, this device is to transmit video data which comes from the camera in the device into the host PC via WLAN. Embedded hardware platform consists of s3c2440 with ARM9 core, WindowsCE OS and other sensors. OS enables the platform to do multitasking jobs which are handling GPS data, taking video, capturing audio via microphone in the device and transfer all kind of realtime data to the host PC.

  • PDF