• Title/Summary/Keyword: Audio signal

Search Result 476, Processing Time 0.03 seconds

Efficient Primary-Ambient Decomposition Algorithm for Audio Upmix (오디오 업믹스를 위한 효율적인 주성분-주변성분 분리 알고리즘)

  • Baek, Yong-Hyun;Jeon, Se-Woon;Lee, Seok-Pil;Park, Young-Cheol
    • Journal of Broadcast Engineering
    • /
    • v.17 no.6
    • /
    • pp.924-932
    • /
    • 2012
  • Decomposition of a stereo signal into the primary and ambient components is a key step to the stereo upmix and it is often based on the principal component analysis (PCA). However, major shortcoming of the PCA-based method is that accuracy of the decomposed components is dependent on both the primary-to-ambient power ratio (PAR) and the panning angle. Previously, a modified PCA was suggested to solve the PAR-dependent problem. However, its performance is still dependent on the panning angle of the primary signal. In this paper, we proposed a new PCA-based primary-ambient decomposition algorithm whose performance is not affected by the PAR as well as the panning angle. The proposed algorithm finds scale factors based on a criterion that is set to preserve the powers of the mixed components, so that the original primary and ambient powers are correctly retrieved. Simulation results are presented to show the effectiveness of the proposed algorithm.

A Study on the Effectiveness of the Lungs Hand Acupuncture Based on Bio Signal Analysis (생체신호분석 기술을 적용한 폐 수지침 요법에 대한 효과성 연구)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The KIPS Transactions:PartB
    • /
    • v.19B no.2
    • /
    • pp.77-82
    • /
    • 2012
  • We carried out study to prove effectiveness as stimulating corresponding points to lung in hand to experiment applied analysis parameters for image and audio signals in this paper. To this end we collected facial image and voice before and after stimulating corresponding points to lung in hand to a male 20s 25 people. In addition, we analyzed change color, voice energy and speaking rate of right cheek area corresponding points to lung to suggest the theory of the Oriental medicine diagnosis based on data collected. As a result, after performing hand acupuncture, L value of right cheek area decreased average 2.33 and a value b value increased 0.76, 0.97 on average. In addition, size of voice energy increased average 0.42, speaking rate decreased average 0.07. In other words, effect of lung function was improved using hand acupuncture corresponding points to lung.

A Scheduler for Multimedia Data and Evaluation Method (멀티미디어 데이터를 위한 스케쥴러 및 평가법 설계)

  • 유명련;김현철
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.3 no.2
    • /
    • pp.1-7
    • /
    • 2002
  • Since multimedia data such as video and audio data are displayed within a certain time constraint, their computation and manipulation should be handled under limited condition. Traditional real-time scheduling algorithms could not be directly applicable, because they are not suitable for multimedia scheduling applications which support many clients at the same time. Rate Regulating Proportional Share Scheduling Algorithm is a scheduling algorithm considered the time constraint of the multimedia data. This scheduling algorithm uses a rate regulator which prevents tasks from receiving more resource than its share in a given period. But this algorithm loses fairness, and does not show graceful degradation of performance under overloaded situation. This paper proposes a new modified algorithm, namely Modified Proportional Share Scheduling Algorithm considering the characteristics of multimedia data such as its continuity and time dependency. Proposed scheduling algorithm shows graceful degradation of performance in overloaded situation and the reduction in the number of context switching. Furthermore, a new evaluation method is proposed which can evaluate the flexibility of scheduling algorithm.

  • PDF

Design of 8K Broadcasting System based on MMT over Heterogeneous Networks

  • Sohn, Yejin;Cho, Minju;Paik, Jongho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.8
    • /
    • pp.4077-4091
    • /
    • 2017
  • This paper presents the design of a broadcasting scenario and system for an 8K-resolution content. Due to an 8K content is four times larger than the 4K content in terms of size, many technologies such as content acquisition, video coding, and transmission are required to deal with it. Therefore, high-quality video and audio for 8K (ultra-high definition television) service is not possible to be transmitted only using the current terrestrial broadcasting system. The proposed broadcasting system divides the 8K content into four 4K contents by area, and each area is hierarchically encoded by Scalable High-efficiency Video Coding (SHVC) into three layers: L0, L1, and L2. Every part of the 8K video content divided into areas and hierarchy is independently treated. These parts are transmitted over heterogeneous networks such as digital broadcasting and broadband networks after going through several processes of generating signal messages, encapsulation, and packetization based on MPEG media transport. We propose three methods of generating streams at the sending entity to merge the divided streams into the original content at the receiving entity. First, we design the composition information, which defines the presentation structure for displays. Second, a descriptor for content synchronization is included in the signal message. Finally, we define the rules for generating "packet_id" among the packet header fields and design the transmission scheduler to acquire the divided streams quickly. We implement the 8K broadcasting system by adapting the proposed methods and show that the 8K-resolution contents are stably received and serviced with a low delay.

User Visit Certification System using Inaudible Frequency

  • Chung, Myoungbeom
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.7
    • /
    • pp.57-64
    • /
    • 2021
  • In this paper, we propose and test the efficacy of an easy-to-use user location certification system for public places that relies on frequencies outside the audible range for humans. The inaudible frequencies come in signal frequency between 18-20 kHz and are generated by general audio speaker. After an individual's smart device detects the signal frequency, it sends the frequency value, user's personal ID, and user's location to a system server that certifies the user's visit location currently. The system server then saves a user visit record and categorizes it by individual. To show the usefulness of this proposed system, we developed a user visit certification application for smart devices linked to a system server. We then conducted a user visit certification experiment using the proposed system, with the result showing 99.6% accuracy. For a comparison, we then held a user visit certification experiment using a QR code, which confirmed that our proposed system performs better than QR code location certification. This proposed system can thus provide restaurants and other facilities reliable user contact tracing and electronic visitor access lists in the age of COVID-19.

Proposal of Hostile Command Attack Method Using Audible Frequency Band for Smart Speaker (스마트 스피커 대상 가청 주파수 대역을 활용한 적대적 명령어 공격 방법 제안)

  • Park, Tae-jun;Moon, Jongsub
    • Journal of Internet Computing and Services
    • /
    • v.23 no.4
    • /
    • pp.1-9
    • /
    • 2022
  • Recently, the functions of smart speakers have diversified, and the penetration rate of smart speakers is increasing. As it becomes more widespread, various techniques have been proposed to cause anomalous behavior against smart speakers. Dolphin Attack, which causes anomalous behavior against the Voice Controllable System (VCS) during various attacks, is a representative method. With this method, a third party controls VCS using ultrasonic band (f>20kHz) without the user's recognition. However, since the method uses the ultrasonic band, it is necessary to install an ultrasonic speaker or an ultrasonic dedicated device which is capable of outputting an ultrasonic signal. In this paper, a smart speaker is controlled by generating an audio signal modulated at a frequency (18 to 20) which is difficult for a person to hear although it is in the human audible frequency band without installing an additional device, that is, an ultrasonic device. As a result with the method proposed in this paper, while humans could not recognize voice commands even in the audible band, it was possible to control the smart speaker with a probability of 82 to 96%.

Commercial 4K UHD Streaming Device over 5G Mobile Communication Network (5G 이동통신망을 통한 상용 4K UHD 스트리밍 장치)

  • Junghoon, Paik;Yongsuk, Kim
    • Journal of Broadcast Engineering
    • /
    • v.27 no.6
    • /
    • pp.914-922
    • /
    • 2022
  • In this paper, we construct a commercial 4K UHD(Ultra High Definition) streaming device that utilizes a 5G mobile communication network as a transport channel and conduct a streaming performance test. It uses RTP(Realtime Transport Protocol) which has transmission quality monitoring capability as a transmission protocol to apply adaptive streaming. In addition, it provides the function to adjust the encoding rate of the video signal so that encoding can be optimized for the change in the bandwidth of the transmission channel. Through the performance test, it is confirmed that the H.265 encoding rate for 4K UHD signal is 48.69Mbps, the average glass-to-glass delay time is 293.60ms, and the average time difference between video and audio for lip sync is 120ms. With the result of performance test, it is shown that the streaming device is applied to 4K UHD Streaming device over 5G mobile communication network.

Implementation and Design of Objective Quality Assurance System for Multimedia Service Video (멀티미디어 서비스 영상의 객관적 품질측정 시스템 설계 및 구현)

  • Joo, Hae-Jong;Hong, Bong-Hwa;On, Jin-Oh;Hong, Suk-Ju
    • 전자공학회논문지 IE
    • /
    • v.45 no.1
    • /
    • pp.58-64
    • /
    • 2008
  • This Paper provides perceptual metrics for video quality based on properties of human visual system, and audio quality based on human audition. All metrics work without reference signals, allowing non-intrusive, in-service measurements. A simple and easy-to-learn user interface displays the metrics and saves them in popular file formats like CSV. In this paper, proposed method was able to various and corrective measurement for the multimedia service video quality. As that it was able to application to set up service guide line and the methode of measurement and system for the set up standardization of the high quality video service.

The Research On the improvements of Speaker's Frequency Characteristic using DSP Audio Processor (DSP 오디오 프로세서를 이용한 스피커 주파수 특성 개선에 관한 연구)

  • Lee, Soon-Reyo;Choi, Hong-Sub
    • Journal of Digital Contents Society
    • /
    • v.8 no.3
    • /
    • pp.341-346
    • /
    • 2007
  • The purpose of this paper is to propose the design of VADSM(Value-Added Digital Speaker Module) which tunes up the speaker unit by measuring the speaker's frequency responses and controlling EQ band. This module can reduce audible distortions at particular frequency band and improve some flatness in the speaker's frequency response. VADSM is composed of DSP AMP and speaker unit. When a speaker transforms electrical signal to sound, the magnitude response at some frequencies are more or less than normal level. So, DSP AMP can be used to adjust those magnitudes up or down by controlling its EQ bands.

  • PDF

Efficient Parsing and Caching Mechanism for Data Carousels (데이터 캐루셀을 위한 효율적인 파싱 및 캐슁 기법)

  • Jeon, Je-Min;Won, Jae-Hoon;Kim, Se-Chang;Ko, Sang-Won;Kim, Jung-Sun
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.635-638
    • /
    • 2008
  • Unlike traditional analog broadcasting, digital broadcasting provides users with various additional services that we have never seen before. To receive these kind of services. data broadcasting includes not only audio, video signal, but also additional data associated with the program. In this paper, we present the efficient parsing and caching mechianism for data carousel in digital broadcasting set-top box. In order to speed up the process of parsing, we use the Message Pool that stores elementary_pid syntax of DSM-CC message packets.

  • PDF