• Title/Summary/Keyword: Audio and Video

Search Result 805, Processing Time 0.025 seconds

Remote Mobile robot control system using multimedia data (멀티미디어 기반의 원격 이동 로봇 제어 시스템)

  • 변재영;문호석;정재한;고성제
    • Proceedings of the IEEK Conference
    • /
    • 2002.06c
    • /
    • pp.235-238
    • /
    • 2002
  • This paper presents a remote mobile robot system that transmits streaming video and audio over the lossy packet networks such as (Wireless) LAN. The error resilient video and audio packets are transmitted on the RTP/UDPfP Protocol stack. The mobile robot can be accessed by a certified user from the remoted area. Thus, the movement of mobile robot can be controlled by the operator observing the working surroundings.

  • PDF

Implementation of an RF Module for 2.4GHz Wireless Audio/Video Transmission (2.4GHz 무선 음성/영상 송신용 RF 모듈 구현)

  • 김거성;권덕기;박종태;유종근
    • Proceedings of the IEEK Conference
    • /
    • 2002.06e
    • /
    • pp.55-58
    • /
    • 2002
  • This paper describes an RF module for 2.4GHz wireless audio/video transmission. The pre-processed baseband input signals are FM-modulated using a VCO and then transmitted through an antenna after RF filtering. The designed circuits are implemented using a Teflon board of which the size is 52mm${\times}$62mm. The measured maximum output signal levels are around -3dBm and the harmonics are less than -450dBc. The manufactured module consumes 130mA from a 8V supply.

  • PDF

Development of a Digital Down-mixer to Convert 5.1 Channel Audio Signals to Stereo Signals (5.1 채널 오디오 신호를 스테레오 신호로 변환하는 디지털 다운믹서 개발)

  • Jeon, Kwang-Sub;Cheong, Ho-Yong;Lee, Seung-Yo
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.62 no.12
    • /
    • pp.1764-1770
    • /
    • 2013
  • Use of the 5.1 channel audio signals suitable for the television system is improper for the radio broadcasting system, which uses the stereo audio system. Therefore, it is necessary to develop an audio down-mixer to convert 5.1 multi-channel audio signals to stereo signals for radio broadcasting. In this paper, a development of an audio down-mixer was carried out to convert 5.1 multi-channel audio signals to stereo signals. The down-mixer which was developed can use the audio signals separated from video signals, including sound signals or individual signals provided from 3-channel AES/EBU signals including Left(L), Right(R), Left Surround(Ls), Right Surround(Rs), Center(C) and Low Frequency Effect(Lfe) sounds as mixer inputs.

Low-Delay, Low-Power, and Real-Time Audio Remote Transmission System over Wi-Fi

  • Hong, Jinwoo;Yoo, Jeongju;Hong, Jeongkyu
    • Journal of information and communication convergence engineering
    • /
    • v.18 no.2
    • /
    • pp.115-122
    • /
    • 2020
  • Audiovisual (AV) facilities such as TVs and signage are installed in various public places. However, audio cannot be used to prevent noise and interference from individuals, which results in a loss of concentration and understanding of AV content. To address this problem, a total technique for remotely listening to audio from audiovisual facilities with clean sound quality while maintaining video and lip-syncing through personal smart mobile devices is proposed in this paper. Through the experimental results, the proposed scheme has been verified to reduce system power consumption by 8% to 16% and provide real-time processing with a low latency of 120 ms. The system described in this paper will contribute to the activation of audio telehearing services as it is possible to provide audio remote services in various places, such as express buses, trains, wide-area and intercity buses, public waiting rooms, and various application services.

The Effect of Audio and Visual Cues on Korean and Japanese EFL Learners' Perception of English Liquids

  • Chung, Hyun-Song
    • English Language & Literature Teaching
    • /
    • v.11 no.2
    • /
    • pp.135-148
    • /
    • 2005
  • This paper investigated the effect of audio and visual cues on Korean and Japanese EFL learners' perception of the lateral/retroflex contrast in English. In a perception experiment, the two English consonants /l/ and /r/ were embedded in initial and medial position in nonsense words in the context of the vowels /i, a, u/. Singletons and clusters were included in the speech material. Audio and video recordings were made using a total of 108 items. The items were presented to Korean and Japanese learners of English in three conditions: audio-alone (A), visual-alone (V) and audio-visual presentation (AV). The results showed that there was no evidence of AV benefit for the perception of the /l/-/r/ contrast for either Korean or Japanese learners of English. Korean listeners showed much better identification rates of the /l/-/r/ contrast than Japanese listeners when presented in audio or audio-visual conditions.

  • PDF

A Study on the Development of Web-based Full Motion Video E-mail System using MPEG-4 (웹을 기반으로 한 MPEG-4 동영상 E-mail 시스템의 개발)

  • 고재승
    • Journal of the Korea Computer Industry Society
    • /
    • v.3 no.3
    • /
    • pp.283-294
    • /
    • 2002
  • Now is the time for web-based video e-mail system because of world wide use of internet. But video data is so large, then data compression is much needed for transmission by web. In this paper, my colleagues and I implement full motion video e-mail system using MPEG-4, the international standards for audio-visual data. This video e-mail system is made of web-based active-X control, so easily accessible by web, and applies real-time audio-video compression. It's possible for everyone to send video e-mail for free to everywhere in the world if this system is used. The main application areas of this system are multimedia mailing service, web-based video advertisement, remote education, remote medical service and shopping mall construction, etc.

  • PDF

IMPROVING THE SPEECH INTELLIGIBILITY IN AN AIR-TRFFIC CONTROL ROOM

  • Pavuza, Franz G.;Beszedics, Geza W.;Pichler, Heinrich
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.912-918
    • /
    • 1994
  • Poor speech intelligibility in an air traffic control room is frequently a result of many, quite different causes and occasionally leads to complaints of the controller personnel. The paper describes a sequence of successful tasks performed in a local control room. The initial measurements included an investigation of the background noise (caused by fans, air condition, computer and radar equipment) and performance checks of the electronic audio and communication equipment with respect to the audio transmission behavior. The spectral composition of the noise as well as the characteristics of the audio communication path between the controllers and the pilots(which showed a loss of spectral information in the audio band due to built-in notch filters for the suppression of control tones) required adaptations of the amplitude behavior of the amplifiers through user adjustable tone controls. The radar console fans, which contributed significantly to the overall noise floor of the room, underwent a substantial reconstruction by replacing the tight mounting with an elastic double suspension, reducing the noise level by 50%. Finally, a possible source of untimely fatigue of the controllers during their working hours has been found in strong spectral components of the noise above the audio band, radiated by numerous video monitors in the control through vibrating components excited by the line frequency of the video signal.

  • PDF

A PSIP Information Generating System for Produce Digital Access Program (디지털 방송 콘텐츠 제작을 위한 PSIP 정보 생성 시스템)

  • Hwang, Kyung-Min;Kim, Jong-Moon;Bang, Jin-Suk;Cho, Tae-Beom;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2007.10a
    • /
    • pp.615-618
    • /
    • 2007
  • It has able to express digital video/audio data from analog and to broadcast it via improvement of video/audio compression technology and publishing standard of MPEG-2 System. Nowaday many System Operators are provide regular digital broadcasting program to customer with their own access program. To provide access program, two process needs that are creating broadcasting information and remultiplexing it with video/audio data, and this solution is providing with high-cost system only. For this reason, digital access program bas week point to product. In this paper, we designed and implemented Generating PSIP Information System to product digital access program which generate PSIP information via receiving broadcasting information from user, and map PSIP information directly to video/audio data.

  • PDF

Design and Implementation of an Embedded Audio Video Bridging Platform for Multichannel Multimedia Transmission (다채널 멀티미디어 전송용 임베디드 Audio Video Bridging 플랫폼 설계 및 구현)

  • Wee, Jungwook;Park, Kyoungwon;Kwon, Kiwon;Song, Byoungchul;Kang, Mingoo
    • Journal of Internet Computing and Services
    • /
    • v.16 no.2
    • /
    • pp.1-6
    • /
    • 2015
  • In this paper, we designed an embedded audio video bridging (AVB) platform based on IEEE 802.1BA for real-time multimedia transmission in smart-car, smart-home, smart-theater, and then evaluated a performance of the implemented platform by analysis of IEEE 802.1AS (time synchronization protocol) and IEEE 802.1Qat (stream reservation protocol). Especially, the AVB Layer-2 protocol of MRP(Multiple Registration Protocol), MMAP(Multicast Address Acquisition Protocol), IEEE 1722, 1722.1 etc. was and implemented by linux based operating system. It is shown by interoperability tests with commercial products that the implemented platform transmits real-time multichannel AV data over AVB networks for Multichannel Multimedia Transmission.

Improved Bimodal Speech Recognition Study Based on Product Hidden Markov Model

  • Xi, Su Mei;Cho, Young Im
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.13 no.3
    • /
    • pp.164-170
    • /
    • 2013
  • Recent years have been higher demands for automatic speech recognition (ASR) systems that are able to operate robustly in an acoustically noisy environment. This paper proposes an improved product hidden markov model (HMM) used for bimodal speech recognition. A two-dimensional training model is built based on dependently trained audio-HMM and visual-HMM, reflecting the asynchronous characteristics of the audio and video streams. A weight coefficient is introduced to adjust the weight of the video and audio streams automatically according to differences in the noise environment. Experimental results show that compared with other bimodal speech recognition approaches, this approach obtains better speech recognition performance.