• Title/Summary/Keyword: Audio to Video Synchronization

Search Result 44, Processing Time 0.03 seconds

Design and Implementation of a Realtime Video Player on Tiled-Display System (타일드-디스플레이 시스템에서 실시간 동영상 상영기의 설계 및 구현)

  • Choe, Gi-Seok;Yu, Jeong-Soo;Choi, Jeong-Hooni;Nang, Jong-Ho
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.35 no.4
    • /
    • pp.150-157
    • /
    • 2008
  • This paper presents a design and implementation of realtime video player that operates on a tiled-display system consisting of multiple PCs to provide a very large and high resolution display. In the proposed system, the master process transmits a compressed video stream to multiple PCs using UDP multicast. All slaves(PC) receive the same video stream, decompress, clip their designated areas from the decompressed video frame, and display it to their displays while being synchronized with each other. A simple synchronization mechanism based on the H/W clock of each slave is proposed to avoid the skew between the tiles of the display, and a flow-control mechanism based on the bit-rate of the video stream and a pre-buffering scheme are proposed to prevent the jitter The proposed system is implemented with Microsoft DirectX filter technology in order to decouple the video/audio codec from the player.

The DLB Method for Multimedia Synchronization in the ATM Networks (ATM 망에서 멀티미디어 동기화를 위한 DLB 기법)

  • 구경옥;이병수;조용환
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.22 no.4
    • /
    • pp.842-854
    • /
    • 1997
  • In this paper, the improved Dual Leaky-Bucket(DLB) algorithm is proposed to reduce the synchronous cell loss rate. The conventional DLB algorithm does not support synchronous cells, but the proposed algorithm gives higher priority to synchronous cells. To reduce synchronous cell loss rate, the synchronous cell detector is used in the proposed algorithm. Synchronous cell detector detects synchronous cells, and passes them cells to the 2nd Leaky-Bucket. So it is similar to give higher priority to synchronous cells. In this paper, the proposed algorithm used audio/videl traffic modeled by On/Off and Two-state MMPP, and simulated by SLAM II package. As simulation results, the proposed algorithm gets lower synchronous cell loss rate than the conventional DLB algorithms. The improved DLB algorithm for multimedia synchronization can be extended to any other cells which require higher priority.

  • PDF

A Web-based Remote Instruction System on Real-time using Action Synchronization between the Instructor and Learners (교수와 학습자간의 행동 동기화를 이용한 웹 기반의 실시간 원격 강의 시스템)

  • 이부권;박규석;서영건
    • Journal of Korea Multimedia Society
    • /
    • v.3 no.6
    • /
    • pp.611-616
    • /
    • 2000
  • By the most important media to deliver the contents on a remote instruction we commonly use audio and documents. A number of remote instruction system are trying to offer the video, but they did not acquire satisfiable results because of the limited network and width. Also, they use the general web browsers that have a lot of unspecific users access the contents. Also, they use the general web browsers that have a lot of unspecific users access the contents. Like this most systems that use the continuous media have not offer the satisfiable contents because of the network limitation. Moreover, because they use the web browser, they offer the contents having documents(web pages) only. In this paper, we propose a web-based remote instruction system on real-time using audio and documents which are the most important media for a information delivery. In addition to the system, we use a action synchronization mechanism between the web browsers. If the instructor uses web pages on his computer and explains the contents of them, the learners see the same web pages as the instructor's and listens to his voice.

  • PDF

Implementation of SMIL Editor for Multimedia Broadcasting (멀티미디어 방송을 위한 SMIL 편집 시스템 구현)

  • 장대영;김창수;정회경
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.3
    • /
    • pp.622-629
    • /
    • 2004
  • Recently, as digital broadcasting and internet are spreaded out of the world, we can easily use informations with less restrictions of time and space. According to the current trends, concerns for the ways of representing multimedia data has been rapidly increased, and users demand the services with integrated document that takes not only simple text and image but also time varying audio-visual data. Therefore, in 1998, W3C presented an international standard, SMIL in order to solve multimedia object representation and synchronization problems. By using SMIL, various multimedia elements can be integrated as a multimedia document with proper view in a space and time. Using this SMIL document, we can create new internet radio broadcasting service that delivers not only audio data but also various text, image and video. In this paper, we describe on a SMIL document editor for the common users to be able to represent time varying multimedia data with special layout and synchronization of time and space.

The implementation of Media Processing Part in the DMB receiver (DMB 방송 수신을 위한 수신기의 멀티미디어 처리기 구현)

  • Park Jeong Hoon;Lee Sang Rae
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2003.11a
    • /
    • pp.187-190
    • /
    • 2003
  • In this paper, the efficient implementation technique of media processing part in the terrestrial and satellite DMB (Digital Multimedia Broadcasting) receiver is presented. To implement the unified multimedia Processor of DMB receiver, we investigated the characteristic of DMB service and the functionality of each processing part in the DMB receiver. To implement the synchronization between audio and video media, we present the general method to use the reference clock of the stream in the DMB receiver. Also we present the method to handle the bit error of the received bitstream within the wireless net work for robust media processor.

  • PDF

Comparisons between Distributed Connections and Centralized Connections of Multimedia Streams for Computer-based Audio-Video Teleconferences (컴퓨터 영상회의 시스템을 위한 분산형과 집중형 스트림 연결 구조 비교)

  • Lee, Gyeong-Hui;Kim, Du-Hyeon;Im, Heon-Gyu;Im, Yeong-Hwan
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.3
    • /
    • pp.591-607
    • /
    • 1996
  • To support various multimedia applications. MuX server produces object-oriented and consistent interfaces for creation, copying, splitting, mixing and interleaving of streams. In this paper, we describes distributed connection structures and centralized connection structures which can be used in building a teleconferencing system using basic objects of MuX and compares merits and demerits of each structure from the viewpoint of multimedia related performance like delay and synchronization.

  • PDF

Design of 8K Broadcasting System based on MMT over Heterogeneous Networks

  • Sohn, Yejin;Cho, Minju;Paik, Jongho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.8
    • /
    • pp.4077-4091
    • /
    • 2017
  • This paper presents the design of a broadcasting scenario and system for an 8K-resolution content. Due to an 8K content is four times larger than the 4K content in terms of size, many technologies such as content acquisition, video coding, and transmission are required to deal with it. Therefore, high-quality video and audio for 8K (ultra-high definition television) service is not possible to be transmitted only using the current terrestrial broadcasting system. The proposed broadcasting system divides the 8K content into four 4K contents by area, and each area is hierarchically encoded by Scalable High-efficiency Video Coding (SHVC) into three layers: L0, L1, and L2. Every part of the 8K video content divided into areas and hierarchy is independently treated. These parts are transmitted over heterogeneous networks such as digital broadcasting and broadband networks after going through several processes of generating signal messages, encapsulation, and packetization based on MPEG media transport. We propose three methods of generating streams at the sending entity to merge the divided streams into the original content at the receiving entity. First, we design the composition information, which defines the presentation structure for displays. Second, a descriptor for content synchronization is included in the signal message. Finally, we define the rules for generating "packet_id" among the packet header fields and design the transmission scheduler to acquire the divided streams quickly. We implement the 8K broadcasting system by adapting the proposed methods and show that the 8K-resolution contents are stably received and serviced with a low delay.

Lip and Voice Synchronization Using Visual Attention (시각적 어텐션을 활용한 입술과 목소리의 동기화 연구)

  • Dongryun Yoon;Hyeonjoong Cho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.4
    • /
    • pp.166-173
    • /
    • 2024
  • This study explores lip-sync detection, focusing on the synchronization between lip movements and voices in videos. Typically, lip-sync detection techniques involve cropping the facial area of a given video, utilizing the lower half of the cropped box as input for the visual encoder to extract visual features. To enhance the emphasis on the articulatory region of lips for more accurate lip-sync detection, we propose utilizing a pre-trained visual attention-based encoder. The Visual Transformer Pooling (VTP) module is employed as the visual encoder, originally designed for the lip-reading task, predicting the script based solely on visual information without audio. Our experimental results demonstrate that, despite having fewer learning parameters, our proposed method outperforms the latest model, VocaList, on the LRS2 dataset, achieving a lip-sync detection accuracy of 94.5% based on five context frames. Moreover, our approach exhibits an approximately 8% superiority over VocaList in lip-sync detection accuracy, even on an untrained dataset, Acappella.

A Study on the Development of T-DMB Frame Analysis Simulator and its Utilization in Education (T-DMB 프레임 분석 시뮬레이터 개발 및 교육활용에 관한 연구)

  • Hwang, In-Tae;Kim, Han-Jong
    • Journal of Practical Engineering Education
    • /
    • v.7 no.1
    • /
    • pp.31-37
    • /
    • 2015
  • Terrestrial digital multimedia broadcasting (TDMB) is a method of bringing multimedia images, radio, internet, and television to portable devices through terrestrial digital radio transmissions. TDMB related educations being carried out in colleges are focusing on developing firmware which enables users to choose a wanted service. TDMB transmission frame is made up of synchronization channel (SC), fast information channel (FIC), and main service channel (MSC). Services such as video, audio and date are transmitted in the form of subchannel in the MSC. FIC carries information related to each services and subchannels. This paper presents a TDMB frame analysis simulator for analyzing and displaying FIC data on PC. TDMB frame analysis simulator contains functions such as controlling TDMB receiver through USB, establishing the frequency, bringing FIC to PC, displaying ensemble ID and levels, and displaying informations related to services and subchannels. In addition to that, this simulator has a function of being able to store FIC date and subchannel data. This simulator being developed with C++ is expected to be used to view those data visually so that it helps students to understand the TDMB system better and bring about the educational motivation.

Multimedia Network Teaching System based on SMIL (SMIL을 기반으로 한 멀티미디어 네트워크 교육시스템)

  • Yu, Lei;Cao, Ke-Rang;Bang, Jin-Suk;Cho, Tae-Beom;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.10a
    • /
    • pp.524-527
    • /
    • 2008
  • Recently, digital and the Internet are widespread out of the world, and multimedia processing technology and the development of information and communication technology in education using the Internet as the demand is rapidly increasing. Also, we tan easily use informations with less restrictions of time and space. however, several kinds of audio, media to integrate multimedia data, such as the proliferation of demands for representation. Therefore, in 1998, W3C presented an international standard, SMIL in order to solve multimedia object representation and synchronization problems. By using SMIL, various multimedia elements can be integrated as a multimedia document with proper view in a spate and time. Using this SMIL document, we can create new internet radio broadcasting service that delivers not noly audio data but also various text, image and video. In this paper, with the system, teachers can easily create multimedia courseware and living broadcast their torture on network, students can receive audio-video information of the teacher, screen displays of the teachers computer. Moreover students can communicate with teacher simultaneously by text editor windows. Students can also order courseware after class.

  • PDF