• Title/Summary/Keyword: Audio-Visual Information

Search Result 207, Processing Time 0.026 seconds

Design of a Format Converter from MPEG-4 Over MPEG-2 TS to MP4 (MPEG-4 Over MPEG-2 TS로부터 MP4 파일로의 포맷 변환기 설계)

  • 최재영;정제창
    • Journal of Broadcast Engineering
    • /
    • v.5 no.2
    • /
    • pp.176-187
    • /
    • 2000
  • MPEG-4 is a digital bit stream format and associated protocols for representing multimedia content consisting of natural and synthetic audio, video and object data. This paper describes an application where multiple audio/visual data stream are combined in MPEG-4 and transported via MPTG-2 transport streams(TS). Also, this paper describes how to convert MPEG-4 Over MPEG-2 TS bit streams into MP4 file which Is designed to contain the media information of an MPEG-4 presentation in a flexible, extensible format. MPEG-4 is presented in the form of audio-visual objects that are arranged into an audio-visual scene by means of a scene descriptor and is composed of the audio-visual objects by means of an object descriptor. These descriptor streams are not defined MPEG-2 TS. So. this paper focuses on handling of these descriptors and parsing TS streams to get MPEG-4 data. The MPEG-4 Over MPEG-2 TS to MP4 format converter is implemented in the demonstrated systems.

  • PDF

XCRAB : A Content and Annotation-based Multimedia Indexing and Retrieval System (XCRAB :내용 및 주석 기반의 멀티미디어 인덱싱과 검색 시스템)

  • Lee, Soo-Chelo;Rho, Seung-Min;Hwang, Een-Jun
    • The KIPS Transactions:PartB
    • /
    • v.11B no.5
    • /
    • pp.587-596
    • /
    • 2004
  • During recent years, a new framework, which aims to bring a unified and global approach in indexing, browsing and querying various digital multimedia data such as audio, video and image has been developed. This new system partitions each media stream into smaller units based on actual physical events. These physical events within oath media stream can then be effectively indexed for retrieval. In this paper, we present a new approach that exploits audio, image and video features to segment and analyze the audio-visual data. Integration of audio and visual analysis can overcome the weakness of previous approach that was based on the image or video analysis only. We Implement a web-based multi media data retrieval system called XCRAB and report on its experiment result.

Robust Feature Extraction Based on Image-based Approach for Visual Speech Recognition (시각 음성인식을 위한 영상 기반 접근방법에 기반한 강인한 시각 특징 파라미터의 추출 방법)

  • Gyu, Song-Min;Pham, Thanh Trung;Min, So-Hee;Kim, Jing-Young;Na, Seung-You;Hwang, Sung-Taek
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.20 no.3
    • /
    • pp.348-355
    • /
    • 2010
  • In spite of development in speech recognition technology, speech recognition under noisy environment is still a difficult task. To solve this problem, Researchers has been proposed different methods where they have been used visual information except audio information for visual speech recognition. However, visual information also has visual noises as well as the noises of audio information, and this visual noises cause degradation in visual speech recognition. Therefore, it is one the field of interest how to extract visual features parameter for enhancing visual speech recognition performance. In this paper, we propose a method for visual feature parameter extraction based on image-base approach for enhancing recognition performance of the HMM based visual speech recognizer. For experiments, we have constructed Audio-visual database which is consisted with 105 speackers and each speaker has uttered 62 words. We have applied histogram matching, lip folding, RASTA filtering, Liner Mask, DCT and PCA. The experimental results show that the recognition performance of our proposed method enhanced at about 21% than the baseline method.

Abnormal Active Pig Detection System using Audio-visual Multimodal Information (Audio-visual 멀티모달 정보 기반의 비정상 활성 돼지 탐지 시스템)

  • Chae, Heechan;Lee, Junhee;Lee, Jonguk;Chung, Yonghwa;Park, Daihee
    • Annual Conference of KIPS
    • /
    • 2022.05a
    • /
    • pp.661-664
    • /
    • 2022
  • 양돈을 관리하는 데에 있어 비정상 개체를 식별하고 사전에 추적하거나 격리할 수 있는 양돈업 시스템을 구축하는 것은 효율적인 돈사관리를 위한 필수 요소이다. 그러나 돈사내의 이상 상황을 탐지하는 연구는 보고되었지만, 이상 상황이 발생한 돼지를 특정하여 식별하는 연구는 찾아보기 힘들다. 따라서, 본 연구에서는 소리를 활용하여 이상 상황이 발생함을 탐지한 후 영상을 활용하여 소리를 낸 특정 돼지를 식별할 수 있는 시스템을 제안한다. 해당 시스템의 주요 알고리즘은 활성 화자 탐지 문제에서 착안하여 이를 돈사에 맞게 적용하여, 비정상 소리를 내는 활성 돼지를 식별 가능하도록 구현하였다. 제안한 방법론은 모의 실험을 통해 돈사 내의 이상 상황이 발생한 돼지를 식별할 수 있음을 확인하였다.

The Implementation of Real-Time Speaker Localization Using Multi-Modality (멀티모달러티를 이용한 실시간 음원추적 시스템 구현)

  • Park, Jeong-Ok;Na, Seung-You;Kim, Jin-Young
    • Proceedings of the KIEE Conference
    • /
    • 2004.11c
    • /
    • pp.459-461
    • /
    • 2004
  • This paper presents an implementation of real-time speaker localization using audio-visual information. Four channels of microphone signals are processed to detect vertical as well as horizontal speaker positions. At first short-time average magnitude difference function(AMDF) signals are used to determine whether the microphone signals are human voices or not. And then the orientation and distance information of the sound sources can be obtained through interaural time difference and interaual level differences. Finally visual information by a camera helps get finer tuning of the speaker orientation. Experimental results of the real-time localization system show that the performance improves to 99.6% compared to the rate of 88.8% when only the audio information is used.

  • PDF

Design and Implemention of Multimedia Integrated Processing Unit for Computer-Nased Video Conference (컴퓨터 영상회의를 위한 멀티미디어 통합처리장치의 설계 및 구현)

  • 김현기;홍재근
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.35C no.3
    • /
    • pp.59-68
    • /
    • 1998
  • This paper propose a hardware architecure of multimediasysgem for integrated processing of the multimedia data such as audio and video, and describes on the design and implementation of multimedia integrated processing Unit. The unit comprises most commonly needed multimedia processing function for computer-based video conference: audio-visual datacapture, playback, compression, decompression as well as interleaving/disinterleaving of compressed audio-visual data. The proposed architecture minimizes the CPU overhead that might be caused by multimedia data processing and assures the fluent data flow among system components. Also, this unit is tested and analyzed under the computer-based video conference to confirm the multimedia unit of proposed architecture using communication protocol and application software through Ethernet and FDDI (Fiber Distributed Data Interface) networks.

  • PDF

A Study on the Use of Supplementary Teaching Materials and Implements in the High School Home Economics Education (고등학교 가정과 교육에서 보조학습 교재.교구의 활용실태 연구)

  • 조은경;김용숙
    • Journal of Korean Home Economics Education Association
    • /
    • v.9 no.1
    • /
    • pp.1-17
    • /
    • 1997
  • This study was conducted to obtain basic materials to improve the teaching method of Home Economics by theoretically looking into the supplementary teaching materials or implements usable in teaching Costume History area. And based on these data, the types and the applications of the supplementary teaching materials or implements highschool owned were examined. The subjects of this study were 111 Home Economics and Housework curriculum highschool teachers who give a lecture in the country by using self-administered questionnaires. SAS program was used to calculate frequency, percentage, average, standard deviation, and $\chi$(sup)2-test analysis. The results of the study were as follows; 1. Most of the highschool teachers used the school expenses for experiments in preparing the supplementary teaching materials or implements. 2. Of the supplementary teaching materials and implements concerning Costume History, visual implements such as slides and pictures were the mostly owned. CD and audio implements as cassette-tapes were not used. 3. Most of the teachers recognized the importance of the audio-visual teaching materials and implements concerning Costume History. 4. Among the audio-visual materials and implements concerning Costume History by which can be made by school teachers of Home Economics and Housework curriculum, the mostly used one was ‘cutting pictorials from magazines and newspapers’, and the next were ‘orbital materials’, and ‘copy the pictorials’, and the least was ‘recording from the radio’. 5. Most of the annual expenses assigned to the department of Home Economics was used in cooking practice, and the least of the expenses was assigned in buying audio-visual teaching materials and implements. 6. Time assigned to the area of Home Economics was for the most part one or two hours per week, and among this, time assigned to the history of western costume and the history ok korean costume was for the most part five to eight hours. 7. The areas that the highschool teachers felt difficulties mostly during clothing and textiles curriculum were ‘textiles’and the next were ‘knitting’, ‘western costume history’, and ‘korean clothing construction’. 8. The difficulties the highschool teachers faced while teaching Costume History were mostly that ‘the pictorials in the text is not fully explainable’, the next were ‘most of the supplementary teaching materials or implements are not owned’, ‘have to explain very much in a short time’, and ‘the lectural explanation is insufficient’. 9. The solution for the difficulties that the highschool teachers faced while teaching Costume History was mostly ‘the information, on which audio-visual materials and implements are distributed in the market, should be easy to obtain’, the next opinions were ‘the school should provide enough experiment and practice expenses to buy audio-visual materials and implements’, and ‘education facilities of the Home Economics Department should be the main aspects in improving the teaching methods and should give special lectures about it’.

  • PDF

The Effects of the Presentation Mode of Web Contents on the Children's Information Processing Process (웹 콘텐츠의 정보제시유형이 어린이 뉴스정보처리과정에 미치는 영향)

  • Choi E-Jung
    • The Journal of the Korea Contents Association
    • /
    • v.5 no.3
    • /
    • pp.113-122
    • /
    • 2005
  • The major purpose of this study is to explore the effect of the presentation undo combined by main four media(moving Image, audio, turf image) of web contents on the children's information processing process. So children were assigned to one of five experimental medium conditions: 'moving Image1 (auditory-visual redundancy)', 'moving Image2 (auditory-visual dissonance)', 'text', 'text-with-image', 'audio'. Results indicated that the moving image was found to be the most effective transmitter of internet news information for children's recall. And the recall advantage of moving image was found to be particularly pronounced for verbal information supplemented with redundant visual.

  • PDF

Research on Audiovisual Type Preservation Format Selection Criteria and Recommended Formats: Focusing on Audio Types (시청각 유형 보존포맷 선정기준 및 권고포맷 연구 - 오디오 유형을 중심으로 -)

  • Hanyeok Jeon;Dongmin Yang
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.35 no.1
    • /
    • pp.273-300
    • /
    • 2024
  • In the electronic records environment, along with discussions on ways to digitize analog records, it is important to prepare preservation strategies for each type of records produced and received electronically. In the same context, there is a need for discussion on applying a preservation format selection system with the goal of long-term preservation of data sets and audio-visual type electronic records other than document types. Audiovisual records must apply preservation strategies appropriate to the characteristics of each medium, such as images, audio, and video. This study establishes unique standards for selecting a preservation format for audio-visual electronic records through analysis of Significant Properties based on literature review, composed audio-type preservation format suitability evaluation items, and proposed a recommended format based on the results of applying them.