• Title/Summary/Keyword: Representation of video Data

Search Result 64, Processing Time 0.028 seconds

A New Anchor Shot Detection System for News Video Indexing

  • Lee, Han-Sung;Im, Young-Hee;Park, Joo-Young;Park, Dai-Hee
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.18 no.1
    • /
    • pp.133-138
    • /
    • 2008
  • In this paper, we propose a novel anchor shot detection system, named to MASD (Multi-phase Anchor Shot Detection), which is a core step of the preprocessing process for the news video analysis. The proposed system is composed of four modules and operates sequentially: 1) skin color detection module for reducing the candidate face regions; 2) face detection module for finding the key-frames with a facial data; 3) vector representation module for the key-frame images using a non-negative matrix factorization; 4) one class SVM module for determining the anchor shots using a support vector data description. Besides the qualitative analysis, our experiments validate that the proposed system shows not only the comparable accuracy to the recently developed methods, but also more faster detection rate than those of others.

A New Anchor Shot Detection System for News Video Indexing

  • Lee, Han-Sung;Im, Young-Hee;Park, Joo-Young;Park, Dai-Hee
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.217-220
    • /
    • 2007
  • In this paper, we present a new anchor shot detection system which is a core step of the preprocessing process for the news video analysis. The proposed system is composed of four modules and operates sequentially: 1) skin color detection module for reducing the candidate face regions; 2) face detection module for finding the key-frames with a facial data; 3) vector representation module for the key-frame images using a non-negative matrix factorization; 4) anchor shot detection module using a support vector data description. According to our computer experiments, the proposed system shows not only the comparable accuracy to the recent other results, but also more faster detection rate than others.

  • PDF

Core Technology and Service Trends of Multimedia Service Using Satellite (위성을 이용한 멀티미디어 서비스의 요소 기술과 제공 현황)

  • 김정호
    • Journal of the Korean Professional Engineers Association
    • /
    • v.34 no.4
    • /
    • pp.36-40
    • /
    • 2001
  • Multimedia service via satellite Is supported voice, data, Image and video signals. The representation case model of satellite multimedia are satellite TV. satellite Internet. In the early 1990s, satellite communication and broad casting services successfully expanded form C/Ku band to Ka band. The benefits of operation at Ka-band are greater bandwidth available to accommodate the increased demand for high-speed Information exchange. By the early years of the 21s1 century, millions of households worldwide with dual Ku / Ka-band dishes Satellite multimedia systems receive hundreds of TV channels, originating from around the world, and delivering entertainment, information and education. Many Ku-band satellites have been ordered, but few Ka-band systems are moving into production. So Ka-band systems are characterized that low-cost access to low and high peed, two-way voice, data, and video communications.

  • PDF

Video Event Detection according to Generating of Semantic Unit based on Moving Object (객체 움직임의 의미적 단위 생성을 통한 비디오 이벤트 검출)

  • Shin, Ju-Hyun;Baek, Sun-Kyoung;Kim, Pan-Koo
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.2
    • /
    • pp.143-152
    • /
    • 2008
  • Nowadays, many investigators are studying various methodologies concerning event expression for semantic retrieval of video data. However, most of the parts are still using annotation based retrieval that is defined into annotation of each data and content based retrieval using low-level features. So, we propose a method of creation of the motion unit and extracting event through the unit for the more semantic retrieval than existing methods. First, we classify motions by event unit. Second, we define semantic unit about classified motion of object. For using these to event extraction, we create rules that are able to match the low-level features, from which we are able to retrieve semantic event as a unit of video shot. For the evaluation of availability, we execute an experiment of extraction of semantic event in video image and get approximately 80% precision rate.

  • PDF

H.264 Encoding Technique of Multi-view Image expressed by Layered Depth Image (계층적 깊이 영상으로 표현된 다시점 영상에 대한 H.264 부호화 기술)

  • Kim, Min-Tae;Jee, Inn-Ho
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.10 no.1
    • /
    • pp.81-90
    • /
    • 2010
  • This paper presents H.264 coding schemes for multi-view video using the concept of layered depth image(LDI) representation and efficient compression technique for LDI. After converting those data to the proposed representation, we encode color, depth, and auxiliary data representing the hierarchical structure, respectively, Two kinds of preprocessing approaches are proposed for multiple color and depth components. In order to compress auxiliary data, we have employed a near lossless coding method. Finally, we have reconstructed the original viewpoints successfully from the decoded approach that is useful for dealing with multiple color and depth data simultaneously.

Rate-Constrained Key Frame Selection Method using Iteration (반복 과정을 통한 율-제한 주요 화명 선택 기법)

  • Lee, Hun-Cheol;Kim, Seong-Dae
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.4
    • /
    • pp.388-398
    • /
    • 2002
  • Video representation through representative frames (key frames) has been addressed frequently as an efficient way of preserving the whole temporal information of sequence with a considerably smaller amount of data. Such compact video representation is suitable for the purpose of video browsing in limited storage or transmission bandwidth environments. In a case like this, the controllability of the total key frame number (i.e. key frame rate) depending on the storage or bandwidth capacity is an important requirement of a key frame selection method. In this paper, we present a sequential key frame selection method when the number of key frames is given as a constraint. It first selects the desired number of initial key frames and determines non-overlapping initial time intervals that are represented by each key frame. Then, it adjusts the positions of key frames and time intervals by iteration, which minimizes the distortion. Experimental result demonstrates the improved performance of our algorithm over the existing approaches.

A Study on Image Representation of Bisexual Lighting (바이섹슈얼 라이팅(Bisexual Lighting)의 영상 표현 연구)

  • QIAO, YINA
    • Trans-
    • /
    • v.11
    • /
    • pp.119-142
    • /
    • 2021
  • Video was a cultural practice based on image. The audience longs to experience new things, not everyday things through by video images. There are many components of the image, but among them, color, a visual representation, plays a big role. Since the advent of color films, color has constantly evolved as an important component of visual art and has become an important role in innovative visual art design. According to film history data, filmmakers were interested in color since the film was created in 1895, but in the early stages of film development, film colors were only black and white. Because these two colors no longer satisfy viewers, more natural colors began to emerge from the film as it was colored. However, with the development of historical paintings, the lack of artistic creation and the public's level increased, making people more active in using colors because simple reproduction of natural colors alone does not satisfy people. The colors in the video are both techniques of expression and can be understood by mind and thought. It is also an indication that colors do not just exist, but they work strongly on human psychology. Now people are so motivated by repetitive and unimportant information that they find that the human intuitive system simplifies the information they receive unconsciously that they have certain customs and characteristics when they see things. Color is part of the film language, or color language can express the film's ideological themes or portray vivid characters in the film, and people are receiving more intuitive messages. This study analyzed the basic color components of bisexual lighting, namely, pink, blue, and purple, and analyzed how human psychology is affected through color, combining the scenes from the video. The purpose of this paper is to explore what color language bisexual lighting is expressed using color properties in images and how bisexual lighting interacts with human psychology through color.

Semantic-based Scene Retrieval Using Ontologies for Video Server (비디오 서버에서 온톨로지를 이용한 의미기반 장면 검색)

  • Jung, Min-Young;Park, Sung-Han
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.45 no.5
    • /
    • pp.32-37
    • /
    • 2008
  • To ensure access to rapidly growing video collection, video indexing is becoming more and more important. In this paper, video ontology system for retrieving a video data based on a scene unit is proposed. The proposed system creates a semantic scene as a basic unit of video retrieval, and limits a domain of retrieval through a subject of that scene. The content of semantic scene is defined using the relationship between object and event included in the key frame of shots. The semantic gap between the low level feature and the high level feature is solved through the scene ontology to ensure the semantic-based retrieval.

Hardware Architecture for PC-based MPEG-4 Video CODEC (PC 기반 MPEG-4 비디오 코덱 구현을 위한 하드웨어 아키텍쳐)

  • 곽진석;임영권;박상규;김진웅
    • Journal of Broadcast Engineering
    • /
    • v.2 no.2
    • /
    • pp.86-93
    • /
    • 1997
  • Fast growth of multimedia applications requires new functions for video data processing. such as obj;cted-based video representation and manipulation. which are not supported by 11PEG-l and 11PEG-2. To support these requirements. 11PEG-4 video coding allows users to manipulate every video object easily by decomposing a scene into several video objects and coding each of them independently. However. the large amount of computations and flexible structure of 11PEG-4 video CODEC make it difficult to be implemented by either the general purpose DSP or a dedicated VLSI. In this paper, we propose a hardware architecture using a hybrid of a high performance programmable DSP and an application specific IC to implement a flexible 11PEG-4 video codec requiring the large amount of computations. The application specific IC has the functions of motion estimation and compensation.

  • PDF

Consistency of Responses to Affective Stimuli Across Individuals using Intersubject Representational Similarity Analysis based on Behavioral and Physiological Data (참가자 간 표상 유사성 분석을 이용한 정서 자극 반응 일치성 비교: 행동 및 생리 데이터를 기반으로)

  • Junhyuk Jang;Hyeonjung Kim;Jongwan Kim
    • Science of Emotion and Sensibility
    • /
    • v.26 no.3
    • /
    • pp.3-14
    • /
    • 2023
  • This study used intersubject representational similarity analysis (IS-RSA) to identify participant-response consistency patterns in previously published data. Additionally, analysis of variance (ANOVA) was utilized to detect any variations in the conditions of each experiment. In each experiment, a combination of ASMR stimulation, visual and auditory stimuli, and time-series emotional video stimulation was employed, and emotional ratings and physiological measurements were collected in accordance with the respective experimental conditions. Every pair of participants' measurements for each stimulus in each experiment was correlated using Pearson correlation coefficient as part of the IS-RSA. The results of study revealed a consistent response pattern among participants exposed to ASMR, visual, and auditory stimuli, in contrast to those exposed to time-series emotional video stimulation. Notably, the ASMR experiment demonstrated a high level of response consistency among participants in positive conditions. Furthermore, both auditory and visual experiments exhibited remarkable consistency in participants' responses, especially when subjected to high arousal levels and visual stimulation. The findings of this study confirm that IS-RSA serves as a valuable tool for summarizing and presenting multidimensional data information. Within the scope of this study, IS-RSA emerged as a reliable method for analyzing multidimensional data, effectively capturing and presenting comprehensive information pertaining to the participants.