• Title/Summary/Keyword: Video Summarization

Search Result 60, Processing Time 0.033 seconds

Structural similarity based efficient keyframes extraction from multi-view videos (구조적인 유사성에 기반한 다중 뷰 비디오의 효율적인 키프레임 추출)

  • Hussain, Tanveer;Khan, Salman;Muhammad, Khan;Lee, Mi Young;Baik, Sung Wook
    • The Journal of Korean Institute of Next Generation Computing
    • /
    • v.14 no.6
    • /
    • pp.7-14
    • /
    • 2018
  • Salient information extraction from multi-view videos is a very challenging area because of inter-view, intra-view correlations, and computational complexity. There are several techniques developed for keyframes extraction from multi-view videos with very high computational complexities. In this paper, we present a keyframes extraction approach from multi-view videos using entropy and complexity information present inside frame. In first step, we extract representative shots of the whole video from each view based on structural similarity index measurement (SSIM) difference value between frames. In second step, entropy and complexity scores for all frames of shots in different views are computed. Finally, the frames with highest entropy and complexity scores are considered as keyframes. The proposed system is subjectively evaluated on available office benchmark dataset and the results are convenient in terms of accuracy and time complexity.

MPEG-21 Terminal (MPEG-21 터미널)

  • 손유미;박성준;김문철;김종남;박근수
    • Journal of Broadcast Engineering
    • /
    • v.8 no.4
    • /
    • pp.410-426
    • /
    • 2003
  • MPEG-21 defines a digital item as an atomic unit lot creation, delivery and consumption in order to provide an integrated multimedia framework in networked environments. It is expected that MPEG-21 standardization makes it Possible for users to universally access user's preferred contents in their own way they want. In order to achieve this goal, MPEG-21 has standardized the specifications for the Digital Item Declaration (DID). Digital Identification (DII), Rights Expression Language (REL), Right Data Dictionary (RDD) and Digital Item Adaptation (DIA), and is standardizing the specifications for the Digital Item Processing (DIP), Persistent Association Technology (PAT) and Intellectual Property Management and Protection (IPMP) tot transparent and secured usage of multimedia. In this paper, we design an MPEG-21 terminal architecture based one the MPEG-21 standard with DID, DIA and DIP, and implement with the MPEG-21 terminal. We make a video summarization service scenario in order to validate ow proposed MPEG-21 terminal for the feasibility to of DID, DIA and DIP. Then we present a series of experimental results that digital items are processed as a specific form after adaptation fit for the characteristics of MPEG-21 terminal and are consumed with interoperability based on a PC and a PDA platform. It is believed that this paper has n important significance in the sense that we, for the first time, implement an MPEG-21 terminal which allows for a video summarization service application in an interoperable way for digital item adaptation and processing nth experimental results.

The Production of CD-ROM for the Class and the Development of Effective Master Plan Applied by It -In the Point of Wearing Korean Traditioinal Costume for First Grade of Junior Middle School Students in Home Economics Teaching- (수업용 CD-ROM 제작 및 이를 적용시킨 효과적인 학습지도안 개발 -중학교 1학년 가정 한복 입기를 중심으로-)

  • 이은선;김병미
    • Journal of Korean Home Economics Education Association
    • /
    • v.11 no.2
    • /
    • pp.13-26
    • /
    • 1999
  • The goals of this research are for producing and optimizing the CD-ROM, effective and practical Teaching-Learning method. It consists of Wearing Korean Traditional Costume for the First Grade of Middle School Students in Home Economics Teaching. This research’s summarization is following. First, the multi-media material. CD-ROM making use of Powerpoint. Wearing Korean Traditional Costume, is produced to help the students learn the difficult contents in terms of video and audio. Second, it is introduced the model of Open Education for increasing the efficiency of class. Third, it is developed to proceed the class with the CD-ROM and small group study of place activity. Fourth, it helps students concentrate on the class with proper sound effect whenever the slide films are changed. And it helps to link the web sites related to Korean Traditional Costume. Finally, another kinds of suggestions are following. The effective verification of this software that is tested and applied at the field for a given period will be necessary. And, it is necessary to upgrade for the CD-ROM and the supplementary teaching materials in Korean Traditional Costume education.

  • PDF

Video Summarization System Based on Multi-Camera (멀티카메라 기반 동영상 요약 시스템)

  • Im, Seung-Bin;Park, Han-Saem;Min, Jun-Ki;Hwang, Keum-Sung;Cho, Sung-Bae
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10b
    • /
    • pp.44-48
    • /
    • 2006
  • 디지털 카메라 기술의 발전과 보급으로 공공건물의 보안 카메라부터 개인 휴대 단말기의 카메라까지 동영상 데이터를 수집할 수 있는 수단이 크게 늘었으며, 그 활용 또한 매우 일반화되었다. 동영상 데이터는 문서나 음성 등의 다른 데이터보다 훨씬 구체적이고 사실적인 정보를 포함하므로 과거의 기억을 정리하고 복원하기 위한 유용한 방법이 될 수 있다. 동영상 데이터의 증가와 함께 동영상 요약에 대한 연구가 최근에 활발히 진행되고 있는데, 이들 연구의 대부분은 하나의 동영상을 요약하고 분석하기 위한 것이다. 본 논문에서는 사무실에 여러 대의 카메라를 설치하여 데이터를 저장하며, 이렇게 수집된 동영상 데이터를 효과적으로 요약하고 검색하는 시스템을 구축한다. 동일한 이벤트를 여러 방향에서 바라보고, 그 상황을 가장 잘 설명한 카메라를 선택 할 수 있다는 점에서 멀티 카메라의 사용은 장점을 갖는다. 사전에 정의된 이벤트에 따라 전문가가 어노테이션을 부여하도록 하였으며, 전문가가 설정한 유틸리티에 따라 카메라 선택 및 요약이 이루어진다. 다양한 옵션에 따라 요약된 결과로 사용자 평가를 수행하였다.

  • PDF

Highlight Detection in Personal Broadcasting by Analysing Chat Traffic : Game Contests as a Test Case (채팅 트래픽 분석을 통한 개인방송 하이라이트 검출 : 게임 콘텐츠를 중심으로)

  • Kim, Eunyul;Lee, Gyemin
    • Journal of Broadcast Engineering
    • /
    • v.23 no.2
    • /
    • pp.218-226
    • /
    • 2018
  • As the number of personal broadcasting contents is rapidly increasing, the demand for a service that provides highlights is growing. A highlight, a collection of interesting scenes, can improve the quality of viewing experience. In this paper, we propose a method to automatically detect highlights using only chat traffic information. We also propose evaluation methods the effectiveness of using chat traffic in highlight detection. We apply the detection algorithm to game broadcasting, which has larger audience, and demonstrate its performance.

A Hybrid Comparing Method of a Similar Frame for Generating Video Summarization Sequences (동영상 요약 시퀀스 생성을 위한 하이브리드 유사 프레임 비교 기법)

  • Ock, Chang-Seok;Kwon, Dae-Gun;Cho, Hwan-Gue
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2012.04a
    • /
    • pp.394-397
    • /
    • 2012
  • 멀티미디어의 규모가 급격하게 늘어나고 있는 현재, 영화와 같은 동영상은 용량에 있어 사진과 비교했을 때 상당한 크기를 가지고 있고 그만큼 많은 정보를 담고 있다. 이렇게 많은 정보를 얻기 위해 사용자들은 많은 시간을 소비해야 한다. 이러한 비효율적인 측면의 보완을 위해 동영상의 각 프레임의 유사도를 판단하여 유사한 프레임들은 하나로 모으고, 유사하지 않은 프레임들은 구분하여 요약된 시퀀스로 보여줄 수 있는 방법이 필요하다. 이러한 관점에서 봤을 때 동영상은 시간적 순서에 따라 프레임이 배열되어 있고 인근 프레임 간에는 Coherence가 존재한다는 장점이 있다. 따라서 우리는 이러한 장점을 최대한 이용하여 동영상의 요약 시퀀스를 생성하기 위해 일차적으로 필요한 유사 프레임을 비교할 수 있는 기법을 제안한다. 제안하는 기법은 각 프레임의 공간적인 정보를 활용 할 수 있는 특징점 기반의 기법과, 각 프레임의 색 분포 정보를 활용 할 수 있는 히스토그램 기반의 기법을 Hybrid하게 적용하여 유사 프레임을 판단한다. 제안한 기법을 통해 도출한 결과를 통계학적으로 검증을 위해 널리 사용되는 Precision과 Recall을 이용하여 검증한다.

Analysis of Korean Mathematics Class Organization and Teacher's Approach and Activities: Focused on the Lessons from Learner's Perspective Study (한국 수학 수업의 조직 및 교수 활동 분석: LPS(Learner's Perspective Study) 수업 자료를 중심으로)

  • Park, Kyung-Mee
    • Journal of Educational Research in Mathematics
    • /
    • v.17 no.2
    • /
    • pp.127-145
    • /
    • 2007
  • There have been several international lesson studies such as TIMSS Video Study and Learner's Perspective Study. According to the TIMSS Video Study report, within differences found in the lessons in each country is much less than the between differences found in the lessons across countries. This means that each country has its own way of teaching, so called 'national script'. On the contrary, LPS researchers are skeptical about the existence of 'national script' since significant differences are identified within the lessons conducted by the same teacher. The purpose of this study is to analyze the LPS Korean data in terms of class organization and teacher's approach and activities. The categories of class organization are classwork, small group seatwork, and individual seatwork, and the those of teacher's approach and activities are exploratory, directive, summarization, exercises and practice, and assigning homework. Ten lessons were videotaped from two Korean schools respectively, thus altogether twenty lessons were recorded and analyzed. Each lesson shows unique class approach and teacher's approach and activities, however the average of each category in class organization and teacher's approach and activities for the two schools are very similar. This result supports the TIMSS Video Study in the regard that there is a commonality among the lessons within the country, but also confirms the LPS result that it is difficult to assume 'national script'. This study is a preliminary investigation into the LPS Korean data, and the further in-depth interpretation of LPS lessons will be followed.

  • PDF

Acceleration of Viewport Extraction for Multi-Object Tracking Results in 360-degree Video (360도 영상에서 다중 객체 추적 결과에 대한 뷰포트 추출 가속화)

  • Heesu Park;Seok Ho Baek;Seokwon Lee;Myeong-jin Lee
    • Journal of Advanced Navigation Technology
    • /
    • v.27 no.3
    • /
    • pp.306-313
    • /
    • 2023
  • Realistic and graphics-based virtual reality content is based on 360-degree videos, and viewport extraction through the viewer's intention or automatic recommendation function is essential. This paper designs a viewport extraction system based on multiple object tracking in 360-degree videos and proposes a parallel computing structure necessary for multiple viewport extraction. The viewport extraction process in 360-degree videos is parallelized by composing pixel-wise threads, through 3D spherical surface coordinate transformation from ERP coordinates and 2D coordinate transformation of 3D spherical surface coordinates within the viewport. The proposed structure evaluated the computation time for up to 30 viewport extraction processes in aerial 360-degree video sequences and confirmed up to 5240 times acceleration compared to the CPU-based computation time proportional to the number of viewports. When using high-speed I/O or memory buffers that can reduce ERP frame I/O time, viewport extraction time can be further accelerated by 7.82 times. The proposed parallelized viewport extraction structure can be applied to simultaneous multi-access services for 360-degree videos or virtual reality contents and video summarization services for individual users.

Semantic Event Detection and Summary for TV Golf Program Using MPEG-7 Descriptors (MPEG-7 기술자를 이용한 TV 골프 프로그램의 이벤트검출 및 요약)

  • 김천석;이희경;남제호;강경옥;노용만
    • Journal of Broadcast Engineering
    • /
    • v.7 no.2
    • /
    • pp.96-106
    • /
    • 2002
  • We introduce a novel scheme to characterize and index events in TV golf programs using MPEG-7 descriptors. Our goal is to identify and localize the golf events of interest to facilitate highlight-based video indexing and summarization. In particular, we analyze multiple (low-level) visual features using domain-specific model to create a perceptual relation for semantically meaningful(high-level) event identification. Furthermore, we summarize a TV golf program with TV-Anytime segmentation metadata, a standard form of an XML-based metadata description, in which the golf events are represented by temporally localized segments and segment groups of highlights. Experimental results show that our proposed technique provides reasonable performance for identifying a variety of golf events.

Considerations for Applying Korean Natural Language Processing Technology in Records Management (기록관리 분야에서 한국어 자연어 처리 기술을 적용하기 위한 고려사항)

  • Haklae, Kim
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.22 no.4
    • /
    • pp.129-149
    • /
    • 2022
  • Records have temporal characteristics, including the past and present; linguistic characteristics not limited to a specific language; and various types categorized in a complex way. Processing records such as text, video, and audio in the life cycle of records' creation, preservation, and utilization entails exhaustive effort and cost. Primary natural language processing (NLP) technologies, such as machine translation, document summarization, named-entity recognition, and image recognition, can be widely applied to electronic records and analog digitization. In particular, Korean deep learning-based NLP technologies effectively recognize various record types and generate record management metadata. This paper provides an overview of Korean NLP technologies and discusses considerations for applying NLP technology in records management. The process of using NLP technologies, such as machine translation and optical character recognition for digital conversion of records, is introduced as an example implemented in the Python environment. In contrast, a plan to improve environmental factors and record digitization guidelines for applying NLP technology in the records management field is proposed for utilizing NLP technology.