Search | Korea Science

Semantic-based Scene Retrieval Using Ontologies for Video Server (비디오 서버에서 온톨로지를 이용한 의미기반 장면 검색)

Jung, Min-Young;Park, Sung-Han
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.45 no.5
- /
- pp.32-37
- /
- 2008
To ensure access to rapidly growing video collection, video indexing is becoming more and more important. In this paper, video ontology system for retrieving a video data based on a scene unit is proposed. The proposed system creates a semantic scene as a basic unit of video retrieval, and limits a domain of retrieval through a subject of that scene. The content of semantic scene is defined using the relationship between object and event included in the key frame of shots. The semantic gap between the low level feature and the high level feature is solved through the scene ontology to ensure the semantic-based retrieval.
PDF KSCI

Automatic Text Extraction from News Video using Morphology and Text Shape (형태학과 문자의 모양을 이용한 뉴스 비디오에서의 자동 문자 추출)

Jang, In-Young;Ko, Byoung-Chul;Kim, Kil-Cheon;Byun, Hye-Ran
- Journal of KIISE:Computing Practices and Letters
- /
- v.8 no.4
- /
- pp.479-488
- /
- 2002
In recent years the amount of digital video used has risen dramatically to keep pace with the increasing use of the Internet and consequently an automated method is needed for indexing digital video databases. Textual information, both superimposed and embedded scene texts, appearing in a digital video can be a crucial clue for helping the video indexing. In this paper, a new method is presented to extract both superimposed and embedded scene texts in a freeze-frame of news video. The algorithm is summarized in the following three steps. For the first step, a color image is converted into a gray-level image and applies contrast stretching to enhance the contrast of the input image. Then, a modified local adaptive thresholding is applied to the contrast-stretched image. The second step is divided into three processes: eliminating text-like components by applying erosion, dilation, and (OpenClose+CloseOpen)/2 morphological operations, maintaining text components using (OpenClose+CloseOpen)/2 operation with a new Geo-correction method, and subtracting two result images for eliminating false-positive components further. In the third filtering step, the characteristics of each component such as the ratio of the number of pixels in each candidate component to the number of its boundary pixels and the ratio of the minor to the major axis of each bounding box are used. Acceptable results have been obtained using the proposed method on 300 news images with a recognition rate of 93.6%. Also, my method indicates a good performance on all the various kinds of images by adjusting the size of the structuring element.
PDF KSCI

A New Anchor Shot Detection System for News Video Indexing

Lee, Han-Sung;Im, Young-Hee;Park, Joo-Young;Park, Dai-Hee
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2007.11a
- /
- pp.217-220
- /
- 2007
In this paper, we present a new anchor shot detection system which is a core step of the preprocessing process for the news video analysis. The proposed system is composed of four modules and operates sequentially: 1) skin color detection module for reducing the candidate face regions; 2) face detection module for finding the key-frames with a facial data; 3) vector representation module for the key-frame images using a non-negative matrix factorization; 4) anchor shot detection module using a support vector data description. According to our computer experiments, the proposed system shows not only the comparable accuracy to the recent other results, but also more faster detection rate than others.
PDF

Localization of captions in MPEG compression images based on I frame (I 프레임에 기반한 MPEG 압축영상에서의 자막 탐지)

유태웅
- Journal of the Korea Computer Industry Society
- /
- v.2 no.11
- /
- pp.1465-1476
- /
- 2001
For the applications like video indexing, text understanding, and automatic captions localization system, real-time localization of captions is an essential task. This paper presents a algorithm for localization of captions in MPEG compression images based on I frame. In this algorithm, caption text regions are segmented from background images using their distinguishing texture characteristics and chrominance information. Unlike previously published algorithms which fully decompress the video sequence before extracting the text regions, this algorithm locates candidate caption text region directly in the DCT compressed domain.
PDF

Dimension-Reduced Audio Spectrum Projection Features for Classifying Video Sound Clips

Kim, Hyoung-Gook
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.3E
- /
- pp.89-94
- /
- 2006
For audio indexing and targeted search of specific audio or corresponding visual contents, the MPEG-7 standard has adopted a sound classification framework, in which dimension-reduced Audio Spectrum Projection (ASP) features are used to train continuous hidden Markov models (HMMs) for classification of various sounds. The MPEG-7 employs Principal Component Analysis (PCA) or Independent Component Analysis (ICA) for the dimensional reduction. Other well-established techniques include Non-negative Matrix Factorization (NMF), Linear Discriminant Analysis (LDA) and Discrete Cosine Transformation (DCT). In this paper we compare the performance of different dimensional reduction methods with Gaussian mixture models (GMMs) and HMMs in the classifying video sound clips.
PDF KSCI

A New Anchor Shot Detection System for News Video Indexing

Lee, Han-Sung;Im, Young-Hee;Park, Joo-Young;Park, Dai-Hee
- Journal of the Korean Institute of Intelligent Systems
- /
- v.18 no.1
- /
- pp.133-138
- /
- 2008
In this paper, we propose a novel anchor shot detection system, named to MASD (Multi-phase Anchor Shot Detection), which is a core step of the preprocessing process for the news video analysis. The proposed system is composed of four modules and operates sequentially: 1) skin color detection module for reducing the candidate face regions; 2) face detection module for finding the key-frames with a facial data; 3) vector representation module for the key-frame images using a non-negative matrix factorization; 4) one class SVM module for determining the anchor shots using a support vector data description. Besides the qualitative analysis, our experiments validate that the proposed system shows not only the comparable accuracy to the recently developed methods, but also more faster detection rate than those of others.
https://doi.org/10.5391/JKIIS.2008.18.1.133 인용 PDF KSCI

Automatic Summary Method of Linguistic Educational Video Using Multiple Visual Features (다중 비주얼 특징을 이용한 어학 교육 비디오의 자동 요약 방법)

Han Hee-Jun;Kim Cheon-Seog;Choo Jin-Ho;Ro Yong-Man
- Journal of Korea Multimedia Society
- /
- v.7 no.10
- /
- pp.1452-1463
- /
- 2004
The requirement of automatic video summary is increasing as bi-directional broadcasting contents and various user requests and preferences for the bi -directional broadcast environment are increasing. Automatic video summary is needed for an efficient management and usage of many contents in service provider as well. In this paper, we propose a method to generate a content-based summary of linguistic educational videos automatically. First, shot-boundaries and keyframes are generated from linguistic educational video and then multiple(low-level) visual features are extracted. Next, the semantic parts (Explanation part, Dialog part, Text-based part) of the linguistic educational video are generated using extracted visual features. Lastly the XMI- document describing summary information is made based on HieraTchical Summary architecture oi MPEG-7 MDS (Multimedia I)escription Scheme). Experimental results show that our proposed algorithm provides reasonable performance for automatic summary of linguistic educational videos. We verified that the proposed method is useful ior video summary system to provide various services as well as management of educational contents.
PDF

Retrieval System Adopting Statistical Feature of MPEG Video (MPEG 비디오의 통계적 특성을 이용한 검색 시스템)

Yu, Young-Dal;Kang, Dae-Seong;Kim, Dai-Jin
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.38 no.5
- /
- pp.58-64
- /
- 2001
Recently many informations are transmitted ,md stored as video data, and they are on the rapid increase because of popularization of high performance computer and internet. In this paper, to retrieve video data, shots are found through analysis of video stream and the method of detection of key frame is studied. Finally users can retrieve the video efficiently. This Paper suggests a new feature that is robust to object movement in a shot and is not sensitive to change of color in boundary detection of shots, and proposes the characterizing value that reflects the characteristic of kind of video (movie, drama, news, music video etc,). The key frames are pulled out from many frames by using the local minima and maxima of differential of the value. After original frame(not de image) are reconstructed for key frame, indexing process is performed through computing parameters. Key frames that arc similar to user's query image arc retrieved through computing parameters. It is proved that the proposed methods are better than conventional method from experiments. The retrieval accuracy rate is so high in experiments.
PDF

A multidisciplinary analysis of the main actor's conflict emotions in Animation film's Turning Point (장편 애니메이션 극적전환점에서 주인공의 갈등 정서에 대한 다학제적 분석)

Lee, Tae Rin;Kim, Jong Dae;Liu, Guoxu;Ingabire, Jesse;Kim, Jae Ho
- Korea Science and Art Forum
- /
- v.34
- /
- pp.275-290
- /
- 2018
The study began with the recognition that the animations movie need objective and reasonable methods to classify conflicts in visual to analyze conflicts centering on narratives. Study the emotions of the hero in conflict. The purpose of the study is to analyze conflict intensity and emotion. The results and contents of the study are as follows. First, we found a Turning Point and suggested a conflict classification model (Conflict 6B Model). Second, Based on the conflict classification model, the conflict based shot DB was extracted. Third, I found strength and emotion in inner and super personal conflicts. Fourth, Experiments and tests of strength and emotion were conducted in internal and super personal conflicts. The results of this study are metadata extracted from the emotional research on conflict. It is expected to be applied to video indexing of conflicts.
https://doi.org/10.17548/ksaf.2018.06.30.275 인용

A Design and Implementation of algorithm choosing Context-based Image used Multimedia Communication (멀티미디어 통신을 이용한 내용기반 이미지 추출 알고리즘 설계 및 구현)

안병규
- Journal of the Korea Computer Industry Society
- /
- v.2 no.11
- /
- pp.1421-1426
- /
- 2001
Nowadays, as the quantity of multimedia information increases rapidly, an efficient management for multimedia has become more important. In this paper, to index and search multimedia contents efficiently, we designed the algorithm searching specific image and saving the extracted image using the semantic information extraction scheme based on contents and it is one of the schemes to indexing and searching of video data. After extracting the RGB information from input image, while all frames of video is inspected sequentially, the specific image is saved through referring to the position and distribution of contents from the collection scheme of RGB range. In case of using the proposed image extraction algorithm, because only saved video is searched instead of the whole the searching time can be reduced.
PDF

Search Result 198, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)