• Title/Summary/Keyword: Scene Understanding

Search Result 108, Processing Time 0.026 seconds

View Variations and Recognition of 2-D Objects (화상에서의 각도 변화를 이용한 3차원 물체 인식)

  • Whangbo, Taeg-Keun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.4 no.11
    • /
    • pp.2840-2848
    • /
    • 1997
  • Recognition of 3D objects using computer vision is complicated by the fact that geometric features vary with view orientation. An important factor in designing recognition algorithms in such situations is understanding the variation of certain critical features. The features selected in this paper are the angles between landmarks in a scene. In a class of polyhedral objects the angles at certain vertices may form a distinct and characteristic alignment of faces. For many other classes of objects it may be possible to identify distinctive spacial arrangements of some readily identifiable landmarks. In this paper given an isotropic view orientation and an orthographic projection the two dimensional joint density function of two angles in a scene is derived. Also the joint density of all defining angles of a polygon in an image is derived. The analytic expressions for the densities are useful in determining statistical decision rules to recognize surfaces and objects. Experiments to evaluate the usefulness of the proposed methods are reported. Results indicate that the method is useful and powerful.

  • PDF

Composition of a Nonlinear Storytelling Board while Maintaining Vertical and Horizontal Context of Scenes (비선형 스토리텔링보드 구성과 종적 횡적 장면의 맥락 유지)

  • Hongsik Pak;Suhyeon Choi;Taegu Lee
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.4
    • /
    • pp.423-430
    • /
    • 2023
  • This dissertation discusses the formulation of a nonlinear storytelling board that preserves the contextual perspective of characters. Storytelling encompasses the director's creative intention by leveraging the interaction of various elements to construct a logical narrative that explores cause and effect. Its primary objective is to enhance viewers' empathy. Consequently, there is a pressing need for comprehensive research on differentiating storytelling from storyboarding. Moreover, the integrated approach to storytelling and storyboarding holds scholarly value in understanding the process of narrative composition and visualization. Thus, a study proposes a method for constructing nonlinear storytelling boards considering the discrete camera perspective and contextual scene continuity, ultimately contributing to visual complexity and correlation comprehension. This approach enables a careful and simultaneous consideration of the correlations that deepen cognition, including the physical, emotional, and event rhythms mentioned in Karen Perlman's theory.

Raising Visual Experience of Soccer Video for Mobile Viewers (이동형 단말기 사용자를 위한 축구경기 비디오의 시청경험 향상 방법)

  • Ahn, Il-Koo;Ko, Jae-Seung;Kim, Won-Jun;Kim, Chang-Ick
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.13 no.3
    • /
    • pp.165-178
    • /
    • 2007
  • The recent progress in multimedia signal processing and transmission technologies has contributed to the extensive use of multimedia devices to watch sports games with small LCD panel. However, the most of video sequences are captured for normal viewing on standard TV or HDTV, for cost reasons, merely resized and delivered without additional editing. This may give the small-display-viewers uncomfortable experiences in understanding what is happening in a scene. For instance, in a soccer video sequence taken by a long-shot camera techniques, the tiny objects (e.g., soccer ball and players) may not be clearly viewed on the small LCD panel. Moreover, it is also difficult to recognize the contents of the scorebox which contains the elapsed time and scores. This renuires intelligent display technique to provide small-display-viewers with better experience. To this end, one of the key technologies is to determine region of interest (ROI) and display the magnified ROI on the screen, where ROI is a part of the scene that viewers pay more attention to than other regions. Examples include a region surrounding a ball in long-shot and a scorebox located in the comer of each frame. In this paper, we propose a scheme for raising viewing experiences of multimedia mobile device users. Instead of taking generic approaches utilizing visually salient features for extraction of ROI in a scene, we take domain-specific approach to exploit unique attributes of the soccer video. The proposed scheme consists of two modules: ROI determination and scorebox extraction. The experimental results show that the proposed scheme offers useful tools for intelligent video display on multimedia mobile devices.

A new approach for overlay text detection from complex video scene (새로운 비디오 자막 영역 검출 기법)

  • Kim, Won-Jun;Kim, Chang-Ick
    • Journal of Broadcast Engineering
    • /
    • v.13 no.4
    • /
    • pp.544-553
    • /
    • 2008
  • With the development of video editing technology, there are growing uses of overlay text inserted into video contents to provide viewers with better visual understanding. Since the content of the scene or the editor's intention can be well represented by using inserted text, it is useful for video information retrieval and indexing. Most of the previous approaches are based on low-level features, such as edge, color, and texture information. However, existing methods experience difficulties in handling texts with various contrasts or inserted in a complex background. In this paper, we propose a novel framework to localize the overlay text in a video scene. Based on our observation that there exist transient colors between inserted text and its adjacent background a transition map is generated. Then candidate regions are extracted by using the transition map and overlay text is finally determined based on the density of state in each candidate. The proposed method is robust to color, size, position, style, and contrast of overlay text. It is also language free. Text region update between frames is also exploited to reduce the processing time. Experiments are performed on diverse videos to confirm the efficiency of the proposed method.

The chemical reactivity of detecting tube detection equipment for incident responder (화학사고 초기대응자를 위한 검지관식 탐지장비의 반응성 연구)

  • Ahn, Seung-Young;Kim, Jungmin;Kim, Sungbum;Chun, Kwangsoo;Lee, Jin-Seon;Park, Choonhwa
    • Journal of the Society of Disaster Information
    • /
    • v.10 no.1
    • /
    • pp.33-39
    • /
    • 2014
  • Chemical accidents are the cause of the accident site during the initial responders to quickly and easily see materials and concentration method for the U.S. Environmental Protection Agency(EPA) is widely used in the initial response team direct reading detection equipment used. Ministry of the tubular gas detection equipment to detect direct reading detection equipment used in the event of an accident scene, and shell-and-tube gas detector for rapid detection and identification and precise analysis of causative pollutants before about strategically can identify the quantitative and qualitative useful equipment. However, those who initially respond to the scene of a direct reading detection equipment and a simple lack of understanding of how to use the numbers only because of the way you want to check the accuracy of detection results have been raising questions about the increase. The scene of the accident in order to obtain an accurate detection results used in this paper, the Ministry of Environment of gas detectors detect tubular Kitagawa and Draeger detector tube to check the reactivity of the material on-site detection of early response of those who were to raise the accuracy of the results.

An Analysis of Earth System Understandings (ESU) of 8th-grade Students' Imagery about 'the Earth' Represented by Words and Drawings (단어와 그림으로 표현된 8학년 학생들의 '지구'에 대한 심상에서 나타난 지구계 이해 분석)

  • Oh, Hyun-Seok;Kim, Chan-Jong
    • Journal of the Korean earth science society
    • /
    • v.31 no.1
    • /
    • pp.71-87
    • /
    • 2010
  • The purpose of this study was to explore 8th-grade students' imageries of the Earth. We analyzed the middle school students' imageries about the Earth represented with words and drawings in Earth Systems Understanding (ESU, hereafter) framework. The students' imageries about 'the Earth' are vary by their experiences and prior-knowledge, which significantly impacts their imagery construction. Especially, the students' ESU were characterized into two aspects: One is a macroscopic view point based on full-objects of the Earth by indirect experiences and the other is everyday view point based on scene of the Earth surface and environment by direct experiences. Results revealed students' imageries about the Earth were impacted by visual experiences and those students' ESU were more represented by drawing as visual imagery than by words, formal language. The negative imageries were mainly represented through interactions of the Earth subsystems.

Underwater 3D Reconstruction for Underwater Construction Robot Based on 2D Multibeam Imaging Sonar

  • Song, Young-eun;Choi, Seung-Joon
    • Journal of Ocean Engineering and Technology
    • /
    • v.30 no.3
    • /
    • pp.227-233
    • /
    • 2016
  • This paper presents an underwater structure 3D reconstruction method using a 2D multibeam imaging sonar. Compared with other underwater environmental recognition sensors, the 2D multibeam imaging sonar offers high resolution images in water with a high turbidity level by showing the reflection intensity data in real-time. With such advantages, almost all underwater applications, including ROVs, have applied this 2D multibeam imaging sonar. However, the elevation data are missing in sonar images, which causes difficulties with correctly understanding the underwater topography. To solve this problem, this paper concentrates on the physical relationship between the sonar image and the scene topography to find the elevation information. First, the modeling of the sonar reflection intensity data is studied using the distances and angles of the sonar beams and underwater objects. Second, the elevation data are determined based on parameters like the reflection intensity and shadow length. Then, the elevation information is applied to the 3D underwater reconstruction. This paper evaluates the presented real-time 3D reconstruction method using real underwater environments. Experimental results are shown to appraise the performance of the method. Additionally, with the utilization of ROVs, the contour and texture image mapping results from the obtained 3D reconstruction results are presented as applications.

SVDD based Scene Understanding using Color Space Information (색 공간 정보를 이용한 지지벡터 영역 묘사 기반의 장면 이해)

  • Kim, Soo-Wan;Chang, Hyung-Jin;Kang, Woo-Sung;Choi, Jin-Young
    • Proceedings of the KIEE Conference
    • /
    • 2008.10b
    • /
    • pp.264-265
    • /
    • 2008
  • 기존 영상감시 시스템의 물체 탐지 알고리즘은 주로 배경 모델링 기법을 기반으로 하고 있다. 이 기법은 차영상 기법보다는 성능이 뛰어나기는 하지만 여전히 정지 카메라에서만 활용이 가능하고, 주변 환경에 따라 알고리즘 상의 많은 임계값을 현재 상황에 맞춰 일일이 조절해 주어야 한다는 한계점이 있다. 따라서 이 논문에서는 배경모델링 기법을 사용하지 않고 입력되는 영상의 Color 정보를 이용하여 영상 내에 있는 여러 대상을 직접 판단하여 관심 있는 물체를 탐지하는 방법을 제안하고자 한다. 제안된 알고리즘은 먼저 현재의 영상을 하나의 물체로 추정되는 영역이 하나의 영역으로 구분되어지게 간단하게 분할해낸다 그리고 나누어진 영역마다 대표 Color 값을 계산하여 미리 학습된 데이터를 기준으로 Support Vector Domain Description (SVDD) 알고리즘을 사용하여 구별해내고 그 결과를 바탕으로 영역이 무엇인지를 판별해낸다. 이 방법은 정지되어 있는 카메라뿐만 아니라 움직이는 카메라 상에서도 사용되어질 수 있으며 알고리즘 상에서 사용되는 임계값의 종류가 적기 때문에 많은 상황에서 일반적으로 쓰일 수 있다.

  • PDF

AN IMAGE SEGMENTATION LEVEL SET METHOD FOR BUILDING DETECTION

  • Konstantinos, Karantzalos;Demetre, Argialas
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.610-614
    • /
    • 2006
  • In this paper the advanced method of geodesic active contours was developed for the task of building detection from aerial and satellite images. Automatic extraction of man-made structures including buildings, building blocks or roads from remote sensing data is useful for land use mapping, scene understanding, robotic navigation, image retrieval, surveillance, emergency management procedures, cadastral etc. A level set method based on a region-driven segmentation model was implemented with which building boundaries were detected, through this curve propagation technique. The essence of this approach is to optimize the position and the geometric form of the curve by measuring information along that curve, and within the regions that compose the image partition. To this end, one can consider uniform intensities inside objects and the background. Thus, given an initial position of the curve, one can determine global, region-driven functions and provide a statistical description of the inside and outside object area. The calculus of variations and a gradient descent method was used to optimize the variational functional by an iterative steady state process. Experimental results demonstrate the potential of the proposed processing scheme.

  • PDF

Kinematic Analysis of the Linking Motion from the Swallow Skill to the Nakayama Skill on the Rings (링의 스왈로에서 나까야마 기술로의 연결 동작에 대한 운동학적 분석)

  • Chung, Nam-Ju
    • Korean Journal of Applied Biomechanics
    • /
    • v.14 no.2
    • /
    • pp.1-14
    • /
    • 2004
  • This study was intended to contribute to allowing athletes to raise a technical understanding of two motions of high difficulty such as the Swallow motion and the Nakayama motion and enhance their competitive power by analysing the kinematical factors required to link those two motions on the competitive scene on the rings for current national athletes. For this purpose, the game of the ring event was videotaped for male heavy gymnasts participating in the final elimination match of the 2004 Athens Olympic Games. This study attempted to select the performing motions of the final 1st-and 2nd-place athletes performing the linking motions from the Swallow motion and the Nakayama motion using the DLT(direct linear transformation) method. As a result, it arrived at the following conclusion : A1 properly performed the flexing and extending movements using the angular velocity of the segment and joint as the switching motion using the body at the time of linking the motion from the Swallow skill to the Nakayama skill. A2 was evaluated to perform the skill taking the form of depending on the force at the static state. Therefore, it is thought that A1 should take care of shaking at the time of using the elasticity of the body. It is thought that in case of A2 the proper use of the elasticity of the body take care of shaking at the switching motion while taking advantage of the force will contribute to his competitive power.