• 제목/요약/키워드: scene image

검색결과 947건 처리시간 0.037초

Multimodal Context Embedding for Scene Graph Generation

  • Jung, Gayoung;Kim, Incheol
    • Journal of Information Processing Systems
    • /
    • 제16권6호
    • /
    • pp.1250-1260
    • /
    • 2020
  • This study proposes a novel deep neural network model that can accurately detect objects and their relationships in an image and represent them as a scene graph. The proposed model utilizes several multimodal features, including linguistic features and visual context features, to accurately detect objects and relationships. In addition, in the proposed model, context features are embedded using graph neural networks to depict the dependencies between two related objects in the context feature vector. This study demonstrates the effectiveness of the proposed model through comparative experiments using the Visual Genome benchmark dataset.

스캔 영상 기반의 밀리미터파(Ka 밴드) 복합모드 탐색기 표적인식 알고리즘 연구 (Target Recognition Algorithm Based on a Scanned Image on a Millimeter-Wave(Ka-Band) Multi-Mode Seeker)

  • 노경아;정준영;송성찬
    • 한국전자파학회논문지
    • /
    • 제30권2호
    • /
    • pp.177-180
    • /
    • 2019
  • 유도무기의 명중률 개선을 위해 해상 클러터 환경에서 표적을 정확하게 탐지하고 인식하는 연구가 다수 수행되고 있다. 해상 표적과 클러터의 신호가 다양하고 복잡한 특성을 보이기 때문에 능동 표적인식 기술에 대한 연구가 필수적으로 요구된다. 본 논문에서는 스캔 영상(scan image)으로 형성된 이미지에 프랙탈 차원기법(fractal dimension)인 FS(Fractal Signature) 분류기와 영상정합기법(scene matching)인 HRTI(High Resolution Target Image)을 적용하여 표적과 클러터를 구분하고 표적 간의 인식하는 알고리즘을 제안한다. 알고리즘을 적용한 시뮬레이션 수행 결과, HRTI 분류기는 표적1과 표적2를 모두 100 %, FS 분류기는 표적 1과 표적 2를 각 각 90 %, 93 % 이상 구분 및 인식한다.

Video Content Manipulation Using 3D Analysis for MPEG-4

  • Sull, Sanghoon
    • 방송공학회논문지
    • /
    • 제2권2호
    • /
    • pp.125-135
    • /
    • 1997
  • This paper is concerned with realistic mainpulation of content in video sequences. Manipulation of content in video sequences is one of the content-based functionalities for MPEG-4 Visual standard. We present an approach to synthesizing video sequences by using the intermediate outputs of three-dimensional (3D) motion and depth analysis. For concreteness, we focus on video showing 3D motion of an observer relative to a scene containing planar runways (or roads). We first present a simple runway (or road) model. Then, we describe a method of identifying the runway (or road) boundary in the image using the Point of Heading Direction (PHD) which is defined as the image of, the ray along which a camera moves. The 3D motion of the camera is obtained from one of the existing 3D analysis methods. Then, a video sequence containing a runway is manipulated by (i) coloring the scene part above a vanishing line, say blue, to show sky, (ii) filling in the occluded scene parts, and (iii) overlaying the identified runway edges and placing yellow disks in them, simulating lights. Experimental results for a real video sequence are presented.

  • PDF

Acoustooptical Approach for Moving Scene Holography

  • Petrov, Vladimir
    • 한국정보디스플레이학회:학술대회논문집
    • /
    • 한국정보디스플레이학회 2003년도 International Meeting on Information Display
    • /
    • pp.451-462
    • /
    • 2003
  • At the paper the method of 3D holographic moving image reconstruction is discused. The main idea of this method is based on the substitution of optically created static hologram by equal diffraction array created by acoustical (AO) field which formed by bulk sound waves. Such sound field can be considered as dynamic optical hologram, which is electrically controlled. At the certain moment of time when the whole hologram already formed, the reference optical beam illuminates it, and due to acoustooptical interaction the original optical image is reconstructed. As the acoustically created dynamic optical hologram is electronically controlled, it can be used for moving 3-dimentional scene reconstruction in real time. The architecture of holographic display for moving scene reconstruction is presented at this paper. The calculated variant of such display laboratory model is. given and discussed. The mathematical simulation of step by step images recording and reconstruction is given. The pictures of calculated reconstructed images are presented. The prospects, application areas, shortcomings and main problems are discussed.

  • PDF

Imaging a scene from experience given verbal experssions

  • Sakai, Y.;Kitazawa, M.;Takahashi, S.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1995년도 Proceedings of the Korea Automation Control Conference, 10th (KACC); Seoul, Korea; 23-25 Oct. 1995
    • /
    • pp.307-310
    • /
    • 1995
  • In the conventional systems, a human must have knowledge of machines and of their special language in communicating with machines. In one side, it is desirable for a human but in another side, it is true that achieving it is very elaborate and is also a significant cause of human error. To reduce this sort of human load, an intelligent man-machine interface is desirable to exist between a human operator and machines to be operated. In the ordinary human communication, not only linguistic information but also visual information is effective, compensating for each others defect. From this viewpoint, problem of translating verbal expressions to some visual image is discussed here in this paper. The location relation between any two objects in a visual scene is a key in translating verbal information to visual information, as is the case in Fig.l. The present translation system advances in knowledge with experience. It consists of Japanese Language processing, image processing, and Japanese-scene translation functions.

  • PDF

원격탐사 데이타의 정확도 향상을 위한 Bitemporal Classification 기법의 적용 (Application of Bitemporal Classification Technique for Accuracy Improvement of Remotely Sensed Data)

  • 안철호;안기원;윤상호;박민호
    • 한국측량학회지
    • /
    • 제5권2호
    • /
    • pp.24-33
    • /
    • 1987
  • 본 논문은 원격탐사 Data를 이용한 분야에서 보다 효과적인 좌상처리 기법 및 보다 정확한 분류화상을 얻는 것을 목적으로 하고 있다. 이의 실행을 위해 여름 좌상과 겨울 화상을 합성한 토지이용 분류결과와 여름 화상만의 분류결과를 비교분석 하였다. 위의 분석결과로부터 Bitemporal Classification 기법과 $tan^{-1}$변환이 유효함을 알아내었다. 특히 Bitemporal Classification 기법을 적용함으로써 농경지를 논과 밭으로 구별하여 분류하는 것이 보다 가능하였다.

  • PDF

Acoustooptical Approach for Moving Scene Holography

  • Petrov, Vladimir
    • Journal of Information Display
    • /
    • 제4권3호
    • /
    • pp.29-34
    • /
    • 2003
  • At the paper the method of 3D holographic moving image reconstruction is discused. The main idea of this method is based on the substitution of optically created static hologram by equal diffraction array created by acoustical (AO) field which formed by bulk sound waves. Such sound field can be considered as dynamic optical hologram, which is electrically controlled. At the certain moment of time when the whole hologram already formed, the reference optical beam illuminates it, and due to acoustooptical interaction the original optical image is reconstructed. As the acoustically created dynamic optical hologram is electronically controlled, it can be used for moving 3-dimentional scene reconstruction in real time. The architecture of holographic display for moving scene reconstruction is presented at this paper. The calculated variant of such display laboratory model is given and discussed. The mathematical simulation of step by step images recording and reconstruction is given. The pictures of calculated reconstructed images are presented. The prospects, application areas, shortcomings and main problems are discussed.

IMAGE SYNTHESIS FOR DYNAMIC SCENES

  • Feng, Chen-Chin;Chang, Su-Yuan;Yang, Shi-Nine
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 1999년도 KOBA 방송기술 워크샵 KOBA Broadcasting Technology Workshop
    • /
    • pp.15.1-21
    • /
    • 1999
  • Radiosity method is a global illumination model for image synthesis. It computes all energy interactions among diffuse elements in a virtual environment. One of the major drawbacks if its time consuming computation. Existing radiosity algorithms for static scene is difficult to be applicable to dynamic environments. In this paper we proposed an hierarchical scene partition scheme to speedup the link update computations in the dynamic environments. Since the proposed spatial data structure is global, it not only can be used to speedup the culling of non-affected links after geometry change, but also can be used to accelerate the subsequent visibility computation. Several empirical tests are given to show the efficiency of our improved algorithm.

On-Board Satellite MSS Image Compression

  • Ghassemian, Hassan;Amidian, Asghar
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2003년도 Proceedings of ACRS 2003 ISRS
    • /
    • pp.645-647
    • /
    • 2003
  • In this work a new method for on-line scene segmentation is developed. In remote sensing a scene is represented by the pixel-oriented features. It is possible to reduce data redundancy by an unsupervised segment-feature extraction process, where the segment-features, rather than the pixelfeatures, are used for multispectral scene representation. The algorithm partitions the observation space into exhaustive set of disjoint segments. Then, pixels belonging to each segment are characterized by segment features. Illustrative examples are presented, and the performance of features is investigated. Results show an average compression more than 25, the classification performance is improved for all classes, and the CPU time required for classification is reduced by the same factor.

  • PDF

Noise Reduction for Photon Counting Imaging Using Discrete Wavelet Transform

  • Lee, Jaehoon;Kurosaki, Masayuki;Cho, Myungjin;Lee, Min-Chul
    • Journal of information and communication convergence engineering
    • /
    • 제19권4호
    • /
    • pp.276-283
    • /
    • 2021
  • In this paper, we propose an effective noise reduction method for photon counting imaging using a discrete wavelet transform. Conventional 2D photon counting imaging was used to visualize the object under dark conditions using statistical methods, such as the Poisson random process. The photons in the scene were estimated using a statistical method. However, photons which disturb the visualization and decrease the image quality may occur in the background where there is no object. Although median filters are used to reduce the noise, the noise in the scene remains. To remove the noise effectively, our proposed method uses the discrete wavelet transform, which removes the noise in the scene using a specific thresholding method that utilizes photon counting imaging characteristics. We conducted an optical experiment to demonstrate the denoising performance of the proposed method.