• 제목/요약/키워드: Image Synthesis

검색결과 444건 처리시간 0.028초

Disparity Refinement near the Object Boundaries for Virtual-View Quality Enhancement

  • Lee, Gyu-cheol;Yoo, Jisang
    • Journal of Electrical Engineering and Technology
    • /
    • 제10권5호
    • /
    • pp.2189-2196
    • /
    • 2015
  • Stereo matching algorithm is usually used to obtain a disparity map from a pair of images. However, the disparity map obtained by using stereo matching contains lots of noise and error regions. In this paper, we propose a virtual-view synthesis algorithm using disparity refinement in order to improve the quality of the synthesized image. First, the error region is detected by examining the consistency of the disparity maps. Then, motion information is acquired by applying optical flow to texture component of the image in order to improve the performance. Then, the occlusion region is found using optical flow on the texture component of the image in order to improve the performance of the optical flow. The refined disparity map is finally used for the synthesis of the virtual view image. The experimental results show that the proposed algorithm improves the quality of the generated virtual-view.

깊이 정보를 이용한 실시간 다시점 스테레오 영상 합성 (Real-time Multiple Stereo Image Synthesis using Depth Information)

  • 장세훈;한충신;배진우;유지상
    • 한국통신학회논문지
    • /
    • 제30권4C호
    • /
    • pp.239-246
    • /
    • 2005
  • 본 논문에서는 영상의 RGB 정보와 화소단위의 8비트 깊이 정보를 이용하여 현재의 영상과 스테레오 쌍이 되는 가상의 우 영상을 생성한다. 이 과정에서 깊이 정보를 시차 정보로 변환하고, 생성된 시차정보를 이용하여 우 영상을 생성하게 된다. 또한 스테레오 영상을 합성한 후 회전(rotation)과 이동(translation) 등의 기하학적 변환을 이용하여 관찰자의 위치를 고려한 다시점 스테레오 영상을 합성하는 기법을 제안하고, 깊이 정보와 시차 정보와의 관계를 분석하여 화소 단위의 실시간 처리를 위한 LUT(look-up table) 방식의 고속 기법도 제안한다. 실험 결과 SD급 영상의 경우 8비트 깊이 정보만을 가지고 11시점의 스테레오 영상을 실시간으로 합성할 수 있다.

Text Augmentation Using Hierarchy-based Word Replacement

  • Kim, Museong;Kim, Namgyu
    • 한국컴퓨터정보학회논문지
    • /
    • 제26권1호
    • /
    • pp.57-67
    • /
    • 2021
  • 최근 딥 러닝(Deep Learning) 분석에 이질적인 데이터를 함께 사용하는 멀티모달(Multi-modal) 딥러닝 기술이 많이 활용되고 있으며, 특히 텍스트로부터 자동으로 이미지를 생성해내는 Text to Image 합성에 관한 연구가 활발하게 수행되고 있다. 이미지 합성을 위한 딥러닝 학습은 방대한 양의 이미지와 이미지를 설명하는 텍스트의 쌍으로 구성된 데이터를 필요로 하므로, 소량의 데이터로부터 다량의 데이터를 생성하기 위한 데이터 증강 기법이 고안되어 왔다. 텍스트 데이터 증강의 경우 유의어 대체에 기반을 둔 기법들이 다수 사용되고 있지만, 이들 기법은 명사 단어의 유의어 대체 시 이미지의 내용과 상이한 텍스트를 생성할 가능성이 있다는 한계를 갖는다. 따라서 본 연구에서는 단어가 갖는 품사별 특징을 활용하는 텍스트 데이터 증강 방안, 즉 일부 품사에 대해 단어 계층 정보를 활용하여 단어를 대체하는 방안을 제시하였다. 또한 제안 방법론의 성능을 평가하기 위해 MSCOCO 데이터를 사용하여 실험을 수행하여 결과를 제시하였다.

천리안위성 2A호 위성영상을 위한 영상융합기법의 비교평가 (A Comparison of Pan-sharpening Algorithms for GK-2A Satellite Imagery)

  • 이수봉;최재완
    • 한국측량학회지
    • /
    • 제40권4호
    • /
    • pp.275-292
    • /
    • 2022
  • 기후변화 감시에 위성 자료 활용을 위해 GCOS (Global Climate Observing System)는 시공간 해상도, 시간 변화에 따른 안정성, 불확실도 등의 요구사항을 제시하고 있다. 천리안위성 2A호의 경우, 센서의 한계로 인해 산출물들이 공간해상도 조건에 충족하지 못하는 경우가 많다. 따라서 본 연구에서는 영상융합 기법들을 천리안위성 2A호 영상에 적용하여 산출물 생성 시 활용될 수 있는 최적의 기법을 찾고자 한다. 이를 위해 CS (Component Substitution), MRA (Multiresolution Analysis), VO (Variational Optimization), DL (Deep Learning)에 포함되는 총 6가지 영상융합 기법을 활용하였다. DL의 경우 합성적(Synthesis) 특성 기반 방법을 훈련자료 구축에 사용하였다. 합성적 특성 기반 방법의 과정은 PAN (Panchromatic)과 MS (Multispectral) 영상의 공간해상도 차이만큼 두 영상의 해상도를 낮춰 융합 영상을 생성한 후 원본 MS 영상과 비교한다. 합성적 특성 기반 방법은 공간해상도를 저하시킨 PAN 영상과 MS 영상 간 기하 특성이 같아야 사용자가 원하는 수준의 융합 영상을 제작할 수 있다. 하지만, 훈련자료 구축 시 비유사성이 존재하기에 이를 최소화하는 방법으로 무작위 비율을 활용한 PSGAN 모델(PSGAN_RD)을 추가로 활용하였다. 융합 영상의 검증은 일관성(consistency) 및 합성적 특성 기반 정성적, 정량적 분석을 수행하였다. 분석 결과, 영상융합 알고리즘 중 GSA가 공간 유사도를 나타내는 평가지수에서 가장 높은 수치를 보였으며, 분광 유사도를 나타내는 지수들은 PSGAN_RD 모델의 정확도가 가장 높았다. 융합 영상의 공간 및 분광 특성을 모두 고려한다면 PSGAN_RD 모델이 천리안위성 2A호 산출물 제작에 가장 최적일 것으로 판단하였다.

View Synthesis and Coding of Multi-view Data in Arbitrary Camera Arrangements Using Multiple Layered Depth Images

  • Yoon, Seung-Uk;Ho, Yo-Sung
    • Journal of Multimedia Information System
    • /
    • 제1권1호
    • /
    • pp.1-10
    • /
    • 2014
  • In this paper, we propose a new view synthesis technique for coding of multi-view color and depth data in arbitrary camera arrangements. We treat each camera position as a 3-D point in world coordinates and build clusters of those vertices. Color and depth data within a cluster are gathered into one camera position using a hierarchical representation based on the concept of layered depth image (LDI). Since one camera can cover only a limited viewing range, we set multiple reference cameras so that multiple LDIs are generated to cover the whole viewing range. Therefore, we can enhance the visual quality of the reconstructed views from multiple LDIs comparing with that from a single LDI. From experimental results, the proposed scheme shows better coding performance under arbitrary camera configurations in terms of PSNR and subjective visual quality.

  • PDF

Interaction art using Video Synthesis Technology

  • Kim, Sung-Soo;Eom, Hyun-Young;Lim, Chan
    • International Journal of Advanced Culture Technology
    • /
    • 제7권2호
    • /
    • pp.195-200
    • /
    • 2019
  • Media art, which is a combination of media technology and art, is making a lot of progress in combination with AI, IoT and VR. This paper aims to meet people's needs by creating a video that simulates the dance moves of an object that users admire by using media art that features interactive interactions between users and works. The project proposed a universal image synthesis system that minimizes equipment constraints by utilizing a deep running-based Skeleton estimation system and one of the deep-running neural network structures, rather than a Kinect-based Skeleton image. The results of the experiment showed that the images implemented through the deep learning system were successful in generating the same results as the user did when they actually danced through inference and synthesis of motion that they did not actually behave.

Image Restoration and Object Removal Using Prioritized Adaptive Patch-Based Inpainting in a Wavelet Domain

  • Borole, Rajesh P.;Bonde, Sanjiv V.
    • Journal of Information Processing Systems
    • /
    • 제13권5호
    • /
    • pp.1183-1202
    • /
    • 2017
  • Image restoration has been carried out by texture synthesis mostly for large regions and inpainting algorithms for small cracks in images. In this paper, we propose a new approach that allows for the simultaneous fill-in of different structures and textures by processing in a wavelet domain. A combination of structure inpainting and patch-based texture synthesis is carried out, which is known as patch-based inpainting, for filling and updating the target region. The wavelet transform is used for its very good multiresolution capabilities. The proposed algorithm uses the wavelet domain subbands to resolve the structure and texture components in smooth approximation and high frequency structural details. The subbands are processed separately by the prioritized patch-based inpainting with isophote energy driven texture synthesis at the core. The algorithm automatically estimates the wavelet coefficients of the target regions of various subbands using optimized patches from the surrounding DWT coefficients. The suggested performance improvement drastically improves execution speed over the existing algorithm. The proposed patch optimization strategy improves the quality of the fill. The fill-in is done with higher priority to structures and isophotes arriving at target boundaries. The effectiveness of the algorithm is demonstrated with natural and textured images with varying textural complexions.

View synthesis with sparse light field for 6DoF immersive video

  • Kwak, Sangwoon;Yun, Joungil;Jeong, Jun-Young;Kim, Youngwook;Ihm, Insung;Cheong, Won-Sik;Seo, Jeongil
    • ETRI Journal
    • /
    • 제44권1호
    • /
    • pp.24-37
    • /
    • 2022
  • Virtual view synthesis, which generates novel views similar to the characteristics of actually acquired images, is an essential technical component for delivering an immersive video with realistic binocular disparity and smooth motion parallax. This is typically achieved in sequence by warping the given images to the designated viewing position, blending warped images, and filling the remaining holes. When considering 6DoF use cases with huge motion, the warping method in patch unit is more preferable than other conventional methods running in pixel unit. Regarding the prior case, the quality of synthesized image is highly relevant to the means of blending. Based on such aspect, we proposed a novel blending architecture that exploits the similarity of the directions of rays and the distribution of depth values. By further employing the proposed method, results showed that more enhanced view was synthesized compared with the well-designed synthesizers used within moving picture expert group (MPEG-I). Moreover, we explained the GPU-based implementation synthesizing and rendering views in the level of real time by considering the applicability for immersive video service.

다해상도 기반 계층적 변이맵을 이용한 중간영상 생성 방법 (An Intermediate Image Generation Method using Multiresolution-based Hierarchical Disparity Map)

  • 허경무;유재민
    • 제어로봇시스템학회논문지
    • /
    • 제9권11호
    • /
    • pp.899-905
    • /
    • 2003
  • An intermediate images generation method using multi-resolution based hierarchical block matching disparity map is proposed. This method is composed of a disparity estimation, an occlusion detection and intermediate image synthesis. For the disparity estimation, which is one of the important processes in intermediate image synthesis, we use the multi-resolution based hierarchical block matching algorithm to overcome the imperfect ness of block matching algorithm. The proposed method makes disparity maps more accurate and dense by multi-resolution based hierarchical block matching, and the estimated disparity maps are used to generate intermediate images of stereo images. Generated intermediate images show 0.1∼1.4 ㏈ higher PSNR than the images obtained by block matching algorithm.

물체-검출기 동기회전 방식의 X-선 단층영상시스템 설계 및 성능개선에 관한 연구 (Design and Performance Improvement of a Digital Tomosynthesis System for Object-Detector Synchronous Rotation)

  • 강성택;조형석;노병옥
    • 제어로봇시스템학회논문지
    • /
    • 제5권4호
    • /
    • pp.471-480
    • /
    • 1999
  • This paper presents design and performance improvement of a new digital tomosynthesis (DTS) system for object-detector synchronous rotation. Firstly, a new DTS system, called OSDR (Object-Detector Synchronous Rotation) is suggested and designed to acquire X-ray digital images. Secondly, the shape distortion of DTS images generated by an image intensifier is modeled. And a new synthesis algorithm, which overcomes the limitations of the existing synthesis algorithm, is suggested to improve the sharpness of the synthesized image. Also an artifact analysis of the DTS system is performed. Thirdly, some performance indices, which evaluate quantitatively performance improvement, are defined. And the experimental verification of the performance improvement is accomplished by the ODSR system newly designed. The advantages of the ODSR system are expressed quantitatively, compared with an existing system.

  • PDF