• Title/Summary/Keyword: 시각적 깊이

Search Result 192, Processing Time 0.036 seconds

Unseen Object Pose Estimation using a Monocular Depth Estimator (단안 카메라 깊이 추정기를 이용한 미지 물체의 자세 추정)

  • Song, Sung-Ho;Kim, Incheol
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.05a
    • /
    • pp.637-640
    • /
    • 2022
  • 3차원 물체의 탐지와 자세 추정은 실내외 환경에서 장면 이해, 로봇의 물체 조작 작업, 자율 주행, 증강 현실 등과 같은 다양한 응용 분야들에서 공통적으로 요구되는 매우 중요한 시각 인식 기술이다. 깊이 지도를 요구하는 기존 연구들과는 달리, 본 논문에서는 RGB 컬러 영상만을 이용해 미지의 물체들, 즉 3차원 CAD 모델을 가지고 있지 않은 새로운 물체들을 탐지해내고, 이들의 자세를 추정해낼 수 있는 새로운 신경망 모델을 제안한다. 제안 모델에서는 최근 빠른 속도로 발전하고 있는 깊이 추정 기술을 이용함으로써, 깊이 측정 센서 없이도 물체 자세 추정에 필요한 깊이 지도를 컬러 영상에서 구해낼 수 있다. 본 논문에서는 벤치마크 데이터 집합을 이용한 실험을 통해, 제안 모델의 유용성을 평가한다.

Pedestrian and Vehicle Distance Estimation Based on Hard Parameter Sharing (하드 파라미터 쉐어링 기반의 보행자 및 운송 수단 거리 추정)

  • Seo, Ji-Won;Cha, Eui-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.3
    • /
    • pp.389-395
    • /
    • 2022
  • Because of improvement of deep learning techniques, deep learning using computer vision such as classification, detection and segmentation has also been used widely at many fields. Expecially, automatic driving is one of the major fields that applies computer vision systems. Also there are a lot of works and researches to combine multiple tasks in a single network. In this study, we propose the network that predicts the individual depth of pedestrians and vehicles. Proposed model is constructed based on YOLOv3 for object detection and Monodepth for depth estimation, and it process object detection and depth estimation consequently using encoder and decoder based on hard parameter sharing. We also used attention module to improve the accuracy of both object detection and depth estimation. Depth is predicted with monocular image, and is trained using self-supervised training method.

Effects of Mirror-based Visual Effects on Chest Compression Quality in Cardiopulmonary Resuscitation

  • Yun, Seong-Woo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.11
    • /
    • pp.179-185
    • /
    • 2019
  • In this paper, We purpose the basic data for the success of effective CPR using mirror in order to increase the quality of chest compression during CPR. The subject of this study was an experimental study based on a randomized crossover design of 28 people who completed the BLS Health Care Provider, and collected data were analyzed by SPSS Ver. 23.0 for Win statistics program. As the research methods, depth, speed, compression to relaxation ratio, arm angle and easiness during the chest compression were measured. Taken together, the results of this study showed that using a mirror-based chest compression method for chest compressions in adult CPR could make chest compressions easier, in addition, the quality of breast compression was improved by improving the posture of the rescuers, such as the average depth of compression, compression to relaxation ratio, and arm angle. However, it is necessary to confirm the feasibility of clinical application through additional studies on various environmental factors and job groups for mirror-based chest compression method.

Image Space Occlusion Shading Model for Iso-surface Volume Rendering (등위면 볼륨렌더링을 위한 이미지 공간 폐색 쉐이딩 모델)

  • Kim, Seokyeon;You, Sangbong;Jang, Yun
    • Journal of the Korea Computer Graphics Society
    • /
    • v.20 no.4
    • /
    • pp.1-7
    • /
    • 2014
  • The volume rendering has become an important technique in many applications along with hardware development. Understanding and perception of volume visualization benefit from visual cues which are available from shading. Better visual cues can be obtained from global illumination models but it's huge amount of computation and extra GPU memory need cause a lack of interactivity. In this paper, in order to improve visual cues on volume rendering, we propose an image space occlusion shading model which requires no additional resources.

Image Contents Encryption Technique for Digital Hologram Broadcasting Service (디지털 홀로그램 방송을 위한 영상 콘텐츠의 암호화)

  • Ha, Jun;Choi, Hyun-Jun
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.05a
    • /
    • pp.818-819
    • /
    • 2013
  • This paper propose a contents security technique for digital holographic display service. Digital holographic video system assumes the existing service frame for 2-dimensional or 3-dimensional video, which includes data acquisition, processing, transmission, reception, and reconstruction. In this paper, we perform the encryption of RGB image and depth-map for such a system. The experimental results showed that encrypting only 0.048% of the entire data was enough to hide the constants of the RGB image and depth-map.

  • PDF

Information Types and Display Methods according to the Relation between Frequency of Exposure and Degree of Cognition (노출빈도와 인지도 관계에 따른 정보의 유형과 표현기법)

  • Han, Ji-Ae;You, Si-Cheon
    • Journal of Digital Convergence
    • /
    • v.10 no.10
    • /
    • pp.497-504
    • /
    • 2012
  • Information types and display methods according to the relation between frequency of exposure and degree of cognition was suggested by this study as a way to enhance effective communication by information in aspect of user cognition. First of all, we ascertained the relation between frequency of exposure and degree of cognition by literature research for cognitive psychology and cognitive engineering psychology, results are as follows based in it. First, we suggested information types and attributes for visualization as 'Framework' which helps designers understand cognitive demands of users. Specifically, there are 4 types(STM, STA, LTM, LTA) of information according to the relation between frequency of exposure and degree of cognition, cognitive characteristics for each types and 'attributes matrix for visualization' which is consisted of 14 attributes of high -quality information and resorted by the types. Second, we suggested a guideline for display methods according to depth of information in the design process of information contents. For display methods of STM, STA information as primary information, we suggested "Attribution theory of Distinctiveness", "Advance Organizer", "Progress Closure", "Affordance", for display methods of LTM information as multidimensional information, we suggested "Modularity", "Consistency", "Mimicry", "Mnemonic Device". We had found from this study that there are distinction of status for attributes of information visualization according to information types or depth, and various display methods by them.

3D Depth Information Extraction Algorithm Based on Motion Estimation in Monocular Video Sequence (단안 영상 시퀸스에서 움직임 추정 기반의 3차원 깊이 정보 추출 알고리즘)

  • Park, Jun-Ho;Jeon, Dae-Seong;Yun, Yeong-U
    • The KIPS Transactions:PartB
    • /
    • v.8B no.5
    • /
    • pp.549-556
    • /
    • 2001
  • The general problems of recovering 3D for 2D imagery require the depth information for each picture element form focus. The manual creation of those 3D models is consuming time and cost expensive. The goal in this paper is to simplify the depth estimation algorithm that extracts the depth information of every region from monocular image sequence with camera translation to implement 3D video in realtime. The paper is based on the property that the motion of every point within image which taken from camera translation depends on the depth information. Full-search motion estimation based on block matching algorithm is exploited at first step and ten, motion vectors are compensated for the effect by camera rotation and zooming. We have introduced the algorithm that estimates motion of object by analysis of monocular motion picture and also calculates the averages of frame depth and relative depth of region to the average depth. Simulation results show that the depth of region belongs to a near object or a distant object is in accord with relative depth that human visual system recognizes.

  • PDF

3-D Visualization of Reservoir Characteristics through GOCAD (GOCAD를 이용한 저류층 속성정보의 3차원 시각화 연구)

  • Gwak Sang-Hwan;Lee Doo Sung
    • Geophysics and Geophysical Exploration
    • /
    • v.4 no.3
    • /
    • pp.80-83
    • /
    • 2001
  • Four seismic reflection horizons in 3-D seismic data, coherence derived from the seismic data, and 38 well logs from the Boonsville Gas Filed in Texas were tried to be integrated and visualized in 3 dimensions. Time surface was constructed from pick times of the reflection horizons. Average velocities to each horizon at 38 well locations were calculated based on depth markers from the well logs and time picks from the 3-D seismic data. The time surface was transformed to depth surface through velocity interpolation. Coherence was calculated on the 3-D seismic data by semblance method. Spatial distribution of the coherence is captured easily in 3-D visualization. Comparing to a time-slice of seismic data, distinctive stratigraphic features could be correctly recognized on the 3-D visualization.

  • PDF

Single Image Dehazing Based on Depth Map Estimation via Generative Adversarial Networks (생성적 대립쌍 신경망을 이용한 깊이지도 기반 연무제거)

  • Wang, Yao;Jeong, Woojin;Moon, Young Shik
    • Journal of Internet Computing and Services
    • /
    • v.19 no.5
    • /
    • pp.43-54
    • /
    • 2018
  • Images taken in haze weather are characteristic of low contrast and poor visibility. The process of reconstructing clear-weather image from a hazy image is called dehazing. The main challenge of image dehazing is to estimate the transmission map or depth map for an input hazy image. In this paper, we propose a single image dehazing method by utilizing the Generative Adversarial Network(GAN) for accurate depth map estimation. The proposed GAN model is trained to learn a nonlinear mapping between the input hazy image and corresponding depth map. With the trained model, first the depth map of the input hazy image is estimated and used to compute the transmission map. Then a guided filter is utilized to preserve the important edge information of the hazy image, thus obtaining a refined transmission map. Finally, the haze-free image is recovered via atmospheric scattering model. Although the proposed GAN model is trained on synthetic indoor images, it can be applied to real hazy images. The experimental results demonstrate that the proposed method achieves superior dehazing results against the state-of-the-art algorithms on both the real hazy images and the synthetic hazy images, in terms of quantitative performance and visual performance.

ROS Configuration Method for Effective Control of Modular Service Manipulator (모듈형 서비스 매니퓰레이터의 제어를 위한 ROS 환경 설계 방법)

  • Koo, Mose;Kim, Sang-Hoon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.533-535
    • /
    • 2021
  • 본 연구에서는 서비스 역할을 수행하는 6축 모듈형 매니퓰레이터 개발을 목표로 하며, 최종 기술 사양에 따른 설계를 진행하는 과정에서 기구의 섬세한 동작을 효율적으로 제어하기 위해 로봇 제어 소프트웨어의 오픈소스 환경인 ROS를 사용한다. 매니퓰레이터의 동작 설계를 ROS 기반에서 제어하기 위해 중요한 기본 환경을 구축하였으며, 특히 로봇 모델링을 위한 시각화를 위해 URDF파일에 해당 매니퓰레이터의 필수 파라미터값들을 지정하여 적용하였고, 전체 동작 시나리오에 맞춰 매니퓰레이터가 특정 자세를 취할 경우의 역기구학적인 해석과 그에 따른 경로를 생성하도록 매니퓰레이터의 라이브러리인 MoveIt을 활용하여 시각적으로 표현하고 시뮬레이션을 수행하였다. 또한, 설계한 ROS 환경 설계 방법을 바탕으로 MCU와의 통신을 통해 모터의 실시간 각도 값을 제어하고, 3D 깊이 카메라의 거리정보와 이미지 정보의 융합을 통해 로봇의 서비스 내용의 개선을 기대할 수 있다.