• Title/Summary/Keyword: 깊이카메라

Search Result 473, Processing Time 0.028 seconds

A Real-time Hand Pose Recognition Method with Hidden Finger Prediction (은닉된 손가락 예측이 가능한 실시간 손 포즈 인식 방법)

  • Na, Min-Young;Choi, Jae-In;Kim, Tae-Young
    • Journal of Korea Game Society
    • /
    • v.12 no.5
    • /
    • pp.79-88
    • /
    • 2012
  • In this paper, we present a real-time hand pose recognition method to provide an intuitive user interface through hand poses or movements without a keyboard and a mouse. For this, the areas of right and left hands are segmented from the depth camera image, and noise removal is performed. Then, the rotation angle and the centroid point of each hand area are calculated. Subsequently, a circle is expanded at regular intervals from a centroid point of the hand to detect joint points and end points of the finger by obtaining the midway points of the hand boundary crossing. Lastly, the matching between the hand information calculated previously and the hand model of previous frame is performed, and the hand model is recognized to update the hand model for the next frame. This method enables users to predict the hidden fingers through the hand model information of the previous frame using temporal coherence in consecutive frames. As a result of the experiment on various hand poses with the hidden fingers using both hands, the accuracy showed over 95% and the performance indicated over 32 fps. The proposed method can be used as a contactless input interface in presentation, advertisement, education, and game applications.

Augmented Reality Authoring Tool with Marker & Gesture Interactive Features (마커 및 제스처 상호작용이 가능한 증강현실 저작도구)

  • Shim, Jinwook;Kong, Minje;Kim, Hayoung;Chae, Seungho;Jeong, Kyungho;Seo, Jonghoon;Han, Tack-Don
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.6
    • /
    • pp.720-734
    • /
    • 2013
  • In this paper, we suggest an augmented reality authoring tool system that users can easily make augmented reality contents using hand gesture and marker-based interaction methods. The previous augmented reality authoring tools are focused on augmenting a virtual object and to interact with this kind of augmented reality contents, user used the method utilizing marker or sensor. We want to solve this limited interaction method problem by applying marker based interaction method and gesture interaction method using depth sensing camera, Kinect. In this suggested system, user can easily develop simple form of marker based augmented reality contents through interface. Also, not just providing fragmentary contents, this system provides methods that user can actively interact with augmented reality contents. This research provides two interaction methods, one is marker based method using two markers and the other is utilizing marker occlusion. In addition, by recognizing and tracking user's bare hand, this system provides gesture interaction method which can zoom-in, zoom-out, move and rotate object. From heuristic evaluation about authoring tool and compared usability about marker and gesture interaction, this study confirmed a positive result.

A Study on Performance Improvement of Fruit Vegetables Automatic Grafting System (과채류 접목시스템 개선 연구)

  • Kang, Dong Hyeon;Lee, Si Young;Kim, Jong Koo;Park, Min Jung;Son, Jin Kwan;Yun, Sung-Wook;An, Se Woong;Jung, In Kyu
    • Journal of Bio-Environment Control
    • /
    • v.26 no.3
    • /
    • pp.215-220
    • /
    • 2017
  • This study was conducted to improve the insufficiency of fruit vegetable grafting system developed by National Institute of Agricultural Sciences, Rural Development Administration. When the rotary blade cut the stem of scions and rootstocks, the grafting failure at curved cutting surfaces happened. The cutting depth of a tomato seedling by a rotated cutter was calculated 0.11 mm even when the cutting arm length and the maximum stem diameter were 50 mm and 5 mm, respectively. Mathematical analysis and high-speed photography showed that there was no problem by cutting in straight the stem of scions and rootstocks. The compression test of seedling stems to design the optimal shape of gripper showed that stems were not completely restored when they were compressed above 0.8 mm and 0.6 mm in case of rootstocks and scion, respectively. This study found that the bending angle of stem of tomato seedlings at the grafting period was 10 degree on average. The optimal gripper finger was the edge finger type which could be precisely set center point by adjusting the distance between fingers. In addition, it was found that most of seedling could be grasped without damage when the finger-to-finger distances is set to 2.5 mm for scion and 3.0 mm for rootstocks and finger are coated by 1 mm-thick flexible material.

On the Study of Initializing Extended Depth of Focus Algorithm Parameters (Extended Depth of Focus 알고리듬 파라메타 초기설정에 관한 연구)

  • Yoo, Kyung-Moo;Joo, Hyo-Nam;Kim, Joon-Seek;Park, Duck-Chun;Choi, In-Ho
    • Journal of Broadcast Engineering
    • /
    • v.17 no.4
    • /
    • pp.625-633
    • /
    • 2012
  • Extended Depth of Focus (EDF) algorithms for extracting three-dimensional (3D) information from a set of optical image slices are studied by many researches recently. Due to the limited depth of focus of the microscope, only a small portion of the image slices are in focus. Most of the EDF algorithms try to find the in-focus area to generate a single focused image and a 3D depth image. Inherent to most image processing algorithms, the EDF algorithms need parameters to be properly initialized to perform successfully. In this paper, we select three popular transform-based EDF algorithms which are each based on pyramid, wavelet transform, and complex wavelet transform, and study the performance of the algorithms according to the initialization of its parameters. The parameters we considered consist of the number of levels used in the transform, the selection of the lowest level image, the window size used in high frequency filter, the noise reduction method, etc. Through extended simulation, we find a good relationship between the initialization of the parameters and the properties of both the texture and 3D ground truth images. Typically, we find that a proper initialization of the parameters improve the algorithm performance 3dB ~ 19dB over a default initialization in recovering the 3D information.

A Method for Body Keypoint Localization based on Object Detection using the RGB-D information (RGB-D 정보를 이용한 객체 탐지 기반의 신체 키포인트 검출 방법)

  • Park, Seohee;Chun, Junchul
    • Journal of Internet Computing and Services
    • /
    • v.18 no.6
    • /
    • pp.85-92
    • /
    • 2017
  • Recently, in the field of video surveillance, a Deep Learning based learning method has been applied to a method of detecting a moving person in a video and analyzing the behavior of a detected person. The human activity recognition, which is one of the fields this intelligent image analysis technology, detects the object and goes through the process of detecting the body keypoint to recognize the behavior of the detected object. In this paper, we propose a method for Body Keypoint Localization based on Object Detection using RGB-D information. First, the moving object is segmented and detected from the background using color information and depth information generated by the two cameras. The input image generated by rescaling the detected object region using RGB-D information is applied to Convolutional Pose Machines for one person's pose estimation. CPM are used to generate Belief Maps for 14 body parts per person and to detect body keypoints based on Belief Maps. This method provides an accurate region for objects to detect keypoints an can be extended from single Body Keypoint Localization to multiple Body Keypoint Localization through the integration of individual Body Keypoint Localization. In the future, it is possible to generate a model for human pose estimation using the detected keypoints and contribute to the field of human activity recognition.

Technology Status and Improvement Direction of Special Theaters in Korea by Format (국내 특수상영관 포맷별 기술현황과 개선방향)

  • Jung, Hyun-Jin
    • Journal of Korea Entertainment Industry Association
    • /
    • v.15 no.4
    • /
    • pp.73-87
    • /
    • 2021
  • Special theaters were created to provide a sense of immersion and spectacles due to differentiated screens, sound, seating facilities, and advanced services, and also expanded screens. The purpose of this study is to perform comparative analysis of the technical characteristics formats shown in special theaters(3D film, 4DX, IMAX, ScreenX, and VR) in order to identify and find ways to overcome the technological limitations in production. The various formats show differences in field of view depending on the exhibition technology and these differences affect the mise-en-scene, narrative, and editing of the film and consequently result in changes in the production environment and process. Therefore, directors and creators must understand the technological features and limitations of the new formats before making their approach. However, a new format encounters limitations on production sets due to the decline of technical education and succession. In situations where shooting with a special camera is essential, the particular characteristics of each format should be carefully considered from the planning stage but financial problems arise due to increase in production period and cost. To overcome these various obstacles, it is essential to first identify problems and present alternatives through in-depth research on the production set of each format. Finally, this research aims to explore the prototype of each format and analyze the current state of production technology with formats that have not been adapted to the market trends by combining with the other formats and showing that they can survive in new ways.

Assessment of Applicability of CNN Algorithm for Interpretation of Thermal Images Acquired in Superficial Defect Inspection Zones (포장층 이상구간에서 획득한 열화상 이미지 해석을 위한 CNN 알고리즘의 적용성 평가)

  • Jang, Byeong-Su;Kim, YoungSeok;Kim, Sewon ;Choi, Hyun-Jun;Yoon, Hyung-Koo
    • Journal of the Korean Geotechnical Society
    • /
    • v.39 no.10
    • /
    • pp.41-48
    • /
    • 2023
  • The presence of abnormalities in the subgrade of roads poses safety risks to users and results in significant maintenance costs. In this study, we aimed to experimentally evaluate the temperature distributions in abnormal areas of subgrade materials using infrared cameras and analyze the data with machine learning techniques. The experimental site was configured as a cubic shape measuring 50 cm in width, length, and depth, with abnormal areas designated for water and air. Concrete blocks covered the upper part of the site to simulate the pavement layer. Temperature distribution was monitored over 23 h, from 4 PM to 3 PM the following day, resulting in image data and numerical temperature values extracted from the middle of the abnormal area. The temperature difference between the maximum and minimum values measured 34.8℃ for water, 34.2℃ for air, and 28.6℃ for the original subgrade. To classify conditions in the measured images, we employed the image analysis method of a convolutional neural network (CNN), utilizing ResNet-101 and SqueezeNet networks. The classification accuracies of ResNet-101 for water, air, and the original subgrade were 70%, 50%, and 80%, respectively. SqueezeNet achieved classification accuracies of 60% for water, 30% for air, and 70% for the original subgrade. This study highlights the effectiveness of CNN algorithms in analyzing subgrade properties and predicting subsurface conditions.

Entropy-Based 6 Degrees of Freedom Extraction for the W-band Synthetic Aperture Radar Image Reconstruction (W-band Synthetic Aperture Radar 영상 복원을 위한 엔트로피 기반의 6 Degrees of Freedom 추출)

  • Hyokbeen Lee;Duk-jin Kim;Junwoo Kim;Juyoung Song
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.6_1
    • /
    • pp.1245-1254
    • /
    • 2023
  • Significant research has been conducted on the W-band synthetic aperture radar (SAR) system that utilizes the 77 GHz frequency modulation continuous wave (FMCW) radar. To reconstruct the high-resolution W-band SAR image, it is necessary to transform the point cloud acquired from the stereo cameras or the LiDAR in the direction of 6 degrees of freedom (DOF) and apply them to the SAR signal processing. However, there are difficulties in matching images due to the different geometric structures of images acquired from different sensors. In this study, we present the method to extract an optimized depth map by obtaining 6 DOF of the point cloud using a gradient descent method based on the entropy of the SAR image. An experiment was conducted to reconstruct a tree, which is a major road environment object, using the constructed W-band SAR system. The SAR image, reconstructed using the entropy-based gradient descent method, showed a decrease of 53.2828 in mean square error and an increase of 0.5529 in the structural similarity index, compared to SAR images reconstructed from radar coordinates.

Characteristics of Seafloor Morphology and Manganese Nodule Occurrence in the KODES area, NE Equatorial Pacific (태평양 한국심해환경연구(KODES) 지역 해저변 지형과 망간단괴 분포특성)

  • Jung, Hoi-Soo;Ko, Young-Tak;Chi, Sang-Bum;Kim, Hyun-Sub;Moon, Jai-Woon
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.4 no.4
    • /
    • pp.323-337
    • /
    • 1999
  • Seafloor morphology and manganese nodule occurrence were studied in the Korea Deep-sea Environmental Study (KODES) area, northeast equatorial Pacific, to understand their relationship. Study area is composed of three elongated valleys and hills with about 100~200 m height along NNE-SSW direction. Valley region is generally flat. However, hill region is very rugged with big cliffs of about 100m height and small depressions of several tens of meters depth. Tectonic movement along the Clarion-Clipperton fracture zone, consequent formation of elongated abyssal hills and Valleys, erosion of siliceous bottom sediments by bottom currents, and dissolution of carbonate sediments on the abyssal hills below CCD result in the rugged morphology. Manganese nodule occurrence is closely related to the morphology of the study area; mostly rounded-shaped manganese nodules with about 5 cm diameter are abundant on the flat valley region, whereas irregular shaped nodules (or manganese crust) with less than 5 cm to about 1 m diameter occur on the hill. These results supports the previous reports that nodule abundance, composition, and morphology are variable both on regional and local small scales on the seafloor even within some abundant nodule provinces depending on oceanographic characteristics such as bathymetric features, surface sediment type, sediment thickness, and so on. We suggest that such oceanographic characteristics affect interrelatedly on the formation of manganese nodules, and tectonic movement of the Pacific plate ultimately constrain the nodule occurrence. A potential mining place in the KODES area seems to be the valley region, which is elongated to the NNW-SSE direction with 3-4 km width.

  • PDF

Usefulness of Flow Composite Image in Raynaud Scan ($^{201}Tl$) ($^{201}Tl$을 이용한 레이노 검사에서 동적 Composite 영상의 유용성)

  • Kim, Dae-Yeon;Shin, Gyoo-Seol;Oh, Eun-Jung;Kim, Gun-Jae
    • The Korean Journal of Nuclear Medicine Technology
    • /
    • v.14 no.1
    • /
    • pp.101-104
    • /
    • 2010
  • Purpose: Raynaud scan is divided to flow, blood pool and local-delay image. Usually, we evaluate comparison through blood pool and local-delay image. We will evaluate about usability when comparative observe blood image and local-delay image in Raynaud scan that used $^{201}Tl$ as making flow image to one sheet of images. Materials and Methods: We have selected 29 Raynaud phenomenon patients aged 14~68 years who visited department of vascular surgery between Feb. 2008 and Aug. 2009. An intravenous injection $^{201}Tl$ of 111 MBq (3 mCi) to opposite side diagonal line limbs above an internal auditing department. Equipment used Philips gamma camera forte A-Z, and collimator used LEHR. Matrix size set up to each $64{\times}64$, $128{\times}128$, $256{\times}256$ and zoom factor used to full field. Protocol of dynamic is 2 second to 155 frames. Blood pool and delay count to 300 second. We set up ROI by a foundation to data acquired in PEGASYS processing program. Each results were analyzed with the SPSS 12.0 statistical software. Results: Each averages of count ratio (Rt / Lt) to have been given at composite image, a blood pool image, delay images analyzed at Raynaud phenomenon patients is $1.25{\pm}0.39$, $1.20{\pm}0.33$, $1.11{\pm}0.17$. The sample analysis results of blood pool image and delay image contented itself with p<0.029. Also, there don't have been each difference, and blood pool image, delay image regarding composite image was able to know. Conclusion: We were able to give help for comparison to evaluate a blood pool image and a local delay image at the Raynaud scan which used $^{201}Tl$ while making a flow image to one sheet image. Identification to be visual too was possible. If you are proceeded a researcher that there was further depth, you are more appropriate for, and you may get useful information.

  • PDF