• Title/Summary/Keyword: image Vision

Search Result 2,584, Processing Time 0.028 seconds

A Method of Hand Recognition for Virtual Hand Control of Virtual Reality Game Environment (가상 현실 게임 환경에서의 가상 손 제어를 위한 사용자 손 인식 방법)

  • Kim, Boo-Nyon;Kim, Jong-Ho;Kim, Tae-Young
    • Journal of Korea Game Society
    • /
    • v.10 no.2
    • /
    • pp.49-56
    • /
    • 2010
  • In this paper, we propose a control method of virtual hand by the recognition of a user's hand in the virtual reality game environment. We display virtual hand on the game screen after getting the information of the user's hand movement and the direction thru input images by camera. We can utilize the movement of a user's hand as an input interface for virtual hand to select and move the object. As a hand recognition method based on the vision technology, the proposed method transforms input image from RGB color space to HSV color space, then segments the hand area using double threshold of H, S value and connected component analysis. Next, The center of gravity of the hand area can be calculated by 0 and 1 moment implementation of the segmented area. Since the center of gravity is positioned onto the center of the hand, the further apart pixels from the center of the gravity among the pixels in the segmented image can be recognized as fingertips. Finally, the axis of the hand is obtained as the vector of the center of gravity and the fingertips. In order to increase recognition stability and performance the method using a history buffer and a bounding box is also shown. The experiments on various input images show that our hand recognition method provides high level of accuracy and relatively fast stable results.

Automatic Classification Algorithm for Raw Materials using Mean Shift Clustering and Stepwise Region Merging in Color (컬러 영상에서 평균 이동 클러스터링과 단계별 영역 병합을 이용한 자동 원료 분류 알고리즘)

  • Kim, SangJun;Kwak, JoonYoung;Ko, ByoungChul
    • Journal of Broadcast Engineering
    • /
    • v.21 no.3
    • /
    • pp.425-435
    • /
    • 2016
  • In this paper, we propose a classification model by analyzing raw material images recorded using a color CCD camera to automatically classify good and defective agricultural products such as rice, coffee, and green tea, and raw materials. The current classifying agricultural products mainly depends on visual selection by skilled laborers. However, classification ability may drop owing to repeated labor for a long period of time. To resolve the problems of existing human dependant commercial products, we propose a vision based automatic raw material classification combining mean shift clustering and stepwise region merging algorithm. In this paper, the image is divided into N cluster regions by applying the mean-shift clustering algorithm to the foreground map image. Second, the representative regions among the N cluster regions are selected and stepwise region-merging method is applied to integrate similar cluster regions by comparing both color and positional proximity to neighboring regions. The merged raw material objects thereby are expressed in a 2D color distribution of RG, GB, and BR. Third, a threshold is used to detect good and defective products based on color distribution ellipse for merged material objects. From the results of carrying out an experiment with diverse raw material images using the proposed method, less artificial manipulation by the user is required compared to existing clustering and commercial methods, and classification accuracy on raw materials is improved.

Alternative Tracing Method for Moving Object Using Reference Template in Real-time Image - Focusing on Parking Management System (참조 템플릿 기반 실시간 이동체 영상을 이용한 대안적 탐지 방안 - 주차관리시스템을 대상으로)

  • Joo, Yong Jin;Kang, Lee Seul;Hahm, Chang Hahk
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.32 no.5
    • /
    • pp.495-503
    • /
    • 2014
  • As the number of vehicles has been sharply increases, the significance of safety and effective operation issues in the parking lot is being emphasized, which takes a part of the transportation system. Recently, there have been several studies for the parking management by detecting moving object, however, recognizing numbers of fast-moving vehicles simultaneously in the picture is still a challenging problem. The parking lot in public area, or large-sized buildings has clear parking section, whereas the sensor system is configured to monitor a plurality of parking spaces. Therefore, by considering those parking lots, we suggested to develop the real-time parking availability information system by applying the real-time image processing techniques. with the help of template matching. Following the study, we wanted to provide the alternative method for parking management system through the reference template makers by recognizing movements of parked vehicles with the size and shape, regardless of direct detecting of driving movements. In addition, we evaluated the applicability and performances of the information system, presented in this study, and implemented a prototype system to simulate the parking statuses of each floor. In fat, it was possible to manage and analyze statistics about the total number of parking spaces and the number of vehicles parked through real-time video flames. We expected that the result of the study will be advanced, following the user-friendliness and cost reduction in operating parking management system and giving information by efficient analysis of parking situation.

3D Reconstruction using a Moving Planar Mirror (움직이는 평면거울을 이용한 3차원 물체 복원)

  • 장경호;이동훈;정순기
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.11
    • /
    • pp.1543-1550
    • /
    • 2004
  • Modeling from images is a cost-effective means of obtaining 3D geometric models. These models can be effectively constructed from classical Structure from Motion algorithm. However, it's too difficult to reconstruct whole scenes using SFM method since general sites contain a very complex shapes and brilliant colours. To overcome this difficulty, the current paper proposes a new reconstruction method based on a moving Planar mirror. We devise the mirror posture instead of scene itself as a cue for reconstructing the geometry That implies that the geometric cues are inserted into the scene by compulsion. With this method, we can obtain the geometric details regardless of the scene complexity. For this purpose, we first capture image sequences through the moving mirror containing the interested scene, and then calibrate the camera through the mirror's posture. Since the calibration results are still inaccurate due to the detection error, the camera pose is revised using frame-correspondence of the comer points that are easily obtained using the initial camera posture. Finally, 3D information is computed from a set of calibrated image sequences. We validate our approach with a set of experiments on some complex objects.

Multi-modal Emotion Recognition using Semi-supervised Learning and Multiple Neural Networks in the Wild (준 지도학습과 여러 개의 딥 뉴럴 네트워크를 사용한 멀티 모달 기반 감정 인식 알고리즘)

  • Kim, Dae Ha;Song, Byung Cheol
    • Journal of Broadcast Engineering
    • /
    • v.23 no.3
    • /
    • pp.351-360
    • /
    • 2018
  • Human emotion recognition is a research topic that is receiving continuous attention in computer vision and artificial intelligence domains. This paper proposes a method for classifying human emotions through multiple neural networks based on multi-modal signals which consist of image, landmark, and audio in a wild environment. The proposed method has the following features. First, the learning performance of the image-based network is greatly improved by employing both multi-task learning and semi-supervised learning using the spatio-temporal characteristic of videos. Second, a model for converting 1-dimensional (1D) landmark information of face into two-dimensional (2D) images, is newly proposed, and a CNN-LSTM network based on the model is proposed for better emotion recognition. Third, based on an observation that audio signals are often very effective for specific emotions, we propose an audio deep learning mechanism robust to the specific emotions. Finally, so-called emotion adaptive fusion is applied to enable synergy of multiple networks. The proposed network improves emotion classification performance by appropriately integrating existing supervised learning and semi-supervised learning networks. In the fifth attempt on the given test set in the EmotiW2017 challenge, the proposed method achieved a classification accuracy of 57.12%.

Expression and Reader Cognition of Japanese Comics Character (일본 만화 캐릭터의 표정과 독자 인지)

  • Yoon, Jang-Won
    • The Journal of the Korea Contents Association
    • /
    • v.7 no.2
    • /
    • pp.246-254
    • /
    • 2007
  • As for comics and animation, the specific gravity came to become still larger in all the art fields together with the importance in various image media now which is useful and goes the time of the 21st century new media. Especially the demand of users to the vision culture which develops day by day, Sensitivity Engineering Department is trying to realize the necessity for a sensitivity design acutely together. The influence of the comics which have toxicity most also in Japanese culture in a geographical position like South Korea on it, and animation is the actual condition in the reason which has reached from youth universally to the layer for years, to be inquired systematic to a Korean comics language. This research was conducted as we thought sufficient study on various situations are required, and among them for the research of expressions of cartoons's characters, we've divided the expressions of characters that comes out in Japanese cartoons into categories of 'happiness, anger, sadness, pleasure' and 'fear, astonishment and dislike' and based on these categories, we've drawn out the minimum elements to express emotions in cartoon and prepared image-map by relating them with languages that express emotions of people and based on this, we've made a calculating tools on how our readers would recognize the expression languages. Samples of Japanese cartoons of which we've chosen for the purpose of drawing out the elements of expressions were limited to only published cartoons and we've made a foot steps for expression analysis of animation characters in the future.

Textuality and Vision : Visual Narrative of Ancient Chinese Literature Art Focused on Narratology's Viewpoint (중국 고대예술의 도상서사와 시각문화 연구 -회화의 이시동도법과 만화의 칸의 상호 해석-)

  • Jo, Jeong-rae;Huang, Kuo-Li
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.9
    • /
    • pp.779-790
    • /
    • 2016
  • This study is to exhibit the iconographic narrative and visual culture of ancient Chinese art. The focus of the study is the composite integration of literature and graphic forms, in particular the heterochronous expression of different scenarios of scenes occurring in different time periods in pictures of ancient art. The unity of their origins with picture narration and comic art creation is the fusion of our modern times. The ancient Chinese understanding of visual art includes the traditional style of images and their symbolic meanings. Among artistic narrative expression, imagery contemplation and visual presentation have significance. Artistic thinking is inseparable from visual articulation. It is a rational thought process through creative language interpretation in visual media of imagery narratives. The characteristics of ancient imagery thinking and the way of presenting sequential incidents in the form pictures is a creative space of time. This is the spatial thinking of modern comic art, which is demonstrated through acceptance in artistic styles. Image narration needs new forms and media styles, including integrating with cultural values as aesthetic communication is necessary.

Real-Time Hand Pose Tracking and Finger Action Recognition Based on 3D Hand Modeling (3차원 손 모델링 기반의 실시간 손 포즈 추적 및 손가락 동작 인식)

  • Suk, Heung-Il;Lee, Ji-Hong;Lee, Seong-Whan
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.12
    • /
    • pp.780-788
    • /
    • 2008
  • Modeling hand poses and tracking its movement are one of the challenging problems in computer vision. There are two typical approaches for the reconstruction of hand poses in 3D, depending on the number of cameras from which images are captured. One is to capture images from multiple cameras or a stereo camera. The other is to capture images from a single camera. The former approach is relatively limited, because of the environmental constraints for setting up multiple cameras. In this paper we propose a method of reconstructing 3D hand poses from a 2D input image sequence captured from a single camera by means of Belief Propagation in a graphical model and recognizing a finger clicking motion using a hidden Markov model. We define a graphical model with hidden nodes representing joints of a hand, and observable nodes with the features extracted from a 2D input image sequence. To track hand poses in 3D, we use a Belief Propagation algorithm, which provides a robust and unified framework for inference in a graphical model. From the estimated 3D hand pose we extract the information for each finger's motion, which is then fed into a hidden Markov model. To recognize natural finger actions, we consider the movements of all the fingers to recognize a single finger's action. We applied the proposed method to a virtual keypad system and the result showed a high recognition rate of 94.66% with 300 test data.

The Analysis of Evergreen Tree Area Using UAV-based Vegetation Index (UAV 기반 식생지수를 활용한 상록수 분포면적 분석)

  • Lee, Geun-Sang
    • Journal of Cadastre & Land InformatiX
    • /
    • v.47 no.1
    • /
    • pp.15-26
    • /
    • 2017
  • The decrease of green space according to the urbanization has caused many environmental problems as the destruction of habitat, air pollution, heat island effect. With interest growing in natural view recently, proper management of evergreen tree which is lived even the winter season has been on the rise importantly. This study analyzed the distribution area of evergreen tree using vegetation index based on unmanned aerial vehicle (UAV). Firstly, RGB and NIR+RG camera were loaded in fixed-wing UAV and image mosaic was achieved using GCPs based on Pix4d SW. And normalized differences vegetation index (NDVI) and soil adjusted vegetation index (SAVI) was calculated by band math function from acquired ortho mosaic image. validation points were applied to evaluate accuracy of the distribution of evergreen tree for each range value and analysis showed that kappa coefficient marked the highest as 0.822 and 0.816 respectively in "NDVI > 0.5" and "SAVI > 0.7". The area of evergreen tree in "NDVI > 0.5" and "SAVI > 0.7" was $11,824m^2$ and $15,648m^2$ respectively, that was ratio of 4.8% and 6.3% compared to total area. It was judged that UAV could supply the latest and high resolution information to vegetation works as urban environment, air pollution, climate change, and heat island effect.

Contour Extraction Method using p-Snake with Prototype Energy (원형에너지가 추가된 p-Snake를 이용한 윤곽선 추출 기법)

  • Oh, Seung-Taek;Jun, Byung-Hwan
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.4
    • /
    • pp.101-109
    • /
    • 2014
  • It is an essential element for the establishment of image processing related systems to find the exact contour from the image of an arbitrary object. In particular, if a vision system is established to inspect the products in the automated production process, it is very important to detect the contours for standardized shapes such lines and curves. In this paper, we propose a prototype adaptive dynamic contour model, p-Snake with improved contour extraction algorithms by adding the prototype energy. The proposed method is to find the initial contour by applying the existing Snake algorithm after Sobel operation is performed for prototype analysis. Next, the final contour of the object is detected by analyzing prototypes such as lines and circles, defining prototype energy and using it as an additional energy item in the existing Snake function on the basis of information on initial contour. We performed experiments on 340 images obtained by using an environment that duplicated the background of an industrial site. It was found that even if objects are not clearly distinguished from the background due to noise and lighting or the edges being insufficiently visible in the images, the contour can be extracted. In addition, in the case of similarity which is the measure representing how much it matches the prototype, the prototype similarity of contour extracted from the proposed p-ACM is superior to that of ACM by 9.85%.