• Title/Summary/Keyword: Computer Vision

Search Result 2,208, Processing Time 0.029 seconds

Real Time Eye and Gaze Tracking (실시간 눈과 시선 위치 추적)

  • Cho, Hyeon-Seob;Kim, Hee-Sook
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.6 no.2
    • /
    • pp.195-201
    • /
    • 2005
  • This paper describes preliminary results we have obtained in developing a computer vision system based on active IR illumination for real time gaze tracking for interactive graphic display. Unlike most of the existing gaze tracking techniques, which often require assuming a static head to work well and require a cumbersome calibration process for each person, our gaze tracker can perform robust and accurate gaze estimation without calibration and under rather significant head movement. This is made possible by a new gaze calibration procedure that identifies the mapping from pupil parameters to screen coordinates using the Generalized Regression Neural Networks (GRNN). With GRNN, the mapping does not have to be an analytical function and head movement is explicitly accounted for by the gaze mapping function. Furthermore, the mapping function can generalize to other individuals not used in the training. The effectiveness of our gaze tracker is demonstrated by preliminary experiments that involve gaze-contingent interactive graphic display.

  • PDF

Gesture Recognition Using Stereo Tracking Initiator and HMM for Tele-Operation (스테레오 영상 추적 자동초기화와 HMM을 이용한 원격 작업용 제스처 인식)

  • Jeong, Ji-Won;Lee, Yong-Beom;Jin, Seong-Il
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.8
    • /
    • pp.2262-2270
    • /
    • 1999
  • In this paper, we describe gesture recognition algorithm using computer vision sensor and HMM. The automatic hand region extraction has been proposed for initializing the tracking of the tele-operation gestures. For this, distance informations(disparity map) as results of stereo matching of initial left and right images are employed to isolate the hand region from a scene. PDOE(positive difference of edges) feature images adapted here have been found to be robust against noise and background brightness. The KNU/KAERI(K/K) gesture instruction set is defined for tele-operation in atomic electric power stations. The composite recognition model constructed by concatenating three gesture instruction models including pre-orders, basic orders, and post-orders has been proposed and identified by discrete HMM. Our experimental results showed that consecutive orders composed of more than two ones are correctly recognized at the rate of above 97%.

  • PDF

Fast Image Stitching Based on Improved SURF Algorithm Using Meaningful Features (의미 있는 특징점을 이용한 향상된 SURF 알고리즘 기반의 고속 이미지 스티칭 기법)

  • Ahn, Hyo-Chang;Rhee, Sang-Burm
    • The KIPS Transactions:PartB
    • /
    • v.19B no.2
    • /
    • pp.93-98
    • /
    • 2012
  • Recently, we can easily create high resolution images with digital cameras for high-performance and make use them at variety fields. Especially, the image stitching method which adjusts couple of images has been researched. Image stitching can be used for military purposes such as satellites and reconnaissance aircraft, and computer vision such as medical image and the map. In this paper, we have proposed fast image stitching based on improved SURF algorithm using meaningful features in the process of images matching after extracting features from scenery image. The features are extracted in each image to find out corresponding points. At this time, the meaningful features can be searched by removing the error, such as noise, in extracted features. And these features are used for corresponding points on image matching. The total processing time of image stitching is improved due to the reduced time in searching out corresponding points. In our results, the processing time of feature matching and image stitching is faster than previous algorithms, and also that method can make natural-looking stitched image.

A Curve Lane Detection Method using Lane Variation Vector and Cardinal Spline (차선 변화벡터와 카디널 스플라인을 이용한 곡선 차선 검출방법)

  • Heo, Hwan;Han, Gi-Tae
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.3 no.7
    • /
    • pp.277-284
    • /
    • 2014
  • The detection method of curves for the lanes which is powerful for the variation by utilizing the lane variation vector and cardinal spline on the inverse perspective transformation screen images which do not required the camera parameters are suggested in this paper. This method detects the lane area by setting the expected lane area in the s frame and next s+1 frame where the inverse perspective transformation and entire process of the lane filter are adapted, and expects the points of lane location in the next frames with the lane variation vector calculation from the detected lane areas. The scan area is set from the nextly expected lane position and new lane positions are detected within these areas, and the lane variation vectors are renewed with the detected lane position and the lanes are detected with application of cardinal spline for the control points inside the lane areas. The suggested method is a powerful method for curved lane detection, but it was adopted to the linear lanes too. It showed an excellent lane detection speed of about 20ms in processing a frame.

User Positioning Method Based on Image Similarity Comparison Using Single Camera (단일 카메라를 이용한 이미지 유사도 비교 기반의 사용자 위치추정)

  • Song, Jinseon;Hur, SooJung;Park, Yongwan;Choi, Jeonghee
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.8
    • /
    • pp.1655-1666
    • /
    • 2015
  • In this paper, user-position estimation method is proposed by using a single camera for both indoor and outdoor environments. Conventionally, the GPS of RF-based estimation methods have been widely studied in the literature for outdoor and indoor environments, respectively. Each method is useful only for indoor or outdoor environment. In this context, this study adopts a vision-based approach which can be commonly applicable to both environments. Since the distance or position cannot be extracted from a single still image, the reference images pro-stored in image database are used to identify the current position from the single still image captured by a single camera. The reference image is tagged with its captured position. To find the reference image which is the most similar to the current image, the SURF algorithm is used for feature extraction. The outliers in extracted features are discarded by using RANSAC algorithm. The performance of the proposed method is evaluated for two buildings and their outsides for both indoor and outdoor environments, respectively.

SIFT based Image Similarity Search using an Edge Image Pyramid and an Interesting Region Detection (윤곽선 이미지 피라미드와 관심영역 검출을 이용한 SIFT 기반 이미지 유사성 검색)

  • Yu, Seung-Hoon;Kim, Deok-Hwan;Lee, Seok-Lyong;Chung, Chin-Wan;Kim, Sang-Hee
    • Journal of KIISE:Databases
    • /
    • v.35 no.4
    • /
    • pp.345-355
    • /
    • 2008
  • SIFT is popularly used in computer vision application such as object recognition, motion tracking, and 3D reconstruction among various shape descriptors. However, it is not easy to apply SIFT into the image similarity search as it is since it uses many high dimensional keypoint vectors. In this paper, we present a SIFT based image similarity search method using an edge image pyramid and an interesting region detection. The proposed method extracts keypoints, which is invariant to contrast, scale, and rotation of image, by using the edge image pyramid and removes many unnecessary keypoints from the image by using the hough transform. The proposed hough transform can detect objects of ellipse type so that it can be used to find interesting regions. Experimental results demonstrate that the retrieval performance of the proposed method is about 20% better than that of traditional SIFT in average recall.

Analyzing and Solving GuessWhat?! (GuessWhat?! 문제에 대한 분석과 파훼)

  • Lee, Sang-Woo;Han, Cheolho;Heo, Yujung;Kang, Wooyoung;Jun, Jaehyun;Zhang, Byoung-Tak
    • Journal of KIISE
    • /
    • v.45 no.1
    • /
    • pp.30-35
    • /
    • 2018
  • GuessWhat?! is a game in which two machine players, composed of questioner and answerer, ask and answer yes-no-N/A questions about the object hidden for the answerer in the image, and the questioner chooses the correct object. GuessWhat?! has received much attention in the field of deep learning and artificial intelligence as a testbed for cutting-edge research on the interplay of computer vision and dialogue systems. In this study, we discuss the objective function and characteristics of the GuessWhat?! game. In addition, we propose a simple solver for GuessWhat?! using a simple rule-based algorithm. Although a human needs four or five questions on average to solve this problem, the proposed method outperforms state-of-the-art deep learning methods using only two questions, and exceeds human performance using five questions.

Digital Mirror using Particle System based on Motion Detection (움직임 감지 기반의 파티클 시스템을 이용한 디지털 거울)

  • Lim, Chan;Yun, Jae-Sun
    • The Journal of the Korea Contents Association
    • /
    • v.11 no.11
    • /
    • pp.62-69
    • /
    • 2011
  • Development of sensing technology and progress of digital media have been creating new art genre named interactive media art. digital mirror working based on convergence between computer vision technology and video art, is expressing reconstituted spectator's visual image through various mediums. From this aesthetical point and high accessibility towards spectators, many types of digital mirrors have been introducing. However, the majority of digital mirrors express visual images unrelated to degree of spectator's participation and this caused obstruction to spectator's continuous participation and interaction. This paper proposes digital mirror operated by spectator's movements read through particle system synchronized with motion detection algorithm based on analyzing image difference. This work extracted the data of spectator's movement by image processing and designed particle system changed by this data. And it expressed reconstructed spectator's image.

A Low Cost 3D Skin Wrinkle Reconstruction System Based on Stereo Semi-Dense Matching (반 밀집 정합에 기반한 저가형 3차원 주름 데이터 복원)

  • Zhang, Qian;WhangBo, Taeg-Keun
    • Journal of Internet Computing and Services
    • /
    • v.10 no.4
    • /
    • pp.25-33
    • /
    • 2009
  • In the paper, we proposed a new system to retrieve 3D wrinkle data based on stereo images. Usually, 3D reconstruction based on stereo images or video is very popular and it is the research focus, which has been applied for culture heritage, building and other scene. The target is object measurement, the scene depth calculation and 3D data obtained. There are several challenges in our research. First, it is hard to take the full information wrinkle images by cameras because of light influence, skin with non-rigid object and camera performance. We design a particular computer vision system to take winkle images with a long length camera lens. Second, it is difficult to get the dense stereo data because of the hard skin texture image segmentation and corner detection. We focus on semi-dense stereo matching algorithm for the wrinkle depth. Compared with the 3D scanner, our system is much cheaper and compared with the physical modeling based method, our system is more flexible with high performance.

  • PDF

A Study on Tracking a Moving Object using Photogrammetric Techniques - Focused on a Soccer Field Model - (사진측랑기법을 이용한 이동객체 추적에 관한 연구 - 축구장 모형을 중심으로 -)

  • Bae Sang-Keun;Kim Byung-Guk;Jung Jae-Seung
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.24 no.2
    • /
    • pp.217-226
    • /
    • 2006
  • Extraction and tracking objects are fundamental and important steps of the digital image processing and computer vision. Many algorithms about extracting and tracking objects have been developed. In this research, a method is suggested for tracking a moving object using a pair of CCD cameras and calculating the coordinate of the moving object. A 1/100 miniature of soccer field was made to apply the developed algorithms. After candidates were selected from the acquired images using the RGB value of a moving object (soccer ball), the object was extracted using its size (MBR size) among the candidates. And then, image coordinates of a moving object are obtained. The real-time position of a moving object is tracked in the boundary of the expected motion, which is determined by centering the moving object. The 3D position of a moving object can be obtained by conducting the relative orientation, absolute orientation, and space intersection of a pair of the CCD camera image.