• 제목/요약/키워드: Geometric computer vision

검색결과 68건 처리시간 0.024초

3D Shape Descriptor for Segmenting Point Cloud Data

  • Park, So Young;Yoo, Eun Jin;Lee, Dong-Cheon;Lee, Yong Wook
    • 한국측량학회지
    • /
    • 제30권6_2호
    • /
    • pp.643-651
    • /
    • 2012
  • Object recognition belongs to high-level processing that is one of the difficult and challenging tasks in computer vision. Digital photogrammetry based on the computer vision paradigm has begun to emerge in the middle of 1980s. However, the ultimate goal of digital photogrammetry - intelligent and autonomous processing of surface reconstruction - is not achieved yet. Object recognition requires a robust shape description about objects. However, most of the shape descriptors aim to apply 2D space for image data. Therefore, such descriptors have to be extended to deal with 3D data such as LiDAR(Light Detection and Ranging) data obtained from ALS(Airborne Laser Scanner) system. This paper introduces extension of chain code to 3D object space with hierarchical approach for segmenting point cloud data. The experiment demonstrates effectiveness and robustness of the proposed method for shape description and point cloud data segmentation. Geometric characteristics of various roof types are well described that will be eventually base for the object modeling. Segmentation accuracy of the simulated data was evaluated by measuring coordinates of the corners on the segmented patch boundaries. The overall RMSE(Root Mean Square Error) is equivalent to the average distance between points, i.e., GSD(Ground Sampling Distance).

DISCRETE TORSION AND NUMERICAL DIFFERENTIATION OF BINORMAL VECTOR FIELD OF A SPACE CURVE

  • Jeon, Myung-Jin
    • 한국수학교육학회지시리즈B:순수및응용수학
    • /
    • 제12권4호
    • /
    • pp.275-287
    • /
    • 2005
  • Geometric invariants are basic tools for geometric processing and computer vision. In this paper, we give a linear approximation for the differentiation of the binormal vector field of a space curve by using the forward and backward differences of discrete binormal vectors. Two kind of discrete torsion, say, back-ward torsion $T_b$ and forward torsion $T_f$ can be defined by the dot product of the (backward and forward) discrete differentiation of binormal vectors that are linear approximations of torsion. Using Frenet formula and Taylor series expansion, we give error estimations for the discrete torsions. We also give numerical tests for a curve. Notably the average of $T_b$ and $T_f$ looks more stable in errors.

  • PDF

A New Solution for Projective Reconstruction Based on Coupled Line Cameras

  • Lee, Joo-Haeng
    • ETRI Journal
    • /
    • 제35권5호
    • /
    • pp.939-942
    • /
    • 2013
  • We provide a new solution for the projective reconstruction problem based on coupled line cameras (CLCs) and their geometric properties. The proposed solution is composed of a series of optimized steps, and each step is more efficient than those of the initial solution proposed in [1]. We also give a new determinant condition for rectangle determination, which leads to less ambiguity in implementation. The key steps of the proposed solution can be represented with more compact analytic equations due to the intuitive geometric interpretations of the projective reconstruction problem based on CLCs: the center of projection corresponds to the intersection point of the two solution circles of each line camera involved.

컴퓨터 시각(視覺)에 의거한 측정기술(測定技術) 및 측정오차(測定誤差)의 분석(分析)과 보정(補正) (Computer Vision Based Measurement, Error Analysis and Calibration)

  • 황헌;이충호
    • Journal of Biosystems Engineering
    • /
    • 제17권1호
    • /
    • pp.65-78
    • /
    • 1992
  • When using a computer vision system for a measurement, the geometrically distorted input image usually restricts the site and size of the measuring window. A geometrically distorted image caused by the image sensing and processing hardware degrades the accuracy of the visual measurement and prohibits the arbitrary selection of the measuring scope. Therefore, an image calibration is inevitable to improve the measuring accuracy. A calibration process is usually done via four steps such as measurement, modeling, parameter estimation, and compensation. In this paper, the efficient error calibration technique of a geometrically distorted input image was developed using a neural network. After calibrating a unit pixel, the distorted image was compensated by training CMLAN(Cerebellar Model Linear Associator Network) without modeling the behavior of any system element. The input/output training pairs for the network was obtained by processing the image of the devised sampled pattern. The generalization property of the network successfully compensates the distortion errors of the untrained arbitrary pixel points on the image space. The error convergence of the trained network with respect to the network control parameters were also presented. The compensated image through the network was then post processed using a simple DDA(Digital Differential Analyzer) to avoid the pixel disconnectivity. The compensation effect was verified using known sized geometric primitives. A way to extract directly a real scaled geometric quantity of the object from the 8-directional chain coding was also devised and coded. Since the developed calibration algorithm does not require any knowledge of modeling system elements and estimating parameters, it can be applied simply to any image processing system. Furthermore, it efficiently enhances the measurement accuracy and allows the arbitrary sizing and locating of the measuring window. The applied and developed algorithms were coded as a menu driven way using MS-C language Ver. 6.0, PC VISION PLUS library functions, and VGA graphic functions.

  • PDF

Vision-based Real-time Lane Detection and Tracking for Mobile Robots in a Constrained Track Environment

  • Kim, Young-Ju
    • 한국컴퓨터정보학회논문지
    • /
    • 제24권11호
    • /
    • pp.29-39
    • /
    • 2019
  • 실생활에서의 모바일 로봇 응용이 증가하면서 저비용의 자율 주행 기능이 요구되고 있다. 본 논문은 모바일 로봇의 실내 주행 여건을 고려한 제한된 트랙을 가정하고, 제한된 트랙에서 모바일 로봇의 자율 주행을 지원하는 비젼 기반 실시간 차선 검출 및 추적 시스템을 제안한다. 다양한 형태의 차선 처리와 동작 파리미터의 사전 조정 등을 고려하여 다중 동작 모드를 가진 시스템 구조와 상태 기계를 설계하였으며, 파라미터 조정 모드에서 차선 두께의 기하학적 특성을 바탕으로 컬러 필터의 임계값을 동적으로 조정하고, 곡선 트랙의 불안정 입력 모드와 직선 트랙의 안정 입력 모드에서 차선의 기하학적 그리고 시간적 특성을 바탕으로 차선 특징 픽셀을 적응적으로 추출하고 최소제곱법으로 차선 모형을 추정한다. 추정된 차선 모형으로 트랙 중앙선을 산출하고 움직임 모형을 단순화시켜 선형 칼만 필터를 통해 추적한다. 주행 실험에서 저성능의 로봇 구성에서도 실시간 처리를 통해 제한된 트랙에서 정상적으로 자율 주행이 이루어짐을 확인하였다.

가우시안 잡음에서 변형된 LLAH 알고리즘의 성능 분석 (Performance Analysis of Modified LLAH Algorithm under Gaussian Noise)

  • 류호섭;박한훈
    • 한국멀티미디어학회논문지
    • /
    • 제18권8호
    • /
    • pp.901-908
    • /
    • 2015
  • Methods of detecting, describing, matching image features, like corners and blobs, have been actively studied as a fundamental step for image processing and computer vision applications. As one of feature description/matching methods, LLAH(Locally Likely Arrangement Hashing) describes image features based on the geometric relationship between their neighbors, and thus is suitable for scenes with poor texture. This paper presents a modified LLAH algorithm, which includes the image features themselves for robustly describing the geometric relationship unlike the original LLAH, and employes a voting-based feature matching scheme that makes feature description much simpler. Then, this paper quantitatively analyzes its performance with synthetic images in the presence of Gaussian noise.

손의 기하학적인 특성을 적용한 실시간 비전 기반 증강현실 시스템 (A Real-time Augmented Reality System using Hand Geometric Characteristics based on Computer Vision)

  • 최희선;정다운;최종수
    • 한국멀티미디어학회논문지
    • /
    • 제15권3호
    • /
    • pp.323-335
    • /
    • 2012
  • 본 논문에서는 손을 이용한 컴퓨터 비전 기술 기반의 증강 현실 시스템을 제안한다. 입력 영상에 가상의 물체를 정합하기 위해서는 정확한 특징점 추출과 추적 기술이 중요하다. 기존의 마커를 이용한 증강현실 시스템은 매우 안정성이 있지만 마커가 없이는 증강이 불가능하고 증강된 물체를 조작하는데 제한적인 상황을 제공하는 경향이 있다. 제안한 시스템은 손의 기하학적인 특성을 고려하여 적응적 최적 타원 검출 방법을 통해 손끝점을 특징점으로 추출한다. 그리고 손바닥 중심점을 기준으로 한 최단 거리 검출 방법을 이용하여 손끝점의 움직임을 추출해 가상의 객체를 안정적으로 정합한다. 실험을 통하여 특징점 추출이 약 82.0%의 정확도를 보였고 특징점 판별과 추적 성능 평가에서는 단지 약 1.8%와 2.0%의 오류를 보였다. 또한 제안한 시스템이 가상 객체의 안정적인 증강 측면에서, 효과적으로 카메라 사영 행렬을 획득하여 마커를 이용한 시스템을 대체할 수 있음을 확인하였다.

레이저 구조광을 이용한 3차원 컴퓨터 시각 형상정보 연속 측정 시스템 개발 (Development of the Computer Vision based Continuous 3-D Feature Extraction System via Laser Structured Lighting)

  • 임동혁;황헌
    • Journal of Biosystems Engineering
    • /
    • 제24권2호
    • /
    • pp.159-166
    • /
    • 1999
  • A system to extract continuously the real 3-D geometric fearture information from 2-D image of an object, which is fed randomly via conveyor has been developed. Two sets of structured laser lightings were utilized. And the laser structured light projection image was acquired using the camera from the signal of the photo-sensor mounted on the conveyor. Camera coordinate calibration matrix was obtained, which transforms 2-D image coordinate information into 3-D world space coordinate using known 6 points. The maximum error after calibration showed 1.5 mm within the height range of 103mm. The correlation equation between the shift amount of the laser light and the height was generated. Height information estimated after correlation showed the maximum error of 0.4mm within the height range of 103mm. An interactive 3-D geometric feature extracting software was developed using Microsoft Visual C++ 4.0 under Windows system environment. Extracted 3-D geometric feature information was reconstructed into 3-D surface using MATLAB.

  • PDF

Transformation Based Walking Speed Normalization for Gait Recognition

  • Kovac, Jure;Peer, Peter
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제7권11호
    • /
    • pp.2690-2701
    • /
    • 2013
  • Humans are able to recognize small number of people they know well by the way they walk. This ability represents basic motivation for using human gait as the means for biometric identification. Such biometric can be captured at public places from a distance without subject's collaboration, awareness or even consent. Although current approaches give encouraging results, we are still far from effective use in practical applications. In general, methods set various constraints to circumvent the influence factors like changes of view, walking speed, capture environment, clothing, footwear, object carrying, that have negative impact on recognition results. In this paper we investigate the influence of walking speed variation to different visual based gait recognition approaches and propose normalization based on geometric transformations, which mitigates its influence on recognition results. With the evaluation on MoBo gait dataset we demonstrate the benefits of using such normalization in combination with different types of gait recognition approaches.

표고 외관 특징점의 자동 추출 및 측정 (Automatic Extraction and Measurement of Visual Features of Mushroom (Lentinus edodes L.))

  • 황헌;이용국
    • 생물환경조절학회지
    • /
    • 제1권1호
    • /
    • pp.37-51
    • /
    • 1992
  • Quantizing and extracting visual features of mushroom(Lentinus edodes L.) are crucial to the sorting and grading automation, the growth state measurement, and the dried performance indexing. A computer image processing system was utilized for the extraction and measurement of visual features of front and back sides of the mushroom. The image processing system is composed of the IBM PC compatible 386DK, ITEX PCVISION Plus frame grabber, B/W CCD camera, VGA color graphic monitor, and image output RGB monitor. In this paper, an automatic thresholding algorithm was developed to yield the segmented binary image representing skin states of the front and back sides. An eight directional Freeman's chain coding was modified to solve the edge disconnectivity by gradually expanding the mask size of 3$\times$3 to 9$\times$9. A real scaled geometric quantity of the object was directly extracted from the 8-directional chain element. The external shape of the mushroom was analyzed and converted to the quantitative feature patterns. Efficient algorithms for the extraction of the selected feature patterns and the recognition of the front and back side were developed. The developed algorithms were coded in a menu driven way using MS_C language Ver.6.0, PC VISION PLUS library fuctions, and VGA graphic functions.

  • PDF