• 제목/요약/키워드: Object Pose

검색결과 205건 처리시간 0.027초

Object detection in financial reporting documents for subsequent recognition

  • Sokerin, Petr;Volkova, Alla;Kushnarev, Kirill
    • International journal of advanced smart convergence
    • /
    • 제10권1호
    • /
    • pp.1-11
    • /
    • 2021
  • Document page segmentation is an important step in building a quality optical character recognition module. The study examined already existing work on the topic of page segmentation and focused on the development of a segmentation model that has greater functional significance for application in an organization, as well as broad capabilities for managing the quality of the model. The main problems of document segmentation were highlighted, which include a complex background of intersecting objects. As classes for detection, not only classic text, table and figure were selected, but also additional types, such as signature, logo and table without borders (or with partially missing borders). This made it possible to pose a non-trivial task of detecting non-standard document elements. The authors compared existing neural network architectures for object detection based on published research data. The most suitable architecture was RetinaNet. To ensure the possibility of quality control of the model, a method based on neural network modeling using the RetinaNet architecture is proposed. During the study, several models were built, the quality of which was assessed on the test sample using the Mean average Precision metric. The best result among the constructed algorithms was shown by a model that includes four neural networks: the focus of the first neural network on detecting tables and tables without borders, the second - seals and signatures, the third - pictures and logos, and the fourth - text. As a result of the analysis, it was revealed that the approach based on four neural networks showed the best results in accordance with the objectives of the study on the test sample in the context of most classes of detection. The method proposed in the article can be used to recognize other objects. A promising direction in which the analysis can be continued is the segmentation of tables; the areas of the table that differ in function will act as classes: heading, cell with a name, cell with data, empty cell.

ASM의 성능향상을 위한 형태 정렬 방식 제안 (Proposing Shape Alignment for an Improved Active Shape Model)

  • 한희일
    • 한국멀티미디어학회논문지
    • /
    • 제15권1호
    • /
    • pp.63-70
    • /
    • 2012
  • 본 논문에서는 ASM(active shape model)의 성능을 향상시키기 위하여 형태(shape) 정렬 방법과 이차원 특징벡터 추출 방법을 제안한다. 기존 알고리즘은 입력 이미지의 중간 검출 랜드마크와 기준 모델 간의 정렬을 위하여 스케일, 회전, 이동 정보 만을 이용한다. 하지만 위의 평면적인 정보 만으로는 얼굴과 같이 입체적인 물체의 포즈 변화나 삼차원적인 움직임 등을 제대로 반영할 수 없다. 이를 개선하기 위하여 자유도를 증가시킴으로써 형태의 복잡한 변화에 보다 강인한 형태정렬 방식을 제안한다. 또한, 멀티스케일로 이차원 프로파일을 구하고 이들의 공분산 행렬을 trimming하여 검출속도를 향상시키는 방법을 제안한다. 비교적 다양한 포즈로 촬영한 얼굴 이미지 데이터베이스를 이용하여 제안 알고리즘의 형태 검출 성능을 확인한다.

다층 뉴럴네트워크를 이용한 애자 스탠드에서의 볼트 구멍의 중심위치 인식 (Recognition of the Center Position of Bolt Hole in the Stand of Insulator Using Multilayer Neural Network)

  • 안경관;표성만
    • 제어로봇시스템학회논문지
    • /
    • 제9권4호
    • /
    • pp.304-309
    • /
    • 2003
  • Uninterrupted power supply has become indispensable during the maintenance task of active electric power lines as a result of today's highly information-oriented society and increasing demand of electric utilities. The maintenance task has the risk of electric shock and the danger of falling from high place. Therefore it is necessary to realize an autonomous robot system. In order to realize these tasks autonomously, the three dimensional position of target object such as electric line and the stand of insulator must be recognized accurately and rapidly. The approaching of an insulator and the wrenching of a nut task is selected as the typical task of the maintenance of active electric power distribution lines in this paper. Image recognition by multilayer neural network and optimal target position calculation method are newly proposed in order to recognize the center 3 dimensional position of the bolt hole in the stand of insulator. By the proposed image recognition method, it is proved that the center 3 dimensional position of the bolt hole can be recognized rapidly and accurately without regard to the pose of the stand of insulator. Finally the approaching and wrenching task is automatically realized using 6-link electro-hydraulic manipulators.

Absolute Positioning System for Mobile Robot Navigation in an Indoor Environment (ICCAS 2004)

  • Yun, Jae-Mu;Park, Jin-Woo;Choi, Ho-Seek;Lee, Jang-Myung
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2004년도 ICCAS
    • /
    • pp.1448-1451
    • /
    • 2004
  • Position estimation is one of the most important functions for the mobile robot navigating in the unstructured environment. Most of previous localization schemes estimate current position and pose of mobile robot by applying various localization algorithms with the information obtained from sensors which are set on the mobile robot, or by recognizing an artificial landmark attached on the wall, or objects of the environment as natural landmark in the indoor environment. Several drawbacks about them have been brought up. To compensate the drawbacks, a new localization method that estimates the absolute position of the mobile robot by using a fixed camera on the ceiling in the corridor is proposed. And also, it can improve the success rate for position estimation using the proposed method, which calculates the real size of an object. This scheme is not a relative localization, which decreases the position error through algorithms with noisy sensor data, but a kind of absolute localization. The effectiveness of the proposed localization scheme is demonstrated through the experiments.

  • PDF

Multiple-Shot Person Re-identification by Features Learned from Third-party Image Sets

  • Zhao, Yanna;Wang, Lei;Zhao, Xu;Liu, Yuncai
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제9권2호
    • /
    • pp.775-792
    • /
    • 2015
  • Person re-identification is an important and challenging task in computer vision with numerous real world applications. Despite significant progress has been made in the past few years, person re-identification remains an unsolved problem. This paper presents a novel appearance-based approach to person re-identification. The approach exploits region covariance matrix and color histograms to capture the statistical properties and chromatic information of each object. Robustness against low resolution, viewpoint changes and pose variations is achieved by a novel signature, that is, the combination of Log Covariance Matrix feature and HSV histogram (LCMH). In order to further improve re-identification performance, third-party image sets are utilized as a common reference to sufficiently represent any image set with the same type. Distinctive and reliable features for a given image set are extracted through decision boundary between the specific set and a third-party image set supervised by max-margin criteria. This method enables the usage of an existing dataset to represent new image data without time-consuming data collection and annotation. Comparisons with state-of-the-art methods carried out on benchmark datasets demonstrate promising performance of our method.

마커 없는 증강현실을 위한 실시간 카메라 추적 (Real-Time Camera Tracking for Markerless Augmented Reality)

  • 오주현;손광훈
    • 방송공학회논문지
    • /
    • 제16권4호
    • /
    • pp.614-623
    • /
    • 2011
  • 본 논문에서는 방송용 증강현실 시스템을 위한 실시간 카메라 추적 알고리듬을 제안한다. SURF(speeded up robust features) 알고리듬을 이용하여 추적을 초기화하며, 안정적인 실시간 카메라 추적을 위해 다층(multi-scale) 구조를 사용한다. 미리 알려져 있지 않고 시간에 따라 변하는 조명 환경에서의 특징 추적을 위해 정규상호상관도(normalized cross correlation, NCC)를 사용한다. 방송제작에는 줌 렌즈를 장착한 카메라가 사용되기 때문에 카메라의 초점거리를 온라인으로 추정할 필요가 있다. 카메라의 회전과 이동으로 이루어진 외부 포즈(pose) 변수와 함께 내부 변수인 초점거리를 목적함수에 포함시켜 함께 최적화한다. 실험결과는 제안한 온라인 카메라 보정 기법에 의해 카메라의 초점거리가 정확히 구해지는 것을 보여준다.

가상 현실 어플리케이션을 위한 관성과 시각기반 하이브리드 트래킹 (Hybrid Inertial and Vision-Based Tracking for VR applications)

  • 구재필;안상철;김형곤;김익재;구열회
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 2003년도 학술회의 논문집 정보 및 제어부문 A
    • /
    • pp.103-106
    • /
    • 2003
  • In this paper, we present a hybrid inertial and vision-based tracking system for VR applications. One of the most important aspects of VR (Virtual Reality) is providing a correspondence between the physical and virtual world. As a result, accurate and real-time tracking of an object's position and orientation is a prerequisite for many applications in the Virtual Environments. Pure vision-based tracking has low jitter and high accuracy but cannot guarantee real-time pose recovery under all circumstances. Pure inertial tracking has high update rates and full 6DOF recovery but lacks long-term stability due to sensor noise. In order to overcome the individual drawbacks and to build better tracking system, we introduce the fusion of vision-based and inertial tracking. Sensor fusion makes the proposal tracking system robust, fast, accurate, and low jitter and noise. Hybrid tracking is implemented with Kalman Filter that operates in a predictor-corrector manner. Combining bluetooth serial communication module gives the system a full mobility and makes the system affordable, lightweight energy-efficient. and practical. Full 6DOF recovery and the full mobility of proposal system enable the user to interact with mobile device like PDA and provide the user with natural interface.

  • PDF

KNOWLEDGE-BASED BOUNDARY EXTRACTION OF MULTI-CLASSES OBJECTS

  • Park, Hae-Chul;Shin, Ho-Chul;Lee, Jin-Sung;Cho, Ju-Hyun;Kim, Seong-Dae
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2003년도 하계종합학술대회 논문집 Ⅳ
    • /
    • pp.1968-1971
    • /
    • 2003
  • We propose a knowledge-based algorithm for extracting an object boundary from low-quality image like the forward looking infrared image. With the multi-classes training data set, the global shape is modeled by multispace KL(MKL)[1] and curvature model. And the objective function for fitting the deformable boundary template represented by the shape model to true boundary in an input image is formulated by Bales rule. Simulation results show that our method has more accurateness in case of multi-classes training set and performs better in the sense of computation cost than point distribution model(PDM)[2]. It works well in distortion under the noise, pose variation and some kinds of occlusions.

  • PDF

Dynamic Manipulation of a Virtual Object in Marker-less AR system Based on Both Human Hands

  • Chun, Jun-Chul;Lee, Byung-Sung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제4권4호
    • /
    • pp.618-632
    • /
    • 2010
  • This paper presents a novel approach to control the augmented reality (AR) objects robustly in a marker-less AR system by fingertip tracking and hand pattern recognition. It is known that one of the promising ways to develop a marker-less AR system is using human's body such as hand or face for replacing traditional fiducial markers. This paper introduces a real-time method to manipulate the overlaid virtual objects dynamically in a marker-less AR system using both hands with a single camera. The left bare hand is considered as a virtual marker in the marker-less AR system and the right hand is used as a hand mouse. To build the marker-less system, we utilize a skin-color model for hand shape detection and curvature-based fingertip detection from an input video image. Using the detected fingertips the camera pose are estimated to overlay virtual objects on the hand coordinate system. In order to manipulate the virtual objects rendered on the marker-less AR system dynamically, a vision-based hand control interface, which exploits the fingertip tracking for the movement of the objects and pattern matching for the hand command initiation, is developed. From the experiments, we can prove that the proposed and developed system can control the objects dynamically in a convenient fashion.

대학 교양환경 교육자료의 개발과 적용에 관한 연구 (A Study on the Development and Application of Environmental Education Program in Liberal Arts.)

  • 성정희
    • 한국환경교육학회지:환경교육
    • /
    • 제15권1호
    • /
    • pp.1-17
    • /
    • 2002
  • The aim of this study is to establish an object of environmental education in liberal arts, and to develop a teaming program and search for the most effective environmental teaching method. At first this study analyzed the current situations and problems of the present environmental education in the liberal arts. As a result of this analysis, I found that, most of environmental educations have been conducted mainly by an approach of natural science, inevitably they should have limits in which students can't have holistic view in solve the environmental problem. Due to the fact that, many students were attending lectures, teaching methods were limited in the forms lecture and video tapes. As I applied educational programs with various teaching methods for students in order to change cognition and value toward environment, I found that there was no significant difference of cognition even after applying the programs. This may be interpreted as, most students already had very sound and sustainable environmental view. But some programs with teaching method using role play, debate, cyber-debate lead students to have interest in environments, thus actively participating in the class. These methods, taking into consideration, the hundreds of enrolled students, seem to pose a problem in actual application. The most important matter is, how to develop a cognition and value toward environment into environmental behavior. Therefore, in the future, aim is to study what determines the factors for causing environmental behavior from a cognition and value of the environment, and a development of programs in this regard will be necessary.

  • PDF