• Title/Summary/Keyword: Marker Recognition

Search Result 152, Processing Time 0.029 seconds

Small Marker Detection with Attention Model in Robotic Applications (로봇시스템에서 작은 마커 인식을 하기 위한 사물 감지 어텐션 모델)

  • Kim, Minjae;Moon, Hyungpil
    • The Journal of Korea Robotics Society
    • /
    • v.17 no.4
    • /
    • pp.425-430
    • /
    • 2022
  • As robots are considered one of the mainstream digital transformations, robots with machine vision becomes a main area of study providing the ability to check what robots watch and make decisions based on it. However, it is difficult to find a small object in the image mainly due to the flaw of the most of visual recognition networks. Because visual recognition networks are mostly convolution neural network which usually consider local features. So, we make a model considering not only local feature, but also global feature. In this paper, we propose a detection method of a small marker on the object using deep learning and an algorithm that considers global features by combining Transformer's self-attention technique with a convolutional neural network. We suggest a self-attention model with new definition of Query, Key and Value for model to learn global feature and simplified equation by getting rid of position vector and classification token which cause the model to be heavy and slow. Finally, we show that our model achieves higher mAP than state of the art model YOLOr.

Performance improvement for marker-less object recognition through OpenCV mobile library (모바일 기반 OpenCV 라이브러리를 이용한 마커리스 객체 인식 성능 향상)

  • Jung, Hyeon-Sub;Yin, Xiyuan;Kim, Shin-Dug
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2013.07a
    • /
    • pp.61-64
    • /
    • 2013
  • 본 논문에서는 모바일 기반 OpenCV 라이브러리를 이용한 마커리스 객체 인석 성능 향상을 위한 소프트웨어적인 관점의 방법을 제안한다. 기존의 마커리스 기반 알고리즘을 이용하여 테스트를 수행한 후 성능에 저하를 발생시키는 요인들을 분석하고 그에 따른 상황별 적절한 해결책을 제시한다. 이에 따라 크게 프로그램 코드 개선, 마커리스 기반 알고리즘 코드 개선, 센서를 활용한 성능 향상을 도모한다. 프로그램 코드 개선은 테스트 결과를 분석 한 후 수행시간이 가장 많이 소요되는 함수를 최적화하고 또한 최적의 특징점의 수를 제한한다. 마커리스 기반 알고리즘 코드 개선은 병렬 처리가 제공되는 모바일에 한하여 병렬처리기법으로 코드를 수정한다. 마지막 센서를 활용한 성능향상은 실시간 작업 처리 단위를 묶음으로 처리하였을 때 발생하는 품질의 저하를 보정하는 역할을 수행한다. 본 논문에서는 이러한 마커리스 객체 인식 성능 향상 방법을 소프트웨어적인 관점에서 제안하고 이에 대한 결과 모바일 기반 실시간 증강현실 서비스를 위한 성능 향상 면에서 효과적이다.

  • PDF

Development of Multi Card Touch based Interactive Arcade Game System (멀티 카드 터치기반 인터랙티브 아케이드 게임 시스템 구현)

  • Lee, Dong-Hoon;Jo, Jae-Ik;Yun, Tae-Soo
    • Journal of Korea Entertainment Industry Association
    • /
    • v.5 no.2
    • /
    • pp.87-95
    • /
    • 2011
  • Recently, the issue has been tangible game environment due to the various interactive interface developments. In this paper, we propose the multi card touch based interactive arcade system by using marker recognition interface and multi-touch interaction interface. For our system, the card's location and orientation information is recognized through DI-based recognition algorithm. In addition, the user's hand gesture tracking informations are provided by the various interaction metaphors. The system provides the user with a higher engagement offers a new experience. Therefore, our system will be used in the tangible arcade game machine.

The Bullet Launcher with A Pneumatic System to Detect Objects by Unique Markers

  • Jasmine Aulia;Zahrah Radila;Zaenal Afif Azhary;Aulia M. T. Nasution;Detak Yan Pratama;Katherin Indriawati;Iyon Titok Sugiarto;Wildan Panji Tresna
    • Journal of information and communication convergence engineering
    • /
    • v.21 no.3
    • /
    • pp.252-260
    • /
    • 2023
  • A bullet launcher can be developed as a smart instrument, especially for use in the military section, that can track, identify, detect, mark, lock, and shoot a target by implementing an image-processing system. In this research, the application of object recognition system, laser encoding as a unique marker, 2-dimensional movement, and pneumatic as a shooter has been studied intensively. The results showed that object recognition system could detect various colors, patterns, sizes, and laser blinking. Measuring the average error value of the object distance by using the camera is ±4, ±5, and ±6% for circle, square and triangle form respectively. Meanwhile, the average accuracy of shots on objects is 95.24% and 85.71% in indoor and outdoor conditions respectively. Here, the average prototype response time is 1.11 s. Moreover, the highest accuracy rate of shooting results at 50 cm was obtained 98.32%.

Gesture Recognition Algorithm by Analyzing Direction Change of Trajectory (궤적의 방향 변화 분석에 의한 제스처 인식 알고리듬)

  • Park Jahng-Hyon;Kim Minsoo
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.22 no.4
    • /
    • pp.121-127
    • /
    • 2005
  • There is a necessity for the communication between intelligent robots and human beings because of wide spread use of them. Gesture recognition is currently being studied in regards to better conversing. On the basis of previous research, however, the gesture recognition algorithms appear to require not only complicated algorisms but also separate training process for high recognition rates. This study suggests a gesture recognition algorithm based on computer vision system, which is relatively simple and more efficient in recognizing various human gestures. After tracing the hand gesture using a marker, direction changes of the gesture trajectory were analyzed to determine the simple gesture code that has minimal information to recognize. A map is developed to recognize the gestures that can be expressed with different gesture codes. Through the use of numerical and geometrical trajectory, the advantages and disadvantages of the suggested algorithm was determined.

Automatic Processing of Predicative Nouns for Korean Semantic Recognition. (한국어 의미역 인식을 위한 서술성 명사의 자동처리 연구)

  • Lee, Sukeui;Im, Su-Jong
    • Korean Linguistics
    • /
    • v.80
    • /
    • pp.151-175
    • /
    • 2018
  • This paper proposed a method of semantic recognition to improve the extraction of correct answers of the Q&A system through machine learning. For this purpose, the semantic recognition method is described based on the distribution of predicative nouns. Predicative noun vocabularies and sentences were collected from Wikipedia documents. The predicative nouns are typed by analyzing the environment in which the predicative nouns appear in sentences. This paper proposes a semantic recognition method of predicative nouns to which rules can be applied. In Chapter 2, previous studies on predicative nouns were reviewed. Chapter 3 explains how predicative nouns are distributed. In this paper, every predicative nouns that can not be processed by rules are excluded, therefore, the predicative nouns noun forms combined with the case marker '의' were excluded. In Chapter 4, we extracted 728 sentences composed of 10,575 words from Wikipedia. A semantic analysis engine tool of ETRI was used and presented a predicative nouns noun that can be handled semantic recognition language.

A Study on the Straight Path Prediction Technology of White LED Marker-based AGV in Indoor Environment (실내 환경에서 White LED 마커 기반 무인 운반차의 직진경로 예측 기술 연구)

  • Woo, Deok gun;vinayagam, Mariappan;Kim, Young min;Cha, Jae sang
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.17 no.5
    • /
    • pp.48-54
    • /
    • 2018
  • With the 4th industry era, smart factories are emerging. In the era of multi-product small scale production, unmanned transportation vehicles are rapidly increasing in utilization of unmanned transportation vehicles that carry and arrange goods in the work space. The conventional unmanned vehicle detected its position by using the guided line method and the position based method for indoor location recognition and movement. This method has disadvantages of initial high cost and maintenance / maintenance. In this paper, to solve the disadvantages, the method of predicting the direct path of the unmanned vehicle through the Kalman filter is verified using the white LED marker of the warehouse and the position data and the image data of the white LED marker recognition image. Through this, the reliability of the linear movement which occupies the most part in the lattice structure is secured. It is also expected that the reliance on additional position sensors will also be reduced.

A Study on Object Control in Mobile Augmented Reality Using Indoor Location Based Service (실내 위치기반 서비스를 이용한 모바일 증강현실에서의 객체 제어에 관한 연구)

  • Yoon, Chang-Pyo;Lee, Hae-Jun;Lee, Dae-Sung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2017.05a
    • /
    • pp.288-290
    • /
    • 2017
  • Recently, interest and demand of Augmented Reality(AR) contents are increasing as an application field of AR. Generally, when the AR contents are served in the outdoor environment, the position information using the GPS signal is used to control the display of the object on the AR screen, or a marker based on the image of the object is used. However, there is a problem that location information can not be used in an indoor environment. If the service is provided using only the marker, there is a problem that the recognition of the marker due to the moving obstacle in the vicinity is unstable. and there is a problem that information displayed on the AR screen is not displayed in a fixed state at a specific position, it moves according to the movement of the camera. In this paper, we have studied the object control method for displaying the object to be displayed on the AR screen by using iBeacon using indoor location recognition and specific markers.

  • PDF

Color Recognition and Phoneme Pattern Segmentation of Hangeul Using Augmented Reality (증강현실을 이용한 한글의 색상 인식과 자소 패턴 분리)

  • Shin, Seong-Yoon;Choi, Byung-Seok;Rhee, Yang-Won
    • Journal of the Korea Society of Computer and Information
    • /
    • v.15 no.6
    • /
    • pp.29-35
    • /
    • 2010
  • While diversification of the use of video in the prevalence of cheap video equipment, augmented reality can print additional real-world images and video image. Although many recent advent augmented reality techniques, currently attempting to correct the character recognition is performed. In this paper characters marked with a visual marker recognition, and the color to match the marker color of the characters finds. And, it was shown on the screen by the character recognition. In this paper, by applying the phoneme pattern segmentation algorithm by the horizontal projection, we propose to segment the phoneme to match the six types of Hangul representation. Throughout the experiment sample of phoneme segmentation using augmented reality showed proceeding result at each step, and the experimental results was found to be that detection rate was above 90%.

Evaluation of Marker Images based on Analysis of Feature Points for Effective Augmented Reality (효과적인 증강현실 구현을 위한 특징점 분석 기반의 마커영상 평가 방법)

  • Lee, Jin-Young;Kim, Jongho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.9
    • /
    • pp.49-55
    • /
    • 2019
  • This paper presents a marker image evaluation method based on analysis of object distribution in images and classification of images with repetitive patterns for effective marker-based augmented reality (AR) system development. We measure the variance of feature point coordinates to distinguish marker images that are vulnerable to occlusion, since object distribution affects object tracking performance according to partial occlusion in the images. Moreover, we propose a method to classify images suitable for object recognition and tracking based on the fact that the distributions of descriptor vectors among general images and repetitive-pattern images are significantly different. Comprehensive experiments for marker images confirm that the proposed marker image evaluation method distinguishes images vulnerable to occlusion and repetitive-pattern images very well. Furthermore, we suggest that scale-invariant feature transform (SIFT) is superior to speeded up robust features (SURF) in terms of object tracking in marker images. The proposed method provides users with suitability information for various images, and it helps AR systems to be realized more effectively.