• Title/Summary/Keyword: Object Recognition Technology

Search Result 471, Processing Time 0.028 seconds

Building Information-rich Maps for Intuitive Human Interface Using Networked Knowledge Base

  • Ryu, Jae-Kwan;Kanayama, Chie;Chong, Nak-Young
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.1887-1891
    • /
    • 2005
  • Despite significant advances in multimedia transferring technologies in various fields of robotics, it is sometimes quite difficult for the operator to fully understand the context of 3D remote environments from 2D image feedback. Particularly, in the remote control of mobile robots, the recognition of the object associated with the task is very important, because the operator has to control the robot safely in various situations not through trial and error. Therefore, it is necessary to provide the operator with 3D volumetric models of the object and object-related information as well such as locations, shape, size, material properties, and so on. Thus, in this paper, we propose a vision-based human interface system that provides an interactive, information-rich map through network-based information brokering. The system consists of an object recognition part, a 3D map building part, a networked knowledge base part, and a control part of the mobile robot.

  • PDF

Recognition and Tracking of Moving Objects Using Label-merge Method Based on Fuzzy Clustering Algorithm (퍼지 클러스터링 알고리즘 기반의 라벨 병합을 이용한 이동물체 인식 및 추적)

  • Lee, Seong Min;Seong, Il;Joo, Young Hoon
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.67 no.2
    • /
    • pp.293-300
    • /
    • 2018
  • We propose a moving object extraction and tracking method for improvement of animal identification and tracking technology. First, we propose a method of merging separated moving objects into a moving object by using FCM (Fuzzy C-Means) clustering algorithm to solve the problem of moving object loss caused by moving object extraction process. In addition, we propose a method of extracting data from a moving object and a method of counting moving objects to determine the number of clusters in order to satisfy the conditions for performing FCM clustering algorithm. Then, we propose a method to continuously track merged moving objects. In the proposed method, color histograms are extracted from feature information of each moving object, and the histograms are continuously accumulated so as not to react sensitively to noise or changes, and the average is obtained and stored. Thereafter, when a plurality of moving objects are overlapped and separated, the stored color histogram is compared with each other to correctly recognize each moving object. Finally, we demonstrate the feasibility and applicability of the proposed algorithms through some experiments.

Recognition of Bill Form using Feature Pyramid Network (FPN(Feature Pyramid Network)을 이용한 고지서 양식 인식)

  • Kim, Dae-Jin;Hwang, Chi-Gon;Yoon, Chang-Pyo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.4
    • /
    • pp.523-529
    • /
    • 2021
  • In the era of the Fourth Industrial Revolution, technological changes are being applied in various fields. Automation digitization and data management are also in the field of bills. There are more than tens of thousands of forms of bills circulating in society and bill recognition is essential for automation, digitization and data management. Currently in order to manage various bills, OCR technology is used for character recognition. In this time, we can increase the accuracy, when firstly recognize the form of the bill and secondly recognize bills. In this paper, a logo that can be used as an index to classify the form of the bill was recognized as an object. At this time, since the size of the logo is smaller than that of the entire bill, FPN was used for Small Object Detection among deep learning technologies. As a result, it was possible to reduce resource waste and increase the accuracy of OCR recognition through the proposed algorithm.

Character Recognition and Search for Media Editing (미디어 편집을 위한 인물 식별 및 검색 기법)

  • Park, Yong-Suk;Kim, Hyun-Sik
    • Journal of Broadcast Engineering
    • /
    • v.27 no.4
    • /
    • pp.519-526
    • /
    • 2022
  • Identifying and searching for characters appearing in scenes during multimedia video editing is an arduous and time-consuming process. Applying artificial intelligence to labor-intensive media editing tasks can greatly reduce media production time, improving the creative process efficiency. In this paper, a method is proposed which combines existing artificial intelligence based techniques to automate character recognition and search tasks for video editing. Object detection, face detection, and pose estimation are used for character localization and face recognition and color space analysis are used to extract unique representation information.

Object Recognition Technology using LiDAR Sensor for Obstacle Detection of Agricultural Autonomous Robot (LiDAR 센서 활용 객체 인식기술이 적용된 농업용 자율주행 이송 로봇 개발)

  • Kim, Jong-Sil;Ju, Yeong-Tae;Kim, Eung-Kon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.16 no.3
    • /
    • pp.565-570
    • /
    • 2021
  • Agriculture in South Korea is losing productivity due to the lack of manpower as aging population increases. To overcome this, the agricultural robot market is growing rapidly, and research is being conducted on remote control and autonomous driving of agricultural robots. This work designs the appearance and structure of agricultural robots and implements the devices and control systems for driving. By utilizing and optimizing LiDAR sensors, we applied object recognition technology, which is an essential function for autonomous driving. This can reduce labor costs and improve productivity of transportation tasks that require the most labor in agriculture.

Development of Low-Cost Vision-based Eye Tracking Algorithm for Information Augmented Interactive System

  • Park, Seo-Jeon;Kim, Byung-Gyu
    • Journal of Multimedia Information System
    • /
    • v.7 no.1
    • /
    • pp.11-16
    • /
    • 2020
  • Deep Learning has become the most important technology in the field of artificial intelligence machine learning, with its high performance overwhelming existing methods in various applications. In this paper, an interactive window service based on object recognition technology is proposed. The main goal is to implement an object recognition technology using this deep learning technology to remove the existing eye tracking technology, which requires users to wear eye tracking devices themselves, and to implement an eye tracking technology that uses only usual cameras to track users' eye. We design an interactive system based on efficient eye detection and pupil tracking method that can verify the user's eye movement. To estimate the view-direction of user's eye, we initialize to make the reference (origin) coordinate. Then the view direction is estimated from the extracted eye pupils from the origin coordinate. Also, we propose a blink detection technique based on the eye apply ratio (EAR). With the extracted view direction and eye action, we provide some augmented information of interest without the existing complex and expensive eye-tracking systems with various service topics and situations. For verification, the user guiding service is implemented as a proto-type model with the school map to inform the location information of the desired location or building.

Object detection technology trend and development direction using deep learning

  • Kwak, NaeJoung;Kim, DongJu
    • International Journal of Advanced Culture Technology
    • /
    • v.8 no.4
    • /
    • pp.119-128
    • /
    • 2020
  • Object detection is an important field of computer vision and is applied to applications such as security, autonomous driving, and face recognition. Recently, as the application of artificial intelligence technology including deep learning has been applied in various fields, it has become a more powerful tool that can learn meaningful high-level, deeper features, solving difficult problems that have not been solved. Therefore, deep learning techniques are also being studied in the field of object detection, and algorithms with excellent performance are being introduced. In this paper, a deep learning-based object detection algorithm used to detect multiple objects in an image is investigated, and future development directions are presented.

Proposal for License Plate Recognition Using Synthetic Data and Vehicle Type Recognition System (가상 데이터를 활용한 번호판 문자 인식 및 차종 인식 시스템 제안)

  • Lee, Seungju;Park, Gooman
    • Journal of Broadcast Engineering
    • /
    • v.25 no.5
    • /
    • pp.776-788
    • /
    • 2020
  • In this paper, a vehicle type recognition system using deep learning and a license plate recognition system are proposed. In the existing system, the number plate area extraction through image processing and the character recognition method using DNN were used. These systems have the problem of declining recognition rates as the environment changes. Therefore, the proposed system used the one-stage object detection method YOLO v3, focusing on real-time detection and decreasing accuracy due to environmental changes, enabling real-time vehicle type and license plate character recognition with one RGB camera. Training data consists of actual data for vehicle type recognition and license plate area detection, and synthetic data for license plate character recognition. The accuracy of each module was 96.39% for detection of car model, 99.94% for detection of license plates, and 79.06% for recognition of license plates. In addition, accuracy was measured using YOLO v3 tiny, a lightweight network of YOLO v3.

Improvement of self-mixing semiconductor laser range finder and its application to range-image recognition of slowly moving object

  • Suzuki, Takashi;Shinohara, Shigenobu;Yoshida, Hirofumi;Ikeda, Hiroaki;Saitoh, Yasuhiro;Nishide, Ken-Ichi;Sumi, Masao
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1992.10b
    • /
    • pp.388-393
    • /
    • 1992
  • An infrared range finder using a self-mixing laser diode (SM-LD), which has been proposed and developed by the Authors, can measure not only a range of a moving target but its velocity simultaneously. In this paper, described is that the precise mode-hop pulse train can be obtained by employing a new signal processing circuit even when the backscattered light returning into the SM-LD is much more weaker. As a result, the distance to a tilted square sheet made from aluminium or white paper, which is placed 10 cm through 60 cm from the SM-LD, is measured with accuracy of a few percent even when the tilting angle is less than 75 degrees or 85 degrees, respectively. And in this paper, described is the range-image recognition of a plane object under the condition of standstill. The output laser beam is scanned by scanning two plane mirrors-equipped with each stepping motor. And we succeeded in the acquisition of the range-image of a plane object in a few tens of seconds. Furthermore, described is a feasibility study about the range-image recognition of a slowly moving plane object.

  • PDF

Analyzing DNN Model Performance Depending on Backbone Network (백본 네트워크에 따른 사람 속성 검출 모델의 성능 변화 분석)

  • Chun-Su Park
    • Journal of the Semiconductor & Display Technology
    • /
    • v.22 no.2
    • /
    • pp.128-132
    • /
    • 2023
  • Recently, with the development of deep learning technology, research on pedestrian attribute recognition technology using deep neural networks has been actively conducted. Existing pedestrian attribute recognition techniques can be obtained in such a way as global-based, regional-area-based, visual attention-based, sequential prediction-based, and newly designed loss function-based, depending on how pedestrian attributes are detected. It is known that the performance of these pedestrian attribute recognition technologies varies greatly depending on the type of backbone network that constitutes the deep neural networks model. Therefore, in this paper, several backbone networks are applied to the baseline pedestrian attribute recognition model and the performance changes of the model are analyzed. In this paper, the analysis is conducted using Resnet34, Resnet50, Resnet101, Swin-tiny, and Swinv2-tiny, which are representative backbone networks used in the fields of image classification, object detection, etc. Furthermore, this paper analyzes the change in time complexity when inferencing each backbone network using a CPU and a GPU.

  • PDF