• Title/Summary/Keyword: vision-based recognition

Search Result 633, Processing Time 0.031 seconds

A method of improving the quality of 3D images acquired from RGB-depth camera (깊이 영상 카메라로부터 획득된 3D 영상의 품질 향상 방법)

  • Park, Byung-Seo;Kim, Dong-Wook;Seo, Young-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.5
    • /
    • pp.637-644
    • /
    • 2021
  • In general, in the fields of computer vision, robotics, and augmented reality, the importance of 3D space and 3D object detection and recognition technology has emerged. In particular, since it is possible to acquire RGB images and depth images in real time through an image sensor using Microsoft Kinect method, many changes have been made to object detection, tracking and recognition studies. In this paper, we propose a method to improve the quality of 3D reconstructed images by processing images acquired through a depth-based (RGB-Depth) camera on a multi-view camera system. In this paper, a method of removing noise outside an object by applying a mask acquired from a color image and a method of applying a combined filtering operation to obtain the difference in depth information between pixels inside the object is proposed. Through each experiment result, it was confirmed that the proposed method can effectively remove noise and improve the quality of 3D reconstructed image.

Three-Dimensional Convolutional Vision Transformer for Sign Language Translation (수어 번역을 위한 3차원 컨볼루션 비전 트랜스포머)

  • Horyeor Seong;Hyeonjoong Cho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.13 no.3
    • /
    • pp.140-147
    • /
    • 2024
  • In the Republic of Korea, people with hearing impairments are the second-largest demographic within the registered disability community, following those with physical disabilities. Despite this demographic significance, research on sign language translation technology is limited due to several reasons including the limited market size and the lack of adequately annotated datasets. Despite the difficulties, a few researchers continue to improve the performacne of sign language translation technologies by employing the recent advance of deep learning, for example, the transformer architecture, as the transformer-based models have demonstrated noteworthy performance in tasks such as action recognition and video classification. This study focuses on enhancing the recognition performance of sign language translation by combining transformers with 3D-CNN. Through experimental evaluations using the PHOENIX-Wether-2014T dataset [1], we show that the proposed model exhibits comparable performance to existing models in terms of Floating Point Operations Per Second (FLOPs).

Monovision Charging Terminal Docking Method for Unmanned Automatic Charging of Autonomous Mobile Robots (자율이동로봇의 무인 자동 충전을 위한 모노비전 방식의 충전단자 도킹 방법)

  • Keunho Park;Juhwan Choi;Seonhyeong Kim;Dongkil Kang;Haeseong Jo;Joonsoo Bae
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.47 no.3
    • /
    • pp.95-103
    • /
    • 2024
  • The diversity of smart EV(electric vehicle)-related industries is increasing due to the growth of battery-based eco-friendly electric vehicle component material technology, and labor-intensive industries such as logistics, manufacturing, food, agriculture, and service have invested in and studied automation for a long time. Accordingly, various types of robots such as autonomous mobile robots and collaborative robots are being utilized for each process to improve industrial engineering such as optimization, productivity management, and work management. The technology that should accompany this unmanned automobile industry is unmanned automatic charging technology, and if autonomous mobile robots are manually charged, the utility of autonomous mobile robots will not be maximized. In this paper, we conducted a study on the technology of unmanned charging of autonomous mobile robots using charging terminal docking and undocking technology using an unmanned charging system composed of hardware such as a monocular camera, multi-joint robot, gripper, and server. In an experiment to evaluate the performance of the system, the average charging terminal recognition rate was 98%, and the average charging terminal recognition speed was 0.0099 seconds. In addition, an experiment was conducted to evaluate the docking and undocking success rate of the charging terminal, and the experimental results showed an average success rate of 99%.

A Study on Swarm Robot-Based Invader-Enclosing Technique on Multiple Distributed Object Environments

  • Ko, Kwang-Eun;Park, Seung-Min;Park, Jun-Heong;Sim, Kwee-Bo
    • Journal of Electrical Engineering and Technology
    • /
    • v.6 no.6
    • /
    • pp.806-816
    • /
    • 2011
  • Interest about social security has recently increased in favor of safety for infrastructure. In addition, advances in computer vision and pattern recognition research are leading to video-based surveillance systems with improved scene analysis capabilities. However, such video surveillance systems, which are controlled by human operators, cannot actively cope with dynamic and anomalous events, such as having an invader in the corporate, commercial, or public sectors. For this reason, intelligent surveillance systems are increasingly needed to provide active social security services. In this study, we propose a core technique for intelligent surveillance system that is based on swarm robot technology. We present techniques for invader enclosing using swarm robots based on multiple distributed object environment. The proposed methods are composed of three main stages: location estimation of the object, specified object tracking, and decision of the cooperative behavior of the swarm robots. By using particle filter, object tracking and location estimation procedures are performed and a specified enclosing point for the swarm robots is located on the interactive positions in their coordinate system. Furthermore, the cooperative behaviors of the swarm robots are determined via the result of path navigation based on the combination of potential field and wall-following methods. The results of each stage are combined into the swarm robot-based invader-enclosing technique on multiple distributed object environments. Finally, several simulation results are provided to further discuss and verify the accuracy and effectiveness of the proposed techniques.

Augmented Reality Interface Using Efficient Hand Gesture Recognition (효율적인 손동작 인식을 이용한 증강현실 인터페이스)

  • Choi, Jun-Yeong;Park, Han-Hoon;Park, Jong-Il
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.91-96
    • /
    • 2008
  • 증강현실(Augmented Reality)을 위한 효과적인 비전 기반 인터페이스 개발은 꾸준히 진행되어 왔으나, 대부분 환경적 제약을 받거나, 특수한 장비 혹은 복잡한 모델을 요구한다. 예를 들어, 마커를 이용하면 구현 상의 편의성과 정확성을 보장하지만, 일반적으로 마커는 환경과 대비되는 모양을 가지기 때문에, 사용자에게 거부감을 줄 수 있으며 무엇보다 복잡한 인터랙션에는 적용되기 힘들다. 한편, 손동작을 이용할 경우, 자연스럽고 다양한 인터랙션을 수행할 수 있지만, 색을 이용한 손동작 인식은 복잡한 환경에서 인식률이 크게 저하되고, 3 차원 모델 기반의 손동작 인식은 많은 연산량을 필요로 한다는 문제점을 가진다. 이로 인해 지금까지 제안된 방법을 증강현실 시스템에 적용하는 데는 한계가 있다. 본 논문에서는 기본적으로 손동작을 이용한 인터페이스를 제안하는데, 손동작 인식을 위한 알고리즘을 효율적으로 개선함으로써, 복잡한 환경에서 적은 연산량으로 자연스러운 인터랙션을 제공하고자 한다. 제안방법은 손목에 컬러 밴드를 착용하고, 색 정보를 이용하여 손을 포함하는 최소 영역을 용이하게 검출함으로써, 손 동작 인식률이 좋아지도록 하였다. 제안된 인터페이스는 손의 자연스러운 움직임을 감지해서 손의 모양과 동작에 따라서 가상의 물체를 자연스럽게 제어할 수 있도록 해 준다. 예를 들어, 손이 지정한 위치에 가상의 물체를 나타내고, 가상의 물체를 잡고 다양한 조작을 하는 등의 제어를 할 수 있다. 다양한 환경에서의 실험 및 사용자 평가를 통해 제안된 인터페이스의 유용성을 검증하였다.

  • PDF

Motion-Recognizing Game Controller with Tactile Feedback (동작인식 및 촉감제공 게임 컨트롤러)

  • Jeon, Seok-Hee;Kim, Sang-Ki;Park, Gun-Hyuk;Han, Gab-Jong;Lee, Sung-Kil;Choi, Seung-Moon;Choi, Seung-Jin;Eoh, Hong-Jun
    • 한국HCI학회:학술대회논문집
    • /
    • 2008.02a
    • /
    • pp.1-6
    • /
    • 2008
  • This paper proposes a game controller that provides user motion input and tactile feedback display, in addition to the traditional button-type input. The device utilizes both an accelerometer and an infrared camera in order to estimate 3D position and to recognize user motion. The information from the accelerometer and the camera are integrated for better performance. Various tactile sensations are presented using a voice-coil type vibrator. We apply the proposed controller to a motion-based game and validate its usability.

  • PDF

A Study of Evaluation of the Feature from Cooccurrence Matrix and Appropriate Applicable Resolution

  • Seo, Byoung-Jun;Kwon, Oh-Hyoung;Kim, Yong-Il
    • Proceedings of the KSRS Conference
    • /
    • 1999.11a
    • /
    • pp.8-12
    • /
    • 1999
  • Since the advent of high resolution satellite image, possibilities of applying various human interpretation mechanism to these images have increased. Also many studies about these possibilities in many fields such as computer vision, pattern recognition, artificial intellegence and remote sensing have been done. In this field of these studies, texture is defined as a kind of quantity related to spatial distribution of brightness and tone and also plays an important role for interpretation of images. Especially, methods of obtaining texture by statistical model have been studied intensively. Among these methods, texture measurement method based on cooccurrence matrix is highly estimated because it is easy to calculate texture features compared with other methods. In addition, these results in high classification accuracy when this is applied to satellite images and aerial photos. But in the existing studies using cooccurrence matrix, features have been chosen arbitrarily without considering feature variation. And not enough studies have been implemented for appropriate resolution selection in which cooccurrence matrix can extract texture. Therefore, this study reviews the concept of cooccurrence matrix as a texture measurement method, evaluates usefulness of several features obtained from cooccurrence matrix, and proposes appropriate resolution by investigating variance trend of several features.

  • PDF

A Study of Long-term Development Plan of Korea Research Institute for Library and Information ("도서관연구소" 중장기 발전방안 연구)

  • Yoon, Hee-Yoon
    • Journal of Korean Library and Information Science Society
    • /
    • v.39 no.2
    • /
    • pp.5-27
    • /
    • 2008
  • The purpose of this paper Is to present a long-term development plan for the KRILI(Korea Research Institute for Library and Information). To do so, authors analysed a various library research institutes and related organizations in U.S.A., U.K., Czech Republic, Hungary, and Japan. And using the SWOT analysis, author identified and described the current status and also surveyed recognition of library and information science faculties as to desirable roles and phase of the KRILI. Based on the results of analysis and survey, this paper suggested a long-term plan(vision and objectives, strategic issues, desirable phase and organization system, growth and development model, Internal cooperative operating systems and external research cooperation system, etc.) of the KRILI.

  • PDF

Development of automatic die bonder system for semiconductor parts assembly (반도체 소자용 자동 die bonding system의 개발)

  • 변증남;오상록;서일홍;유범재;안태영;김재옥
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1988.10a
    • /
    • pp.353-359
    • /
    • 1988
  • In this paper, the design and implementation of a multi-processor based die bonder machine for the semiconductor will be described. This is a final research results carried out for two years from June, 1986 to July, 1988. The mechanical system consists of three subsystems such as bonding head module, wafer feeding module, and lead frame feeding module. The overall control system consists of the following three subsystems each of which employs a 16 bit microprocessor MC 68000 : (i) supervisory control system, (ii) visual recognition / inspection system and (iii) the display system. Specifically, the supervisory control system supervises the whole sequence of die bonder machine, performs a self-diagnostics while it controls the bonding head module according to the prespecified bonding cycle. The vision system recognizes the die to inspect the die quality and deviation / orientation of a die with respect to a reference position, while it controls the wafer feeding module. Finally, the display system performs a character display, image display ans various error messages to communicate with operator. Lead frame feeding module is controlled by this subsystem. It is reported that the proposed control system were applied to an engineering sample and tested in real-time, and the results are sucessful as an engineering sample phase.

  • PDF

A Design of Color-identifying Multi Vehicle Controller for Material Delivery Using Adaptive Fuzzy Controller (적응 퍼지제어기를 이용한 컬러식별 Multi Vehicle의 물류이송을 위한 다중제어기 설계)

  • Kim, Hun-Mo
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.18 no.5
    • /
    • pp.42-49
    • /
    • 2001
  • In This paper, we present a collaborative method for material delivery using a distributed vehicle agents system. Generally used AGV(Autonomous Guided Vehicle) systems in FA(Factory Automation) require extraordinary facilities like guidepaths and landmarks and have numerous limitations for application in different environments. Moreover in the case of controlling multi vehicles, the necessity for developing corporation abilities like loading and unloading materials between vehicles including different types is increasing nowadays for automation of material flow. Thus to compensate and improve the functions of AGV, it is important to endow vehicles with the intelligence to recognize environments and goods and to determine the goal point to approach. In this study we propose an interaction method between hetero-type vehicles and adaptive fuzzy logic controllers for sensor-based path planning methods and material identifying methods which recognizes color. For the purpose of carrying materials to the goal, simple color sensor is used instead of intricate vision system to search for material and recognize its color in order to determine the goal point to transfer it to. The technique for the proposed method will be demonstrated by experiment.

  • PDF