• Title/Summary/Keyword: Shape Recognition Algorithm

Search Result 233, Processing Time 0.025 seconds

A Study on Development and Application of Real Time Vision Algorithm for Inspection Process Automation (검사공정 자동화를 위한 실시간 비전알고리즘 개발 및 응용에 관한 연구)

  • Back, Seung-Hak;Hwang, Won-Jun;Shin, Haeng-Bong;Choi, Young-Sik;Park, Dae-Yeong
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.19 no.1
    • /
    • pp.42-49
    • /
    • 2016
  • This study proposes a non-contact inspective technology based robot vision system for Faulty Inspection of welding States and Parts Shape. The maine focus is real time implementation of the machining parts' automatic inspection by the robotic moving. For this purpose, the automatic test instrument inspects the precision components designator the vision system. pattern Recognition Technologies and Precision Components for vision inspection technology and precision machining of precision parts including the status and appearance distinguish between good and bad. To perform a realization of a real-time automation integration system for the precision parts of manufacturing process, it is designed a robot vision system for the integrated system controller and verified the reliability through experiments. The main contents of this paper, the robot vision technology for noncontact inspection of precision components and machinery parts is useful technology for FA.

Pill Identification Algorithm Based on Deep Learning Using Imprinted Text Feature (음각 정보를 이용한 딥러닝 기반의 알약 식별 알고리즘 연구)

  • Seon Min, Lee;Young Jae, Kim;Kwang Gi, Kim
    • Journal of Biomedical Engineering Research
    • /
    • v.43 no.6
    • /
    • pp.441-447
    • /
    • 2022
  • In this paper, we propose a pill identification model using engraved text feature and image feature such as shape and color, and compare it with an identification model that does not use engraved text feature to verify the possibility of improving identification performance by improving recognition rate of the engraved text. The data consisted of 100 classes and used 10 images per class. The engraved text feature was acquired through Keras OCR based on deep learning and 1D CNN, and the image feature was acquired through 2D CNN. According to the identification results, the accuracy of the text recognition model was 90%. The accuracy of the comparative model and the proposed model was 91.9% and 97.6%. The accuracy, precision, recall, and F1-score of the proposed model were better than those of the comparative model in terms of statistical significance. As a result, we confirmed that the expansion of the range of feature improved the performance of the identification model.

An Improved Approach for 3D Hand Pose Estimation Based on a Single Depth Image and Haar Random Forest

  • Kim, Wonggi;Chun, Junchul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.8
    • /
    • pp.3136-3150
    • /
    • 2015
  • A vision-based 3D tracking of articulated human hand is one of the major issues in the applications of human computer interactions and understanding the control of robot hand. This paper presents an improved approach for tracking and recovering the 3D position and orientation of a human hand using the Kinect sensor. The basic idea of the proposed method is to solve an optimization problem that minimizes the discrepancy in 3D shape between an actual hand observed by Kinect and a hypothesized 3D hand model. Since each of the 3D hand pose has 23 degrees of freedom, the hand articulation tracking needs computational excessive burden in minimizing the 3D shape discrepancy between an observed hand and a 3D hand model. For this, we first created a 3D hand model which represents the hand with 17 different parts. Secondly, Random Forest classifier was trained on the synthetic depth images generated by animating the developed 3D hand model, which was then used for Haar-like feature-based classification rather than performing per-pixel classification. Classification results were used for estimating the joint positions for the hand skeleton. Through the experiment, we were able to prove that the proposed method showed improvement rates in hand part recognition and a performance of 20-30 fps. The results confirmed its practical use in classifying hand area and successfully tracked and recovered the 3D hand pose in a real time fashion.

Design of Computer Vision Interface by Recognizing Hand Motion (손동작 인식에 의한 컴퓨터 비전 인터페이스 설계)

  • Yun, Jin-Hyun;Lee, Chong-Ho
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.47 no.3
    • /
    • pp.1-10
    • /
    • 2010
  • As various interfacing devices for computational machines are being developed, a new HCI method using hand motion input is introduced. This interface method is a vision-based approach using a single camera for detecting and tracking hand movements. In the previous researches, only a skin color is used for detecting and tracking hand location. However, in our design, skin color and shape information are collectively considered. Consequently, detection ability of a hand increased. we proposed primary orientation edge descriptor for getting an edge information. This method uses only one hand model. Therefore, we do not need training processing time. This system consists of a detecting part and a tracking part for efficient processing. In tracking part, the system is quite robust on the orientation of the hand. The system is applied to recognize a hand written number in script style using DNAC algorithm. Performance of the proposed algorithm reaches 82% recognition ratio in detecting hand region and 90% in recognizing a written number in script style.

A Study on Lip-reading Enhancement Using Time-domain Filter (시간영역 필터를 이용한 립리딩 성능향상에 관한 연구)

  • 신도성;김진영;최승호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.5
    • /
    • pp.375-382
    • /
    • 2003
  • Lip-reading technique based on bimodal is to enhance speech recognition rate in noisy environment. It is most important to detect the correct lip-image. But it is hard to estimate stable performance in dynamic environment, because of many factors to deteriorate Lip-reading's performance. There are illumination change, speaker's pronunciation habit, versatility of lips shape and rotation or size change of lips etc. In this paper, we propose the IIR filtering in time-domain for the stable performance. It is very proper to remove the noise of speech, to enhance performance of recognition by digital filtering in time domain. While the lip-reading technique in whole lip image makes data massive, the Principal Component Analysis of pre-process allows to reduce the data quantify by detection of feature without loss of image information. For the observation performance of speech recognition using only image information, we made an experiment on recognition after choosing 22 words in available car service. We used Hidden Markov Model by speech recognition algorithm to compare this words' recognition performance. As a result, while the recognition rate of lip-reading using PCA is 64%, Time-domain filter applied to lip-reading enhances recognition rate of 72.4%.

Real Time Multiple Vehicle Detection Using Neural Network with Local Orientation Coding and PCA

  • Kang, Jeong-Gwan;Oh, Se-Young
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.636-639
    • /
    • 2003
  • In this paper, we present a robust method for detecting other vehicles from n forward-looking CCD camera in a moving vehicle. This system uses edge and shape information to detect other vehicles. The algorithm consists of three steps: lane detection, ehicle candidate generation, and vehicle verification. First after detecting a lane from the template matching method, we divide the road into three parts: left lane, front lane, and right lane. Second, we set the region of interest (ROI) using the lane position information and extract a vehicle candidate from the ROI. Third, we use local orientation coding (LOC) edge image of the vehicle candidate as input to a pretrained neural network for vehicle recognition. Experimental results from highway scenes show the robustness and effectiveness of this method.

  • PDF

A Study on Automated Outer Diameter Measurement System for Axisymmetric Automotive Part (자동차용 축대칭 형상 부품 외경 자동측정시스템에 관한 연구)

  • Ban, Kap-Soo;Bae, Jun-Young
    • Journal of the Korean Society of Manufacturing Process Engineers
    • /
    • v.12 no.3
    • /
    • pp.61-68
    • /
    • 2013
  • Automatic measurement system is required since cycle time and cost of production are increased by various factors in manual systems. This paper presents a machine vision based prototype measurement system for the automotive axisymmetric shape parts which are generally measured by a manual system that is required the tolerance of the part is very small on each machined surface. This measurement system adopts a method in which optical lens is transferred along the profile of the part to minimize measurement cycle time. The main interest of this paper is a development of an optimum measurement algorithm to the outside diameter of the parts that can be applied to various combinations of hardware. The operating system used to implement the whole system is Window XP and corresponding environment.

Projected Local Binary Pattern based Two-Wheelers Detection using Adaboost Algorithm

  • Lee, Yeunghak;Kim, Taesun;Shim, Jaechang
    • Journal of Multimedia Information System
    • /
    • v.1 no.2
    • /
    • pp.119-126
    • /
    • 2014
  • We propose a bicycle detection system riding on people based on modified projected local binary pattern(PLBP) for vision based intelligent vehicles. Projection method has robustness for rotation invariant and reducing dimensionality for original image. The features of Local binary pattern(LBP) are fast to compute and simple to implement for object recognition and texture classification area. Moreover, We use uniform pattern to remove the noise. This paper suggests that modified LBP method and projection vector having different weighting values according to the local shape and area in the image. Also our system maintains the simplicity of evaluation of traditional formulation while being more discriminative. Our experimental results show that a bicycle and motorcycle riding on people detection system based on proposed PLBP features achieve higher detection accuracy rate than traditional features.

  • PDF

Design of Edge Class for Digital Image Processing (디지털 영상 처리를 위한 에지 클래스의 설계)

  • 이강호;안용학;김학춘
    • Journal of the Korea Society of Computer and Information
    • /
    • v.9 no.2
    • /
    • pp.49-56
    • /
    • 2004
  • In this paper, we design edge class that can processed digital image effectively, edge is a important information including the point of shape information for a object detection or recognition in the digital image. Therefore, it is of very importance, which managed effectively the edge and can use a variety availability in digital image Processing, after edge detection. The environment using the existing digital image processing system has limits of use and speed. In this paper, we design edge class that can managed detected edges and it analyzes existing methods by edge detection algorithm.

  • PDF

Advanced Technologies in Blockchain, Machine Learning, and Big Data

  • Park, Ji Su;Park, Jong Hyuk
    • Journal of Information Processing Systems
    • /
    • v.16 no.2
    • /
    • pp.239-245
    • /
    • 2020
  • Blockchain, machine learning, and big data are among the key components of the future IT track. These technologies are used in various fields; hence their increasing application. This paper discusses the technologies developed in various research fields, such as data representation, Blockchain application, 3D shape recognition and classification, query method, classification method, and search algorithm, to provide insights into the future paradigm. In this paper, we present a summary of 18 high-quality accepted articles following a rigorous review process in the fields of Blockchain, machine learning, and big data.