• Title/Summary/Keyword: Real-time Segmentation

Search Result 284, Processing Time 0.031 seconds

An Automatic Weight Measurement of Rope Using Computer Vision

  • Joo, Ki-See
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.2 no.1
    • /
    • pp.141-146
    • /
    • 1998
  • Recently, the computer vision such as part measurement, and product inspection is very popular to achieve the factory automation since the labor cost is dramatically increasing. In this paper, the diameter and the length of rope are measured by CCD camera which is orthogonally mounted on the ceiling. Two parameters which are the diameter and the length of rope are used to measure the weight of rope. If the weight of rope is reached to predetermined weight, the information is transmitted to PLC(programmable logic control) to cut the rope on the wheel. The cutting machine cuts the rope according to the information obtained from the CCD camera. To measure the diameter and length of rope on real time, the searching space for image segmentation is restricted the predetermined area according to the camera calibration position. Finally, to estimate the weight of rope, the knowledge base system which depends on the diameter, the length of rope, and weight relation between these information are constructed according to diameters of rope. This method contributes to achieve the factory automation, and reduce the production cost since the operators are unnecessary to measure the weight of rope by try-and-error method.

  • PDF

A Vision-Based Method to Find Fingertips in a Closed Hand

  • Chaudhary, Ankit;Vatwani, Kapil;Agrawal, Tushar;Raheja, J.L.
    • Journal of Information Processing Systems
    • /
    • v.8 no.3
    • /
    • pp.399-408
    • /
    • 2012
  • Hand gesture recognition is an important area of research in the field of Human Computer Interaction (HCI). The geometric attributes of the hand play an important role in hand shape reconstruction and gesture recognition. That said, fingertips are one of the important attributes for the detection of hand gestures and can provide valuable information from hand images. Many methods are available in scientific literature for fingertips detection with an open hand but very poor results are available for fingertips detection when the hand is closed. This paper presents a new method for the detection of fingertips in a closed hand using the corner detection method and an advanced edge detection algorithm. It is important to note that the skin color segmentation methodology did not work for fingertips detection in a closed hand. Thus the proposed method applied Gabor filter techniques for the detection of edges and then applied the corner detection algorithm for the detection of fingertips through the edges. To check the accuracy of the method, this method was tested on a vast number of images taken with a webcam. The method resulted in a higher accuracy rate of detections from the images. The method was further implemented on video for testing its validity on real time image capturing. These closed hand fingertips detection would help in controlling an electro-mechanical robotic hand via hand gesture in a natural way.

Low-Quality Banknote Serial Number Recognition Based on Deep Neural Network

  • Jang, Unsoo;Suh, Kun Ha;Lee, Eui Chul
    • Journal of Information Processing Systems
    • /
    • v.16 no.1
    • /
    • pp.224-237
    • /
    • 2020
  • Recognition of banknote serial number is one of the important functions for intelligent banknote counter implementation and can be used for various purposes. However, the previous character recognition method is limited to use due to the font type of the banknote serial number, the variation problem by the solid status, and the recognition speed issue. In this paper, we propose an aspect ratio based character region segmentation and a convolutional neural network (CNN) based banknote serial number recognition method. In order to detect the character region, the character area is determined based on the aspect ratio of each character in the serial number candidate area after the banknote area detection and de-skewing process is performed. Then, we designed and compared four types of CNN models and determined the best model for serial number recognition. Experimental results showed that the recognition accuracy of each character was 99.85%. In addition, it was confirmed that the recognition performance is improved as a result of performing data augmentation. The banknote used in the experiment is Indian rupee, which is badly soiled and the font of characters is unusual, therefore it can be regarded to have good performance. Recognition speed was also enough to run in real time on a device that counts 800 banknotes per minute.

Object Detection using Multiple Color Normalization and Moving Color Information (다중색상정규화와 움직임 색상정보를 이용한 물체검출)

  • Kim, Sang-Hoon
    • The KIPS Transactions:PartB
    • /
    • v.12B no.7 s.103
    • /
    • pp.721-728
    • /
    • 2005
  • This paper suggests effective object detection system for moving objects with specified color and motion information. The proposed detection system includes the object extraction and definition process which uses MCN(Multiple Color Normalization) and MCWUPC(Moving Color Weighted Unmatched Pixel Count) computation to decide the existence of moving object and object segmentation technique using signature information is used to exactly extract the objects with high probability. Finally, real time detection system is implemented to verify the effectiveness of the technique and experiments show that the success rate of object tracking is more than $89\%$ of total 120 image frames.

Traffic Signal Detection and Recognition Using a Color Segmentation in a HSI Color Model (HSI 색상 모델에서 색상 분할을 이용한 교통 신호등 검출과 인식)

  • Jung, Min Chul
    • Journal of the Semiconductor & Display Technology
    • /
    • v.21 no.4
    • /
    • pp.92-98
    • /
    • 2022
  • This paper proposes a new method of the traffic signal detection and the recognition in an HSI color model. The proposed method firstly converts a ROI image in the RGB model to in the HSI model to segment the color of a traffic signal. Secondly, the segmented colors are dilated by the morphological processing to connect the traffic signal light and the signal light case and finally, it extracts the traffic signal light and the case by the aspect ratio using the connected component analysis. The extracted components show the detection and the recognition of the traffic signal lights. The proposed method is implemented using C language in Raspberry Pi 4 system with a camera module for a real-time image processing. The system was fixedly installed in a moving vehicle, and it recorded a video like a vehicle black box. Each frame of the recorded video was extracted, and then the proposed method was tested. The results show that the proposed method is successful for the detection and the recognition of traffic signals.

Detecting Numeric and Character Areas of Low-quality License Plate Images using YOLOv4 Algorithm (YOLOv4 알고리즘을 이용한 저품질 자동차 번호판 영상의 숫자 및 문자영역 검출)

  • Lee, Jeonghwan
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.18 no.4
    • /
    • pp.1-11
    • /
    • 2022
  • Recently, research on license plate recognition, which is a core technology of an intelligent transportation system(ITS), is being actively conducted. In this paper, we propose a method to extract numbers and characters from low-quality license plate images by applying the YOLOv4 algorithm. YOLOv4 is a one-stage object detection method using convolution neural network including BACKBONE, NECK, and HEAD parts. It is a method of detecting objects in real time rather than the previous two-stage object detection method such as the faster R-CNN. In this paper, we studied a method to directly extract number and character regions from low-quality license plate images without additional edge detection and image segmentation processes. In order to evaluate the performance of the proposed method we experimented with 500 license plate images. In this experiment, 350 images were used for training and the remaining 150 images were used for the testing process. Computer simulations show that the mean average precision of detecting number and character regions on vehicle license plates was about 93.8%.

Image Dehazing Enhancement Algorithm Based on Mean Guided Filtering

  • Weimin Zhou
    • Journal of Information Processing Systems
    • /
    • v.19 no.4
    • /
    • pp.417-426
    • /
    • 2023
  • To improve the effect of image restoration and solve the image detail loss, an image dehazing enhancement algorithm based on mean guided filtering is proposed. The superpixel calculation method is used to pre-segment the original foggy image to obtain different sub-regions. The Ncut algorithm is used to segment the original image, and it outputs the segmented image until there is no more region merging in the image. By means of the mean-guided filtering method, the minimum value is selected as the value of the current pixel point in the local small block of the dark image, and the dark primary color image is obtained, and its transmittance is calculated to obtain the image edge detection result. According to the prior law of dark channel, a classic image dehazing enhancement model is established, and the model is combined with a median filter with low computational complexity to denoise the image in real time and maintain the jump of the mutation area to achieve image dehazing enhancement. The experimental results show that the image dehazing and enhancement effect of the proposed algorithm has obvious advantages, can retain a large amount of image detail information, and the values of information entropy, peak signal-to-noise ratio, and structural similarity are high. The research innovatively combines a variety of methods to achieve image dehazing and improve the quality effect. Through segmentation, filtering, denoising and other operations, the image quality is effectively improved, which provides an important reference for the improvement of image processing technology.

Identifying Puddles based on Intensity Measurement using LiDAR

  • Minyoung Lee;Ji-Chul Kim;Moo Hyun Cha;Hanmin Lee;Sooyong Lee
    • Journal of Sensor Science and Technology
    • /
    • v.32 no.5
    • /
    • pp.267-274
    • /
    • 2023
  • LiDAR, one of the most important sensing methods used in mobile robots and cars with assistive/autonomous driving functions, is used to locate surrounding obstacles or to build maps. For real-time path generation, the detection of potholes or puddles on the driving surface is crucial. To achieve this, we used the coordinates of the reflection points provided by LiDAR as well as the intensity information to classify water areas, which was achieved by applying a linear regression method to the intensity distribution. The rationale for using the LiDAR index as an input variable for linear regression is presented, and we demonstrated that it is not affected by errors in the distance measurement value. Because of LiDAR vertical scanning, if the reflective surface is not uniform, it is divided into different groups according to the intensity distribution, and a mathematical basis for this is presented. Through experiments in an outdoor driving area, we could distinguish between flat ground, potholes, and puddles, and kinematic analysis was performed to calculate the maximum width that could be crossed for a given vehicle body size and wheel radius.

Aluminum Car Door Defect Detection by Using Multi-frame Image Segmentation Techniques (다중 프레임 이미지 분할 기술을 사용한 알루미늄 자동차 도어 결함 감지)

  • Ugur Ercelik;HaoYu Chen;Longfei Li;Kyungbaek Kim
    • Annual Conference of KIPS
    • /
    • 2024.10a
    • /
    • pp.626-629
    • /
    • 2024
  • AI-based image detection technology offers a promising solution for identifying defects in car doors, significantly improving efficiency compared to traditional human inspections. This paper introduces an advanced automatic defect detection system utilizing camera-recorded datasets and trained models to identify defects in aluminum car doors. Unlike previous models focused on aluminum castings, this is the first application specifically targeting car doors. Despite progress in defect detection research, challenges such as data imbalance, complex defect characteristics, and limited research on aluminum car doors persist. To address these issues, we propose the LKADenseNet201 model, enhancing the DenseNet201 architecture with a large kernel attention mechanism.While doing this, we focus on 3 important issues: image augmentation, channel attention and model evaluation.Our image processing process mainly include image augmentation. With image augmentation, we aimed to make data diversity suitable for the real world by obtaining data from different angles and to eliminate the imbalance between defect and normal images. This improvement boosts the model's ability to perceive contextual features and increases computational efficiency, essential for detailed spatial understanding and time-critical tasks. Our approach not only enhances operator efficiency but also moves towards automating the inspection process.

Study on object detection and distance measurement functions with Kinect for windows version 2 (키넥트(Kinect) 윈도우 V2를 통한 사물감지 및 거리측정 기능에 관한 연구)

  • Niyonsaba, Eric;Jang, Jong-Wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.6
    • /
    • pp.1237-1242
    • /
    • 2017
  • Computer vision is coming more interesting with new imaging sensors' new capabilities which enable it to understand more its surrounding environment by imitating human vision system with artificial intelligence techniques. In this paper, we made experiments with Kinect camera, a new depth sensor for object detection and distance measurement functions, most essential functions in computer vision such as for unmanned or manned vehicles, robots, drones, etc. Therefore, Kinect camera is used here to estimate the position or the location of objects in its field of view and measure the distance from them to its depth sensor in an accuracy way by checking whether that the detected object is real object or not to reduce processing time ignoring pixels which are not part of real object. Tests showed promising results with such low-cost range sensor, Kinect camera which can be used for object detection and distance measurement which are fundamental functions in computer vision applications for further processing.