• Title/Summary/Keyword: Faster R-CNN

Search Result 90, Processing Time 0.026 seconds

Scale-aware Faster R-CNN for Caltech Pedestrian Detection (Caltech 보행자 감지를 위한 Scale-aware Faster R-CNN)

  • Byambajav, Batkhuu;Alikhanov, Jumabek;Jo, Geun-Sik
    • Annual Conference of KIPS
    • /
    • 2016.10a
    • /
    • pp.506-509
    • /
    • 2016
  • We present real-time pedestrian detection that exploit accuracy of Faster R-CNN network. Faster R-CNN has shown to success at PASCAL VOC multi-object detection tasks, and their ability to operate on raw pixel input without the need to design special features is very engaging. Therefore, in this work we apply and adjust Faster R-CNN to single object detection, which is pedestrian detection. The drawback of Faster R-CNN is its failure when object size is small. Previously, small sized object problem was solved by Scale-aware Network. We incorporate Scale-aware Network to Faster R-CNN. This made our method Scale-aware Faster R-CNN (DF R-CNN) that is both fast and very accurate. We separated Faster R-CNN networks into two sub-network, that is one for large-size objects and another one for small-size objects. The resulting approach achieves a 28.3% average miss rate on the Caltech Pedestrian detection benchmark, which is competitive with the other best reported results.

Performance Comparison of the Optimizers in a Faster R-CNN Model for Object Detection of Metaphase Chromosomes (중기 염색체 객체 검출을 위한 Faster R-CNN 모델의 최적화기 성능 비교)

  • Jung, Wonseok;Lee, Byeong-Soo;Seo, Jeongwook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.11
    • /
    • pp.1357-1363
    • /
    • 2019
  • In this paper, we compares the performance of the gredient descent optimizers of the Faster Region-based Convolutional Neural Network (R-CNN) model for the chromosome object detection in digital images composed of human metaphase chromosomes. In faster R-CNN, the gradient descent optimizer is used to minimize the objective function of the region proposal network (RPN) module and the classification score and bounding box regression blocks. The gradient descent optimizer. Through performance comparisons among these four gradient descent optimizers in our experiments, we found that the Adamax optimizer could achieve the mean average precision (mAP) of about 52% when considering faster R-CNN with a base network, VGG16. In case of faster R-CNN with a base network, ResNet50, the Adadelta optimizer could achieve the mAP of about 58%.

Deep Learning Method for Improving Contamination Dectection of Xoray Inspection System (X-ray 이물검출기의 이물 검출 향상을 위한 딥러닝 방법)

  • Lim, Byung Hey;Jeong, Seung Su;Yu, Yun Seop
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.460-462
    • /
    • 2021
  • Food basically must have nutrition and safety. Recently, a number of symptoms of food poisoning occurred in a kindergarten in Ansan, where food safety was suspected. Therefore, the safety of food is more demanding. In this paper, we propose a method to inprove the detector to secure food safety. The proposed method is to learn through the network of convolution neural network (CNN) and Faster region-CNN (Faster R-CNN) and test the images of normal and foreign products. As a result of testing through a deep learning model, the method that used Faster R-CNN in parallel with the existing foreign body detector algorithm showed better detection rate than other methods.

  • PDF

Real-Time License Plate Detection Based on Faster R-CNN (Faster R-CNN 기반의 실시간 번호판 검출)

  • Lee, Dongsuk;Yoon, Sook;Lee, Jaehwan;Park, Dong Sun
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.11
    • /
    • pp.511-520
    • /
    • 2016
  • Automatic License Plate Detection (ALPD) is a key technology for a efficient traffic control. It is used to improve work efficiency in many applications such as toll payment systems and parking and traffic management. Until recently, the hand-crafted features made for image processing are used to detect license plates in most studies. It has the advantage in speed. but can degrade the detection rate with respect to various environmental changes. In this paper, we propose a way to utilize a Faster Region based Convolutional Neural Networks (Faster R-CNN) and a Conventional Convolutional Neural Networks (CNN), which improves the computational speed and is robust against changed environments. The module based on Faster R-CNN is used to detect license plate candidate regions from images and is followed by the module based on CNN to remove False Positives from the candidates. As a result, we achieved a detection rate of 99.94% from images captured under various environments. In addition, the average operating speed is 80ms/image. We implemented a fast and robust Real-Time License Plate Detection System.

Application of deep learning technique for battery lead tab welding error detection (배터리 리드탭 압흔 오류 검출의 딥러닝 기법 적용)

  • Kim, YunHo;Kim, ByeongMan
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.2
    • /
    • pp.71-82
    • /
    • 2022
  • In order to replace the sampling tensile test of products produced in the tab welding process, which is one of the automotive battery manufacturing processes, vision inspectors are currently being developed and used. However, the vision inspection has the problem of inspection position error and the cost of improving it. In order to solve these problems, there are recent cases of applying deep learning technology. As one such case, this paper tries to examine the usefulness of applying Faster R-CNN, one of the deep learning technologies, to existing product inspection. The images acquired through the existing vision inspection machine are used as training data and trained using the Faster R-CNN ResNet101 V1 1024x1024 model. The results of the conventional vision test and Faster R-CNN test are compared and analyzed based on the test standards of 0% non-detection and 10% over-detection. The non-detection rate is 34.5% in the conventional vision test and 0% in the Faster R-CNN test. The over-detection rate is 100% in the conventional vision test and 6.9% in Faster R-CNN. From these results, it is confirmed that deep learning technology is very useful for detecting welding error of lead tabs in automobile batteries.

Comparison of Region-based CNN Methods for Defects Detection on Metal Surface (금속 표면의 결함 검출을 위한 영역 기반 CNN 기법 비교)

  • Lee, Minki;Seo, Kisung
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.67 no.7
    • /
    • pp.865-870
    • /
    • 2018
  • A machine vision based industrial inspection includes defects detection and classification. Fast inspection is a fundamental problem for many applications of real-time vision systems. It requires little computation time and localizing defects robustly with high accuracy. Deep learning technique have been known not to be suitable for real-time applications. Recently a couple of fast region-based CNN algorithms for object detection are introduced, such as Faster R-CNN, and YOLOv2. We apply these methods for an industrial inspection problem. Three CNN based detection algorithms, VOV based CNN, Faster R-CNN, and YOLOv2, are experimented for defect detection on metal surface. The results for inspection time and various performance indices are compared and analysed.

Recognition of GUI Widgets Utilizing Translational Embeddings based on Relational Learning (트랜슬레이션 임베딩 기반 관계 학습을 이용한 GUI 위젯 인식)

  • Park, Min-Su;Seok, Ho-Sik
    • Journal of IKEEE
    • /
    • v.22 no.3
    • /
    • pp.693-699
    • /
    • 2018
  • CNN based object recognitions have reported splendid results. However, the recognition of mobile apps raises an interesting challenge that recognition performance of similar widgets is not consistent. In order to improve the performance, we propose a noble method utilizing relations between input widgets. The recognition process flows from the Faster R-CNN based recognition to enhancement using a relation recognizer. The relations are represented as vector translation between objects in a relation space. Experiments on 323 apps show that our method significantly enhances the Faster R-CNN only approach.

A Method of Eye and Lip Region Detection using Faster R-CNN in Face Image (초고속 R-CNN을 이용한 얼굴영상에서 눈 및 입술영역 검출방법)

  • Lee, Jeong-Hwan
    • Journal of the Korea Convergence Society
    • /
    • v.9 no.8
    • /
    • pp.1-8
    • /
    • 2018
  • In the field of biometric security such as face and iris recognition, it is essential to extract facial features such as eyes and lips. In this paper, we have studied a method of detecting eye and lip region in face image using faster R-CNN. The faster R-CNN is an object detection method using deep running and is well known to have superior performance compared to the conventional feature-based method. In this paper, feature maps are extracted by applying convolution, linear rectification process, and max pooling process to facial images in order. The RPN(region proposal network) is learned using the feature map to detect the region proposal. Then, eye and lip detector are learned by using the region proposal and feature map. In order to examine the performance of the proposed method, we experimented with 800 face images of Korean men and women. We used 480 images for the learning phase and 320 images for the test one. Computer simulation showed that the average precision of eye and lip region detection for 50 epoch cases is 97.7% and 91.0%, respectively.

Rear-Approaching Vehicle Detection Research using Region of Interesting based on Faster R-CNN (Faster R-CNN 기반의 관심영역 유사도를 이용한 후방 접근차량 검출 연구)

  • Lee, Yeung-Hak;Kim, Joong-Soo;Shim, Jae-Chnag
    • Journal of IKEEE
    • /
    • v.23 no.1
    • /
    • pp.235-241
    • /
    • 2019
  • In this paper, we propose a new algorithm to detect rear-approaching vehicle using the frame similarity of ROI(Region of Interest) based on deep learning algorithm for use in agricultural machinery systems. Since the vehicle detection system for agricultural machinery needs to detect only a vehicle approaching from the rear. we use Faster R-CNN model that shows excellent accuracy rate in deep learning for vehicle detection. And we proposed an algorithm that uses the frame similarity for ROI using constrained conditions. Experimental results show that the proposed method has a detection rate of 99.9% and reduced the false positive values.

Performance Analysis of Detecting buried pipelines in GPR images using Faster R-CNN (Faster R-CNN을 활용한 GPR 영상에서의 지하배관 위치추적 성능분석)

  • Ko, Hyoung-Yong;Kim, Nam-gi
    • Journal of Convergence for Information Technology
    • /
    • v.9 no.5
    • /
    • pp.21-26
    • /
    • 2019
  • Various pipes are buried in the city as needed, such as water pipes, gas pipes and hydrogen pipes. As the time passes, buried pipes becomes aged due to crack, etc. these pipes has the risk of accidents such as explosion and leakage. To prevent the risks, many pipes are repaired or replaced, but the location of the pipes can also be changed. Failure to identify the location of the altered pipe may cause an accident by touching the pipe. In this paper, we propose a method to detect buried pipes by gathering the GPR images by using GPR and Learning with Faster R-CNN. Then experiments was carried out by raw data sets and data sets augmentation applied to increase the amount of images.