Browse > Article
http://dx.doi.org/10.14346/JKOSOS.2019.34.6.50

A Comparative Study on Performance of Deep Learning Models for Vision-based Concrete Crack Detection according to Model Types  

Kim, Byunghyun (Department of Civil Engineering, University of Seoul))
Kim, Geonsoon (Institute for Industrial Technology, University of Seoul)
Jin, Soomin (Department of Civil Engineering, University of Seoul))
Cho, Soojin (Department of Civil Engineering, University of Seoul))
Publication Information
Journal of the Korean Society of Safety / v.34, no.6, 2019 , pp. 50-57 More about this Journal
Abstract
In this study, various types of deep learning models that have been proposed recently are classified according to data input / output types and analyzed to find the deep learning model suitable for constructing a crack detection model. First the deep learning models are classified into image classification model, object segmentation model, object detection model, and instance segmentation model. ResNet-101, DeepLab V2, Faster R-CNN, and Mask R-CNN were selected as representative deep learning model of each type. For the comparison, ResNet-101 was implemented for all the types of deep learning model as a backbone network which serves as a main feature extractor. The four types of deep learning models were trained with 500 crack images taken from real concrete structures and collected from the Internet. The four types of deep learning models showed high accuracy above 94% during the training. Comparative evaluation was conducted using 40 images taken from real concrete structures. The performance of each type of deep learning model was measured using precision and recall. In the experimental result, Mask R-CNN, an instance segmentation deep learning model showed the highest precision and recall on crack detection. Qualitative analysis also shows that Mask R-CNN could detect crack shapes most similarly to the real crack shapes.
Keywords
crack detection; deep learning; image classification; object detection; semantic segmentation; instance segmentation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 K. Jang et al., "Deep Learning-Based Autonomous Concrete Crack Evaluation through Hybrid Image Scanning", Structural Health Monitoring, Vol. 18, No. 5-6, pp. 1722-1737, 2019.   DOI
2 C. V. Dung and D. A. Le, "Autonomous Concrete Crack Detection Using Deep Fully Convolutional Neural Network", Automation in Construction, Vol. 99, pp. 52-58, 2019.   DOI
3 J. Schmidhuber, "Deep Learning in Neural Networks: An Overview", Neural Networks, Vol. 61, Elsevier Ltd, pp. 85-117, 2015.   DOI
4 Y. LeCun et al., "Gradient-Based Learning Applied to Document Recognition", Proceedings of the IEEE, Vol. 86, No. 11, pp. 2278-2323, 1998.   DOI
5 K. Fukushima, "Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position", Biological Cybernetics, Vol. 36, No. 4, Springer-Verlag, pp. 193-202, 1980.   DOI
6 A. Krizhevsky et al., "ImageNet Classification with Deep Convolutional Neural Networks", Advances In Neural Information Processing Systems, pp. 1-9, 2012.
7 K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition", International Conference on Learning Representations (ICRL), pp. 1-14, 2015.
8 K. He et al., "Deep Residual Learning for Image Recognition", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
9 S. Ren et al., "Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, No. 6, pp. 1137-1149, 2017.   DOI
10 W. Liu et al., "SSD: Single Shot Multibox Detector", European Conference on Computer Vision, Vol. 9905 LNCS, pp. 21-37, 2016.
11 J. Redmon et al., "You Only Look Once: Unified, Real-Time Object Detection", Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 779-788, 2016.
12 V. Badrinarayanan et al., "SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, No. 12, pp. 2481-2495, 2017.   DOI
13 J. R. R. Uijlings et al., "Selective Search for Object Recognition", International Journal of Computer Vision, Vol. 104, No. 2, Springer US, pp. 154-1571, 2013, doi: 10.1007/s11263-013-0620-5.   DOI
14 L. C. Chen et al., "DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 40, No. 4, pp. 834-848, 2018.   DOI
15 O. Ronneberger et al., "U-Net: Convolutional Networks for Biomedical Image Segmentation", International Conference on Medical Image Computing and Computer-Assisted Intervention, Vol. 9351, pp. 234-241, 2015.
16 K. He et al., "Mask R-CNN", Proceedings of the IEEE International Conference on Computer Vision, pp. 2961-2969, 2017.
17 T. Y. Lin et al., Microsoft COCO: Common Objects in Context, 2014.
18 R. Girshick et al., "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation", The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580-587, 2014.
19 R. Girshick, "Fast R-CNN", Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, 2015.
20 J. Huang et al., "Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors", The IEEE Conference on Computer Vision and Pattern Recognition(CVPR), pp. 7310-7311, 2017.
21 GitHub - Waspinator/Pycococreator: Helper Functions to Create COCO Datasets. https://github.com/waspinator/pycococreator. Accessed 8 Aug. 2019.