Image Processing-based Object Recognition Approach for Automatic Operation of Cranes

  • Zhou, Ying (Department of Construction Management, Tsinghua University) ;
  • Guo, Hongling (Department of Construction Management, Tsinghua University) ;
  • Ma, Ling (Department of Construction Management, Tsinghua University) ;
  • Zhang, Zhitian (Department of Construction Management, Tsinghua University)
  • Published : 2020.12.07

Abstract

The construction industry is suffering from aging workers, frequent accidents, as well as low productivity. With the rapid development of information technologies in recent years, automatic construction, especially automatic cranes, is regarded as a promising solution for the above problems and attracting more and more attention. However, in practice, limited by the complexity and dynamics of construction environment, manual inspection which is time-consuming and error-prone is still the only way to recognize the search object for the operation of crane. To solve this problem, an image-processing-based automated object recognition approach is proposed in this paper, which is a fusion of Convolutional-Neutral-Network (CNN)-based and traditional object detections. The search object is firstly extracted from the background by the trained Faster R-CNN. And then through a series of image processing including Canny, Hough and Endpoints clustering analysis, the vertices of the search object can be determined to locate it in 3D space uniquely. Finally, the features (e.g., centroid coordinate, size, and color) of the search object are extracted for further recognition. The approach presented in this paper was implemented in OpenCV, and the prototype was written in Microsoft Visual C++. This proposed approach shows great potential for the automatic operation of crane. Further researches and more extensive field experiments will follow in the future.

Keywords

Acknowledgement

We would like to thank the National Natural Science Foundation of China (Grant No. 51578318, 51208282) as well as Tsinghua University-Glodon Joint Research Centre for Building Information Model (RCBIM) for supporting this research. Besides, thanks for the help provided by Zhubang Luo in computer programming.