Browse > Article
http://dx.doi.org/10.3837/tiis.2022.05.010

Deep Local Multi-level Feature Aggregation Based High-speed Train Image Matching  

Li, Jun (Beijing Key Laboratory of Work Safety Intelligent Monitoring, School of Electronic Engineering, Beijing University of Posts and Telecommunications)
Li, Xiang (Beijing Key Laboratory of Work Safety Intelligent Monitoring, School of Electronic Engineering, Beijing University of Posts and Telecommunications)
Wei, Yifei (Beijing Key Laboratory of Work Safety Intelligent Monitoring, School of Electronic Engineering, Beijing University of Posts and Telecommunications)
Wang, Xiaojun (Dublin City University)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.16, no.5, 2022 , pp. 1597-1610 More about this Journal
Abstract
At present, the main method of high-speed train chassis detection is using computer vision technology to extract keypoints from two related chassis images firstly, then matching these keypoints to find the pixel-level correspondence between these two images, finally, detection and other steps are performed. The quality and accuracy of image matching are very important for subsequent defect detection. Current traditional matching methods are difficult to meet the actual requirements for the generalization of complex scenes such as weather, illumination, and seasonal changes. Therefore, it is of great significance to study the high-speed train image matching method based on deep learning. This paper establishes a high-speed train chassis image matching dataset, including random perspective changes and optical distortion, to simulate the changes in the actual working environment of the high-speed rail system as much as possible. This work designs a convolutional neural network to intensively extract keypoints, so as to alleviate the problems of current methods. With multi-level features, on the one hand, the network restores low-level details, thereby improving the localization accuracy of keypoints, on the other hand, the network can generate robust keypoint descriptors. Detailed experiments show the huge improvement of the proposed network over traditional methods.
Keywords
Image matching; High-speed train; Multi-scale features; Artificial intelligence; Joint description and detection of local features;
Citations & Related Records
연도 인용수 순위
  • Reference
1 E. Rosten and T. Drummond, "Fusing point sand lines for high performance tracking," in Proc. of the IEEE International Conference on Computer Vision (ICCV 2005), vol. 1, no. 2, pp. 1508-1515, 2005.
2 S. Mutic, J. F. Dempsey and W. R. Bosch, "Multimodality image registration quality assurance for conformal three-dimensional treatment planning," International Journal of Radiation Oncology, Biology, Physics, vol.51, no.1, pp. 255-260, 2001.   DOI
3 S. Sun, J. Zhou, J. Wen, Y. Wei and X. Wang, "A dqn-based cache strategy for mobile edge networks," Computers, Materials & Continua, vol. 71, no.2, pp. 3277-3291, 2022.   DOI
4 Z. J. Zhang, "Research on application of dynamic image detection system for EMU vehicle faults (TEDS)," Railway Locomotive &Car, vol. 34, no. 4, pp. 82-84, 2014.   DOI
5 B. D. Lucas and T. Kanade, "An iterative image registration technique with an application to stereo vision," in Proc. of the 7th International Joint Conference on Artificial Intelligence Morgan Kaufmann Publishers Inc., 674-679, 1981.
6 D. Peng, "Anomaly detection algorithm for the bottom parts of high-speed trains based on the SURF feature of rail-side images," M.S. dissertation, Beijing Jiaotong University, China, Beijing, 2016.
7 Z. G. Qu, H. R. Sun and M. Zheng, "An efficient quantum image steganography protocol based on improved EMD algorithm," Quantum Information Processing, vol. 20, no. 53, pp. 1-29, 2021.   DOI
8 E. Rosten and T. Drummond, "Fusing point sand lines for high performance tracking," in Proc. of the IEEE International Conference on Computer Vision (ICCV 2005), vol. 1, no. 2, pp. 1508-1515, 2005.
9 D. G. Lowe, "Distinctive image features from scale invariant keypoints," International Journal of Computer Vision, vol. 60, pp. 91-110, 2004.   DOI
10 B. Herbert, T. Tuytelaars and L. V. Gool, "SURF: Speeded up robust features," in Proc. of the European Conference on Computer Vision (ECCV 2006), pp. 404-417, 2006.
11 D. Detone, T. Malisiewicz and A. Rabinovich, "Deep image homography estimation," arXiv preprint, arXiv: 1606.03798, 2016.
12 Rublee. E, "ORB: an efficient alternative to SIFT or SURF," in Proc. of IEEE International Conference on Computer Vision, Barcelona, Spain, pp. 2564-2571, 2011.
13 M. A. Fischler and R. C. Bolles, "Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography," Readings in Computer Vision, pp. 726-740, 1987.
14 B, Liu, "Research and thoughts on the application of the image detection system for operational faults of EMUs (TEDS)," China Railway, vol. 12, pp. 61-65, 2017.
15 R. Hartley and A. Zisserman, "Multiple view geometry in computer vision," Cambridge University Press, 2003.
16 L. Xiangchun, C. Zhan, S. Wei, L. Fenglei and Y. Yanxing, "Data matching of solar images super-resolution based on deep learning," Computers, Materials & Continua, vol. 68, no.3, pp. 4017-4029, 2021.   DOI
17 Z. G. Qu, Y. M. Huang and M. Zheng, "A novel coherence-based quantum steganalysis protocol," Quantum Information Processing, vol. 19, no. 362, pp. 1-19, 2020.   DOI
18 Y. F. Wei, F. Richard Yu, M. Song and Z. Han, "User scheduling and resource allocation in HetNets with hybrid energy supply: An actor-critic reinforcement learning approach," IEEE Transactions on Wireless Communications, vol. 17, no.1, pp. 680-692, Jan. 2018.   DOI
19 X. Z. Liu, L. Luo and Y. Zhang, "High-speed train image registration algorithm based on deep learning," Information Technology, vol. 45, no. 7, pp. 26-30, 2021.
20 J. Zhang, C. Wang and S. Liu, "Content-aware unsupervised deep homography estimation," in Proc. of European Conference on Computer Vision (ECCV 2020), pp. 653-669, 2020.
21 M. Dusmanu, I. Rocco and T. Pajdla, "D2-Net: A trainable CNN for joint detection and description of local features," in Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2019), pp. 8092-8101, 2019.
22 Z. Luo, L. Zhou, X. Bai, H. Chen and J. Zhang, "ASLFeat: Learning local features of accurate shape and localization," in Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020), pp. 6589-6589, 2020.
23 D. Detone, T. Malisiewicz and A. Rabinovich, "SuperPoint: Self-supervised interest point detection and description," in Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2018), pp. 224-236, 2018.
24 Karami E, Prasad S and Shehata M, "Image matching using SIFT, SURF, BRIEF and ORB: performance comparison for distorted images," arXiv e-prints, arXiv: 1710.02726, 2017.
25 V. Balntas, K. Lenc, A. Vedaldi and K. Mikolajczyk, "HPatches: A benchmark and evaluation of handcrafted and learned local descriptors," in Proc. of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2017), pp. 5173-5182, 2017.
26 G. X. Xie, "Research and development of image-based train component integrity detection method and system," M.S. dissertation, Chang'an University, China, Xi'an, 2019.
27 S. F. Lu and Z. L, "Dynamic image comparison and analysis method for operation faults of EMU," Laser & Optoelectronics Progress, vol. 54, no. 9, pp. 301-307, 2017.
28 K. Mikolajczyk and C. Schmid, "A performance evaluation of local descriptors," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1615-1630, 2005.   DOI