DOI QR코드

DOI QR Code

Vehicle-Level Traffic Accident Detection on Vehicle-Mounted Camera Based on Cascade Bi-LSTM

  • Son, Hyeon-Cheol (Dept. of Computer Engineering, Kumoh National Institute of Technology) ;
  • Kim, Da-Seul (Dept. of Computer Engineering, Kumoh National Institute of Technology) ;
  • Kim, Sung-Young (Dept. of Computer Engineering, Kumoh National Institute of Technology)
  • Received : 2020.12.14
  • Accepted : 2020.12.30
  • Published : 2020.12.31

Abstract

In this paper, we propose a traffic accident detection on vehicle-mounted camera. In the proposed method, the minimum bounding box coordinates the central coordinates on the bird's eye view and motion vectors of each vehicle object, and ego-motions of the vehicle equipped with dash-cam are extracted from the dash-cam video. By using extracted 4 kinds features as the input of Bi-LSTM (bidirectional LSTM), the accident probability (score) is predicted. To investigate the effect of each input feature on the probability of an accident, we analyze the performance of the detection the case of using a single feature input and the case of using a combination of features as input, respectively. And in these two cases, different detection models are defined and used. Bi-LSTM is used as a cascade, especially when a combination of the features is used as input. The proposed method shows 76.1% precision and 75.6% recall, which is superior to our previous work.

Keywords

Acknowledgement

This research was supported by the MSIT (Ministry of Science and ICT), Korea, under the ICT Consilience Creative program (IITP-2020-2011-1-00783) supervised by the IITP (Institute for Information & communications Technology Planning & Evaluation)

References

  1. H Zu, Y. Xie, l. Ma and J. Fu, "Vision-based real-time traffic accident detection," Proceeding of the 11th World Congress on Intelligent Control and Automation, Shenyang, 2014, pp. 1035-1038.
  2. E. P. Ijjina, D. Chand, S. Gupta and K. Goutham, "Computer Vision-based Accident Detection in Traffic Surveillance," 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Kanpur, India, 2019, pp. 1-6.
  3. H. Kim, S. Park and J. Paik, "Pre-Activated 3D CNN and Feature Pyramid Network for Traffic Accident Detection," 2020 IEEE International Conference on Consumer Electronics (ICCE), Las Vegas, NV, USA, 2020, pp. 1-3.
  4. K. He, G. Gkioxari, P. Dollar and R. Girshick, "Mask R-CNN," 2017 IEEE International Conference on Computer Vision (ICCV), Venice, 2017, pp. 2980-2988.
  5. Y. Yao, M. Xu, Y. Wang, D. J. Crandall and E. M. Atkins, "Unsupervised Traffic Accident Detection in First-Person Videos," 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Macau, China, 2019, pp. 273-280.
  6. J. Qiao, J. Fang, D. Yan and J. Xue, "Driving Accident Detection by Self-Supervised Adversarial Appearance-Motion Prediction in First-Person Videos," 2020 3rd International Conference on Unmanned Systems (ICUS), Harbin, 2020, pp. 1083-1088.
  7. I. Goodfellow, J. Pouget-Abadie and M. Mirza, "Generative adversarial networks," in Advances in Neural Information Processing Systems 27, 2014, pp. 2672-2680.
  8. F. H. Chan, Y. T. Chen, Y. Xiang, and M. Sun, "Anticipating accidents in dashcam videos," in ACCV 2016. ACCV 2016. Lecture Notes in Computer Science, 2017, Vol. 10114, pp. 136-153. https://doi.org/10.1007/978-3-319-54190-7_9
  9. D. Bahdanau, K. H. Cho, and Y. Bengio, "Neural machine translation by jointly learning to align and translate," in 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings, 2015.
  10. D. Kim, H. Son, S. Jong and S. Kim, "Traffic Accident Detection Based on Ego Motion and Object Tracking," in Journal of Advanced Information Technology and Convergence, 2020, pp. 15-23.
  11. H. Son, S. Jong, D. Kim, Y. Lee and S. Kim, "Traffic Accident Detection Using Bird's-Eye View and Vehicle Motion Vector," in Korean Society of Computer Information, 2020, pp. 71-72.
  12. X. Zhou, V. Koltun and P. Krahenbuhl, "Tracking Objects as Points," in Computer Vision - ECCV 2020, 2020.
  13. R. Mur-Artal and J. D. Tardos, "ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras," IEEE Transactions on Robotics, 2016.
  14. M. Schuster and K. K. Paliwal, "Bidirectional recurrent neural networks," in IEEE Transactions on Signal Processing, 1997, vol. 45, no. 11, pp. 2673-2681. https://doi.org/10.1109/78.650093