DOI QR코드

DOI QR Code

구면 파노라마 영상에서의 딥러닝 기반 객체 인식

Deep Learning Based Object Recognition in Spherical Panoramic Image

  • 정민석 (인천대학교 컴퓨터공학부) ;
  • 박종승 (인천대학교 컴퓨터공학부)
  • Jung, Minsuk (Dept. of Computer Science & Engineering, Incheon National University) ;
  • Park, Jong-Seung (Dept. of Computer Science & Engineering, Incheon National University)
  • 투고 : 2018.09.11
  • 심사 : 2018.10.19
  • 발행 : 2018.10.20

초록

영상 인식 기술은 평면 영상에 대해서 많이 연구되고 그 성능 또한 발전하고 있다. 그러나 평면 영상이 아닌 구면 파노라마 영상과 다양한 환경에서 주어지는 특수한 형태의 영상에 대한 인식은 평면과 다르게 기하학적인 왜곡으로 인해서 많은 어려움이 따른다. 본 논문에서는 평면 영상의 인식 기술에서 최근 각광받는 훈련을 통한 신경망 인식 기법이 구면 파노라마 영상의 인식에서도 쓰일 수 있음을 보인다. 또한 구면 영상에 대한 기존 신경망 모델의 인식률을 높이기 위해서 큐브맵 변환을 활용하는 방법을 제시한다.

A lot of research has been done on image recognition technique for planar images and the performance has also been improved. However, it is difficult to recognize objects in spherical panoramic images or images in special form which are given in various environments because of the spherical distortion given in different form from the planar case. In this paper, we show that the neural network recognition approach can be used for object recognition in spherical image and suggest a method of using cubemap transform in order to increase recognition accuracy in spherical image.

키워드

참고문헌

  1. Bo-sung Kim, Jong-Seung Park, "Matching between spherical panorama and planar image", Journal of the Korean Institute of Information Scientists and Engineers, Vol. 42, No.10, pp. 1322-1324, 2014.
  2. Javier Cruz-Mota, et al., "Scale invariant feature transform on the sphere: theory and applications", International Journal of Computer Vision, Vol. 98, No. 2, pp. 217-241, 2012. https://doi.org/10.1007/s11263-011-0505-4
  3. Jeong-Hyeon Park, Jong-Seung Park, "Planar texture replacement in spherical images using cubemap", Journal of Korea Game Society, Vol. 17, No. 6, pp. 153-164, 2017. https://doi.org/10.7583/JKGS.2017.17.6.153
  4. Christian Szegedy, et al., "Rethinking the inception architecture for computer vision", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2818-2826, 2016.
  5. Martin Abadi, et al., "TensorFlow: a system for large-scale machine learning", 12th USENIX Symposium on Operating Systems Design and Implementation, Vol. 16, pp. 265-283, 2016.
  6. Christian, Szegedy, et al. "Going deeper with convolutions", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-9, 2015.
  7. Christian Szegedy, et al., "Inception-v4, Inception-ResNet and the impact of residual connections on learning", AAAI, Vol. 4, pp. 4278-4284, 2017.
  8. Ross Girshick, et al., "Region-based convolutional networks for accurate object detection and segmentation", IEEE transactions on pattern analysis and machine intelligence, Vol. 38, No. 1, pp. 142-158, 2016. https://doi.org/10.1109/TPAMI.2015.2437384
  9. Ross Girshick, "Fast R-CNN", Proceedings of the IEEE International Conference on Computer Vision, pp. 1440-1448, 2015.