Browse > Article
http://dx.doi.org/10.9717/kmms.2021.24.8.1020

Facial Landmark Detection by Stacked Hourglass Network with Transposed Convolutional Layer  

Gu, Jungsu (Department of Digital Media, Graduate School, The Catholic University of Korea)
Kang, Ho Chul (Department of Media Technology and Media Contents, The Catholic University of Korea)
Publication Information
Abstract
Facial alignment is very important task for human life. And facial landmark detection is one of the instrumental methods in face alignment. We introduce the stacked hourglass networks with transposed convolutional layers for facial landmark detection. our method substitutes nearest neighbor upsampling for transposed convolutional layer. Our method returns better accuracy in facial landmark detection compared to stacked hourglass networks with nearest neighbor upsampling.
Keywords
Facial Landmark Detection; Face Alignment; Stacked Hourglass Network; Transposed Convolutional Layer; Computer Vision; CNN; Heatmap Based Detection; Bodypart detection;
Citations & Related Records
연도 인용수 순위
  • Reference
1 A. Bulat and G. Tzimiropoulos, "How Far are We from Solving the 2D & 3D Face Alignment problem? (and a Dataset of 230,000 3D Facial Landmarks)," Proceedings of Electrical and Electronics Engineers International Conference on Computer Vision (ICCV), pp. 1021- 1030, 2017.
2 I.J. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair et al., "Generative Adversarial Nets," Proceedings of the 27th International Conference on Neural Information Processing Systems (NIPS'14), Vol. 2, pp. 2672-2680, 2014.
3 P.N. Belhumeur, D.W. Jacobs, D.J. Kriegman, and N. Kumar, "Localizing Parts of Faces Using a Consensus of Exemplars," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 35, Issue 12, pp. 2930-2940, 2013.   DOI
4 J. Yang, Q. Liu, and K. Zhang, "Stacked Hourglass Network for Robust Facial Landmark Localisation," Proceedings of Electrical and Electronics Engineers Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 79-87, 2017.
5 X. Zhu and D. Ramanan, "Face Detection, Pose Estimation, and Landmark Localization in the Wild," IEEE Conference on Computer Vision and Pattern Recognition, pp. 2879-2886, 2012.
6 V. Le, J. Brandt, Z. Lin, L. Bourdev, and T.S. Huang, "Interactive Facial Feature Localization," European Conference on Computer Vision (ECCV) 2012, Part III, LNCS 7574, pp. 679-692, 2012
7 K Simonyan and A Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," Computational and Biological Learning Society, pp. 1-14, 2015.
8 W. Li and E. Lee, "Online Face Avatar Motion Control Based on Face Tracking," Journal of Korea Multimedia Society. Vol. 12, No. 6, pp. 804-814, 2009.
9 X.L. Huang, C.Z. Kim, and S.H. Choi, "An Automatic Strabismus Screening Method with Corneal Light Reflex Based on Image Processing," Journal of Korea Multimedia Society Vol. 24, No. 5, pp. 642-650, 2021.   DOI
10 J.N. Song, H.I. Kim, and Y.M. Ro. "Fast and Robust Face Detection Based on CNN in Wild Environment," Journal of Korea Multimedia Society Vol. 19. No. 8, pp. 1310-1319, 2016   DOI
11 C. Sagonas, G. Tzimiropoulos, S. Zafeiriou, and M. Pantic. "300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge," IEEE International Conference on Computer Vision Workshops, pp. 397-403, 2013.
12 I. Masi, Y. Wu, T. Hassner, and P, Natarajan, "Deep Face Recognition: A Survey," 31st SIBGRAPI Conference on Graphics, Patterns and Images, pp. 471-478, 2018.
13 A. Newell, K. Yang, and J. Deng, "Stacked Hourglass Networks for Human Pose Estimation," European Conference on Computer Vision (ECCV) 2016, Part VIII, LNCS 9912, pp. 483-499, 2016.
14 O. Ronneberger, P. Fischer, and T. Brox, "U-Net: Convolutional Networks for Biomedical Image Segmentation," International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer, Cham, pp. 234-241, 2015.
15 K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," Proceedings of the Institute of Electrical and Electronics Engineers(IEEE) Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.