Browse > Article
http://dx.doi.org/10.9717/kmms.2016.19.8.1310

Fast and Robust Face Detection based on CNN in Wild Environment  

Song, Junam (School of Electrical Engineering, KAIST)
Kim, Hyung-Il (School of Electrical Engineering, KAIST)
Ro, Yong Man (School of Electrical Engineering, KAIST)
Publication Information
Abstract
Face detection is the first step in a wide range of face applications. However, detecting faces in the wild is still a challenging task due to the wide range of variations in pose, scale, and occlusions. Recently, many deep learning methods have been proposed for face detection. However, further improvements are required in the wild. Another important issue to be considered in the face detection is the computational complexity. Current state-of-the-art deep learning methods require a large number of patches to deal with varying scales and the arbitrary image sizes, which result in an increased computational complexity. To reduce the complexity while achieving better detection accuracy, we propose a fully convolutional network-based face detection that can take arbitrarily-sized input and produce feature maps (heat maps) corresponding to the input image size. To deal with the various face scales, a multi-scale network architecture that utilizes the facial components when learning the feature maps is proposed. On top of it, we design multi-task learning technique to improve detection performance. Extensive experiments have been conducted on the FDDB dataset. The experimental results show that the proposed method outperforms state-of-the-art methods with the accuracy of 82.33% at 517 false alarms, while improving computational efficiency significantly.
Keywords
Face Detection; Deep Learning; Fully Convolutional Network; Heat Map; Facial Component; Face Bound Regression;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 P.F. Felzenszwalb, R.B. Girshick, D. Mc Allester, and D. Ramanan "Object Detection with Discriminatively Trained Part Based Models," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 32, No. 9, pp. 1627-1645, 2010.   DOI
2 J. Yan, Z. Lei, L.Wen, and S.Z. Li, "The Fastest Deformable Part Model for Object Detection," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 2497-2504, 2014.
3 J.T. Lee, H. Kang, and K.-T. Lim. "Moving Shadow Detection using Deep Learning and Markov Random Field," Journal of Korea Multimedia Society, Vol. 18, No. 12, pp. 1432-1438, 2015.   DOI
4 C.H. Lampert, M.B. Blaschko, and T. Hofmann. "Beyond Sliding Windows: Object Localization by Efficient Subwindow Search," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-8, 2008.
5 V. Jain, Vidit, and E. Learned-Miller, FDDB: A Benchmark for Face Detection in Unconstrained Settings, University of Massachusetts, Technical Report, UM-CS-2010-009, 2010.
6 H. Li, Z. Lin, X. Shen, J. Brandt, and G. Hua, "A Convolutional Neural Network Cascade for Face Detection," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 5325-5334, 2015.
7 A. Krizhevsky, I. Sutskever, and G.E. Hinton, "Imagenet Classification with Deep Convolutional Neural Networks," Proceeding of Advances in Neural Information Processing Systems, pp. 1097-1105, 2012.
8 Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, et. al, "Caffe: Convolutional Architecture for Fast Feature Embedding," Proceeding of ACM International Conference on Multimedia, pp. 675-678, 2014.
9 P. Viola and M.J. Jones, "Robust Real-time Face Detection," International Journal of Computer Vision, Vol. 57, No. 2, pp. 137-154, 2004.   DOI
10 M.K. Celebi, M.E. Celebi, and B. Smolka, Advances in Face Detection and Facial Image Analysis, Springer International Publishing, Switzerland, 2016.
11 S.H. Lee, J.I. Moon, H.-I. Kim, and Y.M. Ro. “Face Detection Using Multi-level Features for Privacy Protection in Large-scale Surveillance Video," Journal of Korea Multimedia Society, Vol. 18, No. 11, pp. 1268-1280, 2015.   DOI
12 R. Ranjan, V.M. Patel, and R. Chellappa, "A Deep Pyramid Deformable Part Model for Face Detection," Proceeding of IEEE Conference on Biometrics Theory, Applications and Systems, pp. 1-8, 2015.
13 Z. Liu, P. Luo, X. Wang, and X. Tang. "Deep Learning Face Attributes in the Wild," Proceeding of IEEE International Conference on Computer Vision, pp. 3730-3738, 2015.
14 D. Wang, J. Ynag, and Q. Liu, "Hierarchical Convolutional Neural Network for Face Detection," Proceeding of International Conference on Image and Graphics, pp. 373-384, 2015.
15 M. Köstinger, P. Wohlhart, P.M. Roth, and H. Bischof, "Annotated Facial Landmarks in the Wild: A Large-scale, Real-world Database for Facial Landmark Localization," Proceeding of IEEE International Conference on Computer Vision, pp. 2144-2151, 2011.
16 J. Deng, W. Dong, R. Socher, L.J. Li, K. Li, and L. FeiFei, "Imagenet: A Large-scale Hierarchical Image Database," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 248-255, 2009.
17 Z. Zhang, P. Luo, C.C. Loy, and X. Tang, "Facial Landmark Detection by Deep Multitask Learning," Proceeding of European Conference on Computer Vision, pp. 94-108, 2014.
18 N. Markus, M. Frljak, I.S. Pandzic, J. Ahlberg and R. Forchheimer, "Object Detection with Pixel Intensity Comparisons Organized in Decision Trees," ArXiv Preprint ArXiv:1305. 4537, 2014.
19 S. Zhan, Q.Q. Tao, and X.H. Li. "Face Detection Using Representation Learning," Journal of Neurocomputing, Vol. 187, No. C, pp. 19-26, 2015.
20 X. Shen, Z. Lin, J. Brandt, and Y. Wu. "Detecting and Aligning Faces by Image Retrieval," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 3460-3467, 2013.
21 S. Liao, A.K. Jain, and S.Z. Li, "A Fast and Accurate Unconstrained Face Detector," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 38, No. 2, pp. 211-223, 2015.   DOI