Browse > Article
http://dx.doi.org/10.9708/jksci.2020.25.07.047

Multi-scale face detector using anchor free method  

Lee, Dong-Ryeol (School of Computer Science and Engineering, Kangwon National University)
Kim, Yoon (Dept. of Computer Science and Engineering, Kangwon National University)
Abstract
In this paper, we propose one stage multi-scale face detector based Fully Convolution Network using anchor free method. Recently almost all state-of-the-art face detectors which predict location of faces using anchor-based methods rely on pre-defined anchor boxes. However this face detectors need to hyper-parameters and additional computation in training. The key idea of the proposed method is to eliminate hyper-parameters and additional computation using anchor free method. To do this, we apply two ideas. First, by eliminating the pre-defined set of anchor boxes, we avoid the additional computation and hyper-parameters related to anchor boxes. Second, our detector predicts location of faces using multi-feature maps to reduce foreground/background imbalance issue. Through Quantitative evaluation, the performance of the proposed method is evaluated and analyzed. Experimental results on the FDDB dataset demonstrate the effective of our proposed method.
Keywords
Face detection; Anchor free method; Multi-scale detection; Deep learning; Feature pyramid learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Tian, Z., Shen, C., Chen, H., & He, T. "Fcos: Fully convolutional one-stage object detection." In Proceedings of the IEEE International Conference on Computer Vision (pp. 9627-9636). 2019
2 Wang, C., Luo, Z., Lian, S., & Li, S. "Anchor Free Network for Multi-Scale Face Detection." In 2018 24th International Conference on Pattern Recognition (ICPR) (pp. 1554-1559). IEEE. August 2018
3 L. Huang, Y. Yang, Y. Deng, and Y. Yu, "Densebox: Unifying landmark localization with end to end object detection," arXiv preprint arXiv:1509.04874, 2015.
4 J. Yu, Y. Jiang, Z. Wang, Z. Cao, and T. Huang, "Unitbox: An advanced object detection network," in Proc. ACM Multimedia, 2016, pp. 516-520.
5 Xu, Y., Yan, W., Sun, H., Yang, G., & Luo, J. "CenterFace: Joint Face Detection and Alignment Using Face as Point." arXiv preprint arXiv:1911.03599. 2019
6 Yotam Abramson, Bruno Steux, and Hicham Ghorayeb. "Yet even faster (yef) real-time object detection." International Journal of Intelligent Systems Technologies and Applications, 2(2-3):102-112, 2007. 2   DOI
7 P. Hu and D. Ramanan, "Finding tiny faces," in Proc. CVPR, 2017, pp.1522-1530.
8 Zhang, Z., Shen, W., Qiao, S., Wang, Y., Wang, B., & Yuille, A. "Robust face detection via learning small faces on hard images". In The IEEE Winter Conference on Applications of Computer Vision (pp. 1361-1370). 2020
9 Yu Chen, Chunhua Shen, Xiu-Shen Wei, Lingqiao Liu, and Jian Yang. Adversarial "PoseNet: A structure-aware convolutional network for human pose estimation." In Proc. IEEE Int. Conf. Comp. Vis., 2017.
10 Sun, K., Xiao, B., Liu, D., & Wang, J. "Deep high-resolution representation learning for human pose estimation." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 5693-5703). 2019
11 K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556, 2014.
12 Lokesh Boominathan, Srinivas SS Kruthiventi, and R Venkatesh Babu. "Crowdnet: A deep convolutional network for dense crowd counting." In Proc. ACM Int. Conf. Multimedia, pages 640-644. ACM, 2016.
13 Sam, D. B., Peri, S. V., Kamath, A., & Babu, R. V. "Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection." arXiv preprint arXiv:1906.07538. 2019
14 Sindagi, V. A., & Patel, V. M. "Inverse attention guided deep crowd counting network." In 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) (pp. 1-8). IEEE. September 2019.
15 Jonathan Long, Evan Shelhamer, and Trevor Darrell. "Fully convolutional networks for semantic segmentation." In Proc. IEEE Conf. Comp. Vis. Patt. Recogn., pages 3431-3440, 2015
16 Zhang, Kaipeng, et al. "Joint face detection and alignment using multitask cascaded convolutional networks." IEEE Signal Processing Letters 23.10 (2016): 1499-1503.   DOI
17 Huang, Gary B., et al. "Labeled faces in the wild: A database forstudying face recognition in unconstrained environments." 2008.
18 Jain, Vidit, and Erik Learned-Miller. "Fddb: A benchmark for face detection" in unconstrained settings. Vol. 2. No. 6. UMass Amherst technical report, 2010.
19 B. Alexe, T. Deselaers, V. Ferrari, Measuring the objectness of image windows, in: Proceedings of the TPAMI, 2012.
20 J.R. Uijlings, K.E. Van De Sande, T. Gevers, A.W. Smeulders, Selective search for object recognition, in: Proceedings of the IJCV, 2013.
21 Liu, Li, et al. "Deep learning for generic object detection: A survey." International journal of computer vision 128.2 (2020): 261-318.   DOI
22 Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. "You only look once: Unified, real-time object detection." In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 779-788). 2016
23 J. Li, T. Wang, and Y. Zhang, "Face detection using surf cascade," in Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on, pp. 2183-2190, IEEE, 2011.
24 K.-K. Sung and T. Poggio. "Example-based learning for viewbased human face detection." IEEE Transactions on pattern analysis and machine intelligence, 20(1):39-51, 1998.   DOI
25 P. Viola and M. J. Jones. "Robust real-time face detection." International journal of computer vision, 57(2):137-154,2004   DOI
26 X. Zhu and D. Ramanan. "Face detection, pose estimation,and landmark localization in the wild." In Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on,pages 2879-2886. IEEE, 2012
27 Ren, S., He, K., Girshick, R., & Sun, J. "Faster r-cnn: Towards real-time object detection with region proposal networks." In Advances in neural information processing systems (pp. 91-99). 2015
28 Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016, October). "Ssd: Single shot multibox detector." In European conference on computer vision (pp. 21-37). Springer, Cham.
29 Lin, T. Y., Dollar, P., Girshick, R., He, K., Hariharan, B., & Belongie, S. "Feature pyramid networks for object detection." In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2117-2125). 2017
30 Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollar. "Focal loss for dense object detection." In Proc. IEEE Conf. Comp. Vis. Patt. Recogn., pages 2980-2988,2017.
31 He, K., Gkioxari, G., Dollar, P., & Girshick, R. "Mask r-cnn." In Proceedings of the IEEE international conference on computer vision (pp. 2961-2969). 2017
32 Tang, X., Du, D. K., He, Z., & Liu, J. "Pyramidbox: A context-assisted single shot face detector." In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 797-813). 2018
33 Zhang, Shifeng, et al. "Refineface: Refinement neural network for high performance face detection." IEEE Transactions on Pattern Analysis and Machine Intelligence (2020).
34 S. Yang, P. Luo, C.-C. Loy, and X. Tang. "Wider face: A face detection benchmark." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 5525-5533, 2016.
35 H. Jiang and E. Learned-Miller. "Face detection with the faster r-cnn." In Automatic Face & Gesture Recognition (FG 2017), 2017 12th IEEE International Conference on, pages 650-657. IEEE, 2017.
36 Wang, H., Li, Z., Ji, X., & Wang, Y. "Face r-cnn." arXiv preprint arXiv:1706.01061. 2017
37 Yang, W., & Jiachun, Z. "Real-time face detection based on YOLO." In 2018 1st IEEE International Conference on Knowledge Innovation and Invention (ICKII) (pp. 221-224). IEEE. September 2018
38 Deng, J., Guo, J., Zhou, Y., Yu, J., Kotsia, I., & Zafeiriou, S. "Retinaface: Single-stage dense face localisation in the wild." arXiv preprint arXiv:1905.00641. 2019
39 Cakiroglu, O., Ozer, C., & Gunsel, B. "Design of a Deep Face Detector by Mask R-CNN." In 2019 27th Signal Processing and Communications Applications Conference (SIU) (pp. 1-4). IEEE. April 2019
40 Li, J., Wang, Y., Wang, C., Tai, Y., Qian, J., Yang, J., ... & Huang, F. "DSFD: dual shot face detector." In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 5060-5069) 2019
41 Zhang, S., Zhu, X., Lei, Z., Shi, H., Wang, X., & Li, S. Z. "S3fd: Single shot scale-invariant face detector." In Proceedings of the IEEE International Conference on Computer Vision (pp. 192-201). 2017
42 Najibi, M., Samangouei, P., Chellappa, R., & Davis, L. S. "Ssh: Single stage headless face detector." In Proceedings of the IEEE International Conference on Computer Vision (pp. 4875-4884). 2017