Browse > Article
http://dx.doi.org/10.12673/jant.2020.24.5.430

Implementation of Face Recognition Pipeline Model using Caffe  

Park, Jin-Hwan (Department of Energy IT, Gachon University)
Kim, Chang-Bok (Department of Energy IT, Gachon University)
Abstract
The proposed model implements a model that improves the face prediction rate and recognition rate through learning with an artificial neural network using face detection, landmark and face recognition algorithms. After landmarking in the face images of a specific person, the proposed model use the previously learned Caffe model to extract face detection and embedding vector 128D. The learning is learned by building machine learning algorithms such as support vector machine (SVM) and deep neural network (DNN). Face recognition is tested with a face image different from the learned figure using the learned model. As a result of the experiment, the result of learning with DNN rather than SVM showed better prediction rate and recognition rate. However, when the hidden layer of DNN is increased, the prediction rate increases but the recognition rate decreases. This is judged as overfitting caused by a small number of objects to be recognized. As a result of learning by adding a clear face image to the proposed model, it is confirmed that the result of high prediction rate and recognition rate can be obtained. This research will be able to obtain better recognition and prediction rates through effective deep learning establishment by utilizing more face image data.
Keywords
Face detection; Face alignment; Embedding vector; Face recognition; Caffe;
Citations & Related Records
연도 인용수 순위
  • Reference
1 L. Yann, Y. Bengio, and G. Hinton, "Deep learning," Nature, Vol. 521, No. 7553, pp. 436-444, 2015.   DOI
2 V. Paul, and M. Jones, "Rapid object detection using a boosted cascade of simple features," in Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Kauai: Hi, Vol. 1, pp. 511-58, 2001.
3 S. S. Farface, M. Saberian, and L. J. Li, "Multi-view face detection using deep convolutional neural networks," in Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, Seattle: WA, pp.643-650, 2015.
4 H. Li, Z. Lin, X. Shen, J. Brandt, and G. Hua, "A convolutional neural network cascade for face detection," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston: MA, pp. 5325-5334, 2015.
5 R. Joseph and A. Farhadi, "Yolo9000: better, faster, stronger," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu: HI, pp. 7263-7271, 2017.
6 S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: towards real-time object detection with region proposal networks," Advances in Neural Information Processing Systems, Montreal: Canada, pp. 91-99, 2015.
7 K. Zhang, Z. Zhang, Z. Li, and Y. Qiao, "Joint face detection and alignment using multitask cascaded convolutional networks," IEEE Signal Processing Letters, Vol. 23, pp. 10, 1499-1503, 2016.   DOI
8 Y. Sun, X. Wang, and X. Tang, "Deep convolutional network cascade for facial point detection," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland : OR, pp. 3476-3483, 2013.
9 J. Zhang, S. Shan, M. Kan, and X. Chen, "Coarse-to-fine Auto-encoder networks (CFAN) for real-time face alignment," European Conference on Computer Vision, Zurich: Switzerland, pp.1-16, 2014.
10 Y. Taigman, M. Yang, M. Ranzato, L. Wolf, "Deepface: closing the gap to human-level performance in face verification," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus : Ohio, pp. 1701-1708, 2014.
11 Y. Sun, X. Wang, X. Tang, "Deep learning face representation from predicting 10,000 classes," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus: Ohio, pp. 1891-1898, 2014.
12 Y. Sun, X. Wang, X. Tang, "Deep learning face representation by joint identification-verification," Advances in Neural Information Processing Systems, Montreal: Canada, pp. 1988-1996, 2014.
13 E. A. Zanaty, S. H. Aljahdali, and R. J. Cripps, "Accurate support vector machines for data classification," International Journal of Rapid Manufacturing, Vol. 1, No. 2, pp. 114-127, 2009.   DOI
14 Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell, "Caffe: convolutional architecture for fast feature embedding," in Proceedings of the 22nd ACM International Conference on Multimedia, Orlando: Florida, pp. 675-678, 2014.