{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T02:23:00Z","timestamp":1760235780996,"version":"build-2065373602"},"reference-count":53,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2021,9,15]],"date-time":"2021-09-15T00:00:00Z","timestamp":1631664000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the Longyan University\u2019s Qi Mai Science and Technology Innovation Fund Project of Longyan City","award":["2017SHQM07"],"award-info":[{"award-number":["2017SHQM07"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>With the application of deep convolutional neural networks, the performance of computer vision tasks has been improved to a new level. The construction of a deeper and more complex network allows the face recognition algorithm to obtain a higher accuracy, However, the disadvantages of large computation and storage costs of neural networks limit the further popularization of the algorithm. To solve this problem, we have studied the unified and efficient neural network face recognition algorithm under the condition of a single camera; we propose that the complete face recognition process consists of four tasks: face detection, in vivo detection, keypoint detection, and face verification; combining the key algorithms of these four tasks, we propose a unified network model based on a deep separable convolutional structure\u2014UFaceNet. The model uses multisource data to carry out multitask joint training and uses the keypoint detection results to aid the learning of other tasks. It further introduces the attention mechanism through feature level clipping and alignment to ensure the accuracy of the model, using the shared convolutional layer network among tasks to reduce model calculations amount and realize network acceleration. The learning goal of multi-tasking implicitly increases the amount of training data and different data distribution, making it easier to learn the characteristics with generalization. The experimental results show that the UFaceNet model is better than other models in terms of calculation amount and number of parameters with higher efficiency, and some potential areas to be used.<\/jats:p>","DOI":"10.3390\/a14090268","type":"journal-article","created":{"date-parts":[[2021,9,15]],"date-time":"2021-09-15T04:50:28Z","timestamp":1631681428000},"page":"268","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["UFaceNet: Research on Multi-Task Face Recognition Algorithm Based on CNN"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5593-1780","authenticated-orcid":false,"given":"Huoyou","family":"Li","sequence":"first","affiliation":[{"name":"School of Mathematics and Information Engineering, Longyan University, Longyan 364012, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jianshiun","family":"Hu","sequence":"additional","affiliation":[{"name":"School of Mathematics and Information Engineering, Longyan University, Longyan 364012, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingwen","family":"Yu","sequence":"additional","affiliation":[{"name":"Information School, Xiamen University, Xiamen 361005, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ning","family":"Yu","sequence":"additional","affiliation":[{"name":"Information School, Xiamen University, Xiamen 361005, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qingqiang","family":"Wu","sequence":"additional","affiliation":[{"name":"Information School, Xiamen University, Xiamen 361005, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,9,15]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Zamir, A.R., Sax, A., Shen, W., Guibas, L., Malik, J., and Savarese, S. (2018, January 18\u201323). Taskonomy: Disentangling Task Transfer Learning. Proceedings of the IEEE\/Cvf Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00391"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.1080\/17474124.2019.1694903","article-title":"An overview of deep learning algorithms and water exchange in colonoscopy in improving adenoma detection","volume":"13","author":"Hsieh","year":"2019","journal-title":"Expert Rev. Gastroenterol. Hepatol."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Viola, P.A., Jones, M.J., and Snow, D. (2003, January 13\u201316). Detecting Pedestrians Using Patterns of Motion and Appearance. Proceedings of the IEEE International Conference on Computer Vision, Nice, France.","DOI":"10.1109\/ICCV.2003.1238422"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Felzenszwalb, P.F., Mcallester, D.A., and Ramanan, D. (2008, January 23\u201328). A discriminatively trained, multiscale, deformable part model. Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA.","DOI":"10.1109\/CVPR.2008.4587597"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Li, H., Lin, Z., Shen, X., Brandt, J., and Hua, G. (2015, January 7\u201312). A convolutional neural network cascade for face detection. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299170"},{"key":"ref_6","unstructured":"Huang, L., Yi, Y., Deng, Y., and Yu, Y. (2015). DenseBox: Unifying Landmark Localization with End to End Object Detection. arXiv."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Yang, S., Luo, P., Loy, C.-C., and Tang, X. (2015, January 7\u201313). From Facial Parts Responses to Face Detection: A Deep Learning Approach. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.419"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Jiang, H., and Learned-Miller, E. (June, January 30). Face Detection with the Faster R-CNN. Proceedings of the 2017 12th IEEE International Conference on Automatic Face and Gesture Recognition, Washington, DC, USA.","DOI":"10.1109\/FG.2017.82"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"102573","DOI":"10.1016\/j.dsp.2019.08.003","article-title":"Structure-constrained discriminative dictionary learning based on Schatten p-norm for face recognition","volume":"95","author":"Chang","year":"2019","journal-title":"Digit. Signal Process."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"115948","DOI":"10.1016\/j.image.2020.115948","article-title":"Context prior-based with residual learning for face detection: A deep convolutional encoder-decoder network","volume":"88","author":"Zhou","year":"2020","journal-title":"Signal Process.-Image Commun."},{"key":"ref_11","unstructured":"Kahm, O., and Damer, N. (2012, January 6\u20137). 2D face liveness detection: An overview. Proceedings of the International Conference of Biometrics Special Interest Group (BIOSIG), Darmstadt, Germany."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"746","DOI":"10.1109\/TIFS.2015.2400395","article-title":"Face Spoof Detection with Image Distortion Analysis","volume":"10","author":"Di","year":"2015","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1818","DOI":"10.1109\/TIFS.2016.2555286","article-title":"Face Spoofing Detection Using Color Texture Analysis","volume":"11","author":"Boulkenafet","year":"2017","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Tan, X., Yi, L., Liu, J., and Jiang, L. (2010). Face Liveness Detection from a Single Image with Sparse Low Rank Bilinear Discriminative Model. European Conference on Computer Vision, Proceedings of the 11th European Conference on Computer Vision, Heraklion, Crete, Greece, 5\u201311 September 2010, Springer.","DOI":"10.1007\/978-3-642-15567-3_37"},{"key":"ref_15","first-page":"49","article-title":"Face Anti-spoofing via Motion Magnification and Multifeature Videolet Aggregation","volume":"3","author":"Bharadwaj","year":"2016","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"762","DOI":"10.1109\/TIFS.2015.2406533","article-title":"Detection of Face Spoofing Using Visual Dynamics","volume":"10","author":"Tirunagaris","year":"2015","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1049\/iet-bmt.2012.0071","article-title":"Motion-Based Counter-Measures to Photo Attacks in Face Recognition","volume":"3","author":"Anjos","year":"2014","journal-title":"IET Biom."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Xu, Z., Shan, L., and Deng, W. (2015, January 3\u20136). Learning temporal features using LSTM-CNN architecture for face anti-spoofing. Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition (ACPR), Kuala Lumpur, Malaysia.","DOI":"10.1109\/ACPR.2015.7486482"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Atoum, Y., Liu, Y., Jourabloo, A., and Liu, X. (2017, January 1\u20134). Face Anti-Spoofing Using Patch and Depth-Based CNNs. Proceedings of the IEEE International Joint Conference on Biometrics, Denver, CO, USA.","DOI":"10.1109\/BTAS.2017.8272713"},{"key":"ref_20","first-page":"182","article-title":"Discriminative Representation Combinations for Accurate Face Spoofing Detection","volume":"85","author":"Song","year":"2018","journal-title":"Pattern Recognit."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Valstar, M., Martinez, B., Binefa, X., and Pantic, M. (2010, January 13\u201318). Facial point detection using boosted regression and graph models. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5539996"},{"key":"ref_22","first-page":"236","article-title":"Statistical Models of Appearance for computer vision","volume":"4322","author":"Cootes","year":"2004","journal-title":"Proc. SPIE\u2014Int. Soc. Opt. Eng."},{"key":"ref_23","first-page":"1078","article-title":"Cascaded pose regression","volume":"238","author":"Dollar","year":"2010","journal-title":"IEEE"},{"key":"ref_24","unstructured":"Dong, C., Ren, S., Wei, Y., Cao, X., and Sun, J. (2014). Joint Cascade Face Detection and Alignment. European Conference on Computer Vision, Proceedings of the 13th European Conference, Zurich, Switzerland, 6\u201312 September 2014, Springer."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Ren, S., Cao, X., Wei, Y., and Sun, J. (2014, January 23\u201328). Face Alignment at 3000 FPS via Regressing Local Binary Features. Proceedings of the Computer Vision & Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.218"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Kazemi, V., and Sullivan, J. (2014, January 23\u201328). One Millisecond Face Alignment with an Ensemble of Regression Trees. Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.241"},{"key":"ref_27","unstructured":"Yi, S., Wang, X., and Tang, X. (2013, January 23\u201328). Deep Convolutional Network Cascade for Facial Point Detection. Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition, Portland, OR, USA."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhou, E., Fan, H., Cao, Z., Jiang, Y., and Yin, Q. (2013, January 2\u20138). Extensive Facial Landmark Localization with Coarse-to-Fine Convolutional Network Cascade. Proceedings of the IEEE International Conference on Computer Vision Workshops, Sydney, Australia.","DOI":"10.1109\/ICCVW.2013.58"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1499","DOI":"10.1109\/LSP.2016.2603342","article-title":"Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks","volume":"23","author":"Zhang","year":"2016","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Kowalski, M., Naruniec, J., and Trzcinski, T. (2017, January 21\u201326). Deep Alignment Network: A Convolutional Neural Network for Robust Face Alignment. Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition Workshops, Honolulu, HI, USA.","DOI":"10.1109\/CVPRW.2017.254"},{"key":"ref_31","unstructured":"Turk, M.A., and Pentland, A.P. (2011, January 3\u20136). Face recognition using eigenfaces. Proceedings of the International Conference on Computer Research & Development, Maui, HI, USA."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1016\/0262-8856(94)90007-8","article-title":"HMM-based architecture for face identification","volume":"12","author":"Samaria","year":"1994","journal-title":"Image Vis. Comput."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1162\/neco.1989.1.4.541","article-title":"Backpropagation Applied to Handwritten Zip Code Recognition","volume":"1","author":"Lecun","year":"2014","journal-title":"Neural Comput."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Taigman, Y., Ming, Y., Ranzato, M.A., and Wolf, L. (2014, January 23\u201328). DeepFace: Closing the Gap to Human-Level Performance in Face Verification. Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.220"},{"key":"ref_35","unstructured":"Yi, S., Wang, X., and Tang, X. (2015, January 7\u201312). Deeply learned face representations are sparse, selective, and robust. Proceedings of the Computer Vision & Pattern Recognition, Boston, MA, USA."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Schroff, F., Kalenichenko, D., and Philbin, J. (2015, January 7\u201312). FaceNet: A Unified Embedding for Face Recognition and Clustering. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298682"},{"key":"ref_37","first-page":"120","article-title":"A Light CNN for Deep Face Representation with Noisy Labels","volume":"99","author":"Xiang","year":"2015","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_38","first-page":"20","article-title":"Mask R-CNN","volume":"99","author":"He","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Liu, Z., Ping, L., Wang, X., and Tang, X. (2014). Deep Learning Face Attributes in the Wild. arXiv.","DOI":"10.1109\/ICCV.2015.425"},{"key":"ref_40","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv."},{"key":"ref_41","unstructured":"Ioffe, S., and Szegedy, C. (2015). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. arXiv."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Yang, S., Ping, L., Loy, C.C., and Tang, X. (2016, January 27\u201330). WIDER FACE: A Face Detection Benchmark. Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.596"},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/j.imavis.2016.01.002","article-title":"300 Faces In-The-Wild Challenge: Database and results","volume":"47","author":"Sagonas","year":"2016","journal-title":"Image Vis. Comput."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Sagonas, C., Tzimiropoulos, G., Zafeiriou, S., and Pantic, M. (2013, January 2\u20138). 300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge. Proceedings of the IEEE International Conference on Computer Vision Workshops, Sydney, Australia.","DOI":"10.1109\/ICCVW.2013.59"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Guo, Y., Lei, Z., Hu, Y., He, X., and Gao, J. (2016). MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition. arXiv.","DOI":"10.1007\/978-3-319-46487-9_6"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"2930","DOI":"10.1109\/TPAMI.2013.23","article-title":"Localizing parts of faces using a consensus of exemplars","volume":"35","author":"Belhumeur","year":"2013","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_47","unstructured":"Dong, Y., Zhen, L., Liao, S., and Li, S.Z. (2014). Learning Face Representation from Scratch. arXiv."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Trigerrgis, G., Snape, P., Nicolaou, M.A., Antonakos, E., and Zafeiriou, S. (2016, January 27\u201330). Mnemonic Descent Method: A Recurrent Process Applied for End-to-End Face Alignment. Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.453"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"1567","DOI":"10.1109\/LSP.2016.2608139","article-title":"Face Alignment Using K-Cluster Regression Forests with Weighted Splitting","volume":"23","author":"Kowalski","year":"2016","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Lee, D., Park, H., and Chang, D.Y. (2015, January 7\u201312). Face alignment using cascade Gaussian process regression trees. Proceedings of the Computer Vision & Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299048"},{"key":"ref_51","unstructured":"Cheng, L. (2015, January 7\u201312). Face Alignment by Coarse-to-Fine Shape Searching. Proceedings of the Computer Vision & Pattern Recognition, Boston, MA, USA."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Xiong, X., and Torre, F.D.L. (2013, January 23\u201328). Supervised Descent Method and Its Applications to Face Alignment. Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA.","DOI":"10.1109\/CVPR.2013.75"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Leibe, B., Matas, J., Sebe, N., and Welling, M. (2016). Robust Facial Landmark Detection via Recurrent Attentive-Refinement Networks. Computer Vision\u2014ECCV 2016, Springer. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-319-46478-7"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/14\/9\/268\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:02:59Z","timestamp":1760166179000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/14\/9\/268"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,15]]},"references-count":53,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2021,9]]}},"alternative-id":["a14090268"],"URL":"https:\/\/doi.org\/10.3390\/a14090268","relation":{},"ISSN":["1999-4893"],"issn-type":[{"type":"electronic","value":"1999-4893"}],"subject":[],"published":{"date-parts":[[2021,9,15]]}}}