{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T04:55:38Z","timestamp":1769835338660,"version":"3.49.0"},"reference-count":42,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2018,11,1]],"date-time":"2018-11-01T00:00:00Z","timestamp":1541030400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>Race recognition (RR), which has many applications such as in surveillance systems, image\/video understanding, analysis, etc., is a difficult problem to solve completely. To contribute towards solving that problem, this article investigates using a deep learning model. An efficient Race Recognition Framework (RRF) is proposed that includes information collector (IC), face detection and preprocessing (FD&amp;P), and RR modules. For the RR module, this study proposes two independent models. The first model is RR using a deep convolutional neural network (CNN) (the RR-CNN model). The second model (the RR-VGG model) is a fine-tuning model for RR based on VGG, the famous trained model for object recognition. In order to examine the performance of our proposed framework, we perform an experiment on our dataset named VNFaces, composed specifically of images collected from Facebook pages of Vietnamese people, to compare the accuracy between RR-CNN and RR-VGG. The experimental results show that for the VNFaces dataset, the RR-VGG model with augmented input images yields the best accuracy at 88.87% while RR-CNN, an independent and lightweight model, yields 88.64% accuracy. The extension experiments conducted prove that our proposed models could be applied to other race dataset problems such as Japanese, Chinese, or Brazilian with over 90% accuracy; the fine-tuning RR-VGG model achieved the best accuracy and is recommended for most scenarios.<\/jats:p>","DOI":"10.3390\/sym10110564","type":"journal-article","created":{"date-parts":[[2018,11,1]],"date-time":"2018-11-01T11:31:47Z","timestamp":1541071907000},"page":"564","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":43,"title":["Race Recognition Using Deep Convolutional Neural Networks"],"prefix":"10.3390","volume":"10","author":[{"given":"Thanh","family":"Vo","sequence":"first","affiliation":[{"name":"Advanced Program in Computer Science, University of Science, VNU HCMC, Ho Chi Minh 700000, Vietnam"}]},{"given":"Trang","family":"Nguyen","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology, Ho Chi Minh City Open University, Ho Chi Minh 700000, Vietnam"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9354-3335","authenticated-orcid":false,"given":"C. T.","family":"Le","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology, Ho Chi Minh City University of Technology (HUTECH), Ho Chi Minh 700000, Vietnam"}]}],"member":"1968","published-online":{"date-parts":[[2018,11,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"973","DOI":"10.1007\/s00521-015-1913-0","article-title":"Face recognition using a permutation coding neural classifier","volume":"27","author":"Baidyk","year":"2016","journal-title":"Neural Comput. Appl."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1016\/j.eswa.2017.07.051","article-title":"SVAS: Surveillance Video Analysis System","volume":"89","author":"Kardas","year":"2017","journal-title":"Expert Syst. Appl."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1016\/j.compind.2017.04.005","article-title":"Respiration-based emotion recognition with deep learning","volume":"92\u201393","author":"Zhang","year":"2017","journal-title":"Comput. Ind."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1109\/TCSVT.2016.2589859","article-title":"Toward abnormal trajectory and event detection in video surveillance","volume":"27","author":"Cosar","year":"2017","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ahmed, E., Jones, M.J., and Marks, T.K. (2015, January 7\u201312). An improved deep learning architecture for person re-identification. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299016"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"2483","DOI":"10.1109\/TPAMI.2014.2321570","article-title":"Learning Race from face: A survey","volume":"36","author":"Fu","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1007\/s11257-016-9171-0","article-title":"Computational personality recognition in social media","volume":"26","author":"Farnadi","year":"2016","journal-title":"User Model. User-Adapt. Interact."},{"key":"ref_8","unstructured":"Nguyen, D.T., Joty, S.R., Imran, M., Sajjad, H., and Mitra, P. (2016). Applications of online deep learning for crisis response using social media information. arXiv, Available online: https:\/\/arxiv.org\/abs\/1610.01030."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1016\/j.eswa.2017.08.001","article-title":"MISNIS: An intelligent platform for twitter topic mining","volume":"89","author":"Carvalhoa","year":"2017","journal-title":"Expert Syst. Appl."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1007\/s00138-016-0817-z","article-title":"Smile detection in the wild with deep convolutional neural networks","volume":"28","author":"Chen","year":"2017","journal-title":"Mach. Vis. Appl."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1016\/j.eswa.2017.07.018","article-title":"Face alignment using a deep neural network with local feature learning and recurrent regression","volume":"89","author":"Parka","year":"2017","journal-title":"Expert Syst. Appl."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1016\/j.engappai.2017.08.010","article-title":"Deep learning to frame objects for visual target tracking","volume":"65","author":"Pang","year":"2017","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1016\/j.eswa.2016.04.032","article-title":"Human activity recognition with smartphone sensors using deep learning neural networks","volume":"59","author":"Ronao","year":"2016","journal-title":"Expert Syst. Appl."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1016\/j.engappai.2016.01.032","article-title":"Deep neural network for halftone image classification based on sparse auto-encoder","volume":"50","author":"Zhang","year":"2016","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1109\/MIS.2017.23","article-title":"Deep learning-based document modeling for personality detection from text","volume":"32","author":"Majumder","year":"2017","journal-title":"IEEE Intell. Syst."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1016\/j.knosys.2016.06.009","article-title":"Aspect extraction for opinion mining with a deep convolutional neural network","volume":"108","author":"Poria","year":"2016","journal-title":"Knowl. Based Syst."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1447","DOI":"10.1007\/s13042-016-0515-1","article-title":"Recognition of college students from Weibo with deep neural networks","volume":"8","author":"Yu","year":"2017","journal-title":"Int. J. Mach. Learn. Cybern."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1016\/j.knosys.2016.10.008","article-title":"Deep neural network framework and transformed MFCCs for speaker\u2019s age and gender classification","volume":"115","author":"Qawaqneh","year":"2017","journal-title":"Knowl. Based Syst."},{"key":"ref_20","unstructured":"Simonyan, K., and Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. arXiv, Available online: https:\/\/arxiv.org\/abs\/1409.1556."},{"key":"ref_21","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). ImageNet classification with deep convolutional neural networks. Proceedings of the 25th International Conference on Neural Information Processing Systems, Lake Tahoe, NV, USA."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.1142\/S0218001407005818","article-title":"Performance evaluation of face recognition algorithms on Korean face database","volume":"21","author":"Roh","year":"2007","journal-title":"Int. J. Pattern Recognit. Artif. Intell."},{"key":"ref_24","unstructured":"Bastanfard, A., Nik, M.A., and Dehshibi, M.M. (2007, January 28\u201329). Iranian face database with age, pose and expression. Proceedings of the 2007 International Conference on Machine Vision, Islamabad, Pakistan."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1109\/TSMCA.2007.909557","article-title":"The CAS-PEAL large-scale Chinese face database and baseline evaluations","volume":"38","author":"Gao","year":"2008","journal-title":"IEEE Trans. Syst. Man Cybern. Part A Syst. Hum."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1901","DOI":"10.1109\/TPAMI.2015.2491929","article-title":"HCP: A flexible CNN framework for multi-label image classification","volume":"38","author":"Wei","year":"2016","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Li, H., Lin, Z., Shen, X., Brandt, J., and Hua, G. (2015, January 7\u201312). A convolutional neural network cascade for face detection. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299170"},{"key":"ref_28","unstructured":"He, K., Wang, Y., and Hopcroft, J.E. (2016, January 5\u201311). A powerful generative model using random weights for the deep image representation. Proceedings of the 30th Conference on Neural Information Processing Systems (NIPS), Barcelona, Spain."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Yang, W., Ouyang, W., Li, H., and Wang, X. (2016, January 27\u201330). End-to-end learning of deformable mixture of parts and deep convolutional neural networks for human pose estimation. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.335"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"3919","DOI":"10.1109\/TIP.2016.2579306","article-title":"DeepSaliency: Multi-task deep neural network model for salient object detection","volume":"25","author":"Li","year":"2016","journal-title":"IEEE Trans. Image Process."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Carreira, J., Agrawal, P., Fragkiadaki, K., and Malik, J. (2016, January 27\u201330). Human pose estimation with iterative error feedback. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.512"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1063","DOI":"10.1109\/TMM.2016.2638622","article-title":"Words matter: Scene text for image classification and retrieval","volume":"19","author":"Karaoglu","year":"2017","journal-title":"IEEE Trans. Multimed."},{"key":"ref_33","first-page":"388","article-title":"Deep feature transfer learning in combination with traditional features predicts survival among patients with lung adenocarcinoma","volume":"2","author":"Paul","year":"2016","journal-title":"Tomogr. J. Imaging Res."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"1285","DOI":"10.1109\/TMI.2016.2528162","article-title":"Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning","volume":"35","author":"Roth","year":"2016","journal-title":"IEEE Trans. Med. Imaging"},{"key":"ref_36","unstructured":"Sze, V., Chen, Y.H., Yang, T.J., and Emer, J.S. (2017). Efficient Processing of Deep Neural Networks: A Tutorial and Survey. arXiv, Available online: https:\/\/arxiv.org\/abs\/1703.09039."},{"key":"ref_37","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A Method for Stochastic Optimization. arXiv, Available online: https:\/\/arxiv.org\/abs\/1412.6980."},{"key":"ref_38","unstructured":"Viola, P., and Jones, M. (2001, January 8\u201314). Rapid object detection using a boosted cascade of simple features. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Kauai, HI, USA."},{"key":"ref_39","unstructured":"Lyons, M., Akamatsu, S., Kamachi, M., and Gyoba, J. (1998, January 14\u201316). Coding facial expressions with Gabor wavelets. Proceedings of the Third IEEE International Conference on Automatic Face and Gesture Recognition, Nara, Japan."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"1955","DOI":"10.1109\/TPAMI.2008.222","article-title":"Face photo-sketch synthesis and recognition","volume":"31","author":"Wang","year":"2009","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"902","DOI":"10.1016\/j.imavis.2009.11.005","article-title":"A new ranking method for principal components analysis and its application to face image analysis","volume":"28","author":"Thomaz","year":"2010","journal-title":"Image Vis. Comput."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1263","DOI":"10.1109\/TKDE.2008.239","article-title":"Learning from imbalanced data","volume":"21","author":"He","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/10\/11\/564\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:27:22Z","timestamp":1760196442000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/10\/11\/564"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,11,1]]},"references-count":42,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2018,11]]}},"alternative-id":["sym10110564"],"URL":"https:\/\/doi.org\/10.3390\/sym10110564","relation":{},"ISSN":["2073-8994"],"issn-type":[{"value":"2073-8994","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,11,1]]}}}