{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,17]],"date-time":"2026-06-17T20:46:47Z","timestamp":1781729207710,"version":"3.54.5"},"reference-count":28,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2023,7,27]],"date-time":"2023-07-27T00:00:00Z","timestamp":1690416000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Hand gestures are an essential part of human-to-human communication and interaction and, therefore, of technical applications. The aim is increasingly to achieve interaction between humans and computers that is as natural as possible, for example, by means of natural language or hand gestures. In the context of human-machine interaction research, these methods are consequently being explored more and more. However, the realization of natural communication between humans and computers is a major challenge. In the field of hand gesture recognition, research approaches are being pursued that use additional hardware, such as special gloves, to classify gestures with high accuracy. Recently, deep learning techniques using artificial neural networks have been increasingly proposed for the problem of gesture recognition without using such tools. In this context, we explore the approach of convolutional neural network (CNN) in detail for the task of hand gesture recognition. CNN is a deep neural network that can be used in the fields of visual object processing and classification. The goal of this work is to recognize ten types of static hand gestures in front of complex backgrounds and different hand sizes based on raw images without the use of extra hardware. We achieved good results with a CNN network architecture consisting of seven layers. Through data augmentation and skin segmentation, a significant increase in the model\u2019s accuracy was achieved. On public benchmarks, two challenging datasets have been classified almost perfectly, with testing accuracies of 96.5% and 96.57%.<\/jats:p>","DOI":"10.3390\/a16080361","type":"journal-article","created":{"date-parts":[[2023,7,28]],"date-time":"2023-07-28T02:08:00Z","timestamp":1690510080000},"page":"361","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["Visual Static Hand Gesture Recognition Using Convolutional Neural Network"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-4241-9915","authenticated-orcid":false,"given":"Ahmed","family":"Eid","sequence":"first","affiliation":[{"name":"Institute of Neural Information Processing, Ulm University, 89081 Ulm, Germany"},{"name":"Computer Science Engineering Department, German University in Cairo, Cairo 11835, Egypt"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5118-0812","authenticated-orcid":false,"given":"Friedhelm","family":"Schwenker","sequence":"additional","affiliation":[{"name":"Institute of Neural Information Processing, Ulm University, 89081 Ulm, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,7,27]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Amirian, M., K\u00e4chele, M., Palm, G., and Schwenker, F. (June, January 30). Support vector regression of sparse dictionary-based features for view-independent action unit intensity estimation. Proceedings of the 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017), Washington, DC, USA.","DOI":"10.1109\/FG.2017.109"},{"key":"ref_2","unstructured":"Hihn, H., Meudt, S., and Schwenker, F. (2016). Artificial Neural Networks in Pattern Recognition (ANNPR 2016), Proceedings of the 7th IAPR TC3 Workshop, Ulm, Germany, 28\u201330 September 2016, Springer."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Neto, G.M.R., Junior, G.B., de Almeida, J.D.S., and de Paiva, A.C. (2018, January 27\u201329). Sign language recognition based on 3d convolutional neural networks. Proceedings of the 15th International Conference Image Analysis and Recognition (ICIAR 2018), P\u00f3voa de Varzim, Portugal.","DOI":"10.1007\/978-3-319-93000-8_45"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2719","DOI":"10.1007\/s10586-017-1435-x","article-title":"Hand gesture recognition based on convolution neural network","volume":"22","author":"Li","year":"2019","journal-title":"Clust. Comput."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1016\/j.engappai.2018.09.006","article-title":"American Sign Language alphabet recognition using Convolutional Neural Networks with multiview augmentation and inference fusion","volume":"76","author":"Tao","year":"2018","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Xing, K., Ding, Z., Jiang, S., Ma, X., Yang, K., Yang, C., Li, X., and Jiang, F. (2018, January 18\u201321). Hand gesture recognition based on deep learning method. Proceedings of the 2018 IEEE Third International Conference on Data Science in Cyberspace (DSC), Guangzhou, China.","DOI":"10.1109\/DSC.2018.00087"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1016\/j.ecoinf.2018.10.002","article-title":"Deep convolution neural network for image recognition","volume":"48","author":"Traore","year":"2018","journal-title":"Ecol. Inform."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1016\/j.eswa.2017.05.039","article-title":"Deep learning for biological image classification","volume":"85","author":"Affonso","year":"2017","journal-title":"Expert Syst. Appl."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Gao, Q., Liu, J., Ju, Z., Li, Y., Zhang, T., and Zhang, L. (2017, January 16\u201318). Static hand gesture recognition with parallel CNNs for space human-robot interaction. Proceedings of the International Conference on Intelligent Robotics and Applications, Wuhan, China.","DOI":"10.1007\/978-3-319-65289-4_44"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Oliveira, M., Chatbri, H., Little, S., Ferstl, Y., O\u2019Connor, N.E., and Sutherland, A. (December, January 29). Irish sign language recognition using principal component analysis and convolutional neural networks. Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA), Sydney, Australia.","DOI":"10.1109\/DICTA.2017.8227451"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"547","DOI":"10.12988\/ces.2018.8241","article-title":"Convolutional neural network with a dag architecture for control of a robotic arm by means of hand gestures","volume":"11","author":"Arenas","year":"2018","journal-title":"Contemp. Eng. Sci."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Sahoo, J.P., Ari, S., and Patra, S.K. (2019, January 16\u201318). Hand gesture recognition using PCA based deep CNN reduced features and SVM classifier. Proceedings of the 2019 IEEE International Symposium on Smart Electronic Systems (iSES) (Formerly iNiS), Rourkela, India.","DOI":"10.1109\/iSES47678.2019.00056"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"7957","DOI":"10.1007\/s00521-019-04691-y","article-title":"Deep learning-based sign language recognition system for static signs","volume":"32","author":"Wadhawan","year":"2020","journal-title":"Neural Comput. Appl."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"103464","DOI":"10.1016\/j.infrared.2020.103464","article-title":"Human hand gesture recognition with convolutional neural networks for K-12 double-teachers instruction mode classroom","volume":"111","author":"Wang","year":"2020","journal-title":"Infrared Phys. Technol."},{"key":"ref_15","first-page":"6296013","article-title":"Effective inertial hand gesture recognition using particle filtering based trajectory matching","volume":"2018","author":"Wang","year":"2018","journal-title":"J. Electr. Comput. Eng."},{"key":"ref_16","first-page":"51","article-title":"Hand gesture recognition using shape features","volume":"117","author":"Suguna","year":"2017","journal-title":"Int. J. Pure Appl. Math."},{"key":"ref_17","first-page":"90","article-title":"Hand gesture recognition using webcam","volume":"7","author":"Marium","year":"2017","journal-title":"Am. J. Intell. Syst."},{"key":"ref_18","first-page":"276","article-title":"Classification of hand gestures using Gabor filter with Bayesian and na\u00efve Bayes classifier","volume":"7","author":"Ashfaq","year":"2016","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_19","first-page":"39","article-title":"Hand gesture recognition using multiclass support vector machine","volume":"74","author":"Rahman","year":"2013","journal-title":"Int. J. Comput. Appl."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1186\/1687-6180-2012-36","article-title":"3D hand tracking using Kalman filter in depth space","volume":"2012","author":"Park","year":"2012","journal-title":"EURASIP J. Adv. Signal Process."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1110","DOI":"10.1109\/TMM.2013.2246148","article-title":"Robust part-based hand gesture recognition using kinect sensor","volume":"15","author":"Ren","year":"2013","journal-title":"IEEE Trans. Multimed."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Rao, J., Gao, T., Gong, Z., and Jiang, Z. (2009, January 21\u201322). Low cost hand gesture learning and recognition system based on hidden markov model. Proceedings of the 2009 Second International Symposium on Information Science and Engineering, Manchester, UK.","DOI":"10.1109\/ISISE.2009.53"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1007\/s11263-012-0560-5","article-title":"Attention based detection and recognition of hand postures against complex backgrounds","volume":"101","author":"Pisharady","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Mohanty, A., Rambhatla, S.S., and Sahay, R.R. (2017, January 10\u201312). Deep gesture: Static hand gesture recognition using CNN. Proceedings of the International Conference on Computer Vision and Image Processing, Roorkee, India.","DOI":"10.1007\/978-981-10-2107-7_41"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Marcel, S. (1999, January 15\u201320). Hand posture recognition in a body-face centered space. Proceedings of the CHI\u201999 Extended Abstracts on Human Factors in Computing Systems, Pittsburgh, PA, USA.","DOI":"10.1145\/632716.632901"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Dwina, N., Arnia, F., and Munadi, K. (2018, January 25\u201328). Skin segmentation based on improved thresholding method. Proceedings of the 2018 International ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI-NCON), Chiang Rai, Thailand.","DOI":"10.1109\/ECTI-NCON.2018.8378289"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40537-019-0197-0","article-title":"A survey on image data augmentation for deep learning","volume":"6","author":"Shorten","year":"2019","journal-title":"J. Big Data"},{"key":"ref_28","first-page":"1929","article-title":"Dropout: A simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J. Mach. Learn. Res."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/8\/361\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:20:51Z","timestamp":1760127651000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/8\/361"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,27]]},"references-count":28,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2023,8]]}},"alternative-id":["a16080361"],"URL":"https:\/\/doi.org\/10.3390\/a16080361","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,7,27]]}}}