{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,14]],"date-time":"2026-01-14T16:56:04Z","timestamp":1768409764725,"version":"3.49.0"},"reference-count":45,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2020,3,18]],"date-time":"2020-03-18T00:00:00Z","timestamp":1584489600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,3,18]],"date-time":"2020-03-18T00:00:00Z","timestamp":1584489600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61702195"],"award-info":[{"award-number":["61702195"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61811530281"],"award-info":[{"award-number":["61811530281"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61751202"],"award-info":[{"award-number":["61751202"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["U181320097"],"award-info":[{"award-number":["U181320097"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000266","name":"the Engineering and Physical Sciences Research Council","doi-asserted-by":"crossref","award":["EP\/S001913"],"award-info":[{"award-number":["EP\/S001913"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int. J. Fuzzy Syst."],"published-print":{"date-parts":[[2020,6]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Hand gesture is one of the most intuitive and natural ways for human to communicate with computers, and it has been widely adopted in many human\u2013computer interaction applications. However, it is still a challenging problem when confronted with complex background, illumination variation and occlusion in real-world scenarios. In this paper, a two-stage hand gesture recognition method is proposed to tackle these problems. At the first stage, hand pose estimation is developed to locate the hand keypoints using the convolutional pose machine, which can effectively localize hand keypoints even in a complex background. At the second stage, the Fuzzy Gaussian mixture models (FGMMs) are tailored to reject the nongesture patterns and classify the gestures based on the estimated hand keypoints. Extensive experiments are conducted to evaluate the performance of the proposed method, and the result demonstrates that the proposed algorithm is effective, robust, and satisfactory in real-time scenarios.<\/jats:p>","DOI":"10.1007\/s40815-020-00825-w","type":"journal-article","created":{"date-parts":[[2020,3,18]],"date-time":"2020-03-18T19:02:52Z","timestamp":1584558172000},"page":"1330-1341","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":39,"title":["Hand Gesture Recognition in Complex Background Based on Convolutional Pose Machine and Fuzzy Gaussian Mixture Models"],"prefix":"10.1007","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7025-6365","authenticated-orcid":false,"given":"Tong","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Huifeng","family":"Lin","sequence":"additional","affiliation":[]},{"given":"Zhaojie","family":"Ju","sequence":"additional","affiliation":[]},{"given":"Chenguang","family":"Yang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,3,18]]},"reference":[{"issue":"12","key":"825_CR1","doi-asserted-by":"publisher","first-page":"2389","DOI":"10.1016\/S0031-3203(04)00165-7","volume":"37","author":"W Gao","year":"2004","unstructured":"Gao, W., Fang, G., Zhao, D., Chen, Y.: A chinese sign language recognition system based on sofm\/srn\/hmm. Pattern Recognit. 37(12), 2389\u20132402 (2004)","journal-title":"Pattern Recognit."},{"issue":"12","key":"825_CR2","doi-asserted-by":"publisher","first-page":"3941","DOI":"10.1007\/s00521-016-2294-8","volume":"28","author":"OK Oyedotun","year":"2017","unstructured":"Oyedotun, O.K., Khashman, A.: Deep learning in vision-based static hand gesture recognition. Neural Comput. Appl. 28(12), 3941\u20133951 (2017)","journal-title":"Neural Comput. Appl."},{"issue":"1","key":"825_CR3","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1049\/iet-cvi:20080006","volume":"3","author":"J Han","year":"2009","unstructured":"Han, J., Awad, G., Sutherland, A.: Automatic skin segmentation and tracking in sign language recognition. Iet Comput. Vis. 3(1), 24\u201335 (2009)","journal-title":"Iet Comput. Vis."},{"key":"825_CR4","doi-asserted-by":"publisher","first-page":"1947","DOI":"10.1016\/j.neucom.2007.12.035","volume":"71","author":"C-C Chang","year":"2008","unstructured":"Chang, C.-C., Liu, C.-Y., Tai, W.-K.: Feature alignment approach for hand posture recognition based on curvature scale space. Neurocomputing 71, 1947\u20131953 (2008)","journal-title":"Neurocomputing"},{"issue":"1","key":"825_CR5","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1109\/TFUZZ.2016.2554152","volume":"25","author":"G Lai","year":"2017","unstructured":"Lai, G., Liu, Z., Zhang, Y., Chen, C.P., Xie, S., Liu, Y.: Fuzzy adaptive inverse compensation method to tracking control of uncertain nonlinear systems with generalized actuator dead zone. IEEE Trans. Fuzzy Syst. 25(1), 191\u2013204 (2017)","journal-title":"IEEE Trans. Fuzzy Syst."},{"key":"825_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/j.ins.2015.03.067","volume":"315","author":"L Liu","year":"2015","unstructured":"Liu, L., Chen, C.P., Zhou, Y., You, X.: A new weighted mean filter with a two-phase detector for removing impulse noise. Inf. Sci. 315, 1\u201316 (2015)","journal-title":"Inf. Sci."},{"issue":"1","key":"825_CR7","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1109\/TFUZZ.2014.2310491","volume":"23","author":"Z Liu","year":"2015","unstructured":"Liu, Z., Wang, F., Zhang, Y., Chen, X., Chen, C.P.: Adaptive tracking control for a class of nonlinear systems with a fuzzy dead-zone input. IEEE Trans. Fuzzy Syst. 23(1), 193\u2013204 (2015)","journal-title":"IEEE Trans. Fuzzy Syst."},{"issue":"3","key":"825_CR8","doi-asserted-by":"publisher","first-page":"567","DOI":"10.1016\/S0031-3203(02)00072-9","volume":"36","author":"X Yin","year":"2003","unstructured":"Yin, X., Xie, M.: Estimation of the fundamental matrix from uncalibrated stereo hand images for 3d hand gesture recognition. Pattern Recognit. 36(3), 567\u2013584 (2003)","journal-title":"Pattern Recognit."},{"issue":"3","key":"825_CR9","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1007\/s11263-012-0560-5","volume":"101","author":"PK Pisharady","year":"2013","unstructured":"Pisharady, P.K., Vadakkepat, P., Loh, A.P.: Attention based detection and recognition of hand postures against complex backgrounds. Int. J. Comput. Vis. 101(3), 403\u2013419 (2013)","journal-title":"Int. J. Comput. Vis."},{"key":"825_CR10","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1016\/j.patrec.2013.10.010","volume":"50","author":"F Dominio","year":"2014","unstructured":"Dominio, F., Donadeo, M., Zanuttigh, P.: Combining multiple depth-based descriptors for hand gesture recognition. Pattern Recognit. Lett. 50, 101\u2013111 (2014)","journal-title":"Pattern Recognit. Lett."},{"key":"825_CR11","doi-asserted-by":"crossref","unstructured":"Wei, S.-E., Ramakrishna, V., Kanade, T., Sheikh, Y.: Convolutional pose machines. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4724\u20134732 (2016)","DOI":"10.1109\/CVPR.2016.511"},{"issue":"3","key":"825_CR12","doi-asserted-by":"publisher","first-page":"1146","DOI":"10.1016\/j.patcog.2011.08.028","volume":"45","author":"Z Ju","year":"2012","unstructured":"Ju, Z., Liu, H.: Fuzzy gaussian mixture models. Pattern Recognit. 45(3), 1146\u20131158 (2012)","journal-title":"Pattern Recognit."},{"key":"825_CR13","unstructured":"Kovac, J., Peer, P., Solina, F.: Human skin color clustering for face detection. In: EUROCON 2003. Computer as a tool. The IEEE region 8, vol.\u00a02, IEEE, pp. 144\u2013148 (2003)"},{"key":"825_CR14","doi-asserted-by":"crossref","unstructured":"Qian, C., Sun, X., Wei, Y., Tang, X., Sun, J.: Realtime and robust hand tracking from depth. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1106\u20131113 (2014)","DOI":"10.1109\/CVPR.2014.145"},{"key":"825_CR15","doi-asserted-by":"crossref","unstructured":"Van\u00a0den Bergh, M., Van\u00a0Gool, L.: Combining rgb and tof cameras for real-time 3d hand gesture interaction. In: Proceedings of the 2011 IEEE workshop on applications of computer vision (WACV), IEEE, pp. 66\u201372 (2011)","DOI":"10.1109\/WACV.2011.5711485"},{"issue":"1","key":"825_CR16","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1023\/A:1013200319198","volume":"46","author":"MJ Jones","year":"2002","unstructured":"Jones, M.J., Rehg, J.M.: Statistical color models with application to skin detection. Int. J. Comput. Vis. 46(1), 81\u201396 (2002)","journal-title":"Int. J. Comput. Vis."},{"key":"825_CR17","doi-asserted-by":"crossref","unstructured":"Peng, X., Wang, L., Cai, Z., Qiao, Y.: Action and gesture temporal spotting with super vector representation. In: Workshop at the European conference on computer vision, Springer, pp. 518\u2013527 (2014)","DOI":"10.1007\/978-3-319-16178-5_36"},{"key":"825_CR18","doi-asserted-by":"crossref","unstructured":"Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: proceedings of the 2011 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp. 1297\u20131304 (2011)","DOI":"10.1109\/CVPR.2011.5995316"},{"key":"825_CR19","doi-asserted-by":"crossref","unstructured":"Kang, B., Tan, K.-H., Jiang, N., Tai, H.-S., Treffer, D., Nguyen, T.: Hand segmentation for hand-object interaction from depth map. In: Proceedings of the 2017 IEEE global conference on signal and information processing (GlobalSIP), IEEE, pp. 259\u2013263 (2017)","DOI":"10.1109\/GlobalSIP.2017.8308644"},{"issue":"2","key":"825_CR20","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1016\/0010-4485(95)00026-7","volume":"28","author":"CP Chen","year":"1996","unstructured":"Chen, C.P., Xie, S.: Freehand drawing system using a fuzzy logic concept. Comput. Aided Des. 28(2), 77\u201389 (1996)","journal-title":"Comput. Aided Des."},{"key":"825_CR21","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1016\/j.neucom.2015.09.127","volume":"198","author":"J Zhou","year":"2016","unstructured":"Zhou, J., Chen, L., Chen, C.P., Zhang, Y., Li, H.-X.: Fuzzy clustering with the entropy of attribute weights. Neurocomputing 198, 125\u2013134 (2016)","journal-title":"Neurocomputing"},{"issue":"8","key":"825_CR22","doi-asserted-by":"publisher","first-page":"2202","DOI":"10.1016\/j.patcog.2013.01.033","volume":"46","author":"SP Priyal","year":"2013","unstructured":"Priyal, S.P., Bora, P.K.: A robust static hand gesture recognition system using geometry based normalizations and krawtchouk moments. Pattern Recognit. 46(8), 2202\u20132219 (2013)","journal-title":"Pattern Recognit."},{"key":"825_CR23","unstructured":"Lienhart, R., Maydt, J.: An extended set of haar-like features for rapid object detection. In: Proceedings of the 2002 international conference on image processing. 2002. vol.\u00a01, IEEE, pp. I\u2013I (2002)"},{"issue":"11","key":"825_CR24","doi-asserted-by":"publisher","first-page":"3592","DOI":"10.1109\/TIM.2011.2161140","volume":"60","author":"NH Dardas","year":"2011","unstructured":"Dardas, N.H., Georganas, N.D.: Real-time hand gesture detection and recognition using bag-of-features and support vector machine techniques. IEEE Trans. Instrum. Measur. 60(11), 3592\u20133607 (2011)","journal-title":"IEEE Trans. Instrum. Measur."},{"key":"825_CR25","unstructured":"Krizhevsky, A., Sutskever, I., Hinton, G.\u00a0E.: Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp. 1097\u20131105 (2012)"},{"key":"825_CR26","doi-asserted-by":"crossref","unstructured":"Liang, C., Song, Y., Zhang, Y.: Hand gesture recognition using view projection from point cloud. In: Proceedings of the 2016 IEEE international conference on image processing (ICIP), IEEE, pp. 4413\u20134417 (2016)","DOI":"10.1109\/ICIP.2016.7533194"},{"key":"825_CR27","doi-asserted-by":"crossref","unstructured":"Ramakrishna, V., Munoz, D., Hebert, M., Bagnell, J.\u00a0A., Sheikh, Y.: Pose machines: articulated pose estimation via inference machines. In: European conference on computer vision, Springer, pp. 33\u201347 (2014)","DOI":"10.1007\/978-3-319-10605-2_3"},{"key":"825_CR28","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3431\u20133440 (2015)","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"825_CR29","unstructured":"Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition, arXiv preprint arXiv:1409.1556"},{"key":"825_CR30","unstructured":"Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the thirteenth international conference on artificial intelligence and statistics, pp. 249\u2013256 (2010)"},{"key":"825_CR31","first-page":"279","volume-title":"Modular learning in neural networks","author":"DH Ballard","year":"1987","unstructured":"Ballard, D.H.: Modular learning in neural networks, pp. 279\u2013284. AAAI, Menlo Park (1987)"},{"issue":"2","key":"825_CR32","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1109\/72.279181","volume":"5","author":"Y Bengio","year":"1994","unstructured":"Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 5(2), 157\u2013166 (1994)","journal-title":"IEEE Trans. Neural Netw."},{"issue":"3","key":"825_CR33","first-page":"1","volume":"5","author":"DE Rumelhart","year":"1988","unstructured":"Rumelhart, D.E., Hinton, G.E., Williams, R.J., et al.: Learning representations by back-propagating errors. Cogn. Model. 5(3), 1 (1988)","journal-title":"Cogn. Model."},{"key":"825_CR34","doi-asserted-by":"crossref","unstructured":"Zivkovic, Z.: Improved adaptive gaussian mixture model for background subtraction. In: Proceedings of the 17th international conference on pattern recognition, 2004. ICPR 2004. vol.\u00a02, IEEE, pp. 28\u201331 (2004)","DOI":"10.1109\/ICPR.2004.1333992"},{"issue":"1\u20133","key":"825_CR35","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1006\/dspr.1999.0361","volume":"10","author":"DA Reynolds","year":"2000","unstructured":"Reynolds, D.A., Quatieri, T.F., Dunn, R.B.: Speaker verification using adapted gaussian mixture models. Digit. Signal Process. 10(1\u20133), 19\u201341 (2000)","journal-title":"Digit. Signal Process."},{"issue":"1","key":"825_CR36","doi-asserted-by":"publisher","first-page":"72","DOI":"10.1109\/89.365379","volume":"3","author":"DA Reynolds","year":"1995","unstructured":"Reynolds, D.A., Rose, R.C., et al.: Robust text-independent speaker identification using gaussian mixture speaker models. IEEE Trans. Speech Audio Process. 3(1), 72\u201383 (1995)","journal-title":"IEEE Trans. Speech Audio Process."},{"issue":"8","key":"825_CR37","doi-asserted-by":"publisher","first-page":"2632","DOI":"10.1007\/s40815-019-00740-9","volume":"21","author":"Y Gao","year":"2019","unstructured":"Gao, Y., Wang, D., Pan, J., Wang, Z., Chen, B.: A novel fuzzy c-means clustering algorithm using adaptive norm. Int. J. Fuzzy Syst. 21(8), 2632\u20132649 (2019)","journal-title":"Int. J. Fuzzy Syst."},{"key":"825_CR38","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","volume":"39","author":"AP Dempster","year":"1977","unstructured":"Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodological) 39, 1\u201338 (1977)","journal-title":"J R Stat Soc Ser B (Methodological)"},{"issue":"3","key":"825_CR39","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1109\/3477.764879","volume":"29","author":"K Krishna","year":"1999","unstructured":"Krishna, K., Murty, M.N.: Genetic k-means algorithm. IEEE Trans Syst Man Cybern Part B (Cybernetics) 29(3), 433\u2013439 (1999)","journal-title":"IEEE Trans Syst Man Cybern Part B (Cybernetics)"},{"key":"825_CR40","doi-asserted-by":"crossref","unstructured":"Ju, Z., Liu, H.: Applying fuzzy em algorithm with a fast convergence to GMMS. In: Proceedings of the 2010 IEEE international conference on fuzzy systems (FUZZ), IEEE, pp. 1\u20136 (2010)","DOI":"10.1109\/FUZZY.2010.5584456"},{"issue":"2\u20133","key":"825_CR41","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1016\/0098-3004(84)90020-7","volume":"10","author":"JC Bezdek","year":"1984","unstructured":"Bezdek, J.C., Ehrlich, R., Full, W.: FCM: the fuzzy c-means clustering algorithm. Comput. Geosci. 10(2\u20133), 191\u2013203 (1984)","journal-title":"Comput. Geosci."},{"issue":"1","key":"825_CR42","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1007\/s40815-017-0411-1","volume":"20","author":"X Zhao","year":"2018","unstructured":"Zhao, X., Li, Y., Zhao, Q.: A fuzzy clustering approach for complex color image segmentation based on gaussian model with interactions between color planes and mixture gaussian model. Int. J. Fuzzy Syst. 20(1), 309\u2013317 (2018)","journal-title":"Int. J. Fuzzy Syst."},{"issue":"4","key":"825_CR43","doi-asserted-by":"publisher","first-page":"1026","DOI":"10.1007\/s40815-018-00604-8","volume":"21","author":"H Lin","year":"2019","unstructured":"Lin, H., Zhang, T., Chen, Z., Song, H., Yang, C.: Adaptive fuzzy Gaussian mixture models for shape approximation in robot grasping. Int. J. Fuzzy Syst. 21(4), 1026\u20131037 (2019)","journal-title":"Int. J. Fuzzy Syst."},{"key":"825_CR44","unstructured":"Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M.,Ghemawat, S., Irving, G., Isard, M., et\u00a0al.: Tensorflow: a system for large-scale machine learning. In: 12th $$\\{\\text{USENIX}\\}$$ symposium on operating systems design and implementation ($$\\{\\text{ OSDI }\\}$$ 16), pp. 265\u2013283 (2016)"},{"key":"825_CR45","doi-asserted-by":"crossref","unstructured":"Zimmermann, C., Brox, T.: Learning to estimate 3d hand pose from single RGB images. In: Proceedings of the IEEE international conference on computer vision, pp. 4903\u20134911 (2017)","DOI":"10.1109\/ICCV.2017.525"}],"container-title":["International Journal of Fuzzy Systems"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s40815-020-00825-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s40815-020-00825-w\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s40815-020-00825-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,2]],"date-time":"2024-08-02T11:47:33Z","timestamp":1722599253000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s40815-020-00825-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,18]]},"references-count":45,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,6]]}},"alternative-id":["825"],"URL":"https:\/\/doi.org\/10.1007\/s40815-020-00825-w","relation":{},"ISSN":["1562-2479","2199-3211"],"issn-type":[{"value":"1562-2479","type":"print"},{"value":"2199-3211","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,3,18]]},"assertion":[{"value":"4 March 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 December 2019","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 February 2020","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 March 2020","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}