{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T11:54:15Z","timestamp":1777550055863,"version":"3.51.4"},"reference-count":47,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2020,12,13]],"date-time":"2020-12-13T00:00:00Z","timestamp":1607817600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the National Natural Science Foundation of China Enterprise Innovation and Development Joint Fund","award":["U19B2004"],"award-info":[{"award-number":["U19B2004"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Spatiotemporal graph convolution has made significant progress in skeleton-based action recognition in recent years. Most of the existing graph convolution methods take all the joints of the human skeleton as the overall modeling graph, ignoring the differences in the movement patterns of various parts of the human, and cannot well connect the relationship between the different parts of the human skeleton. To capture the unique features of different parts of human skeleton data and the correlation of different parts, we propose two new graph convolution methods: the whole graph convolution network (WGCN) and the part graph convolution network (PGCN). WGCN learns the whole scale skeleton spatiotemporal features according to the movement patterns and physical structure of the human skeleton. PGCN divides the human skeleton graph into several subgraphs to learn the part scale spatiotemporal features. Moreover, we propose an adaptive fusion module that combines the two features for multiple complementary adaptive fusion to obtain more effective skeleton features. By coupling these proposals, we build a whole and part adaptive fusion graph convolution neural network (WPGCN) that outperforms previous state-of-the-art methods on three large-scale datasets: NTU RGB+D 60, NTU RGB+D 120, and Kinetics Skeleton 400.<\/jats:p>","DOI":"10.3390\/s20247149","type":"journal-article","created":{"date-parts":[[2020,12,13]],"date-time":"2020-12-13T23:39:36Z","timestamp":1607902776000},"page":"7149","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Whole and Part Adaptive Fusion Graph Convolutional Networks for Skeleton-Based Action Recognition"],"prefix":"10.3390","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1210-0618","authenticated-orcid":false,"given":"Qi","family":"Zuo","sequence":"first","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lian","family":"Zou","sequence":"additional","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cien","family":"Fan","sequence":"additional","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dongqian","family":"Li","sequence":"additional","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hao","family":"Jiang","sequence":"additional","affiliation":[{"name":"School of Electronic Information, Wuhan University, Wuhan 430072, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yifeng","family":"Liu","sequence":"additional","affiliation":[{"name":"National Engineering Laboratory for Risk Perception and Prevention (NEL-RPP), Beijing 100041, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,12,13]]},"reference":[{"key":"ref_1","unstructured":"Song, B., Gaur, U., Zhu, Y., and Roy-Chowdhury, A. (2011). A \u201cstring of feature graphs\u201d model for recognition of complex activities in natural videos. Proceedings of the 2011 IEEE International Conference on Computer Vision (ICCV 2011), Barcelona, Spain, 6\u201313 November 2011, IEEE Computer Society."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Wang, H., and Wang, L. (2017, January 21\u201326). Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.387"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Li, C., Zhong, Q., Xie, D., and Pu, S. (2018, January 13\u201319). Co-Occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation. Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence IJCAI-18, Stockholm, Sweden.","DOI":"10.24963\/ijcai.2018\/109"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1272","DOI":"10.1109\/JPROC.2002.801449","article-title":"Integrating perceptual and cognitive modeling for adaptive and intelligent human-computer interaction","volume":"90","author":"Duric","year":"2002","journal-title":"Proc. IEEE"},{"key":"ref_5","unstructured":"Feichtenhofer, C., Fan, H., Malik, J., and He, K. (November, January 27). SlowFast Networks for Video Recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Tran, D., Wang, H., Torresani, L., Ray, J., LeCun, Y., and Paluri, M. (2018, January 18\u201323). A Closer Look at Spatiotemporal Convolutions for Action Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00675"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., and Li, F.-F. (2014, January 23\u201328). Large-Scale Video Classification with Convolutional Neural Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.223"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"454","DOI":"10.1007\/s11263-017-0998-6","article-title":"Lie-X: Depth Image Based Articulated Object Pose Estimation, Tracking, and Action Recognition on Lie Groups","volume":"123","author":"Xu","year":"2017","journal-title":"Int. J. Comput. Vis."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Vemulapalli, R., Arrate, F., and Chellappa, R. (2014, January 23\u201328). Human Action Recognition by Representing 3D Skeletons as Points in a Lie Group. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.82"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Zhou, Q., Yu, S., Wu, X., Gao, Q., Li, C., and Xu, Y. (2009, January 18\u201322). HMMs-based human action recognition for an intelligent household surveillance robot. Proceedings of the 2009 IEEE International Conference on Robotics and Biomimetics (ROBIO), Guilin, China.","DOI":"10.1109\/ROBIO.2009.5420459"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Fernando, B., Gavves, E., Oramas, J.M., Ghodrati, A., and Tuytelaars, T. (2015, January 7\u201312). Modeling Video Evolution for Action Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299176"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Hu, J.F., Zheng, W.S., Lai, J., and Zhang, J. (2015, January 7\u201312). Jointly Learning Heterogeneous Features for RGB-D Activity Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299172"},{"key":"ref_13","unstructured":"Du, Y., Wang, W., and Wang, L. (2015, January 7\u201312). Hierarchical Recurrent Neural Network for Skeleton Based Action Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Song, S., Lan, C., Xing, J., Zeng, W., and Liu, J. (2016). An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data. arXiv.","DOI":"10.1609\/aaai.v31i1.11212"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Li, S., Li, W., Cook, C., Zhu, C., and Gao, Y. (2018, January 18\u201323). Independently Recurrent Neural Network (IndRNN): Building a Longer and Deeper RNN. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00572"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zhang, P., Lan, C., Xing, J., Zeng, W., Xue, J., and Zheng, N. (2017, January 22\u201329). View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition From Skeleton Data. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.233"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Si, C., Jing, Y., Wang, W., Wang, L., and Tan, T. (2018, January 8\u201314). Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack Learning. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01246-5_7"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1044","DOI":"10.1109\/LSP.2018.2841649","article-title":"Ensemble One-Dimensional Convolution Neural Networks for Skeleton-Based Action Recognition","volume":"25","author":"Xu","year":"2018","journal-title":"IEEE Signal Process. Lett."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Li, Y., Xia, R., Liu, X., and Huang, Q. (2019, January 8\u201312). Learning Shape-Motion Representations from Geometric Algebra Spatio-Temporal Model for Skeleton-Based Action Recognition. Proceedings of the 2019 IEEE International Conference on Multimedia and Expo (ICME), Shanghai, China.","DOI":"10.1109\/ICME.2019.00187"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Caetano, C., Sena, J., Br\u00e9mond, F., Dos Santos, J.A., and Schwartz, W.R. (2019, January 18\u201321). SkeleMotion: A New Representation of Skeleton Joint Sequences based on Motion Information for 3D Action Recognition. Proceedings of the 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Taipei, Taiwan.","DOI":"10.1109\/AVSS.2019.8909840"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, J., Nie, X., Xia, Y., Wu, Y., and Zhu, S.C. (2014, January 23\u201328). Cross-view Action Modeling, Learning and Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.339"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Liang, D., Fan, G., Lin, G., Chen, W., Pan, X., and Zhu, H. (2019, January 16\u201320). Three-Stream Convolutional Neural Network with Multi-Task and Ensemble Learning for 3D Action Recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Long Beach, CA, USA.","DOI":"10.1109\/CVPRW.2019.00123"},{"key":"ref_23","unstructured":"Minh, T.L., Inoue, N., and Shinoda, K. (2018). A Fine-to-Coarse Convolutional Neural Network for 3D Human Action Recognition. arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Monti, F., Boscaini, D., Masci, J., Rodola, E., Svoboda, J., and Bronstein, M.M. (2017, January 21\u201326). Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.576"},{"key":"ref_25","unstructured":"Niepert, M., Ahmed, M., and Kutzkov, K. (2016, January 19\u201324). Learning Convolutional Neural Networks for Graphs. Proceedings of the International Conference on Machine Learning, New York, NY, USA."},{"key":"ref_26","unstructured":"Cortes, C., Lawrence, N.D., Lee, D.D., Sugiyama, M., and Garnett, R. (2015). Convolutional Networks on Graphs for Learning Molecular Fingerprints. Advances in Neural Information Processing Systems 28, Curran Associates, Inc."},{"key":"ref_27","unstructured":"Kipf, T.N., and Welling, M. (2016). Semi-Supervised Classification with Graph Convolutional Networks. arXiv."},{"key":"ref_28","unstructured":"Lee, D.D., Sugiyama, M., Luxburg, U.V., Guyon, I., and Garnett, R. (2016). Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering. Advances in Neural Information Processing Systems 29, Curran Associates, Inc."},{"key":"ref_29","unstructured":"Bruna, J., Zaremba, W., Szlam, A., and Lecun, Y. (2013). Spectral Networks and Locally Connected Networks on Graphs. arXiv."},{"key":"ref_30","unstructured":"Henaff, M., Bruna, J., and LeCun, Y. (2015). Deep Convolutional Networks on Graph-Structured Data. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Yan, S., Xiong, Y., and Lin, D. (2018). Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition. arXiv.","DOI":"10.1609\/aaai.v32i1.12328"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Shi, L., Zhang, Y., Cheng, J., and Lu, H. (2019, January 16\u201320). Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01230"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Li, M., Chen, S., Chen, X., Zhang, Y., Wang, Y., and Tian, Q. (2019, January 16\u201320). Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00371"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Shahroudy, A., Liu, J., Ng, T.T., and Wang, G. (2016, January 27\u201330). NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.115"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"2684","DOI":"10.1109\/TPAMI.2019.2916873","article-title":"NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding","volume":"42","author":"Liu","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_36","unstructured":"Kay, W., Carreira, J., Simonyan, K., Zhang, B., Hillier, C., Vijayanarasimhan, S., Viola, F., Green, T., Back, T., and Natsev, P. (2017). The kinetics human action video dataset. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Shi, L., Zhang, Y., Cheng, J., and Lu, H. (2019). Skeleton-Based Action Recognition with Multi-Stream Adaptive Graph Convolutional Networks. arXiv.","DOI":"10.1109\/CVPR.2019.00810"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Shi, L., Zhang, Y., Cheng, J., and Lu, H. (2019, January 16\u201320). Skeleton-Based Action Recognition with Directed Graph Neural Networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00810"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Si, C., Chen, W., Wang, W., Wang, L., and Tan, T. (2019, January 16\u201320). An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00132"},{"key":"ref_40","unstructured":"Gao, J., He, T., Zhou, X., and Ge, S. (2019). Focusing and Diffusion: Bidirectional Attentive Graph Convolutional Networks for Skeleton-based Action Recognition. arXiv."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Liu, Z., Zhang, H., Chen, Z., Wang, Z., and Ouyang, W. (2020, January 13\u201319). Disentangling and Unifying Graph Convolutions for Skeleton-Based Action Recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00022"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Cheng, K., Zhang, Y., He, X., Chen, W., Cheng, J., and Lu, H. (2020, January 13\u201319). Skeleton-Based Action Recognition with Shift Graph Convolutional Network. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00026"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Chan, W., Tian, Z., and Wu, Y. (2020). GAS-GCN: Gated Action-Specific Graph Convolutional Networks for Skeleton-Based Action Recognition. Sensors, 20.","DOI":"10.3390\/s20123499"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Li, F., Li, J., Zhu, A., Xu, Y., Yin, H., and Hua, G. (2020). Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition. Sensors, 20.","DOI":"10.3390\/s20185260"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Cao, Z., Simon, T., Wei, S.E., and Sheikh, Y. (2017, January 21\u201326). Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.143"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Zhang, P., Lan, C., Zeng, W., Xing, J., Xue, J., and Zheng, N. (2020, January 13\u201319). Semantics-Guided Neural Networks for Efficient Skeleton-Based Human Action Recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00119"},{"key":"ref_47","first-page":"8561","article-title":"Spatio-Temporal Graph Routing for Skeleton-Based Action Recognition","volume":"33","author":"Li","year":"2019","journal-title":"Proc. AAAI Conf. Artif. Intell."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/24\/7149\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:44:36Z","timestamp":1760179476000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/24\/7149"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,13]]},"references-count":47,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2020,12]]}},"alternative-id":["s20247149"],"URL":"https:\/\/doi.org\/10.3390\/s20247149","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12,13]]}}}