{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,13]],"date-time":"2026-05-13T20:13:35Z","timestamp":1778703215750,"version":"3.51.4"},"reference-count":18,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2024,12,11]],"date-time":"2024-12-11T00:00:00Z","timestamp":1733875200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computers"],"abstract":"<jats:p>This paper aims to enhance the effectiveness of table tennis coaching and player performance analysis through human action recognition by using deep learning. In the field of video analysis, human action recognition has emerged as a highly researched area. Beyond post-session analysis, it has the potential for real-time applications, such as providing instant feedback or comparing ideal motions with actual player movements. However, the complexity of human actions presents significant challenges. To address these issues, in this paper, we combine the latest computer vision and deep learning algorithms to accurately identify and classify a few table tennis strokes in human action recognition. Through an in-depth review of existing methods, we develop a high-precision offline method for player action recognition. Our experimental results show that the proposed method achieves an average accuracy of 99.85% in recognizing six distinct table tennis actions based on our own dataset.<\/jats:p>","DOI":"10.3390\/computers13120332","type":"journal-article","created":{"date-parts":[[2024,12,11]],"date-time":"2024-12-11T03:14:58Z","timestamp":1733886898000},"page":"332","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Player Performance Analysis in Table Tennis Through Human Action Recognition"],"prefix":"10.3390","volume":"13","author":[{"given":"Kangnan","family":"Dong","sequence":"first","affiliation":[{"name":"Department of Computer Science and Software Engineering, Auckland University of Technology, Auckland 1010, New Zealand"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7443-3285","authenticated-orcid":false,"given":"Wei Qi","family":"Yan","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Software Engineering, Auckland University of Technology, Auckland 1010, New Zealand"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,12,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Arnab, A., Dehghani, M., Heigold, G., Sun, C., Lu\u010di\u0107, M., and Schmid, C. (2021, January 10\u201317). ViViT: A video vision transformer. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00676"},{"key":"ref_2","unstructured":"Bertasius, G., Wang, H., and Torresani, L. (2021, January 18\u201324). Is space-time attention all you need for video understanding?. Proceedings of the International Conference on Machine Learning, Virtual."},{"key":"ref_3","unstructured":"Summerfield, M. (2015). Rapid GUI Programming with Python and Qt: The Definitive Guide to PyQt Programming, Pearson Education."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Carreira, J., and Zisserman, A. (2017, January 21\u201326). Quo vadis, action recognition? A new model and the kinetics dataset. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.502"},{"key":"ref_5","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., and Houlsby, N. (2021, January 3\u20137). An image is worth 16\u00d716 words: Transformers for image recognition at scale. Proceedings of the International Conference on Learning Representations, Virtual."},{"key":"ref_6","unstructured":"Lugaresi, C., Tang, J., Nash, H., McClanahan, C., Uboweja, E., Hays, M., and Grundmann, M. (2019). MediaPipe: A framework for building perception pipelines. arXiv."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Kulkarni, K.M., and Shenoy, S. (2021, January 19\u201325). Table tennis stroke recognition using two-dimensional human pose estimation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPRW53098.2021.00515"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Zhou, H. (2023). Computational Analysis of Table Tennis Games from Real-Time Videos Using Deep Learning. [Master\u2019s Thesis, Auckland University of Technology].","DOI":"10.1007\/978-981-97-0376-0_6"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Tran, D., Bourdev, L., Fergus, R., Torresani, L., and Paluri, M. (2015, January 7\u201313). Learning spatiotemporal features with 3D convolutional networks. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.510"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1007\/s11263-012-0594-8","article-title":"Dense trajectories and motion boundary descriptors for action recognition","volume":"103","author":"Wang","year":"2016","journal-title":"Int. J. Comput. Vis."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1109\/TPAMI.2012.59","article-title":"3D convolutional neural networks for human action recognition","volume":"35","author":"Ji","year":"2013","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L. (2018, January 18\u201323). MobileNetV2: Inverted Residuals and Linear Bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_14","unstructured":"Touvron, H., Cord, M., Dei, T., Matthey, L., El-Nouby, A., Douze, M., Massa, F., Sablayrolles, A., and J\u00e9gou, H. (2021, January 18\u201324). Training data-efficient image transformers & distillation through attention. Proceedings of the International Conference on Machine Learning, Virtual."},{"key":"ref_15","unstructured":"Zhang, C., Wei, X., Wang, Y., Jin, R., and Yan, S. (2021, January 10\u201317). TimeSformer: Temporally-coupled Transformer for video action recognition. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Liu, Z., Ning, J., Cao, Y., Wei, Y., Zhang, Z., Lin, S., and Hu, H. (2022, January 18\u201324). Video Swin Transformer. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00320"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Girdhar, R., and Ramanan, D. (2017, January 21\u201326). ActionVLAD: Learning Spatio-temporal Aggregations for Action Classification. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.337"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Yan, W. (2023). Computational Methods for Deep Learning: Theory, Algorithms, and Implementations, Springer.","DOI":"10.1007\/978-981-99-4823-9"}],"container-title":["Computers"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-431X\/13\/12\/332\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:51:55Z","timestamp":1760115115000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-431X\/13\/12\/332"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,11]]},"references-count":18,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["computers13120332"],"URL":"https:\/\/doi.org\/10.3390\/computers13120332","relation":{},"ISSN":["2073-431X"],"issn-type":[{"value":"2073-431X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,11]]}}}