{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T15:05:06Z","timestamp":1777734306412,"version":"3.51.4"},"reference-count":146,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2024,4,18]],"date-time":"2024-04-18T00:00:00Z","timestamp":1713398400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100000780","name":"European Union\u2019s Horizon Europe Project \u201cSestosenso\u201d","doi-asserted-by":"publisher","award":["101070310"],"award-info":[{"award-number":["101070310"]}],"id":[{"id":"10.13039\/501100000780","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MAKE"],"abstract":"<jats:p>Human activity recognition (HAR) remains an essential field of research with increasing real-world applications ranging from healthcare to industrial environments. As the volume of publications in this domain continues to grow, staying abreast of the most pertinent and innovative methodologies can be challenging. This survey provides a comprehensive overview of the state-of-the-art methods employed in HAR, embracing both classical machine learning techniques and their recent advancements. We investigate a plethora of approaches that leverage diverse input modalities including, but not limited to, accelerometer data, video sequences, and audio signals. Recognizing the challenge of navigating the vast and ever-growing HAR literature, we introduce a novel methodology that employs large language models to efficiently filter and pinpoint relevant academic papers. This not only reduces manual effort but also ensures the inclusion of the most influential works. We also provide a taxonomy of the examined literature to enable scholars to have rapid and organized access when studying HAR approaches. Through this survey, we aim to inform researchers and practitioners with a holistic understanding of the current HAR landscape, its evolution, and the promising avenues for future exploration.<\/jats:p>","DOI":"10.3390\/make6020040","type":"journal-article","created":{"date-parts":[[2024,4,18]],"date-time":"2024-04-18T10:30:52Z","timestamp":1713436252000},"page":"842-876","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":76,"title":["A Comprehensive Survey on Deep Learning Methods in Human Activity Recognition"],"prefix":"10.3390","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2333-6469","authenticated-orcid":false,"given":"Michail","family":"Kaseris","sequence":"first","affiliation":[{"name":"Department of Supply Chain Management, International Hellenic University, Kanellopoulou 2, 60132 Katerini, Greece"},{"name":"Information Technologies Institute (ITI) Center of Research and Technology Hellas (CERTH), 57001 Thessaloniki, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2882-2914","authenticated-orcid":false,"given":"Ioannis","family":"Kostavelis","sequence":"additional","affiliation":[{"name":"Department of Supply Chain Management, International Hellenic University, Kanellopoulou 2, 60132 Katerini, Greece"},{"name":"Information Technologies Institute (ITI) Center of Research and Technology Hellas (CERTH), 57001 Thessaloniki, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3911-7527","authenticated-orcid":false,"given":"Sotiris","family":"Malassiotis","sequence":"additional","affiliation":[{"name":"Information Technologies Institute (ITI) Center of Research and Technology Hellas (CERTH), 57001 Thessaloniki, Greece"}]}],"member":"1968","published-online":{"date-parts":[[2024,4,18]]},"reference":[{"key":"ref_1","first-page":"100046","article-title":"Deep learning based human activity recognition (HAR) using wearable sensor data","volume":"1","author":"Gupta","year":"2021","journal-title":"Int. J. Inf. Manag. Data Insights"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Diraco, G., Rescio, G., Caroppo, A., Manni, A., and Leone, A. (2023). Human Action Recognition in Smart Living Services and Applications: Context Awareness, Data Availability, Personalization, and Privacy. Sensors, 23.","DOI":"10.20944\/preprints202306.0672.v1"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Shuvo, M.M.H., Ahmed, N., Nouduri, K., and Palaniappan, K. (2020, January 13\u201315). A Hybrid Approach for Human Activity Recognition with Support Vector Machine and 1D Convolutional Neural Network. Proceedings of the 2020 IEEE Applied Imagery Pattern Recognition Workshop (AIPR), Washington, DC, USA.","DOI":"10.1109\/AIPR50011.2020.9425332"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Rojanavasu, P., Jantawong, P., Jitpattanakul, A., and Mekruksavanich, S. (2023, January 22\u201325). Improving Inertial Sensor-based Human Activity Recognition using Ensemble Deep Learning. Proceedings of the 2023 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT & NCON), Phuket, Thailand.","DOI":"10.1109\/ECTIDAMTNCON57770.2023.10139689"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Muhoza, A.C., Bergeret, E., Brdys, C., and Gary, F. (2023, January 20\u201323). Multi-Position Human Activity Recognition using a Multi-Modal Deep Convolutional Neural Network. Proceedings of the 2023 8th International Conference on Smart and Sustainable Technologies (SpliTech), Split, Croatia.","DOI":"10.23919\/SpliTech58164.2023.10193600"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Tao, S., Goh, W.L., and Gao, Y. (2023, January 11\u201313). A Convolved Self-Attention Model for IMU-based Gait Detection and Human Activity Recognition. Proceedings of the 2023 IEEE 5th International Conference on Artificial Intelligence Circuits and Systems (AICAS), Hangzhou, China.","DOI":"10.1109\/AICAS57966.2023.10168654"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Hassler, A.P., Menasalvas, E., Garc\u00eda-Garc\u00eda, F.J., Rodr\u00edguez-Ma\u00f1as, L., and Holzinger, A. (2019). Importance of medical data preprocessing in predictive modeling and risk factor discovery for the frailty syndrome. BMC Med. Inform. Decis. Mak., 19.","DOI":"10.1186\/s12911-019-0747-6"},{"key":"ref_8","first-page":"2505414","article-title":"Deformable convolutional networks for multimodal human activity recognition using wearable sensors","volume":"71","author":"Xu","year":"2022","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"30509","DOI":"10.1007\/s11042-020-09004-3","article-title":"Vision-based human activity recognition: A survey","volume":"79","author":"Beddiar","year":"2020","journal-title":"Multimed. Tools Appl."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1192","DOI":"10.1109\/SURV.2012.110112.00192","article-title":"A survey on human activity recognition using wearable sensors","volume":"15","author":"Lara","year":"2012","journal-title":"IEEE Commun. Surv. Tutorials"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"88","DOI":"10.3390\/computers2020088","article-title":"A review on video-based human activity recognition","volume":"2","author":"Ke","year":"2013","journal-title":"Computers"},{"key":"ref_12","first-page":"100142","article-title":"Transfer learning enhanced vision-based human activity recognition: A decade-long analysis","volume":"3","author":"Ray","year":"2023","journal-title":"Int. J. Inf. Manag. Data Insights"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1016\/j.cogsys.2022.10.003","article-title":"Recent trends in human activity recognition\u2013A comparative study","volume":"77","author":"Singh","year":"2023","journal-title":"Cogn. Syst. Res."},{"key":"ref_14","first-page":"1","article-title":"A survey on deep learning for human activity recognition","volume":"54","author":"Gu","year":"2021","journal-title":"Acm Comput. Surv. (CSUR)"},{"key":"ref_15","unstructured":"Hussain, Z., Sheng, M., and Zhang, W.E. (2019). Different approaches for human activity recognition: A survey. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1016\/j.procs.2019.08.100","article-title":"Human activity recognition: A survey","volume":"155","author":"Jobanputra","year":"2019","journal-title":"Procedia Comput. Sci."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"e1254","DOI":"10.1002\/widm.1254","article-title":"Recent trends in machine learning for human activity recognition\u2014A survey","volume":"8","author":"Roy","year":"2018","journal-title":"Wiley Interdiscip. Rev. Data Min. Knowl. Discov."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"107561","DOI":"10.1016\/j.patcog.2020.107561","article-title":"Sensor-based and vision-based human activity recognition: A comprehensive survey","volume":"108","author":"Dang","year":"2020","journal-title":"Pattern Recognit."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"28","DOI":"10.3389\/frobt.2015.00028","article-title":"A review of human activity recognition methods","volume":"2","author":"Vrigkas","year":"2015","journal-title":"Front. Robot. AI"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"4145","DOI":"10.1007\/s00521-022-07937-4","article-title":"Toward human activity recognition: A survey","volume":"35","author":"Saleem","year":"2023","journal-title":"Neural Comput. Appl."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Morshed, M.G., Sultana, T., Alam, A., and Lee, Y.K. (2023). Human Action Recognition: A Taxonomy-Based Survey, Updates, and Opportunities. Sensors, 23.","DOI":"10.3390\/s23042182"},{"key":"ref_22","unstructured":"Hinton, G.E., and Roweis, S. (2002). Stochastic neighbor embedding. Adv. Neural Inf. Process. Syst., 15."},{"key":"ref_23","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"1709","DOI":"10.1109\/TAES.2018.2799758","article-title":"Deep convolutional autoencoder for radar-based classification of similar aided and unaided human activities","volume":"54","year":"2018","journal-title":"IEEE Trans. Aerosp. Electron. Syst."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"915","DOI":"10.1016\/j.asoc.2017.09.027","article-title":"Real-time human activity recognition from accelerometer data using Convolutional Neural Networks","volume":"62","author":"Ignatov","year":"2018","journal-title":"Appl. Soft Comput."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"979","DOI":"10.1109\/JBHI.2017.2734803","article-title":"Automatic recognition of activities of daily living utilizing insole-based and wrist-worn wearable sensors","volume":"22","author":"Hegde","year":"2017","journal-title":"IEEE J. Biomed. Health Inform."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1118","DOI":"10.1109\/JSAC.2017.2679658","article-title":"Device-free human activity recognition using commercial WiFi devices","volume":"35","author":"Wang","year":"2017","journal-title":"IEEE J. Sel. Areas Commun."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.jnca.2017.12.010","article-title":"Device-free human localization and tracking with UHF passive RFID tags: A data-driven approach","volume":"104","author":"Ruan","year":"2018","journal-title":"J. Netw. Comput. Appl."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1016\/j.compag.2018.01.008","article-title":"Monitoring drinking behavior in bucket-fed dairy calves using an ear-attached tri-axial accelerometer: A pilot study","volume":"145","author":"Rol","year":"2018","journal-title":"Comput. Electron. Agric."},{"key":"ref_30","first-page":"1","article-title":"RFID systems in healthcare settings and activity of daily living in smart homes: A review. E-Health Telecommun","volume":"6","author":"Alsinglawi","year":"2017","journal-title":"Syst. Netw."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1109\/MWC.2019.1800405","article-title":"When RFID meets deep learning: Exploring cognitive intelligence for activity identification","volume":"26","author":"Fan","year":"2019","journal-title":"IEEE Wirel. Commun."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1016\/j.jbi.2018.09.002","article-title":"Examining sensor-based physical activity recognition and monitoring for healthcare using inter- net of things: A systematic review","volume":"87","author":"Qi","year":"2018","journal-title":"J. Biomed. Inform."},{"key":"ref_33","first-page":"75","article-title":"Recognizing multi-resident activities in non-intrusive sensor-based smart homes by formal concept analysis","volume":"318","author":"Hao","year":"2018","journal-title":"Neuro-Computing"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s12652-015-0294-7","article-title":"Ambient and smartphone sensor assisted ADL recognition in multi-inhabitant smart environments","volume":"7","author":"Roy","year":"2016","journal-title":"J. Ambient Intell. Humaniz. Comput."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1016\/j.patcog.2016.08.003","article-title":"Robust human activity recognition from depth video using spatiotemporal multi-fused features","volume":"61","author":"Jalal","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"3941","DOI":"10.1007\/s00521-016-2294-8","article-title":"Deep learning in vision-based static hand gesture recognition","volume":"28","author":"Oyedotun","year":"2017","journal-title":"Neural Comput. Appl."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1016\/j.imavis.2017.01.010","article-title":"Going deeper into action recognition: A survey","volume":"60","author":"Herath","year":"2017","journal-title":"Image Vis. Comput."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1016\/j.cviu.2016.10.010","article-title":"Detecting anomalous events in videos by learning deep representations of appearance and motion","volume":"156","author":"Xu","year":"2017","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"5115","DOI":"10.1109\/JSEN.2018.2830743","article-title":"Vision-based human action classification using adaptive boosting algorithm","volume":"18","author":"Zerrouki","year":"2018","journal-title":"IEEE Sens. J."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"176241","DOI":"10.1109\/ACCESS.2020.3026276","article-title":"Human4d: A human-centric multimodal dataset for motions and immersive media","volume":"8","author":"Chatzitofis","year":"2020","journal-title":"IEEE Access"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1109\/TPAMI.2013.248","article-title":"Human3. 6m: Large scale datasets and predictive methods for 3d human sensing in natural environments","volume":"36","author":"Ionescu","year":"2013","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_42","unstructured":"Mahmood, N., Ghorbani, N., Troje, N.F., Pons-Moll, G., and Black, M.J. (November, January 27). AMASS: Archive of motion capture as surface shapes. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Zeng, M., Nguyen, L.T., Yu, B., Mengshoel, O.J., Zhu, J., Wu, P., and Zhang, J. (2014, January 6\u20137). Convolutional neural networks for human activity recognition using mobile sensors. Proceedings of the 6th International Conference on Mobile Computing, Applications and Services, Austin, TX, USA.","DOI":"10.4108\/icst.mobicase.2014.257786"},{"key":"ref_44","unstructured":"Yang, J., Nguyen, M.N., San, P.P., Li, X., and Krishnaswamy, S. (2015, January 25\u201331). Deep convolutional neural networks on multichannel time series for human activity recognition. Proceedings of the Ijcai, Buenos Aires, Argentina."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Ha, S., Yun, J.M., and Choi, S. (2015, January 9\u201312). Multi-modal convolutional neural networks for activity recognition. Proceedings of the 2015 IEEE International Conference on Systems, Man, and Cybernetics, Kowloon Tong, Hong Kong, China.","DOI":"10.1109\/SMC.2015.525"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1162\/neco.2006.18.7.1527","article-title":"A fast learning algorithm for deep belief nets","volume":"18","author":"Hinton","year":"2006","journal-title":"Neural Comput."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Ha, S., and Choi, S. (2016, January 24\u201329). Convolutional neural networks for human activity recognition using multiple accelerometer and gyroscope sensors. Proceedings of the 2016 International Joint Conference on Neural Networks (IJCNN), Vancouver, BC, Canada.","DOI":"10.1109\/IJCNN.2016.7727224"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Edel, M., and K\u00f6ppe, E. (2016, January 4\u20137). Binarized-blstm-rnn based human activity recognition. Proceedings of the 2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN), Alcala de Henares, Spain.","DOI":"10.1109\/IPIN.2016.7743581"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3090076","article-title":"Ensembles of deep lstm learners for activity recognition using wearables","volume":"1","author":"Guan","year":"2017","journal-title":"Proc. ACM Interact. Mob. Wearable Ubiquitous Technol."},{"key":"ref_50","unstructured":"Hammerla, N.Y., Halloran, S., and Pl\u00f6tz, T. (2016). Deep, convolutional, and recurrent models for human activity recognition using wearables. arXiv."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1007\/s10015-017-0422-x","article-title":"Deep recurrent neural network for mobile human activity recognition with high throughput","volume":"23","author":"Inoue","year":"2018","journal-title":"Artif. Life Robot."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Maurya, R., Teo, T.H., Chua, S.H., Chow, H.C., and Wey, I.C. (2022, January 19\u201322). Complex Human Activities Recognition Based on High Performance 1D CNN Model. Proceedings of the 2022 IEEE 15th International Symposium on Embedded Multicore\/Many-Core Systems-on-Chip (MCSoC), Penang, Malaysia.","DOI":"10.1109\/MCSoC57363.2022.00059"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Liang, Y., Feng, K., and Ren, Z. (2023, January 26\u201328). Human Activity Recognition Based on Transformer via Smart-phone Sensors. Proceedings of the 2023 IEEE 3rd International Conference on Computer Communication and Artificial Intelligence (CCAI), Taiyuan, China.","DOI":"10.1109\/CCAI57533.2023.10201297"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Aswal, V., Sreeram, V., Kuchik, A., Ahuja, S., and Patel, H. (2020, January 13\u201315). Real-time human activity generation using bidirectional long short term memory networks. Proceedings of the 2020 4th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India.","DOI":"10.1109\/ICICCS48265.2020.9121096"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"16852","DOI":"10.1109\/JSEN.2021.3077563","article-title":"Physique-based human activity recognition using ensemble learning and smartphone sensors","volume":"21","author":"Choudhury","year":"2021","journal-title":"IEEE Sens. J."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"4137","DOI":"10.1109\/ACCESS.2022.3140373","article-title":"Convae-lstm: Convolutional autoencoder long short-term memory network for smartphone-based human activity recognition","volume":"10","author":"Thakur","year":"2022","journal-title":"IEEE Access"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"15211","DOI":"10.1109\/JSEN.2022.3186086","article-title":"Hierarchical activity recognition based on belief functions theory in body sensor networks","volume":"22","author":"Dong","year":"2022","journal-title":"IEEE Sens. J."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"7265","DOI":"10.1109\/JSEN.2020.2978772","article-title":"The layer-wise training convolutional neural networks using local loss for sensor-based human activity recognition","volume":"20","author":"Teng","year":"2020","journal-title":"IEEE Sens. J."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"12355","DOI":"10.1109\/JSEN.2023.3267243","article-title":"Semi-Supervised Generative Adversarial Networks with Temporal Convolutions for Human Activity Recognition","volume":"23","author":"Zilelioglu","year":"2023","journal-title":"IEEE Sens. J."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Mekruksavanich, S., Jantawong, P., Hnoohom, N., and Jitpattanakul, A. (2022, January 22\u201325). A novel deep bigru-resnet model for human activity recognition using smartphone sensors. Proceedings of the 2022 19th International Joint Conference on Computer Science and Software Engineering (JCSSE), Bangkok, Thailand.","DOI":"10.1109\/JCSSE54890.2022.9836276"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Dubey, A., Lyons, N., Santra, A., and Pandey, A. (2022, January 12\u201314). XAI-BayesHAR: A novel Framework for Human Activity Recognition with Integrated Uncertainty and Shapely Values. Proceedings of the 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA), Nassau, Bahamas.","DOI":"10.1109\/ICMLA55696.2022.00205"},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"2509313","DOI":"10.1109\/TIM.2023.3265128","article-title":"An Efficient Diverse-branch Convolution Scheme for Sensor-Based Human Activity Recognition","volume":"72","author":"Han","year":"2023","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Stolovas, I., Su\u00e1rez, S., Pereyra, D., De Izaguirre, F., and Cabrera, V. (2021, January 24\u201326). Human activity recognition using machine learning techniques in a low-resource embedded system. Proceedings of the 2021 IEEE URUCON, Montevideo, Uruguay.","DOI":"10.1109\/URUCON53396.2021.9647236"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Khatun, M.A., Yousuf, M.A., and Moni, M.A. (2023, January 20\u201321). Deep CNN-GRU Based Human Activity Recognition with Automatic Feature Extraction Using Smartphone and Wearable Sensors. Proceedings of the 2023 International Conference on Electrical, Computer and Communication Engineering (ECCE), Kolkata, India.","DOI":"10.1109\/ECCE57851.2023.10101550"},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"3893","DOI":"10.1109\/TCSI.2020.3011984","article-title":"A partially binarized hybrid neural network system for low-power and resource constrained human activity recognition","volume":"67","author":"Russo","year":"2020","journal-title":"IEEE Trans. Circuits Syst. I Regul. Pap."},{"key":"ref_66","doi-asserted-by":"crossref","first-page":"2106","DOI":"10.1109\/TIE.2022.3161812","article-title":"Multiscale deep feature learning for human activity recognition using wearable sensors","volume":"70","author":"Tang","year":"2022","journal-title":"IEEE Trans. Ind. Electron."},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"218898","DOI":"10.1109\/ACCESS.2020.3041822","article-title":"Sensor-based human activity recognition using deep stacked multilayered perceptron model","volume":"8","author":"Rustam","year":"2020","journal-title":"IEEE Access"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Wang, Z., Chen, S., Yang, W., and Xu, Y. (2021, January 6\u201311). Environment-independent wi-fi human activity recognition with adversarial network. Proceedings of the ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada.","DOI":"10.1109\/ICASSP39728.2021.9413590"},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Hsieh, C.F., Chen, Y.C., Hsieh, C.Y., and Ku, M.L. (2020, January 28\u201330). Device-free indoor human activity recognition using Wi-Fi RSSI: Machine learning approaches. Proceedings of the 2020 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-Taiwan), Taoyuan, Taiwan.","DOI":"10.1109\/ICCE-Taiwan49838.2020.9258097"},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Salehinejad, H., Hasanzadeh, N., Djogo, R., and Valaee, S. (2023, January 4\u201310). Joint Human Orientation-Activity Recognition Using WIFI Signals for Human-Machine Interaction. Proceedings of the ICASSP 2023\u20142023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece.","DOI":"10.1109\/ICASSP49357.2023.10094880"},{"key":"ref_71","doi-asserted-by":"crossref","first-page":"4628","DOI":"10.1109\/JIOT.2020.3026732","article-title":"Data augmentation and dense-LSTM for human activity recognition using WiFi signal","volume":"8","author":"Zhang","year":"2020","journal-title":"IEEE Internet Things J."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Ding, X., Jiang, T., Li, Y., Xue, W., and Zhong, Y. (2020, January 7\u201311). Device-free location-independent human activity recognition using transfer learning based on CNN. Proceedings of the 2020 IEEE International Conference on Communications Workshops (ICC Workshops), Dublin, Ireland.","DOI":"10.1109\/ICCWorkshops49005.2020.9145092"},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Khan, D., and Ho, I.W.H. (July, January 14). Deep learning of CSI for efficient device-free human activity recognition. Proceedings of the 2021 IEEE 7th World Forum on Internet of Things (WF-IoT), New Orleans, LA, USA.","DOI":"10.1109\/WF-IoT51360.2021.9595661"},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Zeeshan, M., Pandey, A., and Kumar, S. (2022, January 4\u20138). CSI-based device-free joint activity recognition and localization using Siamese networks. Proceedings of the 2022 14th International Conference on COMmunication Systems & NETworkS (COMSNETS), Bangalore, India.","DOI":"10.1109\/COMSNETS53615.2022.9668391"},{"key":"ref_75","doi-asserted-by":"crossref","unstructured":"Xiang, F., Nie, X., Cui, C., Nie, W., and Dong, X. (2023, January 6\u20138). Radar-based human activity recognition using two-dimensional feature extraction. Proceedings of the 2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE), Guangzhou, China.","DOI":"10.1109\/ICCECE58074.2023.10135278"},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Guo, Z., Guendel, R.G., Yarovoy, A., and Fioranelli, F. (2023, January 1\u20135). Point Transformer-Based Human Activity Recognition Using High-Dimensional Radar Point Clouds. Proceedings of the 2023 IEEE Radar Conference (RadarConf23), San Antonio, TX, USA.","DOI":"10.1109\/RadarConf2351548.2023.10149679"},{"key":"ref_77","doi-asserted-by":"crossref","unstructured":"Werthen-Brabants, L., Bhavanasi, G., Couckuyt, I., Dhaene, T., and Deschrijver, D. (2022, January 28\u201330). Quantifying uncertainty in real time with split BiRNN for radar human activity recognition. Proceedings of the 2022 19th European Radar Conference (EuRAD), Milan, Italy.","DOI":"10.23919\/EuRAD54643.2022.9924932"},{"key":"ref_78","doi-asserted-by":"crossref","unstructured":"McQuire, J., Watson, P., Wright, N., Hiden, H., and Catt, M. (2023, January 2\u20135). A Data Efficient Vision Transformer for Robust Human Activity Recognition from the Spectrograms of Wearable Sensor Data. Proceedings of the 2023 IEEE Statistical Signal Processing Workshop (SSP), Hanoi, Vietnam.","DOI":"10.1109\/SSP53291.2023.10208059"},{"key":"ref_79","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1038\/s41597-020-0563-y","article-title":"A database of human gait performance on irregular and uneven surfaces collected by wearable sensors","volume":"7","author":"Luo","year":"2020","journal-title":"Sci. Data"},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Reiss, A., and Stricker, D. (2012, January 6\u20138). Creating and benchmarking a new dataset for physical activity monitoring. Proceedings of the 5th International Conference on PErvasive Technologies Related to Assistive Environments, Heraklion, Crete, Greece.","DOI":"10.1145\/2413097.2413148"},{"key":"ref_81","doi-asserted-by":"crossref","unstructured":"Reiss, A., and Stricker, D. (2012, January 18\u201322). Introducing a new benchmarked dataset for activity monitoring. Proceedings of the 2012 16th International Symposium on Wearable Computers, Newcastle, UK.","DOI":"10.1109\/ISWC.2012.13"},{"key":"ref_82","doi-asserted-by":"crossref","unstructured":"Qin, W., and Wu, H.N. (2022, January 25\u201327). Switching GMM-HMM for Complex Human Activity Modeling and Recognition. Proceedings of the 2022 China Automation Congress (CAC), Xiamen, China.","DOI":"10.1109\/CAC57257.2022.10055662"},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Bhuiyan, R.A., Amiruzzaman, M., Ahmed, N., and Islam, M.R. (2020, January 21\u201323). Efficient frequency domain feature extraction model using EPS and LDA for human activity recognition. Proceedings of the 2020 3rd IEEE International Conference on Knowledge Innovation and Invention (ICKII), Kaohsiung, Taiwan.","DOI":"10.1109\/ICKII50300.2020.9318786"},{"key":"ref_84","doi-asserted-by":"crossref","first-page":"25363","DOI":"10.1109\/JIOT.2022.3196170","article-title":"A hybrid attention-based deep neural network for simultaneous multi-sensor pruning and human activity recognition","volume":"9","author":"Zhou","year":"2022","journal-title":"IEEE Internet Things J."},{"key":"ref_85","doi-asserted-by":"crossref","unstructured":"Li, W., Feng, X., He, Z., and Zheng, H. (2021, January 10\u201313). Human activity recognition based on data fusion of fmcw radar and image. Proceedings of the 2021 7th International Conference on Computer and Communications (ICCC), Tianjin, China.","DOI":"10.1109\/ICCC54389.2021.9674543"},{"key":"ref_86","doi-asserted-by":"crossref","first-page":"174105","DOI":"10.1109\/ACCESS.2020.3025938","article-title":"Human daily activity recognition performed using wearable inertial sensors combined with deep learning algorithms","volume":"8","author":"Yen","year":"2020","journal-title":"IEEE Access"},{"key":"ref_87","doi-asserted-by":"crossref","unstructured":"Chowdhury, A.I., Ashraf, M., Islam, A., Ahmed, E., Jaman, M.S., and Rahman, M.M. (2020, January 22\u201324). hActNET: An improved neural network based method in recognizing human activities. Proceedings of the 2020 4th International Symposium on Multidisciplinary Studies and Innovative Technologies (ISMSIT), Istanbul, Turkey.","DOI":"10.1109\/ISMSIT50672.2020.9254992"},{"key":"ref_88","doi-asserted-by":"crossref","unstructured":"Psychoula, I., Singh, D., Chen, L., Chen, F., Holzinger, A., and Ning, H. (2018, January 8\u201312). Users\u2019 privacy concerns in IoT based applications. Proceedings of the 2018 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Cloud & Big Data Computing, Internet of People and Smart City Innovation (SmartWorld\/SCALCOM\/UIC\/ATC\/CBDCom\/IOP\/SCI), Guangzhou, China.","DOI":"10.1109\/SmartWorld.2018.00317"},{"key":"ref_89","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. Proceedings of the Advancements in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_90","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv."},{"key":"ref_91","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_92","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_93","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_94","doi-asserted-by":"crossref","unstructured":"He, K., Chen, X., Xie, S., Li, Y., Doll\u00e1r, P., and Girshick, R. (2022, January 18\u201324). Masked autoencoders are scalable vision learners. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01553"},{"key":"ref_95","doi-asserted-by":"crossref","unstructured":"Keren, G., and Schuller, B. (2016, January 24\u201329). Convolutional RNN: An enhanced model for extracting features from sequential data. Proceedings of the 2016 International Joint Conference on Neural Networks (IJCNN), Vancouver, BC, Canada.","DOI":"10.1109\/IJCNN.2016.7727636"},{"key":"ref_96","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_97","doi-asserted-by":"crossref","unstructured":"Gundu, S., and Syed, H. (2023). Vision-Based HAR in UAV Videos Using Histograms and Deep Learning Techniques. Sensors, 23.","DOI":"10.3390\/s23052569"},{"key":"ref_98","doi-asserted-by":"crossref","unstructured":"Islam, M.M., and Iqbal, T. (2020, January 25\u201329). Hamlet: A hierarchical multimodal attention-based human activity recognition algorithm. Proceedings of the 2020 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA.","DOI":"10.1109\/IROS45743.2020.9340987"},{"key":"ref_99","doi-asserted-by":"crossref","unstructured":"Gupta, D., Singh, A.K., Gupta, N., and Vishwakarma, D.K. (2023, January 9\u201311). SDL-Net: A Combined CNN & RNN Human Activity Recognition Model. Proceedings of the 2023 International Conference in Advances in Power, Signal, and Information Technology (APSIT), Bhubaneswar, India.","DOI":"10.1109\/APSIT58554.2023.10201657"},{"key":"ref_100","doi-asserted-by":"crossref","first-page":"143996","DOI":"10.1109\/ACCESS.2020.3013406","article-title":"Fusion mechanisms for human activity recognition using automated machine learning","volume":"8","author":"Popescu","year":"2020","journal-title":"IEEE Access"},{"key":"ref_101","doi-asserted-by":"crossref","unstructured":"Kumar, K.V., Harikiran, J., and Chandana, B.S. (2022, January 12\u201314). Human Activity Recognition with Privacy Preserving using Deep Learning Algorithms. Proceedings of the 2022 2nd International Conference on Artificial Intelligence and Signal Processing (AISP), Vijayawada, India.","DOI":"10.1109\/AISP53593.2022.9760596"},{"key":"ref_102","doi-asserted-by":"crossref","unstructured":"Bukht, T.F.N., Rahman, H., and Jalal, A. (2023, January 20\u201322). A Novel Framework for Human Action Recognition Based on Features Fusion and Decision Tree. Proceedings of the 2023 4th International Conference on Advancements in Computational Sciences (ICACS), Lahore, Pakistan.","DOI":"10.1109\/ICACS55311.2023.10089752"},{"key":"ref_103","doi-asserted-by":"crossref","unstructured":"Mutegeki, R., and Han, D.S. (2020, January 19\u201321). A CNN-LSTM approach to human activity recognition. Proceedings of the 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), Fukuoka, Japan.","DOI":"10.1109\/ICAIIC48513.2020.9065078"},{"key":"ref_104","doi-asserted-by":"crossref","unstructured":"Razmah, M., Prabha, R., Divya, B., Sridevi, S., and Naveen, A. (2022, January 8\u20139). LSTM Method for Human Activity Recognition of Video Using PSO Algorithm. Proceedings of the 2022 International Conference on Power, Energy, Control and Transmission Systems (ICPECTS), Chennai, India.","DOI":"10.1109\/ICPECTS56089.2022.10046783"},{"key":"ref_105","doi-asserted-by":"crossref","first-page":"160635","DOI":"10.1109\/ACCESS.2021.3132559","article-title":"Maximum entropy Markov model for human activity recognition using depth camera","volume":"9","author":"Alrashdi","year":"2021","journal-title":"IEEE Access"},{"key":"ref_106","doi-asserted-by":"crossref","unstructured":"Ahad, M.A.R., Antar, A.D., Ahmed, M., Ahad, M.A.R., Antar, A.D., and Ahmed, M. (2021). IoT Sensor-Based Activity Recognition: Human Activity Recognition, Springer.","DOI":"10.1007\/978-3-030-51379-5"},{"key":"ref_107","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1145\/2972413.2972425","article-title":"Activity recognition on smart devices: Dealing with diversity in the wild","volume":"20","author":"Blunck","year":"2016","journal-title":"Getmobile Mob. Comput. Commun."},{"key":"ref_108","unstructured":"Torres, R.L.S., Ranasinghe, D.C., Shi, Q., and Sample, A.P. (May, January 30). Sensor enabled wearable RFID technology for mitigating the risk of falls near beds. Proceedings of the 2013 IEEE International Conference on RFID (RFID), Orlando, FL, USA."},{"key":"ref_109","doi-asserted-by":"crossref","first-page":"87","DOI":"10.3233\/AIS-160372","article-title":"Human activity recognition using multisensor data fusion based on reservoir computing","volume":"8","author":"Palumbo","year":"2016","journal-title":"J. Ambient. Intell. Smart Environ."},{"key":"ref_110","unstructured":"Anguita, D., Ghio, A., Oneto, L., Parra, X., and Reyes-Ortiz, J.L. (2013, January 24\u201326). A public domain dataset for human activity recognition using smartphones. Proceedings of the Esann, Bruges, Belgium."},{"key":"ref_111","doi-asserted-by":"crossref","first-page":"754","DOI":"10.1016\/j.neucom.2015.07.085","article-title":"Transition-aware human activity recognition using smartphones","volume":"171","author":"Oneto","year":"2016","journal-title":"Neurocomputing"},{"key":"ref_112","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1007\/s00779-011-0415-z","article-title":"Personalization and user verification in wearable systems using biometric walking patterns","volume":"16","author":"Casale","year":"2012","journal-title":"Pers. Ubiquitous Comput."},{"key":"ref_113","doi-asserted-by":"crossref","first-page":"2033","DOI":"10.1016\/j.patrec.2012.12.014","article-title":"The Opportunity challenge: A benchmark database for on-body sensor-based activity recognition","volume":"34","author":"Chavarriaga","year":"2013","journal-title":"Pattern Recognit. Lett."},{"key":"ref_114","doi-asserted-by":"crossref","first-page":"5460","DOI":"10.3390\/s130505460","article-title":"Activity recognition using hybrid generative\/discriminative models on home environments using binary sensors","volume":"13","author":"Sanchis","year":"2013","journal-title":"Sensors"},{"key":"ref_115","doi-asserted-by":"crossref","unstructured":"Ba\u00f1os, O., Damas, M., Pomares, H., Rojas, I., T\u00f3th, M.A., and Amft, O. (2012, January 5\u20138). A benchmark dataset to evaluate sensor displacement in activity recognition. Proceedings of the 2012 ACM Conference on Ubiquitous Computing, Pittsburgh, PA, USA.","DOI":"10.1145\/2370216.2370437"},{"key":"ref_116","doi-asserted-by":"crossref","first-page":"3605","DOI":"10.1016\/j.patcog.2010.04.019","article-title":"Comparative study on classifying human activities with miniature inertial and magnetic sensors","volume":"43","author":"Altun","year":"2010","journal-title":"Pattern Recognit."},{"key":"ref_117","doi-asserted-by":"crossref","first-page":"1451","DOI":"10.1007\/s00521-013-1364-4","article-title":"An experimental characterization of reservoir computing in ambient assisted living applications","volume":"24","author":"Bacciu","year":"2014","journal-title":"Neural Comput. Appl."},{"key":"ref_118","doi-asserted-by":"crossref","unstructured":"Banos, O., Garcia, R., Holgado-Terriza, J.A., Damas, M., Pomares, H., Rojas, I., Saez, A., and Villalonga, C. (2014, January 2\u20135). mHealthDroid: A novel framework for agile development of mobile health applications. Proceedings of the Ambient Assisted Living and Daily Activities: 6th International Work-Conference, IWAAL 2014, Belfast, UK. Proceedings 6.","DOI":"10.1007\/978-3-319-13105-4_14"},{"key":"ref_119","doi-asserted-by":"crossref","first-page":"133190","DOI":"10.1109\/ACCESS.2019.2940729","article-title":"Smartphone and smartwatch-based biometrics using activities of daily living","volume":"7","author":"Weiss","year":"2019","journal-title":"IEEE Access"},{"key":"ref_120","doi-asserted-by":"crossref","unstructured":"Schmidt, P., Reiss, A., Duerichen, R., Marberger, C., and Van Laerhoven, K. (2018, January 16\u201320). Introducing wesad, a multimodal dataset for wearable stress and affect detection. Proceedings of the 20th ACM international conference on multimodal interaction, Boulder, CO, USA.","DOI":"10.1145\/3242969.3242985"},{"key":"ref_121","doi-asserted-by":"crossref","unstructured":"Schuldt, C., Laptev, I., and Caputo, B. (2004, January 26). Recognizing human actions: A local SVM approach. Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, Cambridge, UK.","DOI":"10.1109\/ICPR.2004.1334462"},{"key":"ref_122","doi-asserted-by":"crossref","unstructured":"Ballan, L., Bertini, M., Del Bimbo, A., Seidenari, L., and Serra, G. (October, January 27). Effective codebooks for human action categorization. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, Kyoto, Japan.","DOI":"10.1109\/ICCVW.2009.5457658"},{"key":"ref_123","unstructured":"Li, W., Wong, Y., Liu, A.A., Li, Y., Su, Y.T., and Kankanhalli, M. (2016). Multi-camera action dataset (MCAD): A dataset for studying non-overlapped cross-camera action recognition. arXiv."},{"key":"ref_124","unstructured":"Wang, J., Liu, Z., Wu, Y., and Yuan, J. (2012, January 16\u201321). Mining actionlet ensemble for action recognition with depth cameras. Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA."},{"key":"ref_125","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1007\/s00138-012-0450-4","article-title":"Recognizing 50 human action categories of web videos","volume":"24","author":"Reddy","year":"2013","journal-title":"Mach. Vis. Appl."},{"key":"ref_126","doi-asserted-by":"crossref","unstructured":"Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., and Serre, T. (2011, January 6\u201313). HMDB: A large video database for human motion recognition. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126543"},{"key":"ref_127","doi-asserted-by":"crossref","unstructured":"Marszalek, M., Laptev, I., and Schmid, C. (2009, January 20\u201325). Actions in context. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206557"},{"key":"ref_128","doi-asserted-by":"crossref","first-page":"2247","DOI":"10.1109\/TPAMI.2007.70711","article-title":"Actions as space-time shapes","volume":"29","author":"Gorelick","year":"2007","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_129","doi-asserted-by":"crossref","unstructured":"Yao, B., Jiang, X., Khosla, A., Lin, A.L., Guibas, L., and Fei-Fei, L. (2011, January 6\u201313). Human action recognition by learning bases of action attributes and parts. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126386"},{"key":"ref_130","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1016\/j.cviu.2006.07.013","article-title":"Free viewpoint action recognition using motion history volumes","volume":"104","author":"Weinl","year":"2006","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_131","doi-asserted-by":"crossref","unstructured":"Stein, S., and McKenna, S.J. (2013, January 8\u201312). Combining embedded accelerometers with computer vision for recognizing food preparation activities. Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing, Zurich, Switzerland.","DOI":"10.1145\/2493432.2493482"},{"key":"ref_132","doi-asserted-by":"crossref","unstructured":"Nghiem, A.T., Bremond, F., Thonnat, M., and Valentin, V. (2007, January 5\u20137). ETISEO, performance evaluation for video surveillance systems. Proceedings of the 2007 IEEE Conference on Advanced Video and Signal Based Surveillance, London, UK.","DOI":"10.1109\/AVSS.2007.4425357"},{"key":"ref_133","doi-asserted-by":"crossref","unstructured":"Niebles, J.C., Chen, C.W., and Fei-Fei, L. (2010, January 5\u201311). Modeling temporal structure of decomposable motion segments for activity classification. Proceedings of the Computer Vision\u2013ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece. Proceedings, Part II 11.","DOI":"10.1007\/978-3-642-15552-9_29"},{"key":"ref_134","unstructured":"Ryoo, M.S., and Aggarwal, J.K. (2010, January 23\u201326). UT-interaction dataset, ICPR contest on semantic description of human activities (SDHA). Proceedings of the IEEE International Conference on Pattern Recognition Workshops, Istanbul, Turkey."},{"key":"ref_135","doi-asserted-by":"crossref","unstructured":"Chen, C.-C., and Aggarwal, J.K. (2009, January 8\u20139). Recognizing human action from a far field of view. Proceedings of the 2009 Workshop on Motion and Video Computing (WMVC), Snowbird, UT, USA.","DOI":"10.1109\/WMVC.2009.5399231"},{"key":"ref_136","doi-asserted-by":"crossref","unstructured":"Caba Heilbron, F., Escorcia, V., Ghanem, B., and Carlos Niebles, J. (2015, January 7\u201312). Activitynet: A large-scale video benchmark for human activity understanding. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298698"},{"key":"ref_137","unstructured":"Kay, W., Carreira, J., Simonyan, K., Zhang, B., Hillier, C., Vijayanarasimhan, S., Viola, F., Green, T., Back, T., and Natsev, P. (2017). The kinetics human action video dataset. arXiv."},{"key":"ref_138","first-page":"1","article-title":"A dataset of 101 human action classes from videos in the wild","volume":"2","author":"Soomro","year":"2012","journal-title":"Cent. Res. Comput. Vis."},{"key":"ref_139","doi-asserted-by":"crossref","unstructured":"Liu, J., Luo, J., and Shah, M. (2009, January 20\u201325). Recognizing realistic actions from videos \u201cin the wild\u201d. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206744"},{"key":"ref_140","doi-asserted-by":"crossref","unstructured":"Berenson, D., Abbeel, P., and Goldberg, K. (2012, January 14\u201318). A robot path planning framework that learns from experience. Proceedings of the 2012 IEEE International Conference on Robotics and Automation, St. Paul, MN, USA.","DOI":"10.1109\/ICRA.2012.6224742"},{"key":"ref_141","doi-asserted-by":"crossref","unstructured":"Martinez, J., Black, M.J., and Romero, J. (2017, January 21\u201326). On human motion prediction using recurrent neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.497"},{"key":"ref_142","doi-asserted-by":"crossref","first-page":"6096","DOI":"10.1109\/TIP.2021.3089380","article-title":"PVRED: A position-velocity recurrent encoder-decoder for human motion prediction","volume":"30","author":"Wang","year":"2021","journal-title":"IEEE Trans. Image Process."},{"key":"ref_143","doi-asserted-by":"crossref","unstructured":"Cao, Z., Gao, H., Mangalam, K., Cai, Q.Z., Vo, M., and Malik, J. (2020, January 23\u201328). Long-term human motion prediction with scene context. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK. Proceedings, Part I 16.","DOI":"10.1007\/978-3-030-58452-8_23"},{"key":"ref_144","doi-asserted-by":"crossref","unstructured":"Aksan, E., Kaufmann, M., Cao, P., and Hilliges, O. (2021, January 1\u20133). A spatio-temporal transformer for 3d human motion prediction. Proceedings of the 2021 International Conference on 3D Vision (3DV), Virtual Conference.","DOI":"10.1109\/3DV53792.2021.00066"},{"key":"ref_145","doi-asserted-by":"crossref","unstructured":"Medjaouri, O., and Desai, K. (2022, January 18\u201324). Hr-stan: High-resolution spatio-temporal attention network for 3d human motion prediction. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LO, USA.","DOI":"10.1109\/CVPRW56347.2022.00286"},{"key":"ref_146","doi-asserted-by":"crossref","unstructured":"Tanberk, S., T\u00fckel, D.B., and Uysal, M. (2020, January 15\u201317). A Simple AI-Powered Video Analytics Framework for Human Motion Imitation. Proceedings of the 2020 Innovations in Intelligent Systems and Applications Conference (ASYU), Istanbul, Turkey.","DOI":"10.1109\/ASYU50717.2020.9259895"}],"container-title":["Machine Learning and Knowledge Extraction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-4990\/6\/2\/40\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:30:03Z","timestamp":1760106603000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-4990\/6\/2\/40"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,18]]},"references-count":146,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2024,6]]}},"alternative-id":["make6020040"],"URL":"https:\/\/doi.org\/10.3390\/make6020040","relation":{},"ISSN":["2504-4990"],"issn-type":[{"value":"2504-4990","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,18]]}}}