{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,31]],"date-time":"2025-12-31T00:24:20Z","timestamp":1767140660662,"version":"build-2238731810"},"reference-count":42,"publisher":"Springer Science and Business Media LLC","issue":"14","license":[{"start":{"date-parts":[[2023,1,20]],"date-time":"2023-01-20T00:00:00Z","timestamp":1674172800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,1,20]],"date-time":"2023-01-20T00:00:00Z","timestamp":1674172800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Appl Intell"],"published-print":{"date-parts":[[2023,7]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Predicting human motion based on past observed motion is one of the challenging issues in computer vision and graphics. Existing research works are dealing with this issue by using discriminative models and showing the results for cases that follow a homogeneous distribution (in distribution) and not discussing the issues of the domain shift problem, where training and testing data follow a heterogeneous (out of distribution) problem, which is the reality when such models are used in practice. However, recent research proposed addressing domain shift issues by augmenting the discriminative model with a generative model and obtained better results. In the present investigation, we propose regularizing the extended network by inserting linear layers to minimize the rank of the latent space and train the entire end-to-end network. We regularize the network to strengthen the model to deal effectively with domain shift scenarios. Both training and testing data come from different distribution sets; to deal with this, we toughen our network by adding the extra linear layers to the network encoder. We tested our model with the benchmark datasets, CMU Motion Capture and Human3.6M, and proved that our model outperforms 14 OoD actions of H3.6M and 7 OoD actions of CMU MoCap in terms of the Euclidean distance calculated between predicted and ground truth joint angle values. Our average results of 14 OoD actions for short-term (80, 160, 320, 400) are 0.34, 0.6, 0.96, 1.07, and for CMU MoCap of 7 OoD actions for short-term and long term (80, 160, 320, 400, 1000) are 0.28, 0.45, 0.77, 0.89, 1.46. All these results are much better than the other state-of-the-art results.<\/jats:p>","DOI":"10.1007\/s10489-022-04419-x","type":"journal-article","created":{"date-parts":[[2023,1,20]],"date-time":"2023-01-20T02:47:18Z","timestamp":1674182838000},"page":"18027-18040","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Implicit regularization of a deep augmented neural network model for human motion prediction"],"prefix":"10.1007","volume":"53","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7022-290X","authenticated-orcid":false,"given":"Gaurav Kumar","family":"Yadav","sequence":"first","affiliation":[]},{"given":"Mohamed","family":"Abdel-Nasser","sequence":"additional","affiliation":[]},{"given":"Hatem A.","family":"Rashwan","sequence":"additional","affiliation":[]},{"given":"Domenec","family":"Puig","sequence":"additional","affiliation":[]},{"given":"G. C.","family":"Nandi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,1,20]]},"reference":[{"key":"4419_CR1","doi-asserted-by":"crossref","unstructured":"Gui L-Y, Zhang K, Wang Y-X, Liang X, Moura JM, Veloso M (2018) Teaching robots to predict human motion. In: 2018 IEEE\/RSJ international conference on intelligent robots and systems (IROS). IEEE, pp 562\u2013567","DOI":"10.1109\/IROS.2018.8594452"},{"key":"4419_CR2","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1111\/epi.14050","volume":"59","author":"EE Geertsema","year":"2018","unstructured":"Geertsema EE, Thijs RD, Gutter T, Vledder B, Arends JB, Leijten FS, Visser GH, Kalitzin SN (2018) Automated video-based detection of nocturnal convulsive seizures in a residential care setting. Epilepsia 59:53\u201360","journal-title":"Epilepsia"},{"key":"4419_CR3","doi-asserted-by":"crossref","unstructured":"Shirai A, Geslin E, Richir S (2007) Wiimedia: motion analysis methods and applications using a consumer video game controller. In: Proceedings of the 2007 ACM SIGGRAPH symposium on video games, pp 133\u2013140","DOI":"10.1145\/1274940.1274966"},{"key":"4419_CR4","unstructured":"Rofougaran AR, Rofougaran M, Seshadri N, Ibrahim BB, Walley J, Karaoguz J (2018) Game console and gaming object with motion prediction modeling and methods for use therewith. Google Patents, US Patent 9,943.760"},{"key":"4419_CR5","doi-asserted-by":"crossref","unstructured":"Zhang B, Zhong J, Cai W (2022) A data-driven approach for pedestrian intention prediction in large public places. In: SIGSIM Conference on principles of advanced discrete simulation, pp 33\u201336","DOI":"10.1145\/3518997.3531022"},{"issue":"3","key":"4419_CR6","doi-asserted-by":"publisher","first-page":"3018","DOI":"10.1007\/s10489-021-02562-5","volume":"52","author":"Q Ma","year":"2022","unstructured":"Ma Q, Zou Q, Huang Y, Wang N (2022) Dynamic pedestrian trajectory forecasting with lstm-based delaunay triangulation. Appl Intell 52(3):3018\u20133028","journal-title":"Appl Intell"},{"key":"4419_CR7","doi-asserted-by":"crossref","unstructured":"Hsu Y. -C., Shen Y, Jin H, Kira Z (2020) Generalized odin: detecting out-of-distribution image without learning from out-of-distribution data. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 10951\u201310960","DOI":"10.1109\/CVPR42600.2020.01096"},{"key":"4419_CR8","doi-asserted-by":"crossref","unstructured":"Singh D, Srivastava R (2022) Graph neural network with rnns based trajectory prediction of dynamic agents for autonomous vehicle. Appl Intell 1\u201316","DOI":"10.1007\/s10489-021-03120-9"},{"key":"4419_CR9","doi-asserted-by":"publisher","first-page":"103453","DOI":"10.1016\/j.trc.2021.103453","volume":"134","author":"A Kalatian","year":"2022","unstructured":"Kalatian A, Farooq B (2022) A context-aware pedestrian trajectory prediction framework for automated vehicles. Transportation Research Part C: Emerging Technologies 134:103453","journal-title":"Transportation Research Part C: Emerging Technologies"},{"key":"4419_CR10","doi-asserted-by":"publisher","first-page":"141635","DOI":"10.1109\/ACCESS.2021.3119629","volume":"9","author":"S Dafrallah","year":"2021","unstructured":"Dafrallah S, Amine A, Mousset S, Bensrhair A (2021) Monocular pedestrian orientation recognition based on capsule network for a novel collision warning system. IEEE Access 9:141635\u2013141650","journal-title":"IEEE Access"},{"key":"4419_CR11","unstructured":"Bourached A, Griffiths R. -R., Gray R, Jha A, Nachev P (2020) Generative model-enhanced human motion prediction. Applied AI Letters"},{"key":"4419_CR12","doi-asserted-by":"crossref","unstructured":"Mao W, Liu M, Salzmann M, Li H (2019) Learning trajectory dependencies for human motion prediction. In: Proceedings of the IEEE\/CVF international conference on computer vision, pp 9489\u20139497","DOI":"10.1109\/ICCV.2019.00958"},{"key":"4419_CR13","first-page":"14736","volume":"33","author":"L Jing","year":"2020","unstructured":"Jing L, Zbontar J, et al. (2020) Implicit rank-minimizing autoencoder. Adv Neural Inf Process Syst 33:14736\u201314746","journal-title":"Adv Neural Inf Process Syst"},{"issue":"7","key":"4419_CR14","doi-asserted-by":"publisher","first-page":"1325","DOI":"10.1109\/TPAMI.2013.248","volume":"36","author":"C Ionescu","year":"2014","unstructured":"Ionescu C, Papava D, Olaru V, Sminchisescu C (2014) Human3.6m: large scale datasets and predictive methods for 3d human sensing in natural environments. IEEE Trans Pattern Anal Mach Intell 36 (7):1325\u20131339","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"4419_CR15","unstructured":"CMU Graphics Lab Motion Capture Database. http:\/\/mocap.cs.cmu.edu\/"},{"key":"4419_CR16","doi-asserted-by":"crossref","unstructured":"Li M, Chen S, Zhao Y, Zhang Y, Wang Y, Tian Q (2020) Dynamic multiscale graph neural networks for 3d skeleton based human motion prediction. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 214\u2013223","DOI":"10.1109\/CVPR42600.2020.00029"},{"key":"4419_CR17","doi-asserted-by":"crossref","unstructured":"Butepage J, Black MJ, Kragic D, Kjellstrom H (2017) Deep representation learning for human motion prediction and classification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6158\u20136166","DOI":"10.1109\/CVPR.2017.173"},{"key":"4419_CR18","doi-asserted-by":"crossref","unstructured":"Fragkiadaki K, Levine S, Felsen P, Malik J (2015) Recurrent network models for human dynamics. In: Proceedings of the IEEE international conference on computer vision, pp 4346\u20134354","DOI":"10.1109\/ICCV.2015.494"},{"key":"4419_CR19","doi-asserted-by":"crossref","unstructured":"Mao W, Liu M, Salzmann M (2020) History repeats itself: human motion prediction via motion attention. In: European conference on computer vision. Springer, pp 474\u2013489","DOI":"10.1007\/978-3-030-58568-6_28"},{"issue":"6","key":"4419_CR20","doi-asserted-by":"publisher","first-page":"6769","DOI":"10.1007\/s10489-021-02764-x","volume":"52","author":"Y Yu","year":"2022","unstructured":"Yu Y, Tian N, Hao X, Ma T, Yang C (2022) Human motion prediction with gated recurrent unit model of multi-dimensional input. Appl Intell 52(6):6769\u20136781","journal-title":"Appl Intell"},{"issue":"3","key":"4419_CR21","doi-asserted-by":"publisher","first-page":"478","DOI":"10.1109\/JSTSP.2020.2987728","volume":"14","author":"C Zhang","year":"2020","unstructured":"Zhang C, Yang Z, He X, Deng L (2020) Multimodal intelligence: representation learning, information fusion, and applications. IEEE J Sel Top Signal Process 14(3):478\u2013493","journal-title":"IEEE J Sel Top Signal Process"},{"issue":"7","key":"4419_CR22","doi-asserted-by":"publisher","first-page":"5132","DOI":"10.1007\/s10489-020-02049-9","volume":"51","author":"A Aldhubri","year":"2021","unstructured":"Aldhubri A, Lasheng Y, Mohsen F, Al-Qatf M (2021) Variational autoencoder bayesian matrix factorization (vabmf) for collaborative filtering. Appl Intell 51(7):5132\u20135145","journal-title":"Appl Intell"},{"key":"4419_CR23","first-page":"5081","volume":"33","author":"R Lopez","year":"2020","unstructured":"Lopez R, Boyeau P, Yosef N, Jordan M, Regier J (2020) Decision-making with auto-encoding variational bayes. Adv Neural Inf Process Syst 33:5081\u20135092","journal-title":"Adv Neural Inf Process Syst"},{"key":"4419_CR24","unstructured":"Zietlow D, Rolinek M, Martius G (2021) Demystifying inductive biases for (beta-) vae based architectures. In: International conference on machine learning. PMLR, pp 12945\u201312954"},{"key":"4419_CR25","unstructured":"Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: International conference on machine learning. PMLR, pp 1597\u20131607"},{"key":"4419_CR26","unstructured":"Liang S, Li Y, Srikant R (2018) Enhancing the reliability of out-of-distribution image detection in neural networks. In: International conference on learning representations"},{"key":"4419_CR27","unstructured":"Hendrycks D, Mazeika M, Dietterich T (2018) Deep anomaly detection with outlier exposure. In: International conference on learning representations"},{"key":"4419_CR28","doi-asserted-by":"crossref","unstructured":"Gustafsson FK, Danelljan M, Schon TB (2020) Evaluating scalable bayesian deep learning methods for robust computer vision. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition workshops, pp 318\u2013319","DOI":"10.1109\/CVPRW50498.2020.00167"},{"key":"4419_CR29","unstructured":"Lee K, Lee K, Lee H, Shin J (2018) A simple unified framework for detecting out-of-distribution samples and adversarial attacks. Advances in Neural Information Processing Systems 31"},{"issue":"23","key":"4419_CR30","doi-asserted-by":"publisher","first-page":"11537","DOI":"10.1073\/pnas.1820226116","volume":"116","author":"AM Saxe","year":"2019","unstructured":"Saxe AM, McClelland JL, Ganguli S (2019) A mathematical theory of semantic development in deep neural networks. Proc Natl Acad Sci 116(23):11537\u201311546","journal-title":"Proc Natl Acad Sci"},{"key":"4419_CR31","doi-asserted-by":"crossref","unstructured":"Gunasekar S, Woodworth B, Bhojanapalli S, Neyshabur B, Srebro N (2018) Implicit regularization in matrix factorization. In: 2018 information theory and applications workshop (ITA). IEEE, pp 1\u201310","DOI":"10.1109\/ITA.2018.8503198"},{"issue":"1","key":"4419_CR32","first-page":"2822","volume":"19","author":"D Soudry","year":"2018","unstructured":"Soudry D, Hoffer E, Nacson MS, Gunasekar S, Srebro N (2018) The implicit bias of gradient descent on separable data. The Journal of Machine Learning Research 19(1):2822\u20132878","journal-title":"The Journal of Machine Learning Research"},{"key":"4419_CR33","unstructured":"Gidel G, Bach F, Lacoste-Julien S (2019) Implicit regularization of discrete gradient dynamics in linear neural networks. Adv Neural Inf Process Syst 32"},{"issue":"7","key":"4419_CR34","doi-asserted-by":"publisher","first-page":"1325","DOI":"10.1109\/TPAMI.2013.248","volume":"36","author":"C Ionescu","year":"2013","unstructured":"Ionescu C, Papava D, Olaru V, Sminchisescu C (2013) Human3. 6m: large scale datasets and predictive methods for 3d human sensing in natural environments. IEEE Trans Pattern Anal Mach Intell 36 (7):1325\u20131339","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"4419_CR35","doi-asserted-by":"crossref","unstructured":"Yadav GK, Nandi G (2020) Development of adaptive sampling based strategy for human activity predictions using sequential networks. In: 2020 IEEE 4th conference on information & communication technology (CICT). IEEE, pp 1\u20136","DOI":"10.1109\/CICT51604.2020.9312097"},{"key":"4419_CR36","doi-asserted-by":"crossref","unstructured":"Lian J, Ren W, Li L, Zhou Y, Zhou B (2022) Ptp-stgcn: pedestrian trajectory prediction based on a spatio-temporal graph convolutional neural network. Appl Intell 1\u201317","DOI":"10.1007\/s10489-022-03524-1"},{"key":"4419_CR37","doi-asserted-by":"crossref","unstructured":"Martinez J, Black MJ, Romero J (2017) On human motion prediction using recurrent neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2891\u20132900","DOI":"10.1109\/CVPR.2017.497"},{"key":"4419_CR38","doi-asserted-by":"crossref","unstructured":"Li D, Rodriguez C, Yu X, Li H (2020) Word-level deep sign language recognition from video: a new large-scale dataset and methods comparison. In: Proceedings of the IEEE\/CVF winter conference on applications of computer vision, pp 1459\u20131469","DOI":"10.1109\/WACV45572.2020.9093512"},{"key":"4419_CR39","doi-asserted-by":"crossref","unstructured":"Myronenko A (2018) 3d mri brain tumor segmentation using autoencoder regularization. In: International MICCAI brainlesion workshop. Springer, pp 311\u2013320","DOI":"10.1007\/978-3-030-11726-9_28"},{"key":"4419_CR40","unstructured":"Paszke A, Gross S, Chintala S, Chanan G, Yang E, DeVito Z, Lin Z, Desmaison A, Antiga L, Lerer A (2017) Automatic differentiation in pytorch"},{"key":"4419_CR41","doi-asserted-by":"crossref","unstructured":"Zhang Z (2018) Improved adam optimizer for deep neural networks. In: 2018 IEEE\/ACM 26th international symposium on quality of Service (IWQoS). Ieee, pp 1\u20132","DOI":"10.1109\/IWQoS.2018.8624183"},{"key":"4419_CR42","doi-asserted-by":"crossref","unstructured":"Lebailly T, Kiciroglu S, Salzmann M, Fua P, Wang W (2020) Motion prediction using temporal inception module. In: Proceedings of the asian conference on computer vision","DOI":"10.1007\/978-3-030-69532-3_39"}],"updated-by":[{"DOI":"10.1007\/s10489-024-06148-9","type":"correction","label":"Correction","source":"publisher","updated":{"date-parts":[[2025,3,6]],"date-time":"2025-03-06T00:00:00Z","timestamp":1741219200000}}],"container-title":["Applied Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-022-04419-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10489-022-04419-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-022-04419-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,6]],"date-time":"2025-03-06T03:36:56Z","timestamp":1741232216000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10489-022-04419-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,20]]},"references-count":42,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2023,7]]}},"alternative-id":["4419"],"URL":"https:\/\/doi.org\/10.1007\/s10489-022-04419-x","relation":{},"ISSN":["0924-669X","1573-7497"],"issn-type":[{"value":"0924-669X","type":"print"},{"value":"1573-7497","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,20]]},"assertion":[{"value":"19 December 2022","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 January 2023","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 March 2025","order":3,"name":"change_date","label":"Change Date","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Correction","order":4,"name":"change_type","label":"Change Type","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"A Correction to this paper has been published:","order":5,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"https:\/\/doi.org\/10.1007\/s10489-024-06148-9","URL":"https:\/\/doi.org\/10.1007\/s10489-024-06148-9","order":6,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no conflicts of interest to declare.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"<!--Emphasis Type='Bold' removed-->Conflict of Interests"}}]}}