{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T09:29:18Z","timestamp":1768901358124,"version":"3.49.0"},"reference-count":54,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2024,4,15]],"date-time":"2024-04-15T00:00:00Z","timestamp":1713139200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,4,15]],"date-time":"2024-04-15T00:00:00Z","timestamp":1713139200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62373116"],"award-info":[{"award-number":["62373116"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62163007"],"award-info":[{"award-number":["62163007"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018555","name":"Science and Technology Program of Guizhou Province","doi-asserted-by":"publisher","award":["QKHZC [2023]118"],"award-info":[{"award-number":["QKHZC [2023]118"]}],"id":[{"id":"10.13039\/501100018555","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018555","name":"Science and Technology Program of Guizhou Province","doi-asserted-by":"publisher","award":["PTRC[2020]6007-2"],"award-info":[{"award-number":["PTRC[2020]6007-2"]}],"id":[{"id":"10.13039\/501100018555","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100018555","name":"Science and Technology Program of Guizhou Province","doi-asserted-by":"publisher","award":["[2021]439"],"award-info":[{"award-number":["[2021]439"]}],"id":[{"id":"10.13039\/501100018555","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2024,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Keyframe extraction can effectively help users quickly understand video content. Generally, keyframes should be representative of the video content and simultaneously be diverse to reduce redundancy. Aiming to find the features of frames and filter out representative frames of the video, we propose a method of keyframe recommendation based on feature intercross and fusion (KFRFIF). The method is inspired by the implied relations between keyframe-extraction problem and recommendation problem. First, we investigate the application of a recommendation framework to the keyframe extraction problem. Second, the architecture of the proposed KFRFIF is put forward. Then, an algorithm for extracting intra-frame image features based on the combination of multiple image descriptors is proposed. An algorithm for extracting inter-frame distance features based on the combination of multiple distance calculation methods is designed. Moreover, A recommendation model based on feature intercross and fusion is put forward. An ablation study is further performed to verify the effectiveness of the submodule. Ultimately, the experimental results on four datasets with five outstanding approaches indicate the superior performance of our approach.<\/jats:p>","DOI":"10.1007\/s40747-024-01417-z","type":"journal-article","created":{"date-parts":[[2024,4,15]],"date-time":"2024-04-15T09:01:59Z","timestamp":1713171719000},"page":"4955-4971","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["Keyframe recommendation based on feature intercross and fusion"],"prefix":"10.1007","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8761-5195","authenticated-orcid":false,"given":"Guanci","family":"Yang","sequence":"first","affiliation":[]},{"given":"Zonglin","family":"He","sequence":"additional","affiliation":[]},{"given":"Zhidong","family":"Su","sequence":"additional","affiliation":[]},{"given":"Yang","family":"Li","sequence":"additional","affiliation":[]},{"given":"Bingqi","family":"Hu","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2024,4,15]]},"reference":[{"key":"1417_CR1","doi-asserted-by":"publisher","first-page":"3137","DOI":"10.1109\/TIP.2015.2438550","volume":"24","author":"W Wang","year":"2015","unstructured":"Wang W, Shen J, Li X, Porikli F (2015) Robust video object cosegmentation. IEEE Trans Image Process 24:3137\u20133148. https:\/\/doi.org\/10.1109\/TIP.2015.2438550","journal-title":"IEEE Trans Image Process"},{"key":"1417_CR2","doi-asserted-by":"crossref","unstructured":"Venugopala PS, Nayak AA, Sarojadevi H, Chiplunkar NN (2015) Various challenges in video watermarking for android mobile devices. In: 2015 IEEE int. conf. inf. process. ICIP, pp 248\u2013253","DOI":"10.1109\/INFOP.2015.7489388"},{"key":"1417_CR3","doi-asserted-by":"publisher","first-page":"355","DOI":"10.1109\/TIP.2016.2627801","volume":"26","author":"X Lu","year":"2017","unstructured":"Lu X, Zheng X, Li X (2017) Latent semantic minimal hashing for image retrieval. IEEE Trans Image Process 26:355\u2013368. https:\/\/doi.org\/10.1109\/TIP.2016.2627801","journal-title":"IEEE Trans Image Process"},{"key":"1417_CR4","doi-asserted-by":"publisher","first-page":"109943","DOI":"10.1109\/ACCESS.2021.3101938","volume":"9","author":"A Castro","year":"2021","unstructured":"Castro A, Villagra VA, Garcia P, Rivera D, Toledo D (2021) An ontological-based model to data governance for big data. IEEE Access 9:109943\u2013109959. https:\/\/doi.org\/10.1109\/ACCESS.2021.3101938","journal-title":"IEEE Access"},{"key":"1417_CR5","doi-asserted-by":"publisher","first-page":"2161","DOI":"10.1109\/TMM.2016.2614233","volume":"18","author":"Y-G Jiang","year":"2016","unstructured":"Jiang Y-G, Wang J, Wang Q, Liu W, Ngo C-W (2016) Hierarchical visualization of video search results for topic-based browsing. IEEE Trans Multimed 18:2161\u20132170. https:\/\/doi.org\/10.1109\/TMM.2016.2614233","journal-title":"IEEE Trans Multimed"},{"key":"1417_CR6","doi-asserted-by":"publisher","first-page":"1251","DOI":"10.1109\/TCSVT.2014.2302554","volume":"24","author":"P Sidiropoulos","year":"2014","unstructured":"Sidiropoulos P, Mezaris V, Kompatsiaris I (2014) Video tomographs and a base detector selection strategy for improving large-scale video concept detection. IEEE Trans Circuits Syst Video Technol 24:1251\u20131264. https:\/\/doi.org\/10.1109\/TCSVT.2014.2302554","journal-title":"IEEE Trans Circuits Syst Video Technol"},{"key":"1417_CR7","doi-asserted-by":"publisher","first-page":"2799","DOI":"10.1109\/TIP.2018.2890749","volume":"28","author":"Z Tu","year":"2019","unstructured":"Tu Z, Li H, Zhang D, Dauwels J, Li B, Yuan J (2019) Action-stage emphasized spatiotemporal VLAD for video action recognition. IEEE Trans Image Process 28:2799\u20132812. https:\/\/doi.org\/10.1109\/TIP.2018.2890749","journal-title":"IEEE Trans Image Process"},{"key":"1417_CR8","doi-asserted-by":"publisher","first-page":"1299","DOI":"10.1007\/s41870-023-01180-3","volume":"15","author":"R Roselinkiruba","year":"2023","unstructured":"Roselinkiruba R, Saranya Jothi C, Tamil Thendral M, Hemalatha R (2023) Secure video steganography using key frame and region selection technique. Int J Inf Technol 15:1299\u20131308. https:\/\/doi.org\/10.1007\/s41870-023-01180-3","journal-title":"Int J Inf Technol"},{"key":"1417_CR9","doi-asserted-by":"publisher","first-page":"15429","DOI":"10.1007\/s11042-020-10390-x","volume":"80","author":"R Mounika Bommisetty","year":"2021","unstructured":"Mounika Bommisetty R, Khare A, Siddiqui TJ, Palanisamy P (2021) Fusion of gradient and feature similarity for keyframe extraction. Multimed Tools Appl 80:15429\u201315467. https:\/\/doi.org\/10.1007\/s11042-020-10390-x","journal-title":"Multimed Tools Appl"},{"key":"1417_CR10","doi-asserted-by":"publisher","unstructured":"Thakre KS, Rajurkar AM, Manthalkar RR (2016) Video partitioning and secured keyframe extraction of MPEG video. In: 1ST Int. conf. inf. secur. priv., vol 78, pp 790\u2013798. https:\/\/doi.org\/10.1016\/j.procs.2016.02.058","DOI":"10.1016\/j.procs.2016.02.058"},{"key":"1417_CR11","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1007\/s00530-019-00642-8","volume":"26","author":"RM Bommisetty","year":"2020","unstructured":"Bommisetty RM, Prakash O, Khare A (2020) Keyframe extraction using Pearson correlation coefficient and color moments. Multimed Syst 26:267\u2013299. https:\/\/doi.org\/10.1007\/s00530-019-00642-8","journal-title":"Multimed Syst"},{"key":"1417_CR12","doi-asserted-by":"crossref","unstructured":"Sun B, Kong D, Wang S, Li J (2018) Keyframe extraction for human motion capture data based on affinity propagation. In: 2018 IEEE 9th annu. inf. technol. electron. mob. commun. conf. IEMCON, pp 107\u2013112","DOI":"10.1109\/IEMCON.2018.8614862"},{"key":"1417_CR13","doi-asserted-by":"publisher","first-page":"52","DOI":"10.1016\/j.patrec.2016.01.027","volume":"72","author":"A Ioannidis","year":"2016","unstructured":"Ioannidis A, Chasanis V, Likas A (2016) Weighted multi-view key-frame extraction. Pattern Recognit Lett 72:52\u201361. https:\/\/doi.org\/10.1016\/j.patrec.2016.01.027","journal-title":"Pattern Recognit Lett"},{"key":"1417_CR14","doi-asserted-by":"publisher","first-page":"522","DOI":"10.1016\/j.patcog.2014.08.002","volume":"48","author":"S Mei","year":"2015","unstructured":"Mei S, Guan G, Wang Z, Wan S, He M, Feng DD (2015) Video summarization via minimum sparse reconstruction. Pattern Recognit 48:522\u2013533. https:\/\/doi.org\/10.1016\/j.patcog.2014.08.002","journal-title":"Pattern Recognit"},{"key":"1417_CR15","doi-asserted-by":"publisher","first-page":"1589","DOI":"10.1109\/TIE.2016.2610946","volume":"64","author":"G Xia","year":"2017","unstructured":"Xia G, Sun H, Niu X, Zhang G, Feng L (2017) Keyframe extraction for human motion capture data based on joint kernel sparse representation. IEEE Trans Ind Electron 64:1589\u20131599. https:\/\/doi.org\/10.1109\/TIE.2016.2610946","journal-title":"IEEE Trans Ind Electron"},{"issue":"1","key":"1417_CR16","doi-asserted-by":"publisher","first-page":"4943","DOI":"10.1007\/s00371-022-02639-3","volume":"39","author":"Y Liu","year":"2022","unstructured":"Liu Y, Chen L, Lin Z (2022) Keyframe extraction for motion capture data via pose saliency and reconstruction error. Vis Comput 39(1):4943\u20134953. https:\/\/doi.org\/10.1007\/s00371-022-02639-3","journal-title":"Vis Comput"},{"issue":"34","key":"1417_CR17","doi-asserted-by":"publisher","first-page":"24513","DOI":"10.1007\/s00521-021-06322-x","volume":"35","author":"RS Kiziltepe","year":"2021","unstructured":"Kiziltepe RS, Gan JQ, Escobar JJ (2021) A novel keyframe extraction method for video classification using deep neural networks. Neural Comput. Appl 35(34):24513\u201324524. https:\/\/doi.org\/10.1007\/s00521-021-06322-x","journal-title":"Neural Comput. Appl"},{"key":"1417_CR18","doi-asserted-by":"crossref","unstructured":".Mahasseni B, Lam M, Todorovic S (2017) Unsupervised video summarization with adversarial lstm networks. In: Proc. IEEE conf. comput. vis. pattern recognit., pp 202\u2013211","DOI":"10.1109\/CVPR.2017.318"},{"key":"1417_CR19","doi-asserted-by":"crossref","unstructured":"Kar A, Rai N, Sikka K, Sharma G (2017) Adascan: adaptive scan pooling in deep convolutional neural networks for human action recognition in videos. In: Proc. IEEE conf. comput. vis. pattern recognit., pp 3376\u20133385","DOI":"10.1109\/CVPR.2017.604"},{"key":"1417_CR20","doi-asserted-by":"publisher","first-page":"4455","DOI":"10.1109\/JIOT.2019.2950469","volume":"7","author":"K Muhammad","year":"2020","unstructured":"Muhammad K, Hussain T, Tanveer M, Sannino G, de Albuquerque VHC (2020) Cost-effective video summarization using deep CNN with hierarchical weighted fusion for IoT surveillance networks. IEEE Internet Things J 7:4455\u20134463. https:\/\/doi.org\/10.1109\/JIOT.2019.2950469","journal-title":"IEEE Internet Things J"},{"key":"1417_CR21","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1016\/j.ecoinf.2013.09.003","volume":"23","author":"GGL Priya","year":"2014","unstructured":"Priya GGL, Domnic S (2014) Shot based keyframe extraction for ecological video indexing and retrieval. Ecol Inf 23:107\u2013117. https:\/\/doi.org\/10.1016\/j.ecoinf.2013.09.003","journal-title":"Ecol Inf"},{"key":"1417_CR22","doi-asserted-by":"publisher","first-page":"2730","DOI":"10.1109\/TIP.2011.2143421","volume":"20","author":"M Omidyeganeh","year":"2011","unstructured":"Omidyeganeh M, Ghaemmaghami S, Shirmohammadi S (2011) Video keyframe analysis using a segment-based statistical metric in a visually sensitive parametric space. IEEE Trans Image Process 20:2730\u20132737. https:\/\/doi.org\/10.1109\/TIP.2011.2143421","journal-title":"IEEE Trans Image Process"},{"key":"1417_CR23","doi-asserted-by":"publisher","DOI":"10.1002\/cav.1976","author":"C Xu","year":"2021","unstructured":"Xu C, Yu W, Li Y, Lu X, Wang M, Yang X (2021) Key frame extraction for human motion capture data via multiple binomial fitting. Comput Animat Virtual Worlds. https:\/\/doi.org\/10.1002\/cav.1976","journal-title":"Comput Animat Virtual Worlds"},{"key":"1417_CR24","doi-asserted-by":"publisher","first-page":"280","DOI":"10.1049\/iet-cvi.2015.0237","volume":"10","author":"M Fei","year":"2016","unstructured":"Fei M, Jiang W, Mao W, Song Z (2016) New fusional framework combining sparse selection and clustering for key frame extraction. IET Comput Vis 10:280\u2013288. https:\/\/doi.org\/10.1049\/iet-cvi.2015.0237","journal-title":"IET Comput Vis"},{"key":"1417_CR25","doi-asserted-by":"publisher","first-page":"3597","DOI":"10.1109\/TCSII.2021.3076112","volume":"68","author":"Y Zhou","year":"2021","unstructured":"Zhou Y, Zhang X, Ding F (2021) Hierarchical estimation approach for RBF-AR models with regression weights based on the increasing data length. IEEE Trans Circuits Syst II Expr Briefs 68:3597\u20133601. https:\/\/doi.org\/10.1109\/TCSII.2021.3076112","journal-title":"IEEE Trans Circuits Syst II Expr Briefs"},{"key":"1417_CR26","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1016\/j.neucom.2017.04.065","volume":"266","author":"J Li","year":"2017","unstructured":"Li J, Yao T, Ling Q, Mei T (2017) Detecting shot boundary with sparse coding for video summarization. Neurocomputing 266:66\u201378. https:\/\/doi.org\/10.1016\/j.neucom.2017.04.065","journal-title":"Neurocomputing"},{"key":"1417_CR27","doi-asserted-by":"publisher","first-page":"3967","DOI":"10.1109\/TCSVT.2020.3044600","volume":"31","author":"M Ma","year":"2021","unstructured":"Ma M, Mei S, Wan S, Wang Z, Feng DD, Bennamoun M (2021) Similarity based block sparse subset selection for video summarization. IEEE Trans Circuits Syst Video Technol 31:3967\u20133980. https:\/\/doi.org\/10.1109\/TCSVT.2020.3044600","journal-title":"IEEE Trans Circuits Syst Video Technol"},{"key":"1417_CR28","doi-asserted-by":"publisher","first-page":"47","DOI":"10.1016\/j.inffus.2022.10.015","volume":"91","author":"Y Li","year":"2023","unstructured":"Li Y, Yang G, Su Z, Li S, Wang Y (2023) Human activity recognition based on multienvironment sensor data. Inf Fusion 91:47\u201363. https:\/\/doi.org\/10.1016\/j.inffus.2022.10.015","journal-title":"Inf Fusion"},{"key":"1417_CR29","doi-asserted-by":"publisher","first-page":"926","DOI":"10.3390\/sym6040926","volume":"6","author":"Q Zhang","year":"2014","unstructured":"Zhang Q, Zhang S, Zhou D (2014) Keyframe extraction from human motion capture data based on a multiple population genetic algorithm. Symmetry-Basel 6:926\u2013937. https:\/\/doi.org\/10.3390\/sym6040926","journal-title":"Symmetry-Basel"},{"key":"1417_CR30","doi-asserted-by":"publisher","unstructured":"Yan X, Gilani SZ, Qin H, Feng M, Zhang L, Mian A (2018) Deep keyframe detection in human action videos. https:\/\/doi.org\/10.48550\/arXiv.1804.10021","DOI":"10.48550\/arXiv.1804.10021"},{"key":"1417_CR31","doi-asserted-by":"publisher","DOI":"10.1007\/s41870-024-01733-0","author":"A Banerjee","year":"2024","unstructured":"Banerjee A, Kumar E, Ravinder M (2024) Particle swarm optimized deep spatio-temporal features for efficient video retrieval. Int J Inf Technol. https:\/\/doi.org\/10.1007\/s41870-024-01733-0","journal-title":"Int J Inf Technol"},{"key":"1417_CR32","doi-asserted-by":"publisher","unstructured":"Kuehne H, Jhuang H, Garrote E, Poggio T, Serre T (2011) HMDB: a large video database for human motion recognition. In: 2011 international conference on computer vision, pp 2556\u20132563. https:\/\/doi.org\/10.1109\/ICCV.2011.6126543","DOI":"10.1109\/ICCV.2011.6126543"},{"key":"1417_CR33","doi-asserted-by":"publisher","unstructured":"Soomro K, Zamir AR, Shah M (2012) UCF101: a dataset of 101 human actions classes from videos in the wild. https:\/\/doi.org\/10.48550\/arXiv.1212.0402","DOI":"10.48550\/arXiv.1212.0402"},{"key":"1417_CR34","first-page":"505","volume-title":"Comput. Vis.\u2014ECCV 214","author":"M Gygli","year":"2014","unstructured":"Gygli M, Grabner H, Riemenschneider H, Van Gool L (2014) Creating summaries from user videos. In: Fleet D, Pajdla T, Schiele B, Tuytelaars T (eds) Comput. Vis.\u2014ECCV 214. Springer International Publishing, Cham, pp 505\u2013520"},{"key":"1417_CR35","doi-asserted-by":"publisher","unstructured":"Song Y, Vallmitjana J, Stent A, Jaimes A (2015) TVSum: summarizing web videos using titles. In: 2015 IEEE conf. comput. vis. pattern recognit. CVPR, pp 5179\u20135187. https:\/\/doi.org\/10.1109\/CVPR.2015.7299154","DOI":"10.1109\/CVPR.2015.7299154"},{"key":"1417_CR36","doi-asserted-by":"publisher","first-page":"7021","DOI":"10.1007\/s11042-023-15859-z","volume":"83","author":"AA Pandian","year":"2024","unstructured":"Pandian AA, Maheswari S (2024) A keyframe selection for summarization of informative activities using clustering in surveillance videos. Multimed Tools Appl 83:7021\u20137034. https:\/\/doi.org\/10.1007\/s11042-023-15859-z","journal-title":"Multimed Tools Appl"},{"key":"1417_CR37","doi-asserted-by":"publisher","unstructured":"Mo CA, Hu K, Long C, Wang Z (2023) Continuous intermediate token learning with implicit motion manifold for keyframe based motion interpolation. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition. CVPR, pp 13894\u201313903. https:\/\/doi.org\/10.48550\/arXiv.2303.14926","DOI":"10.48550\/arXiv.2303.14926"},{"key":"1417_CR38","doi-asserted-by":"publisher","first-page":"1923","DOI":"10.1109\/TCYB.2017.2718579","volume":"48","author":"X Li","year":"2018","unstructured":"Li X, Zhao B, Lu X (2018) Key frame extraction in the summary space. IEEE Trans Cybern 48:1923\u20131934. https:\/\/doi.org\/10.1109\/TCYB.2017.2718579","journal-title":"IEEE Trans Cybern"},{"key":"1417_CR39","doi-asserted-by":"crossref","unstructured":"Kuncheva LI, Yousefi P, Almeida J (2017) Comparing keyframe summaries of egocentric videos: closest-to-centroid baseline. In: Proc. 2017 seventh int. conf. image process. theory tools appl. IPTA 2017","DOI":"10.1109\/IPTA.2017.8310123"},{"key":"1417_CR40","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1016\/j.aei.2017.09.007","volume":"34","author":"K Kamal","year":"2017","unstructured":"Kamal K, Qayyum R, Mathavan S, Zafar T (2017) Wood defects classification using laws texture energy measures and supervised learning approach. Adv Eng Inf 34:125\u2013135. https:\/\/doi.org\/10.1016\/j.aei.2017.09.007","journal-title":"Adv Eng Inf"},{"key":"1417_CR41","doi-asserted-by":"publisher","unstructured":"Hannane R, Elboushaki A, Afdel K (2016) Efficient video summarization based on motion SIFT-distribution histogram. In: 2016 13TH int. conf. comput. graph. imaging vis. CGIV., pp 312\u2013317. https:\/\/doi.org\/10.1109\/CGiV.2016.67","DOI":"10.1109\/CGiV.2016.67"},{"key":"1417_CR42","doi-asserted-by":"publisher","unstructured":"Tafannum F, Shopnil MNS, Salsabil A, Ahmed N, Alam MGR, Reza MT (2021) Demystifying black-box learning models of rumor detection from social media posts. In: 2021 IEEE 12th annu. ubiquitous comput. electron. mob. commun. conf. UEMCON, pp 358\u2013364. https:\/\/doi.org\/10.1109\/UEMCON53757.2021.9666567","DOI":"10.1109\/UEMCON53757.2021.9666567"},{"key":"1417_CR43","doi-asserted-by":"publisher","first-page":"5446","DOI":"10.1109\/TKDE.2021.3050407","volume":"34","author":"C Chen","year":"2022","unstructured":"Chen C, Li D, Yan J, Yang X (2022) Modeling dynamic user preference via dictionary learning for sequential recommendation. IEEE Trans Knowl Data Eng 34:5446\u20135458. https:\/\/doi.org\/10.1109\/TKDE.2021.3050407","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"1417_CR44","doi-asserted-by":"publisher","unstructured":"Mao X, Mitra S, Swaminathan V (2017) Feature selection for FM-based context-aware recommendation systems. In: 2017 IEEE int. symp. multimed. ISM., pp 252\u2013255. https:\/\/doi.org\/10.1109\/ISM.2017.42","DOI":"10.1109\/ISM.2017.42"},{"key":"1417_CR45","doi-asserted-by":"publisher","first-page":"41342","DOI":"10.1109\/ACCESS.2020.2977231","volume":"8","author":"N Wen","year":"2020","unstructured":"Wen N, Zhang F (2020) Extended factorization machines for sequential recommendation. IEEE Access 8:41342\u201341350. https:\/\/doi.org\/10.1109\/ACCESS.2020.2977231","journal-title":"IEEE Access"},{"key":"1417_CR46","doi-asserted-by":"publisher","unstructured":"Lian J, Zhou X, Zhang F, Chen Z, Xie X, Sun G (2018) xDeepFM: combining explicit and implicit feature interactions for recommender systems. In: Proc. 24th ACM SIGKDD int. conf. knowl. discov. data min., pp 1754\u20131763. https:\/\/doi.org\/10.1145\/3219819.3220023","DOI":"10.1145\/3219819.3220023"},{"key":"1417_CR47","doi-asserted-by":"publisher","DOI":"10.1016\/j.bspc.2022.104206","volume":"79","author":"Y Wang","year":"2023","unstructured":"Wang Y, Yang G, Li S, Li Y, He L, Liu D (2023) Arrhythmia classification algorithm based on multi-head self-attention mechanism. Biomed Signal Process Control 79:104206. https:\/\/doi.org\/10.1016\/j.bspc.2022.104206","journal-title":"Biomed Signal Process Control"},{"key":"1417_CR48","doi-asserted-by":"publisher","unstructured":"Nimmagadda P, Sudhakar K, Rajasekar P (2023) Perceptual video summarization using keyframes extraction technique. In: 2023 3rd international conference on innovative practices in technology and management (ICIPTM). IEEE, pp 1\u20134. https:\/\/doi.org\/10.1109\/ICIPTM57143.2023.10118236","DOI":"10.1109\/ICIPTM57143.2023.10118236"},{"key":"1417_CR49","doi-asserted-by":"publisher","first-page":"948","DOI":"10.1109\/TIP.2020.3039886","volume":"30","author":"W Zhu","year":"2021","unstructured":"Zhu W, Lu J, Li J, Zhou J (2021) DSNet: a flexible detect-to-summarize network for video summarization. IEEE Trans Image Process 30:948\u2013962. https:\/\/doi.org\/10.1109\/TIP.2020.3039886","journal-title":"IEEE Trans Image Process"},{"key":"1417_CR50","unstructured":"Ji Z, Xiong K, Pang Y, Li X (2018) Video summarization with attention-based encoder\u2013decoder networks. http:\/\/arxiv.org\/abs\/1708.09545. Accessed December 18, 2023"},{"key":"1417_CR51","doi-asserted-by":"publisher","unstructured":"Apostolidis E, Balaouras G, Mezaris V, Patras I (2021) Combining global and local attention with positional encoding for video summarization. In: 2021 IEEE int. symp. multimed. ISM, IEEE, Naple, Italy, pp 226\u2013234. https:\/\/doi.org\/10.1109\/ISM52913.2021.00045","DOI":"10.1109\/ISM52913.2021.00045"},{"key":"1417_CR52","doi-asserted-by":"publisher","unstructured":"Song W, Shi C, Xiao Z, Duan Z, Xu Y, Zhang M, Tang J (2019) AutoInt: automatic feature interaction learning via self-attentive neural networks. In: Proc. 28th ACM int. conf. inf. knowl. manag., pp 1161\u20131170. https:\/\/doi.org\/10.1145\/3357384.3357925","DOI":"10.1145\/3357384.3357925"},{"key":"1417_CR53","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2023.106374","volume":"123","author":"L Li","year":"2023","unstructured":"Li L, Yang G, Li Y, Zhu D, He L (2023) Abnormal sitting posture recognition based on multi-scale spatiotemporal features of skeleton graph. Eng Appl Artif Intell 123:106374. https:\/\/doi.org\/10.1016\/j.engappai.2023.106374","journal-title":"Eng Appl Artif Intell"},{"issue":"2","key":"1417_CR54","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1049\/bme2.1211012","volume":"12","author":"G Yang","year":"2023","unstructured":"Yang G, Yang S, Luo K, Lan S, He L, Li Y (2023) Detection of non-suicidal self-injury based on spatiotemporal features of indoor activities. IET Biom 12(2):91\u2013101. https:\/\/doi.org\/10.1049\/bme2.1211012","journal-title":"IET Biom"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01417-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-024-01417-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-024-01417-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,17]],"date-time":"2024-07-17T17:18:23Z","timestamp":1721236703000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-024-01417-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,15]]},"references-count":54,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2024,8]]}},"alternative-id":["1417"],"URL":"https:\/\/doi.org\/10.1007\/s40747-024-01417-z","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,15]]},"assertion":[{"value":"7 September 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 March 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 April 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}