{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T07:24:05Z","timestamp":1740122645713,"version":"3.37.3"},"reference-count":61,"publisher":"Springer Science and Business Media LLC","issue":"8","license":[{"start":{"date-parts":[[2024,4,1]],"date-time":"2024-04-01T00:00:00Z","timestamp":1711929600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"},{"start":{"date-parts":[[2024,4,1]],"date-time":"2024-04-01T00:00:00Z","timestamp":1711929600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.springernature.com\/gp\/researchers\/text-and-data-mining"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Appl Intell"],"published-print":{"date-parts":[[2024,4]]},"DOI":"10.1007\/s10489-024-05460-8","type":"journal-article","created":{"date-parts":[[2024,5,8]],"date-time":"2024-05-08T12:01:46Z","timestamp":1715169706000},"page":"6198-6212","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Clustering-based multi-featured self-supervised learning for human activities and video retrieval"],"prefix":"10.1007","volume":"54","author":[{"given":"Muhammad Hafeez","family":"Javed","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zeng","family":"Yu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Taha M.","family":"Rajeh","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fahad","family":"Rafique","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7090-0325","authenticated-orcid":false,"given":"Tianrui","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,5,8]]},"reference":[{"issue":"4","key":"5460_CR1","first-page":"1254","volume":"8","author":"S Ramasamy Ramamurthy","year":"2018","unstructured":"Ramasamy Ramamurthy S, Roy N (2018) Recent trends in machine learning for human activity recognition-a survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 8(4):1254","journal-title":"Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery"},{"issue":"8","key":"5460_CR2","doi-asserted-by":"publisher","first-page":"1577","DOI":"10.1080\/09720529.2019.1696552","volume":"22","author":"MK Jangir","year":"2019","unstructured":"Jangir MK, Singh K (2019) Hargrurnn: Human activity recognition using inertial body sensor gated recurrent units recurrent neural network. Journal of Discrete Mathematical Sciences and Cryptography 22(8):1577\u20131587","journal-title":"Journal of Discrete Mathematical Sciences and Cryptography"},{"key":"5460_CR3","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1016\/j.neucom.2018.05.042","volume":"311","author":"R Saini","year":"2018","unstructured":"Saini R, Kumar P, Roy PP, Dogra DP (2018) A novel framework of continuous human-activity recognition using kinect. Neurocomputing 311:99\u2013111","journal-title":"Neurocomputing"},{"issue":"4","key":"5460_CR4","doi-asserted-by":"publisher","first-page":"1157","DOI":"10.1007\/s13042-021-01441-2","volume":"13","author":"MH Javed","year":"2022","unstructured":"Javed MH, Yu Z, Li T, Rajeh TM, Rafique F, Waqar S (2022) Hybrid two-stream dynamic cnn for view adaptive human action recognition using ensemble learning. Int J Mach Learn Cybern 13(4):1157\u20131166","journal-title":"Int J Mach Learn Cybern"},{"key":"5460_CR5","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1016\/j.comcom.2023.08.007","volume":"210","author":"R Barbosa","year":"2023","unstructured":"Barbosa R, Ogobuchi OD, Joy OO, Saadi M, Rosa RL, Otaibi SA, Rodr\u00edguez DZ (2023) Iot based real-time traffic monitoring system using images sensors by sparse deep learning algorithm. Comput Commun 210:321\u2013330. https:\/\/doi.org\/10.1016\/j.comcom.2023.08.007","journal-title":"Comput Commun"},{"issue":"11","key":"5460_CR6","doi-asserted-by":"publisher","first-page":"4037","DOI":"10.1109\/TPAMI.2020.2992393","volume":"43","author":"L Jing","year":"2020","unstructured":"Jing L, Tian Y (2020) Self-supervised visual feature learning with deep neural networks: A survey. IEEE Trans Pattern Anal Mach Intell 43(11):4037\u20134058","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"5460_CR7","doi-asserted-by":"crossref","unstructured":"Koupai AK, Bocus MJ, Santos-Rodriguez R, Piechocki RJ, McConville R (2022) Self-supervised multimodal fusion transformer for passive activity recognition. IET Wireless Sensor Systems","DOI":"10.1049\/wss2.12044"},{"key":"5460_CR8","doi-asserted-by":"publisher","first-page":"1377","DOI":"10.1109\/LSP.2022.3181849","volume":"29","author":"Y Teng","year":"2022","unstructured":"Teng Y, Song C, Wu B (2022) Learning social relationship from videos via pre-trained multimodal transformer. IEEE Signal Process Lett 29:1377\u20131381","journal-title":"IEEE Signal Process Lett"},{"key":"5460_CR9","doi-asserted-by":"publisher","first-page":"4067","DOI":"10.1109\/TMM.2021.3112814","volume":"24","author":"S Chang","year":"2021","unstructured":"Chang S, Li Y, Shen S, Feng J, Zhou Z (2021) Contrastive attention for video anomaly detection. IEEE Trans Multimedia 24:4067\u20134076","journal-title":"IEEE Trans Multimedia"},{"key":"5460_CR10","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2022.105331","volume":"116","author":"L Xi","year":"2022","unstructured":"Xi L, Yun Z, Liu H, Wang R, Huang X, Fan H (2022) Semi-supervised time series classification model with self-supervised learning. Eng Appl Artif Intell 116:105331. https:\/\/doi.org\/10.1016\/j.engappai.2022.105331","journal-title":"Eng Appl Artif Intell"},{"issue":"2","key":"5460_CR11","doi-asserted-by":"publisher","first-page":"1030","DOI":"10.1109\/JIOT.2020.3009358","volume":"8","author":"A Saeed","year":"2020","unstructured":"Saeed A, Salim FD, Ozcelebi T, Lukkien J (2020) Federated self-supervised learning of multisensor representations for embedded intelligence. IEEE Internet Things J 8(2):1030\u20131040","journal-title":"IEEE Internet Things J"},{"key":"5460_CR12","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.118965","volume":"213","author":"P Li","year":"2023","unstructured":"Li P, Cao J, Ye X (2023) Prototype contrastive learning for point-supervised temporal action detection. Expert Syst Appl 213:118965","journal-title":"Expert Syst Appl"},{"key":"5460_CR13","doi-asserted-by":"publisher","first-page":"244","DOI":"10.1016\/j.neunet.2022.09.029","volume":"156","author":"H Zhang","year":"2022","unstructured":"Zhang H, Zhao S, Qiang W, Chen Y, Jing L (2022) Feature extraction framework based on contrastive learning with adaptive positive and negative samples. Neural Netw 156:244\u2013257","journal-title":"Neural Netw"},{"key":"5460_CR14","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2022.109020","volume":"133","author":"X Zhang","year":"2023","unstructured":"Zhang X, Li Q, Quan Z, Yang W (2023) Pyramid geometric consistency learning for semantic segmentation. Pattern Recogn 133:109020","journal-title":"Pattern Recogn"},{"key":"5460_CR15","doi-asserted-by":"crossref","unstructured":"Wang Z, Lyu J, Luo W, Tang X (2022) Superpixel inpainting for self-supervised skin lesion segmentation from dermoscopic images. In: Proceedings of the International Symposium on Biomedical Imaging (ISBI), pp. 1\u20134. IEEE","DOI":"10.1109\/ISBI52829.2022.9761620"},{"key":"5460_CR16","doi-asserted-by":"crossref","unstructured":"Shi L, Zhang Y, Cheng J, Lu H (2019) Two-stream adaptive graph convolutional networks for skeleton-based action recognition. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 12026\u201312035","DOI":"10.1109\/CVPR.2019.01230"},{"key":"5460_CR17","doi-asserted-by":"crossref","unstructured":"Pan T, Song Y, Yang T, Jiang W, Liu W (2021) Videomoco: Contrastive video representation learning with temporally adversarial examples. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 11205\u201311214","DOI":"10.1109\/CVPR46437.2021.01105"},{"issue":"6","key":"5460_CR18","doi-asserted-by":"publisher","first-page":"1227","DOI":"10.1007\/s11760-020-01644-0","volume":"14","author":"X Liu","year":"2020","unstructured":"Liu X, Li Y, Xia R (2020) Rotation-based spatial-temporal feature learning from skeleton sequences for action recognition. SIViP 14(6):1227\u20131234","journal-title":"SIViP"},{"key":"5460_CR19","doi-asserted-by":"publisher","first-page":"85284","DOI":"10.1109\/ACCESS.2020.2993227","volume":"8","author":"Y Wan","year":"2020","unstructured":"Wan Y, Yu Z, Wang Y, Li X (2020) Action recognition based on two-stream convolutional networks with long-short-term spatiotemporal features. IEEE Access 8:85284\u201385293","journal-title":"IEEE Access"},{"key":"5460_CR20","unstructured":"Duvenaud DK, Maclaurin D, Iparraguirre J, Bombarell R, Hirzel T, Aspuru-Guzik A, Adams RP (2015) Convolutional networks on graphs for learning molecular fingerprints. Advances in Neural Information Processing Systems 28"},{"issue":"5","key":"5460_CR21","doi-asserted-by":"publisher","first-page":"624","DOI":"10.1109\/LSP.2017.2678539","volume":"24","author":"C Li","year":"2017","unstructured":"Li C, Hou Y, Wang P, Li W (2017) Joint distance maps based action recognition with convolutional neural networks. IEEE Signal Process Lett 24(5):624\u2013628","journal-title":"IEEE Signal Process Lett"},{"key":"5460_CR22","doi-asserted-by":"crossref","unstructured":"Han Y, Hui L, Jiang H, Qian J, Xie J (2022) Generative subgraph contrast for self-supervised graph representation learning. In: Proceedings of the European Conference on Computer Vision, pp. 91\u2013107. Springer","DOI":"10.1007\/978-3-031-20056-4_6"},{"issue":"6","key":"5460_CR23","doi-asserted-by":"publisher","first-page":"1607","DOI":"10.1093\/bioinformatics\/btac007","volume":"38","author":"R Zhang","year":"2022","unstructured":"Zhang R, Luo Y, Ma J, Zhang M, Wang S (2022) scpretrain: multi-task self-supervised learning for cell-type classification. Bioinformatics 38(6):1607\u20131614","journal-title":"Bioinformatics"},{"key":"5460_CR24","doi-asserted-by":"crossref","unstructured":"Huang L, Liu Y, Wang B, Pan P, Xu Y, Jin R (2021) Self-supervised video representation learning by context and motion decoupling. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp.13886\u201313895","DOI":"10.1109\/CVPR46437.2021.01367"},{"key":"5460_CR25","doi-asserted-by":"crossref","unstructured":"Li Y, Chen J, Li F, Fu B, Wu H, Ji Y, Zhou Y, Niu Y, Shi G, Zheng W (2022) Gmss: Graph-based multi-task self-supervised learning for eeg emotion recognition. IEEE Transactions on Affective Computing","DOI":"10.1109\/TAFFC.2022.3170428"},{"key":"5460_CR26","doi-asserted-by":"publisher","DOI":"10.1016\/j.displa.2022.102160","volume":"72","author":"P Zhang","year":"2022","unstructured":"Zhang P, Zhou L, Bai X, Wang C, Zhou J, Zhang L, Zheng J (2022) Learning multi-view visual correspondences with self-supervision. Displays 72:102160","journal-title":"Displays"},{"key":"5460_CR27","doi-asserted-by":"crossref","unstructured":"Zhang R, Isola P, Efros AA (2016) Colorful image colorization. In: Proceedings of the European Conference on Computer Vision, pp. 649\u2013666. Springer","DOI":"10.1007\/978-3-319-46487-9_40"},{"key":"5460_CR28","doi-asserted-by":"crossref","unstructured":"Ma Z, Li K, Li Y (2023) Self-supervised method for 3d human pose estimation with consistent shape and viewpoint factorization. Applied Intelligence 3864\u20133876","DOI":"10.1007\/s10489-022-03714-x"},{"key":"5460_CR29","doi-asserted-by":"crossref","unstructured":"Wei D, Lim JJ, Zisserman A, Freeman WT (2018) Learning and using the arrow of time. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8052\u20138060","DOI":"10.1109\/CVPR.2018.00840"},{"key":"5460_CR30","doi-asserted-by":"crossref","unstructured":"Benaim S, Ephrat A, Lang O, Mosseri I, Freeman WT, Rubinstein M, Irani M, Dekel T (2020) Speednet: Learning the speediness in videos. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 9922\u20139931","DOI":"10.1109\/CVPR42600.2020.00994"},{"key":"5460_CR31","doi-asserted-by":"crossref","unstructured":"Yao Y, Liu C, Luo D, Zhou Y, Ye Q (2020) Video playback rate perception for self-supervised spatio-temporal representation learning. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 6548\u20136557","DOI":"10.1109\/CVPR42600.2020.00658"},{"key":"5460_CR32","unstructured":"Sohn K (2016) Improved deep metric learning with multi-class n-pair loss objective. Advances in Neural Information Processing Systems 29"},{"key":"5460_CR33","unstructured":"Lin S, Liu C, Zhou P, Hu ZY, Wang S, Zhao R, Zheng Y, Lin L, Xing E, Liang X (2022) Prototypical graph contrastive learning. IEEE Transactions on Neural Networks and Learning Systems"},{"key":"5460_CR34","doi-asserted-by":"publisher","first-page":"1978","DOI":"10.1109\/TIP.2022.3147032","volume":"31","author":"Y Liu","year":"2022","unstructured":"Liu Y, Wang K, Liu L, Lan H, Lin L (2022) Tcgl: Temporal contrastive graph for self-supervised video representation learning. IEEE Trans Image Process 31:1978\u20131993","journal-title":"IEEE Trans Image Process"},{"key":"5460_CR35","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1016\/j.neucom.2021.11.031","volume":"482","author":"Y Liu","year":"2022","unstructured":"Liu Y, Ma J, Xie Y, Yang X, Tao X, Peng L, Gao W (2022) Contrastive predictive coding with transformer for video representation learning. Neurocomputing 482:154\u2013162","journal-title":"Neurocomputing"},{"key":"5460_CR36","doi-asserted-by":"publisher","first-page":"32087","DOI":"10.1007\/s11042-019-08019-9","volume":"78","author":"Z Yu","year":"2019","unstructured":"Yu Z, Wang J, Lu G (2019) Optimized self-adapting contrast enhancement algorithm for wafer contour extraction. Multimedia Tools and Applications 78:32087\u201332108","journal-title":"Multimedia Tools and Applications"},{"key":"5460_CR37","doi-asserted-by":"crossref","unstructured":"Zhang W, Deng Z, Zhang T, Choi KS, Wang S (2023) Multi-view fuzzy representation learning with rules based model. IEEE Transactions on Knowledge and Data Engineering","DOI":"10.1109\/TKDE.2023.3295874"},{"key":"5460_CR38","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.118696","volume":"213","author":"M Farhat","year":"2023","unstructured":"Farhat M, Chaabouni-Chouayakh H, Ben-Hamadou A (2023) Self-supervised endoscopic image key-points matching. Expert Syst Appl 213:118696","journal-title":"Expert Syst Appl"},{"key":"5460_CR39","first-page":"21271","volume":"33","author":"JB Grill","year":"2020","unstructured":"Grill JB, Strub F, Altch\u00e9 F, Tallec C, Richemond P, Buchatskaya E, Doersch C, Avila Pires B, Guo Z, Gheshlaghi Azar M et al (2020) Bootstrap your own latent-a new approach to self-supervised learning. Adv Neural Inf Process Syst 33:21271\u201321284","journal-title":"Adv Neural Inf Process Syst"},{"key":"5460_CR40","doi-asserted-by":"crossref","unstructured":"Feichtenhofer C, Fan H, Xiong B, Girshick R, He K (2021) A large-scale study on unsupervised spatiotemporal representation learning. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 3299\u20133309","DOI":"10.1109\/CVPR46437.2021.00331"},{"key":"5460_CR41","doi-asserted-by":"publisher","first-page":"7704","DOI":"10.1109\/ACCESS.2022.3142758","volume":"10","author":"J Ju","year":"2022","unstructured":"Ju J, Jung H, Oh Y, Kim J (2022) Extending contrastive learning to unsupervised coreset selection. IEEE Access 10:7704\u20137715","journal-title":"IEEE Access"},{"key":"5460_CR42","doi-asserted-by":"publisher","first-page":"189","DOI":"10.1109\/LSP.2021.3130826","volume":"29","author":"J Ye","year":"2021","unstructured":"Ye J, Xiao Q, Wang J, Zhang H, Deng J, Lin Y (2021) Cosleep: A multi-view representation learning framework for self-supervised learning of sleep stage classification. IEEE Signal Process Lett 29:189\u2013193","journal-title":"IEEE Signal Process Lett"},{"issue":"12","key":"5460_CR43","doi-asserted-by":"publisher","first-page":"14491","DOI":"10.1007\/s10489-022-03341-6","volume":"52","author":"S Liu","year":"2022","unstructured":"Liu S, Sehgal N, Ostadabbas S (2022) Adapted human pose: monocular 3d human pose estimation with zero real 3d pose data. Appl Intell 52(12):14491\u201314506","journal-title":"Appl Intell"},{"key":"5460_CR44","doi-asserted-by":"crossref","unstructured":"Caron M, Bojanowski P, Joulin A, Douze M (2018) Deep clustering for unsupervised learning of visual features. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 132\u2013149","DOI":"10.1007\/978-3-030-01264-9_9"},{"key":"5460_CR45","first-page":"28864","volume":"34","author":"J Xie","year":"2021","unstructured":"Xie J, Zhan X, Liu Z, Ong YS, Loy CC (2021) Unsupervised object-level representation learning from scene images. Adv Neural Inf Process Syst 34:28864\u201328876","journal-title":"Adv Neural Inf Process Syst"},{"key":"5460_CR46","doi-asserted-by":"crossref","unstructured":"Yan X, Misra I, Gupta A, Ghadiyaram D, Mahajan D (2020) Clusterfit: Improving generalization of visual representations. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 6509\u20136518","DOI":"10.1109\/CVPR42600.2020.00654"},{"key":"5460_CR47","first-page":"9912","volume":"33","author":"M Caron","year":"2020","unstructured":"Caron M, Misra I, Mairal J, Goyal P, Bojanowski P, Joulin A (2020) Unsupervised learning of visual features by contrasting cluster assignments. Adv Neural Inf Process Syst 33:9912\u20139924","journal-title":"Adv Neural Inf Process Syst"},{"key":"5460_CR48","doi-asserted-by":"crossref","unstructured":"Ntelemis F, Jin Y, Thomas SA (2022) Information maximization clustering via multi-view self-labelling. Knowledge-Based Systems 109042","DOI":"10.1016\/j.knosys.2022.109042"},{"key":"5460_CR49","doi-asserted-by":"crossref","unstructured":"Zhu Y, Shuai H, Liu G, Liu Q (2022) Self-supervised video representation learning using improved instance-wise contrastive learning and deep clustering. IEEE Transactions on Circuits and Systems for Video Technology","DOI":"10.1109\/TCSVT.2022.3169469"},{"key":"5460_CR50","doi-asserted-by":"crossref","unstructured":"Ke Q, Bennamoun M, An S, Sohel F, Boussaid F (2017) A new representation of skeleton sequences for 3d action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3288\u20133297","DOI":"10.1109\/CVPR.2017.486"},{"key":"5460_CR51","doi-asserted-by":"crossref","unstructured":"Khorasgani SH, Chen Y, Shkurti F (2022) Slic: Self-supervised learning with iterative clustering for human action videos. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 16091\u201316101","DOI":"10.1109\/CVPR52688.2022.01562"},{"key":"5460_CR52","doi-asserted-by":"crossref","unstructured":"Kumar V, Tripathi V, Pant B (2022) Learning unsupervised visual representations using 3d convolutional autoencoder with temporal contrastive modeling for video retrieval. International Journal of Mathematical, Engineering and Management Sciences","DOI":"10.33889\/IJMEMS.2022.7.2.018"},{"key":"5460_CR53","doi-asserted-by":"crossref","unstructured":"Zatsarynna O, Farha YA, Gall J (2022) Self-supervised learning for unintentional action prediction. In: Proceedings of the DAGM German Conference on Pattern Recognition, pp. 429\u2013444. Springer","DOI":"10.1007\/978-3-031-16788-1_26"},{"key":"5460_CR54","doi-asserted-by":"publisher","DOI":"10.1016\/j.asoc.2023.110125","volume":"136","author":"J Hua","year":"2023","unstructured":"Hua J, Cui X, Li X, Tang K, Zhu P (2023) Multimodal fake news detection through data augmentation-based contrastive learning. Appl Soft Comput 136:110125. https:\/\/doi.org\/10.1016\/j.asoc.2023.110125","journal-title":"Appl Soft Comput"},{"key":"5460_CR55","doi-asserted-by":"crossref","unstructured":"Wang J, Yan S, Xiong Y, Lin D (2020) Motion guided 3d pose estimation from videos. In: Proceedings of the European Conference on Computer Vision, pp. 764\u2013780. Springer","DOI":"10.1007\/978-3-030-58601-0_45"},{"key":"5460_CR56","doi-asserted-by":"publisher","DOI":"10.1016\/j.cviu.2022.103406","volume":"219","author":"I Dave","year":"2022","unstructured":"Dave I, Gupta R, Rizve MN, Shah M (2022) Tclr: Temporal contrastive learning for video representation. Comput Vis Image Underst 219:103406","journal-title":"Comput Vis Image Underst"},{"key":"5460_CR57","first-page":"5679","volume":"33","author":"T Han","year":"2020","unstructured":"Han T, Xie W, Zisserman A (2020) Self-supervised co-training for video representation learning. Adv Neural Inf Process Syst 33:5679\u20135690","journal-title":"Adv Neural Inf Process Syst"},{"key":"5460_CR58","doi-asserted-by":"crossref","unstructured":"Han T, Xie W, Zisserman A (2020) Memory-augmented dense predictive coding for video representation learning. In: Proceedings of the European Conference on Computer Vision, pp. 312\u2013329. Springer","DOI":"10.1007\/978-3-030-58580-8_19"},{"key":"5460_CR59","doi-asserted-by":"publisher","first-page":"11701","DOI":"10.1609\/aaai.v34i07.6840","volume":"34","author":"D Luo","year":"2020","unstructured":"Luo D, Liu C, Zhou Y, Yang D, Ma C, Ye Q, Wang W (2020) Video cloze procedure for self-supervised spatio-temporal learning. Proceedings of the AAAI Conference on Artificial Intelligence 34:11701\u201311708","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"key":"5460_CR60","doi-asserted-by":"crossref","unstructured":"Xu D, Xiao J, Zhao Z, Shao J, Xie D, Zhuang Y (2019) Self-supervised spatiotemporal learning via video clip order prediction. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp.10334\u201310343","DOI":"10.1109\/CVPR.2019.01058"},{"key":"5460_CR61","doi-asserted-by":"crossref","unstructured":"Cui M, Wang W, Zhang K, Sun Z, Wang L (2022) Pose-appearance relational modeling for video action recognition. IEEE Transactions on Image Processing","DOI":"10.1109\/TIP.2022.3228156"}],"container-title":["Applied Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-024-05460-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10489-024-05460-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10489-024-05460-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,15]],"date-time":"2024-06-15T12:12:51Z","timestamp":1718453571000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10489-024-05460-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4]]},"references-count":61,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2024,4]]}},"alternative-id":["5460"],"URL":"https:\/\/doi.org\/10.1007\/s10489-024-05460-8","relation":{},"ISSN":["0924-669X","1573-7497"],"issn-type":[{"type":"print","value":"0924-669X"},{"type":"electronic","value":"1573-7497"}],"subject":[],"published":{"date-parts":[[2024,4]]},"assertion":[{"value":"10 April 2024","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 May 2024","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declaration"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.The authors declare the following financial interests\/personal relationships which may be considered as potential competing interests:","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicts of interest"}},{"value":"The manuscript is not be submitted to more than one journal for simultaneous consideration.The submitted work is original and not have been published elsewhere in any form or language (partially or in full).A single study is not be split up into several parts to increase the quantity of submissions and submitted to various journals or to one journal over time (i.e. ?salamislicing\/ publishing?).Concurrent or secondary publication is sometimes justifiable, provided certain conditions are met. Examples include: translations or a manuscript that is intended for a different group of readers.Results are presented clearly, honestly, and without fabrication, falsification or inappropriate data manipulation (including image based manipulation).No data, text, or theories by others are presented as if they were the author\u2019s own (\u201cplagiarism\u201d).","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical and Informed Consent for Data Used"}},{"value":"We make sure they have permissions for the use of software, questionnaires\/(web) surveys and scales in their studies (if appropriate).Research articles and non-research articles (e.g. Opinion, Review, and Commentary articles) have cite appropriate and relevant literature in support of the claims made. We don?t have excessive and inappropriate self-citation or coordinated efforts among several authors to collectively self-cite.We have avoid untrue statements about an entity (who can be an individual person or a company) or descriptions of their behavior or actions that could potentially be seen as personal attacks or allegations about that person.Research don?t contain anything that may be misapplied to pose a threat to public health or national security.We ensure that the author group, the Corresponding Author, and the order of authors are all correct at submission.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Informed consent for data used"}}]}}