{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,20]],"date-time":"2026-06-20T16:42:12Z","timestamp":1781973732310,"version":"3.54.5"},"reference-count":49,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2024,8,27]],"date-time":"2024-08-27T00:00:00Z","timestamp":1724716800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,8,27]],"date-time":"2024-08-27T00:00:00Z","timestamp":1724716800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100010669","name":"H2020 LEIT Information and Communication Technologies","doi-asserted-by":"publisher","award":["951911"],"award-info":[{"award-number":["951911"]}],"id":[{"id":"10.13039\/100010669","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100018699","name":"HORIZON EUROPE Digital, Industry and Space","doi-asserted-by":"publisher","award":["101092612"],"award-info":[{"award-number":["101092612"]}],"id":[{"id":"10.13039\/100018699","id-type":"DOI","asserted-by":"publisher"}]},{"name":"ISTI - PISA"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["SN COMPUT. SCI."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This work addresses the challenge of video violence detection in data-scarce scenarios, focusing on bridging the domain gap that often hinders the performance of deep learning models when applied to unseen domains. We present a novel unsupervised domain adaptation (UDA) scheme designed to effectively mitigate this gap by combining supervised learning in the train (source) domain with unlabeled test (target) data. We employ single-image classification and multiple instance learning (MIL) to select frames with the highest classification scores, and, upon this, we exploit UDA techniques to adapt the model to unlabeled target domains. We perform an extensive experimental evaluation, using general-context data as the source domain and target domain datasets collected in specific environments, such as violent\/non-violent actions in hockey matches and public transport. The results demonstrate that our UDA pipeline substantially enhances model performances, improving their generalization capabilities in novel scenarios without requiring additional labeled data.<\/jats:p>","DOI":"10.1007\/s42979-024-03126-3","type":"journal-article","created":{"date-parts":[[2024,8,27]],"date-time":"2024-08-27T17:03:00Z","timestamp":1724778180000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["In the Wild Video Violence Detection: An Unsupervised Domain Adaptation Approach"],"prefix":"10.1007","volume":"5","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6985-0439","authenticated-orcid":false,"given":"Luca","family":"Ciampi","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4737-0020","authenticated-orcid":false,"given":"Carlos","family":"Santiago","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6258-5313","authenticated-orcid":false,"given":"Fabrizio","family":"Falchi","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3715-149X","authenticated-orcid":false,"given":"Claudio","family":"Gennaro","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0171-4315","authenticated-orcid":false,"given":"Giuseppe","family":"Amato","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,8,27]]},"reference":[{"key":"3126_CR1","doi-asserted-by":"publisher","unstructured":"Erakin ME, Demir U, Ekenel HK. On recognizing occluded faces in the wild. In: 2021 IEEE International Conference of the Biometrics Special Interest Group (BIOSIG) 2021; https:\/\/doi.org\/10.1109\/biosig52210.2021.9548293.","DOI":"10.1109\/biosig52210.2021.9548293"},{"key":"3126_CR2","doi-asserted-by":"publisher","first-page":"139110","DOI":"10.1109\/ACCESS.2020.3011028","volume":"8","author":"L Li","year":"2020","unstructured":"Li L, Mu X, Li S, Peng H. A review of face recognition technology. IEEE Access. 2020;8:139110\u201320. https:\/\/doi.org\/10.1109\/ACCESS.2020.3011028.","journal-title":"IEEE Access"},{"key":"3126_CR3","doi-asserted-by":"publisher","unstructured":"Avvenuti M, Bongiovanni M, Ciampi L, Falchi F, Gennaro C, Messina N. A spatio- temporal attentive network for video-based crowd counting. In: 2022 IEEE Symposium on Computers and Communications (ISCC), 2022;1\u20136. https:\/\/doi.org\/10.1109\/ISCC55528.2022.9913019","DOI":"10.1109\/ISCC55528.2022.9913019"},{"key":"3126_CR4","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.117125","volume":"199","author":"M Di Benedetto","year":"2022","unstructured":"Di Benedetto M, Carrara F, Ciampi L, Falchi F, Gennaro C, Amato G. An embedded toolset for human activity monitoring in critical environments. Expert Syst Appl. 2022;199: 117125. https:\/\/doi.org\/10.1016\/j.eswa.2022.117125.","journal-title":"Expert Syst Appl"},{"key":"3126_CR5","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.117929","volume":"207","author":"L Ciampi","year":"2022","unstructured":"Ciampi L, Gennaro C, Carrara F, Falchi F, Vairo C, Amato G. Multi-camera vehicle counting using edge-ai. Expert Syst Appl. 2022;207: 117929. https:\/\/doi.org\/10.1016\/j.eswa.2022.117929.","journal-title":"Expert Syst Appl"},{"key":"3126_CR6","doi-asserted-by":"publisher","unstructured":"Amato G, Ciampi L, Falchi F, Gennaro C. Counting vehicles with deep learning in onboard uav imagery. In: 2019 IEEE Symposium on Computers and Communications (ISCC). 2019;1\u20136. https:\/\/doi.org\/10.1109\/ISCC47284.2019.8969620.","DOI":"10.1109\/ISCC47284.2019.8969620"},{"issue":"18","key":"3126_CR7","doi-asserted-by":"publisher","first-page":"5250","DOI":"10.3390\/s20185250","volume":"20","author":"L Ciampi","year":"2020","unstructured":"Ciampi L, Messina N, Falchi F, Gennaro C, Amato G. Virtual to real adaptation of pedestrian detectors. Sensors. 2020;20(18):5250. https:\/\/doi.org\/10.3390\/s20185250.","journal-title":"Sensors"},{"issue":"22","key":"3126_CR8","doi-asserted-by":"publisher","first-page":"17081","DOI":"10.1007\/s00500-020-04999-1","volume":"24","author":"B Kim","year":"2020","unstructured":"Kim B, Yuvaraj N, SriPreethaa KR, Santhosh R, Sabari A. Enhanced pedestrian detection using optimized deep convolution neural network for smart building surveillance. Soft Comput. 2020;24(22):17081\u201392. https:\/\/doi.org\/10.1007\/s00500-020-04999-1.","journal-title":"Soft Comput"},{"key":"3126_CR9","doi-asserted-by":"publisher","unstructured":"Huo X, Xie L, Hu H, Zhou W, Li H, Tian Q. Domain-agnostic prior for transfer semantic segmentation. In: 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2022;7065\u201375. https:\/\/doi.org\/10.1109\/CVPR52688.2022.00694.","DOI":"10.1109\/CVPR52688.2022.00694"},{"key":"3126_CR10","doi-asserted-by":"publisher","unstructured":"Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L. Imagenet: A large-scale hierarchical image database. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009;248\u2013255. https:\/\/doi.org\/10.1109\/CVPR.2009.5206848","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"3126_CR11","doi-asserted-by":"crossref","unstructured":"Lin T-Y, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Doll\u00e1r P, Zitnick CL. Microsoft coco: Common objects in context. In: Computer Vision \u2013 ECCV 2014, pp. 740\u2013755. Springer, Cham 2014;","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"3126_CR12","doi-asserted-by":"publisher","unstructured":"Torralba A, Efros AA. Unbiased look at dataset bias. In: CVPR 2011, 2011;1521\u20131528. https:\/\/doi.org\/10.1109\/CVPR.2011.5995347","DOI":"10.1109\/CVPR.2011.5995347"},{"issue":"21","key":"3126_CR13","doi-asserted-by":"publisher","first-page":"8345","DOI":"10.3390\/s22218345","volume":"22","author":"L Ciampi","year":"2022","unstructured":"Ciampi L, Foszner P, Messina N, Staniszewski M, Gennaro C, Falchi F, Serao G, Cogiel M, Golba D, Szczesna A, Amato G. Bus violence: An open benchmark for video violence detection on public transport. Sensors. 2022;22(21):8345. https:\/\/doi.org\/10.3390\/s22218345.","journal-title":"Sensors"},{"key":"3126_CR14","doi-asserted-by":"publisher","unstructured":"Akti S, Ofli F, Imran M, Ekenel HK. Fight detection from still images in the wild. In: IEEE\/CVF Winter Conference on Applications of Computer Vision Workshops, WACV - Workshops, Waikoloa, HI, USA, January 4-8, 2022, 2022;550\u2013559. https:\/\/doi.org\/10.1109\/WACVW54805.2022.00061 .","DOI":"10.1109\/WACVW54805.2022.00061"},{"key":"3126_CR15","doi-asserted-by":"publisher","unstructured":"Bermejo Nievas E, Deniz Suarez O, Bueno Garc\u00eda G, Sukthankar R. Violence detection in video using computer vision techniques. In: Computer Analysis of Images and Patterns, pp. 332\u2013339. Springer, Berlin, Heidelberg 2011\u2019. https:\/\/doi.org\/10.1007\/978-3-642-23678-5_39","DOI":"10.1007\/978-3-642-23678-5_39"},{"key":"3126_CR16","doi-asserted-by":"publisher","unstructured":"Ciampi L, Santiago C, Costeira J, Falchi F, Gennaro C, Amato G. Unsupervised Domain Adaptation for Video Violence Detection in the Wild. In: Proceedings of the 3rd International Conference on Image Processing and Vision Engineering - IMPROVE, pp. 37\u201346. SciTePress, 2023; https:\/\/doi.org\/10.5220\/0011965300003497 . INSTICC","DOI":"10.5220\/0011965300003497"},{"key":"3126_CR17","doi-asserted-by":"publisher","unstructured":"Soliman MM, Kamal MH, El-Massih Nashed MA, Mostafa YM, Chawky BS, Khattab D. Violence recognition from videos using deep learning techniques. In: 2019 Ninth International Conference on Intelligent Computing and Information Systems (ICICIS), 2019;80\u201385. https:\/\/doi.org\/10.1109\/ICICIS46948.2019.9014714","DOI":"10.1109\/ICICIS46948.2019.9014714"},{"key":"3126_CR18","doi-asserted-by":"crossref","unstructured":"Asad M, Yang Z, Khan Z, Yang J, He X. Feature fusion based deep spatiotemporal model for violence detection in videos. In: Neural Information Processing, pp. 405\u2013417. Springer, Cham 2019;","DOI":"10.1007\/978-3-030-36708-4_33"},{"key":"3126_CR19","doi-asserted-by":"publisher","unstructured":"Sudhakaran S, Lanz O. Learning to detect violent videos using convolutional long short-term memory. In: 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 2017;1\u20136. https:\/\/doi.org\/10.1109\/AVSS.2017.8078468","DOI":"10.1109\/AVSS.2017.8078468"},{"key":"3126_CR20","doi-asserted-by":"publisher","unstructured":"Hanson A, PNVR K, Krishnagopal S, Davis L. Bidirectional convolutional lstm for the detection of violence in videos. In: Computer Vision \u2013 ECCV 2018 Workshops, pp. 280\u2013295. Springer, Cham 2019;. https:\/\/doi.org\/10.1007\/978-3-030-11012-3_24","DOI":"10.1007\/978-3-030-11012-3_24"},{"key":"3126_CR21","doi-asserted-by":"publisher","unstructured":"Tran D, Wang H, Torresani L, Ray J, LeCun Y, Paluri M. A closer look at spatiotemporal convolutions for action recognition. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 2018;6450\u20136459 https:\/\/doi.org\/10.1109\/CVPR.2018.00675","DOI":"10.1109\/CVPR.2018.00675"},{"key":"3126_CR22","doi-asserted-by":"publisher","unstructured":"Sharma M, Baghel R. Video surveillance for violence detection using deep learning. In: Advances in Data Science and Management, pp. 411\u2013420. Springer, Singapore 2020;. https:\/\/doi.org\/10.1007\/978-981-15-0978-0_40","DOI":"10.1007\/978-981-15-0978-0_40"},{"key":"3126_CR23","doi-asserted-by":"publisher","unstructured":"Mugunga I, Dong J, Rigall E, Guo S, Madessa AH, Nawaz HS. A frame-based feature model for violence detection from surveillance cameras using convlstm network. In: 2021 6th International Conference on Image, Vision and Computing (ICIVC), pp. 2021; https:\/\/doi.org\/10.1109\/ICIVC52351.2021.9526948","DOI":"10.1109\/ICIVC52351.2021.9526948"},{"key":"3126_CR24","doi-asserted-by":"publisher","unstructured":"Akti S, Tataroglu GA, Ekenel HK. Vision-based fight detection from surveillance cameras. In: IEEE Ninth International Conference on Image Processing Theory, Tools and Applications, IPTA 2019, Istanbul, Turkey, November 6-9, 2019, pp. 2019;1\u20136. https:\/\/doi.org\/10.1109\/IPTA.2019.8936070 .","DOI":"10.1109\/IPTA.2019.8936070"},{"key":"3126_CR25","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-023-15060-2","author":"M Gnouma","year":"2023","unstructured":"Gnouma M, Ejbali R, Zaied M. A two-stream abnormal detection using a cascade of extreme learning machines and stacked auto encoder. Multimedia Tools and Applications. 2023. https:\/\/doi.org\/10.1007\/s11042-023-15060-2.","journal-title":"Multimedia Tools and Applications"},{"key":"3126_CR26","doi-asserted-by":"publisher","DOI":"10.1016\/j.engappai.2023.106173","volume":"123","author":"W Ullah","year":"2023","unstructured":"Ullah W, Hussain T, Ullah FUM, Lee MY, Baik SW. Transcnn: Hybrid cnn and transformer mechanism for surveillance anomaly detection. Eng Appl Artif Intell. 2023;123: 106173. https:\/\/doi.org\/10.1016\/j.engappai.2023.106173.","journal-title":"Eng Appl Artif Intell"},{"key":"3126_CR27","doi-asserted-by":"publisher","unstructured":"Wu J-C, Hsieh H-Y, Chen D-J, Fuh C-S, Liu T-L. Self-supervised sparse representation for video anomaly detection. In: Computer Vision \u2013 ECCV 2022, pp. 729\u2013745. Springer, Cham 2022. https:\/\/doi.org\/10.1007\/978-3-031-19778-9_42","DOI":"10.1007\/978-3-031-19778-9_42"},{"key":"3126_CR28","doi-asserted-by":"publisher","unstructured":"Cheng M, Cai K, Li M. Rwf-2000: An open large scale video database for violence detection. In: 2020 25th International Conference on Pattern Recognition (ICPR), pp. 2021;4183\u20134190. https:\/\/doi.org\/10.1109\/ICPR48806.2021.9412502","DOI":"10.1109\/ICPR48806.2021.9412502"},{"issue":"8","key":"3126_CR29","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","volume":"9","author":"S Hochreiter","year":"1997","unstructured":"Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735\u201380. https:\/\/doi.org\/10.1162\/neco.1997.9.8.1735.","journal-title":"Neural Comput"},{"key":"3126_CR30","unstructured":"Shi X, Chen Z, Wang H, Yeung D, Wong W, Woo W. Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In: Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, December 7-12, 2015, Montreal, Quebec, Canada, pp. 2015;802\u2013810. https:\/\/proceedings.neurips.cc\/paper\/2015\/hash\/07563a3fe3bbe7e3ba84431ad9d055af-Abstract.html"},{"key":"3126_CR31","doi-asserted-by":"publisher","unstructured":"Li J, Jiang X, Sun T, Xu K. Efficient violence detection using 3d convolutional neural networks. In: 2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 2019;1\u20138. https:\/\/doi.org\/10.1109\/AVSS.2019.8909883","DOI":"10.1109\/AVSS.2019.8909883"},{"key":"3126_CR32","doi-asserted-by":"publisher","unstructured":"Tran D, Bourdev L, Fergus R, Torresani L, Paluri M. Learning spatiotemporal features with 3d convolutional networks. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 2015;4489\u20134497. https:\/\/doi.org\/10.1109\/ICCV.2015.510","DOI":"10.1109\/ICCV.2015.510"},{"key":"3126_CR33","unstructured":"Feichtenhofer C, Pinz A, Wildes RP. Spatiotemporal residual networks for video action recognition. In: Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, December 5-10, 2016, Barcelona, Spain, pp. 2016;3468\u20133476. https:\/\/proceedings.neurips.cc\/paper\/2016\/hash\/3e7e0224018ab3cf51abb96464d518cd-Abstract.html"},{"key":"3126_CR34","doi-asserted-by":"publisher","unstructured":"Feichtenhofer C, Fan H, Malik J, He K. Slowfast networks for video recognition. In: 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), pp. 2019;6201\u20136210. https:\/\/doi.org\/10.1109\/ICCV.2019.00630","DOI":"10.1109\/ICCV.2019.00630"},{"key":"3126_CR35","doi-asserted-by":"publisher","unstructured":"Liu Z, Ning J, Cao Y, Wei Y, Zhang Z, Lin S, Hu H. Video swin transformer. In: 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2022;3192\u20133201. https:\/\/doi.org\/10.1109\/CVPR52688.2022.00320","DOI":"10.1109\/CVPR52688.2022.00320"},{"key":"3126_CR36","doi-asserted-by":"publisher","unstructured":"Liu Z, Lin Y, Cao Y, Hu H, Wei Y, Zhang Z, Lin S, Guo B. Swin transformer: Hierarchical vision transformer using shifted windows. In: 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), pp. 2021;9992\u201310002. https:\/\/doi.org\/10.1109\/ICCV48922.2021.00986","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"3126_CR37","doi-asserted-by":"publisher","unstructured":"Ganin Y, Ustinova E, Ajakan H, Germain P, Larochelle H, Laviolette F, Marchand M, Lempitsky V. Domain-Adversarial Training of Neural Networks, pp. 189\u2013209. Springer, Cham 2017; https:\/\/doi.org\/10.1007\/978-3-319-58347-1_10 .","DOI":"10.1007\/978-3-319-58347-1_10"},{"key":"3126_CR38","doi-asserted-by":"publisher","unstructured":"Jin Y, Wang X, Long M, Wang J. Minimum class confusion for versatile domain adaptation. In: Computer Vision \u2013 ECCV 2020, pp. 464\u2013480. Springer, Cham 2020;https:\/\/doi.org\/10.1007\/978-3-030-58589-1_28","DOI":"10.1007\/978-3-030-58589-1_28"},{"key":"3126_CR39","doi-asserted-by":"publisher","unstructured":"Zhang Y, David P, Gong B. Curriculum domain adaptation for semantic segmentation of urban scenes. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2017;2039\u20132049. https:\/\/doi.org\/10.1109\/ICCV.2017.223","DOI":"10.1109\/ICCV.2017.223"},{"key":"3126_CR40","doi-asserted-by":"publisher","unstructured":"Hong W, Wang Z, Yang M, Yuan J. Conditional generative adversarial network for structured domain adaptation. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 2018;1335\u20131344 . https:\/\/doi.org\/10.1109\/CVPR.2018.00145","DOI":"10.1109\/CVPR.2018.00145"},{"key":"3126_CR41","doi-asserted-by":"publisher","unstructured":"Chen Y, Li W, Chen X, Van Gool L. Learning semantic segmentation from synthetic data: A geometrically guided input-output adaptation approach. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2019;1841\u20131850. https:\/\/doi.org\/10.1109\/CVPR.2019.00194","DOI":"10.1109\/CVPR.2019.00194"},{"key":"3126_CR42","doi-asserted-by":"publisher","unstructured":"Ciampi L, Santiago C, Costeira JP, Gennaro, C, Amato G. Domain Adaptation for Traffic Density Estimation. In: Proceedings of the 16th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2021) - Volume 5: VISAPP, pp. 2021;185\u2013195. https:\/\/doi.org\/10.5220\/0010303401850195 . INSTICC","DOI":"10.5220\/0010303401850195"},{"key":"3126_CR43","unstructured":"Ciampi L, Santiago C, Costeira JP, Gennaro C, Amato G. Unsupervised vehicle counting via multiple camera domain adaptation. In: Proceedings of the First International Workshop on New Foundations for Human-Centered AI (NeHuAI) Co-located with 24th European Conference on Artificial Intelligence (ECAI 2020), Santiago de Compostella, Spain, September 4, 2020. CEUR Workshop Proceedings, vol. 2659, pp. 2020;82\u201385. https:\/\/ceur-ws.org\/Vol-2659\/ciampi.pdf"},{"issue":"10","key":"3126_CR44","doi-asserted-by":"publisher","first-page":"1345","DOI":"10.1109\/TKDE.2009.191","volume":"22","author":"SJ Pan","year":"2010","unstructured":"Pan SJ, Yang Q. A survey on transfer learning. IEEE Trans Knowl Data Eng. 2010;22(10):1345\u201359. https:\/\/doi.org\/10.1109\/TKDE.2009.191.","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"3126_CR45","doi-asserted-by":"publisher","unstructured":"Csurka G. In: Csurka, G. (ed.) A Comprehensive Survey on Domain Adaptation for Visual Applications, pp. 1\u201335. Springer, Cham 2017. https:\/\/doi.org\/10.1007\/978-3-319-58347-1_1 .","DOI":"10.1007\/978-3-319-58347-1_1"},{"key":"3126_CR46","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1016\/j.patcog.2017.10.009","volume":"77","author":"M-A Carbonneau","year":"2018","unstructured":"Carbonneau M-A, Cheplygina V, Granger E, Gagnon G. Multiple instance learning: A survey of problem characteristics and applications. Pattern Recogn. 2018;77:329\u201353. https:\/\/doi.org\/10.1016\/j.patcog.2017.10.009.","journal-title":"Pattern Recogn"},{"key":"3126_CR47","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2016. https:\/\/doi.org\/10.1109\/cvpr.2016.90","DOI":"10.1109\/cvpr.2016.90"},{"key":"3126_CR48","unstructured":"Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. In: Bengio, Y., LeCun, Y. (eds.) 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7-9, 2015, Conference Track Proceedings 2015."},{"key":"3126_CR49","unstructured":"Kay W, Carreira J, Simonyan K, Zhang B, Hillier C, Vijayanarasimhan S, Viola F, Green T, Back T, Natsev P, Suleyman M, Zisserman A. The kinetics human action video dataset. CoRR arXiv:abs\/1705.06950 2017."}],"container-title":["SN Computer Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s42979-024-03126-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s42979-024-03126-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s42979-024-03126-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,27]],"date-time":"2024-08-27T17:16:26Z","timestamp":1724778986000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s42979-024-03126-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8,27]]},"references-count":49,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2024,10]]}},"alternative-id":["3126"],"URL":"https:\/\/doi.org\/10.1007\/s42979-024-03126-3","relation":{},"ISSN":["2661-8907"],"issn-type":[{"value":"2661-8907","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,8,27]]},"assertion":[{"value":"11 October 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 July 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 August 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"Neither humans nor animals have been involved in this research.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Research involving Human Participants and\/or Animals"}},{"value":"Neither humans nor animals have been involved in this research.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Informed Consent"}}],"article-number":"834"}}