{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T09:20:39Z","timestamp":1775640039561,"version":"3.50.1"},"reference-count":32,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T00:00:00Z","timestamp":1769817600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T00:00:00Z","timestamp":1769817600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"NextGenerationEU and Italian Ministry of University and Research","award":["ECS00000035"],"award-info":[{"award-number":["ECS00000035"]}]},{"name":"NextGenerationEU and Italian Ministry of University and Research","award":["ECS00000035"],"award-info":[{"award-number":["ECS00000035"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Med Biol Eng Comput"],"published-print":{"date-parts":[[2026,3]]},"abstract":"<jats:sec>\n                    <jats:title>Abstract<\/jats:title>\n                    <jats:p>In this paper, we extend the SonoNet architecture to capture spatio-temporal information from ultra-sound (US) sequences. More specifically, we propose 3D-SonoNet32 \u2013 which lifts 2D convolutions to 3D \u2013 and to an efficient (2+1)D variant \u2013 to keep the computational cost under control while preserving the benefits of the spatio-temporal model. We investigate the potential of these architectures on a scan-plane detection problem and discuss how these methodologies can be beneficial for AI-driven online \u201cscan assistants\u201d, to enhance the quality and reproducibility of the evaluation and ultimately support the clinicians in the US examination. Our main contributions are (i) the design of novel Space-Time SonoNet architectures for analysing US video sequences, (ii) an in depth experimental analysis to show the benefit of using space-time models with respect to purely spatial ones, and to discuss the potential improvements gained by exploiting domain-specific properties like temporal coherence and prior knowledge of the ongoing scan. Overall, we show that the proposed models are specifically designed to be computationally lightweight, but also competitive in performance, making them suitable for real-time deployment on portable US devices.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Graphical abstract<\/jats:title>\n                  <\/jats:sec>","DOI":"10.1007\/s11517-025-03504-w","type":"journal-article","created":{"date-parts":[[2026,1,30]],"date-time":"2026-01-30T23:28:07Z","timestamp":1769815687000},"page":"1167-1178","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["SpaceTime-SonoNet: efficient classification of ultra-sound video sequences"],"prefix":"10.1007","volume":"64","author":[{"given":"Matteo","family":"Interlando","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Luca","family":"Zini","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nicola","family":"Guraschi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6482-4768","authenticated-orcid":false,"given":"Nicoletta","family":"Noceti","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Francesca","family":"Odone","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2026,1,31]]},"reference":[{"issue":"11","key":"3504_CR1","doi-asserted-by":"publisher","first-page":"7771","DOI":"10.1109\/TII.2021.3069470","volume":"17","author":"B Pu","year":"2021","unstructured":"Pu B, Li K, Li S, Zhu N (2021) Automatic fetal ultrasound standard plane recognition based on deep learning and IIOT. IEEE Trans Ind Inform 17(11):7771\u20137780","journal-title":"IEEE Trans Ind Inform"},{"key":"3504_CR2","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1016\/j.sopen.2024.02.005","volume":"18","author":"C Poggi","year":"2024","unstructured":"Poggi C, Palavecino M (2024) Ultrasound principles and instrumentation. Surg Open Sci 18:123\u2013128","journal-title":"Surg Open Sci"},{"issue":"9","key":"3504_CR3","doi-asserted-by":"publisher","first-page":"1578","DOI":"10.1007\/s003300101002","volume":"11","author":"CJ Harvey","year":"2001","unstructured":"Harvey CJ, Albrecht T (2001) Ultrasound of focal liver lesions. Eur Radiol 11(9):1578\u20131593","journal-title":"Eur Radiol"},{"key":"3504_CR4","unstructured":"Noble JA (2016) Reflections on ultrasound image analysis. Elsevier"},{"key":"3504_CR5","doi-asserted-by":"publisher","DOI":"10.3389\/fphy.2024.1398393","volume":"12","author":"K Song","year":"2024","unstructured":"Song K, Feng J, Chen D (2024) A survey on deep learning in medical ultrasound imaging. Front Phys 12:1398393","journal-title":"Front Phys"},{"issue":"6","key":"3504_CR6","doi-asserted-by":"publisher","DOI":"10.3390\/app13063693","volume":"13","author":"R Tenajas","year":"2023","unstructured":"Tenajas R et al (2023) Recent advances in artificial intelligence-assisted ultrasound scanning. Appl Sci 13(6):3693","journal-title":"Appl Sci"},{"issue":"6","key":"3504_CR7","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0252882","volume":"16","author":"T Tiyarattanachai","year":"2021","unstructured":"Tiyarattanachai T et al (2021) Development and validation of artificial intelligence to detect and diagnose liver lesions from ultrasound images. PLoS One 16(6):0252882","journal-title":"PLoS One"},{"issue":"4","key":"3504_CR8","doi-asserted-by":"publisher","first-page":"786","DOI":"10.1007\/s00261-018-1517-0","volume":"43","author":"L Brattain","year":"2018","unstructured":"Brattain L et al (2018) Machine learning for medical ultrasound: status, methods, and future opportunities. Abdom Radiol 43(4):786\u2013799","journal-title":"Abdom Radiol"},{"key":"3504_CR9","doi-asserted-by":"publisher","first-page":"182","DOI":"10.1016\/j.sopen.2022.09.002","volume":"10","author":"RB Adams","year":"2022","unstructured":"Adams RB (2022) Ultrasound scanning techniques. Surg Open Sci 10:182\u2013207","journal-title":"Surg Open Sci"},{"issue":"1","key":"3504_CR10","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1007\/s11517-023-02881-4","volume":"62","author":"J Dapueto","year":"2024","unstructured":"Dapueto J, Zini L, Odone F (2024) Knowledge distillation for efficient standard scanplane detection of fetal ultrasound. Med Biol Eng Comput 62(1):73\u201382","journal-title":"Med Biol Eng Comput"},{"issue":"11","key":"3504_CR11","doi-asserted-by":"publisher","first-page":"2204","DOI":"10.1109\/TMI.2017.2712367","volume":"36","author":"C Baumgartner","year":"2017","unstructured":"Baumgartner C et al (2017) Sononet: real-time detection and localisation of fetal standard scan planes in freehand ultrasound. IEEE Trans Med Imaging 36(11):2204\u20132215","journal-title":"IEEE Trans Med Imaging"},{"key":"3504_CR12","unstructured":"Schlemper J et al (2018) Attention-gated networks for improving ultrasound scan plane detection. arXiv:1804.05338"},{"issue":"5","key":"3504_CR13","doi-asserted-by":"publisher","first-page":"1627","DOI":"10.1109\/JBHI.2015.2425041","volume":"19","author":"H Chen","year":"2015","unstructured":"Chen H et al (2015) Standard plane localization in fetal ultrasound via domain transferred deep neural networks. IEEE J Biomed Health Inform 19(5):1627\u20131636","journal-title":"IEEE J Biomed Health Inform"},{"key":"3504_CR14","doi-asserted-by":"crossref","unstructured":"Cai Y et al (2018) Multi-task sonoeyenet: detection of fetal standardized planes assisted by generated sonographer attention maps. In: MICCAI, pp 871\u2013879","DOI":"10.1007\/978-3-030-00928-1_98"},{"key":"3504_CR15","doi-asserted-by":"publisher","DOI":"10.1016\/j.compbiomed.2024.108430","volume":"174","author":"G Migliorelli","year":"2024","unstructured":"Migliorelli G et al (2024) On the use of contrastive learning for standard-plane classification in fetal ultrasound imaging. Comput Biol Med 174:108430","journal-title":"Comput Biol Med"},{"key":"3504_CR16","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.122153","volume":"238","author":"TB Krishna","year":"2024","unstructured":"Krishna TB, Kokil P (2024) Standard fetal ultrasound plane classification based on stacked ensemble of deep learning models. Expert Syst Appl 238:122153","journal-title":"Expert Syst Appl"},{"key":"3504_CR17","doi-asserted-by":"crossref","unstructured":"Chen H et al (2015) Automatic fetal ultrasound standard plane detection using knowledge transferred recurrent neural networks. In: MICCAI, Springer, pp 507\u2013514","DOI":"10.1007\/978-3-319-24553-9_62"},{"key":"3504_CR18","doi-asserted-by":"publisher","DOI":"10.1016\/j.media.2020.101762","volume":"65","author":"Y Cai","year":"2020","unstructured":"Cai Y et al (2020) Spatio-temporal visual attention modelling of standard biometry plane-finding navigation. Med Image Anal 65:101762","journal-title":"Med Image Anal"},{"key":"3504_CR19","doi-asserted-by":"publisher","DOI":"10.21037\/jmai.2019.10.03","author":"J Howard","year":"2020","unstructured":"Howard J et al (2020) Improving ultrasound video classification: an evaluation of novel deep learning methods in echocardiography. J Med Artif Intell. https:\/\/doi.org\/10.21037\/jmai.2019.10.03","journal-title":"J Med Artif Intell"},{"key":"3504_CR20","unstructured":"Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556"},{"key":"3504_CR21","doi-asserted-by":"publisher","DOI":"10.1016\/j.ejrad.2023.110885","volume":"165","author":"G Zhao","year":"2023","unstructured":"Zhao G et al (2023) Deep learning-based classification of breast lesions using dynamic ultrasound video. Eur J Radiol 165:110885","journal-title":"Eur J Radiol"},{"key":"3504_CR22","doi-asserted-by":"publisher","DOI":"10.1016\/j.bspc.2023.105541","volume":"87","author":"D Kong","year":"2024","unstructured":"Kong D, Hu S, Zhao G (2024) Mv-stcnet: breast cancer diagnosis using spatial and temporal dual-attention guided classification network based on multi-view ultrasound videos. Biomed Signal Process Control 87:105541","journal-title":"Biomed Signal Process Control"},{"key":"3504_CR23","doi-asserted-by":"publisher","DOI":"10.3390\/s21165486","author":"B Barros","year":"2021","unstructured":"Barros B et al (2021) Pulmonary covid-19: learning spatiotemporal features combining cnn and lstm networks for lung ultrasound video classification. Sensors. https:\/\/doi.org\/10.3390\/s21165486","journal-title":"Sensors"},{"key":"3504_CR24","unstructured":"Dosovitskiy A et al (2020) An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929"},{"key":"3504_CR25","doi-asserted-by":"crossref","unstructured":"Tran D et al (2018) A closer look at spatiotemporal convolutions for action recognition. In: CVPR, pp 6450\u20136459","DOI":"10.1109\/CVPR.2018.00675"},{"key":"3504_CR26","unstructured":"Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. CoRR. abs\/1409.1556"},{"key":"3504_CR27","doi-asserted-by":"crossref","unstructured":"Qiu Z, Yao T, Mei T (2017) Learning spatio-temporal representation with pseudo-3d residual networks. In: ICCV, pp 5533\u20135541","DOI":"10.1109\/ICCV.2017.590"},{"issue":"1","key":"3504_CR28","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s40537-019-0197-0","volume":"6","author":"C Shorten","year":"2019","unstructured":"Shorten C, Khoshgoftaar TM (2019) A survey on image data augmentation for deep learning. J Big Data 6(1):1\u201348","journal-title":"J Big Data"},{"issue":"2017","key":"3504_CR29","first-page":"1","volume":"11","author":"J Wang","year":"2017","unstructured":"Wang J, Perez L et al (2017) The effectiveness of data augmentation in image classification using deep learning. Convolutional Neural Netw Vis Recognit 11(2017):1\u20138","journal-title":"Convolutional Neural Netw Vis Recognit"},{"key":"3504_CR30","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1016\/j.neunet.2018.07.011","volume":"106","author":"M Buda","year":"2018","unstructured":"Buda M, Maki A, Mazurowski MA (2018) A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw 106:249\u2013259","journal-title":"Neural Netw"},{"key":"3504_CR31","unstructured":"Loshchilov I, Hutter F et al (2017) Fixing weight decay regularization in adam, 5. arXiv:1711.05101"},{"key":"3504_CR32","doi-asserted-by":"publisher","DOI":"10.3389\/fcomp.2022.867289","volume":"4","author":"E Nicora","year":"2022","unstructured":"Nicora E, Noceti N (2022) On the use of efficient projection kernels for motion-based visual saliency estimation. Front Comput Sci 4:867289","journal-title":"Front Comput Sci"}],"container-title":["Medical &amp; Biological Engineering &amp; Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11517-025-03504-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11517-025-03504-w","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11517-025-03504-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T08:41:33Z","timestamp":1775637693000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11517-025-03504-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,1,31]]},"references-count":32,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2026,3]]}},"alternative-id":["3504"],"URL":"https:\/\/doi.org\/10.1007\/s11517-025-03504-w","relation":{},"ISSN":["0140-0118","1741-0444"],"issn-type":[{"value":"0140-0118","type":"print"},{"value":"1741-0444","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,1,31]]},"assertion":[{"value":"22 September 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 December 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 January 2026","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare they have no known competing financial interests or personal relationships that could have influenced the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}]}}