{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T13:52:33Z","timestamp":1781617953191,"version":"3.54.5"},"reference-count":56,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T00:00:00Z","timestamp":1763337600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T00:00:00Z","timestamp":1763337600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Imaging"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>In biportal endoscopic spine surgery (BESS), accurately identifying neural structures, mainly the spinal nerve roots, dural sac, and the cauda equina, is crucial for preventing dural tears and achieving optimal clinical outcomes. Contrary to the growing popularity of deep learning in biomedical image processing, its application to BESS has not yet been well established. We propose a two-stage framework for real-time neural tissue segmentation from spinal endoscopy video, and compare various types of deep learning architectures.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>6410 intraoperative images from 28 patients were collected and split at the patient level into 4661 and 1749 images for training and testing, respectively. For each set, 2307 and 635 images contained neural tissue. First, a lightweight image classifier that determines the presence of neural tissue was developed. Then, six variants of the U-Net family and SegFormers for neural tissue segmentation were trained. Ground truth segmentation masks were generated by a spine specialist with more than four years of experience. AUROC and DSC on the test set were the primary outcome measures for the classification and segmentation models, respectively, and computational burden was also measured, followed by qualitative assessment of the output predictions.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>\n                      ResNet-18 achieved the highest test AUROC of 0.92 (95% CI: 0.91\u20130.93), running at a mean inference time of 1.2 ms per image, outperforming MobilenetV3-Large in all performance metrics (\n                      <jats:italic>p<\/jats:italic>\n                      \u2009&lt;\u20090.001) and computational efficiency. AR2U-Net exhibited the highest test DSC of 0.80 (95% CI: 0.78\u20130.81), IoU of 0.70 (95% CI: 0.68\u20130.72), and AUPRC of 0.88 (95% CI: 0.87\u20130.89). The test performance metrics of other U-Net variants did not differ significantly, except for AUPRC (\n                      <jats:italic>p<\/jats:italic>\n                      \u2009&lt;\u20090.001), while those of SegFormer-B0 and SegFormer-B1 were significantly inferior (\n                      <jats:italic>p<\/jats:italic>\n                      \u2009&lt;\u20090.001). U-Net variants were generally operable at a reasonable speed (25\u201340 FPS), in which U-Net and AU-Net exhibited inference times of less than 30 ms.\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>We have developed and compared deep learning models for neural tissue segmentation in spinal endoscopy and explored their potential for clinical application to real-time surgical video streams.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Clinical trial number<\/jats:title>\n                    <jats:p>Not applicable.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12880-025-01918-4","type":"journal-article","created":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T20:46:44Z","timestamp":1763412404000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Comparison of deep learning models for real-time neural tissue segmentation in spinal endoscopy"],"prefix":"10.1186","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7770-5760","authenticated-orcid":false,"given":"Wounsuk","family":"Rhee","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hyung Rae","family":"Lee","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8992-2559","authenticated-orcid":false,"given":"Bong-Soon","family":"Chang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4152-687X","authenticated-orcid":false,"given":"Sam Yeol","family":"Chang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4500-9653","authenticated-orcid":false,"given":"Hyoungmin","family":"Kim","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,11,17]]},"reference":[{"issue":"2","key":"1918_CR1","doi-asserted-by":"publisher","first-page":"418","DOI":"10.31616\/asj.2021.0527","volume":"17","author":"L Junjie","year":"2023","unstructured":"Junjie L, Jiheng Y, Jun L, Haifeng Y. Comparison of unilateral biportal endoscopy decompression and microscopic decompression effectiveness in lumbar spinal stenosis treatment: a systematic review and meta-analysis. Asian Spine J. 2023;17(2):418\u201330. https:\/\/doi.org\/10.31616\/asj.2021.0527.","journal-title":"Asian Spine J"},{"issue":"1","key":"1918_CR2","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1016\/j.spinee.2021.07.004","volume":"22","author":"AK Simpson","year":"2022","unstructured":"Simpson AK, Lightsey HM 4th, Xiong GX, Crawford AM, Minamide A, Schoenfeld AJ. Spinal endoscopy: evidence, techniques, global trends, and future projections. Spine J. 2022;22(1):64\u201374. https:\/\/doi.org\/10.1016\/j.spinee.2021.07.004.","journal-title":"Spine J"},{"issue":"1","key":"1918_CR3","doi-asserted-by":"publisher","first-page":"22","DOI":"10.1186\/s13018-018-0725-1","volume":"13","author":"SK Kim","year":"2018","unstructured":"Kim SK, Kang SS, Hong YH, Park SW, Lee SC. Clinical comparison of unilateral biportal endoscopic technique versus open microdiscectomy for single-level lumbar discectomy: a multicenter, retrospective analysis. J Orthop Surg Res. 2018;13(1):22. https:\/\/doi.org\/10.1186\/s13018-018-0725-1.","journal-title":"J Orthop Surg Res"},{"issue":"4","key":"1918_CR4","doi-asserted-by":"publisher","first-page":"624","DOI":"10.4184\/asj.2016.10.4.624","volume":"10","author":"DJ Choi","year":"2016","unstructured":"Choi DJ, Choi CM, Jung JT, Lee SJ, Kim YS. Learning curve associated with complications in biportal endoscopic spinal surgery: challenges and strategies. Asian Spine J. 2016;10(4):624. https:\/\/doi.org\/10.4184\/asj.2016.10.4.624.","journal-title":"Asian Spine J"},{"issue":"1","key":"1918_CR5","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1111\/os.13437","volume":"15","author":"B Wang","year":"2023","unstructured":"Wang B, He P, Liu X, Wu Z, Xu B. Complications of unilateral biportal endoscopic spinal surgery for lumbar spinal stenosis: a systematic review of the literature and meta-analysis of single-arm studies. Orthop Surg. 2023;15(1):3\u201315. https:\/\/doi.org\/10.1111\/os.13437.","journal-title":"Orthop Surg"},{"issue":"6","key":"1918_CR6","doi-asserted-by":"publisher","first-page":"790","DOI":"10.31616\/asj.2019.0297","volume":"14","author":"JE Kim","year":"2020","unstructured":"Kim JE, Choi DJ, Park EJ. Risk factors and options of management for an incidental dural tear in biportal endoscopic spine surgery. Asian Spine J. 2020;14(6):790\u2013800. https:\/\/doi.org\/10.31616\/asj.2019.0297.","journal-title":"Asian Spine J"},{"issue":"5","key":"1918_CR7","doi-asserted-by":"publisher","first-page":"1445","DOI":"10.1021\/acs.molpharmaceut.5b00982","volume":"13","author":"P Mamoshina","year":"2016","unstructured":"Mamoshina P, Vieira A, Putin E, Zhavoronkov A. Applications of deep learning in biomedicine. Mol Pharm. 2016;13(5):1445\u201354. https:\/\/doi.org\/10.1021\/acs.molpharmaceut.5b00982.","journal-title":"Mol Pharm"},{"key":"1918_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12874-018-0482-1","volume":"18","author":"JL Katzman","year":"2018","unstructured":"Katzman JL, Shaham U, Cloninger A, Bates J, Jiang T, Kluger Y. DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network. BMC Med Res Methodol. 2018;18:1\u20132.","journal-title":"BMC Med Res Methodol"},{"key":"1918_CR9","doi-asserted-by":"crossref","unstructured":"Ko S, Pareek A, Ro DH, Lu Y, Camp CL, Martin RK, Krych AJ. Artificial intelligence in orthopedics: three strategies for deep learning with orthopedic specific imaging. Knee Surgery, Sports Traumatology, Arthroscopy. 2022 Mar;1\u20134.","DOI":"10.1007\/s00167-021-06838-8"},{"key":"1918_CR10","doi-asserted-by":"crossref","unstructured":"Rhee W, Park SC, Kim H, Chang BS, Chang SY. Deep learning-based prediction of cervical Canal stenosis from mid-sagittal T2-weighted MRI. Skeletal Radiol. 2025 Mar;28:1\u201310.","DOI":"10.1007\/s00256-025-04917-2"},{"key":"1918_CR11","doi-asserted-by":"crossref","unstructured":"Kang DH, Jeong YJ, Kim ST, Kim Y, Chang BS, Kim H, Chang SY, Ro DH. Automated measurement of pelvic parameters using convolutional neural network in complex spinal deformities: overcoming challenges in coronal deformity cases. Spine J. 2025 Jan 31.","DOI":"10.1016\/j.spinee.2025.01.020"},{"key":"1918_CR12","doi-asserted-by":"publisher","first-page":"40496","DOI":"10.1109\/access.2021.3063716","volume":"9","author":"D Jha","year":"2021","unstructured":"Jha D, Ali S, Tomar NK, Johansen HD, Johansen D, Rittscher J, Riegler MA, Halvorsen P. Real-time polyp detection, localization and segmentation in colonoscopy using deep learning. IEEE Access. 2021;9:40496\u2013510. https:\/\/doi.org\/10.1109\/access.2021.3063716.","journal-title":"IEEE Access"},{"issue":"10","key":"1918_CR13","doi-asserted-by":"publisher","first-page":"2962","DOI":"10.1097\/JS9.0000000000000595","volume":"109","author":"FR Kolbinger","year":"2023","unstructured":"Kolbinger FR, Rinner FM, Jenke AC, et al. Anatomy segmentation in laparoscopic surgery: comparison of machine learning and human expertise \u2013 an experimental study. Int J Surg. 2023;109(10):2962\u201374. https:\/\/doi.org\/10.1097\/JS9.0000000000000595.","journal-title":"Int J Surg"},{"key":"1918_CR14","doi-asserted-by":"publisher","unstructured":"Kletz S, Schoeffmann K, Benois-Pineau J, Husslein H. Identifying surgical instruments in laparoscopy using deep learning instance segmentation. In: 2019 International Conference on Content-Based Multimedia Indexing (CBMI); 2019; Dublin, Ireland. p. 1\u20136. https:\/\/doi.org\/10.1109\/CBMI.2019.8877379","DOI":"10.1109\/CBMI.2019.8877379"},{"issue":"11","key":"1918_CR15","doi-asserted-by":"publisher","first-page":"1082","DOI":"10.3390\/bioengineering11111082","volume":"11","author":"HR Lee","year":"2024","unstructured":"Lee HR, Rhee W, Chang SY, Chang BS, Kim H. Deep learning in spinal endoscopy: U-Net models for neural tissue detection. Bioeng. 2024;11(11):1082. https:\/\/doi.org\/10.3390\/bioengineering11111082.","journal-title":"Bioeng"},{"key":"1918_CR16","doi-asserted-by":"publisher","first-page":"104384","DOI":"10.1016\/j.compbiomed.2021.104384","volume":"133","author":"SM Cho","year":"2021","unstructured":"Cho SM, Kim YG, Jeong J, Kim I, Lee HJ, Kim N. Automatic tip detection of surgical instruments in biportal endoscopic spine surgery. Comput Biol Med. 2021;133:104384. https:\/\/doi.org\/10.1016\/j.compbiomed.2021.104384.","journal-title":"Comput Biol Med"},{"issue":"5","key":"1918_CR17","doi-asserted-by":"publisher","first-page":"587","DOI":"10.1007\/s43465-024-01134-2","volume":"58","author":"J Bu","year":"2024","unstructured":"Bu J, Lei Y, Wang Y, Zhao J, Huang S, Liang J, Wang Z, Xu L, He B, Dong M, Liu G. A Multi-Element identification system based on deep learning for the visual field of percutaneous endoscopic spine surgery. Indian J Orthop. 2024;58(5):587\u201397.","journal-title":"Indian J Orthop"},{"key":"1918_CR18","doi-asserted-by":"publisher","first-page":"234","DOI":"10.1007\/978-3-319-24574-4_28","volume-title":"Medical image computing and Computer-Assisted Intervention \u2013 MICCAI 2015. MICCAI 2015","author":"O Ronneberger","year":"2015","unstructured":"Ronneberger O, Fischer P, Brox T. U-Net: convolutional networks for biomedical image segmentation. In: Navab N, Hornegger J, Wells W, Frangi A, editors. Medical image computing and Computer-Assisted Intervention \u2013 MICCAI 2015. MICCAI 2015. Lecture Notes in Computer Science. Volume 9351. Cham: Springer; 2015. pp. 234\u201341. https:\/\/doi.org\/10.1007\/978-3-319-24574-4_28."},{"key":"1918_CR19","doi-asserted-by":"publisher","first-page":"82031","DOI":"10.1109\/ACCESS.2021.3086020","volume":"9","author":"N Siddique","year":"2021","unstructured":"Siddique N, Paheding S, Elkin CP, Devabhaktuni V. U-Net and its variants for medical image segmentation: a review of theory and applications. IEEE Access. 2021;9:82031\u201357. https:\/\/doi.org\/10.1109\/ACCESS.2021.3086020.","journal-title":"IEEE Access"},{"issue":"1","key":"1918_CR20","doi-asserted-by":"publisher","first-page":"014006","DOI":"10.1117\/1.jmi.6.1.014006","volume":"6","author":"MZ Alom","year":"2019","unstructured":"Alom MZ, Yakopcic C, Hasan M, Taha TM, Asari VK. Recurrent residual U-Net for medical image segmentation. J Med Imaging. 2019;6(1):014006. https:\/\/doi.org\/10.1117\/1.jmi.6.1.014006.","journal-title":"J Med Imaging"},{"issue":"5","key":"1918_CR21","doi-asserted-by":"publisher","first-page":"749","DOI":"10.1109\/LGRS.2018.2802944","volume":"15","author":"Z Zhang","year":"2018","unstructured":"Zhang Z, Liu Q, Wang Y. Road extraction by deep residual U-Net. IEEE Geosci Remote Sens Lett. 2018;15(5):749\u201353. https:\/\/doi.org\/10.1109\/LGRS.2018.2802944.","journal-title":"IEEE Geosci Remote Sens Lett"},{"key":"1918_CR22","unstructured":"Oktay O, Schlemper J, Folgoc LL, Lee M, Heinrich M, Misawa K, Mori K, McDonagh S, Hammerla NY, Kainz B, Glocker B, Rueckert D. (2018) Attention U-Net: Learning where to look for the pancreas. arXiv. 2018. https:\/\/arxiv.org\/abs\/1804.03999. Accessed 26 Dec 2024."},{"key":"1918_CR23","doi-asserted-by":"publisher","first-page":"6625688","DOI":"10.1155\/2021\/6625688","volume":"2021","author":"Q Zuo","year":"2021","unstructured":"Zuo Q, Chen S, Wang Z. R2AU-Net: attention recurrent residual convolutional neural network for multimodal medical image segmentation. Secur Commun Netw. 2021;2021:6625688. https:\/\/doi.org\/10.1155\/2021\/6625688.","journal-title":"Secur Commun Netw"},{"key":"1918_CR24","first-page":"12077","volume":"34","author":"E Xie","year":"2021","unstructured":"Xie E, Wang W, Yu Z, Anandkumar A, Alvarez JM, Luo P. SegFormer: simple and efficient design for semantic segmentation with Transformers. Adv Neural Inf Process Syst. 2021;34:12077\u201390.","journal-title":"Adv Neural Inf Process Syst"},{"key":"1918_CR25","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1007\/s11263-007-0090-8","volume":"77","author":"BC Russell","year":"2008","unstructured":"Russell BC, Torralba A, Murphy KP, Freeman WT. LabelMe: a database and web-based tool for image annotation. Int J Comput Vis. 2008;77:157\u201373. https:\/\/doi.org\/10.1007\/s11263-007-0090-8.","journal-title":"Int J Comput Vis"},{"key":"1918_CR26","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, Sun J, Vision, Recognition P, CVPR). Deep residual learning for image recognition. In: Proceedings of the 2016 Jun 27\u201330; Las Vegas, NV, USA. p. 770\u20138. https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"key":"1918_CR27","doi-asserted-by":"crossref","unstructured":"Howard A, Sandler M, Chu G, Chen LC, Chen B, Tan M, Wang W, Zhu Y, Pang R, Vasudevan V, Le QV. Searching for mobilenetv3. InProceedings of the IEEE\/CVF international conference on computer vision 2019 (pp. 1314\u20131324).","DOI":"10.1109\/ICCV.2019.00140"},{"issue":"2","key":"1918_CR28","doi-asserted-by":"publisher","first-page":"386","DOI":"10.1109\/tpami.2018.2844175","volume":"42","author":"K He","year":"2020","unstructured":"He K, Gkioxari G, Doll\u00e1r P, Girshick R, Mask R-CNN. IEEE Trans Pattern Anal Mach Intell. 2020;42(2):386\u201397. https:\/\/doi.org\/10.1109\/tpami.2018.2844175.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1918_CR29","doi-asserted-by":"crossref","unstructured":"Liu Z, Mao H, Wu CY, Feichtenhofer C, Darrell T, Xie S. A convnet for the 2020s. InProceedings of the IEEE\/CVF conference on computer vision and pattern recognition 2022 (pp. 11976\u201311986).","DOI":"10.1109\/CVPR52688.2022.01167"},{"key":"1918_CR30","doi-asserted-by":"crossref","unstructured":"Khan S, Naseer M, Hayat M, Zamir SW, Khan FS, Shah M. Transformers in vision: a survey. ACM computing surveys (CSUR). 2022;54(10s):1\u201341.","DOI":"10.1145\/3505244"},{"issue":"2","key":"1918_CR31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.5121\/ijdkp.2015.5201","volume":"5","author":"M Hossin","year":"2015","unstructured":"Hossin M, Sulaiman MN. A review on evaluation metrics for data classification evaluations. Int J Data Min Knowl Manage Process. 2015;5(2):1.","journal-title":"Int J Data Min Knowl Manage Process"},{"key":"1918_CR32","doi-asserted-by":"publisher","unstructured":"Jadon S. A survey of loss functions for semantic segmentation. In: Proceedings of the 2020 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB); 2020; Via del Mar, Chile. pp. 1\u20137. https:\/\/doi.org\/10.1109\/CIBCB48159.2020.9277638","DOI":"10.1109\/CIBCB48159.2020.9277638"},{"key":"1918_CR33","doi-asserted-by":"publisher","unstructured":"Zhao R, Qian B, Zhang X, Li Y, Wei R, Liu Y, Pan Y. Rethinking dice loss for medical image segmentation. In: Proceedings of the 2020 IEEE International Conference on Data Mining (ICDM); 2020; Sorrento, Italy. pp. 851\u2009\u2013\u200960. https:\/\/doi.org\/10.1109\/ICDM50108.2020.00094","DOI":"10.1109\/ICDM50108.2020.00094"},{"issue":"1","key":"1918_CR34","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1186\/s13104-022-06096-y","volume":"15","author":"D M\u00fcller","year":"2022","unstructured":"M\u00fcller D, Soto-Rey I, Kramer F. Towards a guideline for evaluation metrics in medical image segmentation. BMC Res Notes. 2022;15(1):210.","journal-title":"BMC Res Notes"},{"issue":"1","key":"1918_CR35","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1038\/s41746-020-00376-2","volume":"4","author":"A Esteva","year":"2021","unstructured":"Esteva A, Chou K, Yeung S, Naik N, Madani A, Mottaghi A, Liu Y, Topol E, Dean J, Socher R. Deep learning-enabled medical computer vision. NPJ Digit Med. 2021;4(1):5.","journal-title":"NPJ Digit Med"},{"key":"1918_CR36","doi-asserted-by":"crossref","unstructured":"Azad R, Aghdam EK, Rauland A, Jia Y, Avval AH, Bozorgpour A, Karimijafarbigloo S, Cohen JP, Adeli E, Merhof D. Medical image segmentation review: the success of u-net. IEEE Trans Pattern Anal Mach Intell. 2024 Aug 21.","DOI":"10.1109\/TPAMI.2024.3435571"},{"key":"1918_CR37","doi-asserted-by":"publisher","first-page":"104791","DOI":"10.1016\/j.bspc.2023.104791","volume":"84","author":"H Xiao","year":"2023","unstructured":"Xiao H, Li L, Liu Q, Zhu X, Zhang Q. Transformers in medical image segmentation: A review. Biomed Signal Process Control. 2023;84:104791.","journal-title":"Biomed Signal Process Control"},{"key":"1918_CR38","doi-asserted-by":"crossref","first-page":"73104","DOI":"10.52202\/075280-3196","volume":"36","author":"L Pandey","year":"2023","unstructured":"Pandey L, Wood S, Wood J. Are vision Transformers more data hungry than newborn visual systems? Adv Neural Inf Process Syst. 2023;36:73104\u201321.","journal-title":"Adv Neural Inf Process Syst"},{"key":"1918_CR39","unstructured":"Ruan BK, Shuai HH, Cheng WH. Vision transformers: state of the art and research challenges. arXiv preprint arXiv:2207.03041. 2022 Jul 7."},{"key":"1918_CR40","unstructured":"Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. ArXiv Preprint ArXiv:1409.1556. 2014 Sep 4."},{"issue":"4","key":"1918_CR41","doi-asserted-by":"publisher","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","volume":"40","author":"LC Chen","year":"2017","unstructured":"Chen LC, Papandreou G, Kokkinos I, Murphy K, Yuille AL, Deeplab. Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected Crfs. IEEE Trans Pattern Anal Mach Intell. 2017;40(4):834\u201348.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1918_CR42","unstructured":"Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S, Uszkoreit J. An image is worth 16x16 words: Transformers for image recognition at scale. ArXiv Preprint ArXiv:2010.11929. 2020 Oct 22."},{"issue":"1","key":"1918_CR43","first-page":"e220028","volume":"5","author":"F Maleki","year":"2022","unstructured":"Maleki F, Ovens K, Gupta R, Reinhold C, Spatz A, Forghani R. Generalizability of machine learning models: quantitative evaluation of three methodological pitfalls. Radiology: Artif Intell. 2022;5(1):e220028.","journal-title":"Radiology: Artif Intell"},{"key":"1918_CR44","doi-asserted-by":"crossref","unstructured":"Myllyaho L, Raatikainen M, M\u00e4nnist\u00f6 T, Mikkonen T, Nurminen JK. Systematic literature review of validation methods for AI systems. ArXiv Preprint ArXiv:2107.12190. 2021 Jul 26.","DOI":"10.1016\/j.jss.2021.111050"},{"issue":"3","key":"1918_CR45","first-page":"e210064","volume":"4","author":"AC Yu","year":"2022","unstructured":"Yu AC, Mohajer B, Eng J. External validation of deep learning algorithms for radiologic diagnosis: a systematic review. Radiology: Artif Intell. 2022;4(3):e210064.","journal-title":"Radiology: Artif Intell"},{"issue":"2","key":"1918_CR46","doi-asserted-by":"publisher","first-page":"252","DOI":"10.1093\/jamiaopen\/ooaa006","volume":"3","author":"AD Bedoya","year":"2020","unstructured":"Bedoya AD, Futoma J, Clement ME, Corey K, Brajer N, Lin A, Simons MG, Gao M, Nichols M, Balu S, Heller K. Machine learning for early detection of sepsis: an internal and Temporal validation study. JAMIA Open. 2020;3(2):252\u201360.","journal-title":"JAMIA Open"},{"issue":"1","key":"1918_CR47","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1016\/j.ijmedinf.2006.01.003","volume":"76","author":"N Bricon-Souf","year":"2007","unstructured":"Bricon-Souf N, Newman CR. Context awareness in health care: A review. Int J Med Informatics. 2007;76(1):2\u201312.","journal-title":"Int J Med Informatics"},{"key":"1918_CR48","doi-asserted-by":"publisher","unstructured":"Tran D, Bourdev L, Fergus R, Torresani L, Paluri M. Learning spatiotemporal features with 3D convolutional networks. In: Proceedings of the. 2015 IEEE International Conference on Computer Vision (ICCV); 2015; Santiago, Chile. pp. 4489-97. https:\/\/doi.org\/10.1109\/ICCV.2015.510","DOI":"10.1109\/ICCV.2015.510"},{"key":"1918_CR49","doi-asserted-by":"crossref","unstructured":"Neimark D, Bar O, Zohar M, Asselmann D. Video transformer network. InProceedings of the IEEE\/CVF international conference on computer vision 2021 (pp. 3163\u20133172).","DOI":"10.1109\/ICCVW54120.2021.00355"},{"issue":"11","key":"1918_CR50","doi-asserted-by":"publisher","first-page":"12922","DOI":"10.1109\/TPAMI.2023.3243465","volume":"45","author":"J Selva","year":"2023","unstructured":"Selva J, Johansen AS, Escalera S, Nasrollahi K, Moeslund TB, Clap\u00e9s A. Video transformers: A survey. IEEE Trans Pattern Anal Mach Intell. 2023;45(11):12922\u201343.","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1918_CR51","doi-asserted-by":"publisher","first-page":"102035","DOI":"10.1016\/j.media.2021.102035","volume":"71","author":"J Ma","year":"2021","unstructured":"Ma J, Chen J, Ng M, Huang R, Li Y, Li C, Yang X, Martel AL. Loss odyssey in medical image segmentation. Med Image Anal. 2021;71:102035.","journal-title":"Med Image Anal"},{"key":"1918_CR52","doi-asserted-by":"publisher","first-page":"6855","DOI":"10.1109\/TIP.2021.3099405","volume":"30","author":"J Li","year":"2021","unstructured":"Li J, Su J, Xia C, Ma M, Tian Y. Salient object detection with purificatory mechanism and structural similarity loss. IEEE Trans Image Process. 2021;30:6855\u201368.","journal-title":"IEEE Trans Image Process"},{"issue":"7","key":"1918_CR53","doi-asserted-by":"publisher","first-page":"3312","DOI":"10.1109\/TIP.2019.2895768","volume":"28","author":"A Lucas","year":"2019","unstructured":"Lucas A, Lopez-Tapia S, Molina R, Katsaggelos AK. Generative adversarial networks and perceptual losses for video super-resolution. IEEE Trans Image Process. 2019;28(7):3312\u201327.","journal-title":"IEEE Trans Image Process"},{"key":"1918_CR54","doi-asserted-by":"crossref","unstructured":"Li M, Ganni S, Ponten J, Albayrak A, Rutkowski AF, Jakimowicz J. Analysing usability and presence of a virtual reality operating room (VOR) simulator during laparoscopic surgery training. In 2020 IEEE Conference on Virtual Reality and 3D User Interfaces (VR) 2020 Mar 22 (pp. 566\u2013572). IEEE.","DOI":"10.1109\/VR46266.2020.1581301697128"},{"issue":"5","key":"1918_CR55","doi-asserted-by":"publisher","first-page":"1535","DOI":"10.1007\/s11517-024-03030-1","volume":"62","author":"YG Kim","year":"2024","unstructured":"Kim YG, Lee JH, Shim JW, Rhee W, Kim BS, Yoon D, Kim MJ, Park JW, Jeong CW, Yang HK, Cho M, Kim S. A multimodal virtual vision platform as a next-generation vision system for a surgical robot. Med Biol Eng Comput. 2024;62(5):1535\u201348. https:\/\/doi.org\/10.1007\/s11517-024-03030-1.","journal-title":"Med Biol Eng Comput"},{"key":"1918_CR56","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1007\/s10055-024-00986-1","volume":"28","author":"W Rhee","year":"2024","unstructured":"Rhee W, Kim YG, Lee JH, Shim JW, Kim BS, Yoon D, Cho M, Kim S. Unconstrained lightweight control interface for robot-assisted minimally invasive surgery using MediaPipe framework and head-mounted display. Virtual Reality. 2024;28:114. https:\/\/doi.org\/10.1007\/s10055-024-00986-1.","journal-title":"Virtual Reality"}],"container-title":["BMC Medical Imaging"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12880-025-01918-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12880-025-01918-4","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12880-025-01918-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,6,16]],"date-time":"2026-06-16T13:03:07Z","timestamp":1781614987000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1186\/s12880-025-01918-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,17]]},"references-count":56,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["1918"],"URL":"https:\/\/doi.org\/10.1186\/s12880-025-01918-4","relation":{},"ISSN":["1471-2342"],"issn-type":[{"value":"1471-2342","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,17]]},"assertion":[{"value":"14 March 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 September 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 November 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and\/or national research committee and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. The study was approved by the Public Institutional Review Board (IRB) of the National Bioethics Policy Institute through the public e-IRB system (Date: August 16th, 2024 \/ No. 2024-1010-001). Informed consent to participate was waived by the IRB due to the retrospective nature of this study.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not Applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"470"}}