{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T11:08:22Z","timestamp":1774868902450,"version":"3.50.1"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2025,8,27]],"date-time":"2025-08-27T00:00:00Z","timestamp":1756252800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,8,27]],"date-time":"2025-08-27T00:00:00Z","timestamp":1756252800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100010664","name":"H2020 Future and Emerging Technologies","doi-asserted-by":"publisher","award":["16726"],"award-info":[{"award-number":["16726"]}],"id":[{"id":"10.13039\/100010664","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100014852","name":"Departamento de Educaci\u00f3n, Cultura y Deporte, Gobierno de Arag\u00f3n","doi-asserted-by":"publisher","award":["16727"],"award-info":[{"award-number":["16727"]}],"id":[{"id":"10.13039\/501100014852","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Agencia Estatal de Investigacion","award":["16728"],"award-info":[{"award-number":["16728"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J CARS"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Purpose<\/jats:title>\n                    <jats:p>We aim to automate the initial analysis of complete endoscopy videos, identifying the sparse relevant content. This facilitates long procedure recording understanding, reduces the clinicians\u2019 review time, and facilitates downstream tasks such as video summarization, event detection, and 3D reconstruction.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>Our approach extracts endoscopic video frame representations with a learned embedding model. These descriptors are clustered to find visual patterns in the procedure, identifying key scene types (surgery, clear visibility frames, etc.) and enabling segmentation into informative and non-informative video parts.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Evaluation on complete colonoscopy videos presents good performance identifying surgery segments and different visibility conditions. The method produces structured overviews that separate useful segments from irrelevant ones. We illustrate its suitability and benefits as preprocessing for other downstream tasks, such as 3D reconstruction or video summarization.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>Our approach enables automated endoscopy overview generation, helping the clinicians focus on the relevant video content such as good visibility sections and surgery actions. The presented work facilitates faster recording reviewing for clinicians and effective video preprocessing for downstream tasks.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1007\/s11548-025-03502-1","type":"journal-article","created":{"date-parts":[[2025,8,27]],"date-time":"2025-08-27T12:12:26Z","timestamp":1756296746000},"page":"617-624","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Automated overview of complete endoscopies with unsupervised learned descriptors"],"prefix":"10.1007","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8191-6261","authenticated-orcid":false,"given":"O. Leon","family":"Barbed","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pablo","family":"Azagra","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Juan","family":"Plo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ana C.","family":"Murillo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,8,27]]},"reference":[{"issue":"1","key":"3502_CR1","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41597-020-00622-y","volume":"7","author":"H Borgli","year":"2020","unstructured":"Borgli H, Thambawita V, Smedsrud PH, Hicks S, Jha D, Eskeland SL, Randel KR, Pogorelov K, Lux M, Nguyen DTD, Johansen D, Griwoz C, Stensland HK, Garcia-Ceja E, Schmidt PT, Hammer HL, Riegler MA, Halvorsen P, Lange T (2020) HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy. Sci Data 7(1):1\u201314","journal-title":"Sci Data"},{"issue":"1","key":"3502_CR2","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1038\/s41597-023-02564-7","volume":"10","author":"P Azagra","year":"2023","unstructured":"Azagra P, Sostres C, Ferr\u00e1ndez \u00c1, Riazuelo L, Tomasini C, Barbed OL, Morlana J, Recasens D, Batlle VM, G\u00f3mez-Rodr\u00edguez JJ, Elvira R, L\u00f3pez J, Oriol C, Civera J, Tard\u00f3s JD, Murillo AC, Lanas \u00c1, Montiel JMM (2023) Endomapper dataset of complete calibrated endoscopy procedures. Sci Data 10(1):671","journal-title":"Sci Data"},{"key":"3502_CR3","doi-asserted-by":"publisher","first-page":"104003","DOI":"10.1016\/j.compbiomed.2020.104003","volume":"126","author":"I Pacal","year":"2020","unstructured":"Pacal I, Karaboga D, Basturk A, Akay B, Nalbantoglu U (2020) A comprehensive review of deep learning in colon cancer. Comput Biol Med 126:104003","journal-title":"Comput Biol Med"},{"key":"3502_CR4","doi-asserted-by":"crossref","unstructured":"G\u00f3mez-Rodr\u00edguez JJ, Lamarca J, Morlana J, Tard\u00f3s JD, Montiel JM (2021) SD-DefSLAM: semi-direct monocular SLAM for deformable and intracorporeal scenes. In: IEEE International Confence on Robotics and Automation, pp 5170\u20135177","DOI":"10.1109\/ICRA48506.2021.9561512"},{"key":"3502_CR5","doi-asserted-by":"publisher","first-page":"104519","DOI":"10.1016\/j.compbiomed.2021.104519","volume":"134","author":"I Pacal","year":"2021","unstructured":"Pacal I, Karaboga D (2021) A robust real-time deep learning based automatic polyp detection system. Comput Biol Med 134:104519","journal-title":"Comput Biol Med"},{"key":"3502_CR6","doi-asserted-by":"publisher","first-page":"101565","DOI":"10.1016\/j.bspc.2019.101565","volume":"53","author":"M Hajabdollahi","year":"2019","unstructured":"Hajabdollahi M, Esfandiarpoor R, Khadivi P, Soroushmehr SR, Karimi N, Najarian K, Samavi S (2019) Segmentation of bleeding regions in wireless capsule endoscopy for detection of informative frames. Biomed Sig Process Control 53:101565","journal-title":"Biomed Sig Process Control"},{"key":"3502_CR7","doi-asserted-by":"crossref","unstructured":"Hirsch R, Caron M, Cohen R, Livne A, Shapiro R, Golany T, Goldenberg R, Freedman D, Rivlin E (2023) Self-supervised learning for endoscopic video analysis. In: International conference on medical image Computing and computer-assisted intervention, pp 569\u2013578","DOI":"10.1007\/978-3-031-43904-9_55"},{"key":"3502_CR8","doi-asserted-by":"crossref","unstructured":"He K, Fan H, Wu Y, Xie S, Girshick R (2020) Momentum contrast for unsupervised visual representation learning. In: CVPR","DOI":"10.1109\/CVPR42600.2020.00975"},{"key":"3502_CR9","unstructured":"Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations. In: ICML"},{"key":"3502_CR10","first-page":"21271","volume":"33","author":"J-B Grill","year":"2020","unstructured":"Grill J-B, Strub F, Altch\u00e9 F, Tallec C, Richemond P, Buchatskaya E, Doersch C, Avila Pires B, Guo Z, Gheshlaghi Azar M, Piot B, Kavukcuoglu K, Munos R, Valko M (2020) Bootstrap your own latent-a new approach to self-supervised learning. Adv Neural Inf Process Syst 33:21271\u201321284","journal-title":"Adv Neural Inf Process Syst"},{"key":"3502_CR11","doi-asserted-by":"crossref","unstructured":"Chen X, He K (2021) Exploring simple siamese representation learning. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 15750\u201315758","DOI":"10.1109\/CVPR46437.2021.01549"},{"key":"3502_CR12","doi-asserted-by":"crossref","unstructured":"Wang Z, Liu C, Zhang S, Dou Q (2023) Foundation model for endoscopy video analysis via large-scale self-supervised pre-train. In: International conference on medical image computing and computer-assisted intervention, pp 101\u2013111","DOI":"10.1007\/978-3-031-43996-4_10"},{"issue":"3","key":"3502_CR13","doi-asserted-by":"publisher","first-page":"848","DOI":"10.1109\/TBME.2018.2859322","volume":"66","author":"PD Byrnes","year":"2019","unstructured":"Byrnes PD, Higgins WE (2019) Efficient bronchoscopic video summarization. IEEE Trans Biomed Eng 66(3):848\u2013863","journal-title":"IEEE Trans Biomed Eng"},{"issue":"6","key":"3502_CR14","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1007\/s11548-024-03098-y","volume":"19","author":"A Meyer","year":"2024","unstructured":"Meyer A, Mazellier J-P, Dana J, Padoy N (2024) On-the-fly point annotation for fast medical video labeling. Int J Comput Assist Radiol Surg 19(6):1093\u20131101","journal-title":"Int J Comput Assist Radiol Surg"},{"key":"3502_CR15","doi-asserted-by":"publisher","first-page":"231","DOI":"10.1007\/s11548-021-02311-6","volume":"16","author":"H Wang","year":"2021","unstructured":"Wang H, Pan X, Zhao H, Gao C, Liu N (2021) Hard frame detection for the automated clipping of surgical nasal endoscopic video. Int J Comput Assist Radiol Surg 16:231\u2013240","journal-title":"Int J Comput Assist Radiol Surg"},{"key":"3502_CR16","first-page":"2183","volume":"14","author":"V Raut","year":"2022","unstructured":"Raut V, Gunjan R (2022) Transfer learning based video summarization in wireless capsule endoscopy. Int J Inf Technol 14:2183\u20132190","journal-title":"Int J Inf Technol"},{"key":"3502_CR17","doi-asserted-by":"crossref","unstructured":"Ismail MMB, Bchir O, Emam AZ (2013) Endoscopy video summarization based on unsupervised learning and feature discrimination. In: 2013 Visual communications and image processing (VCIP), pp 1\u20136","DOI":"10.1109\/VCIP.2013.6706410"},{"key":"3502_CR18","first-page":"1","volume":"38","author":"I Mehmood","year":"2014","unstructured":"Mehmood I, Sajjad M, Baik SW (2014) Video summarization based tele-endoscopy: a service to efficiently manage visual data generated during wireless capsule endoscopy procedure. J Med Syst 38:1\u20139","journal-title":"J Med Syst"},{"issue":"10","key":"3502_CR19","doi-asserted-by":"publisher","first-page":"3407","DOI":"10.3390\/app10103407","volume":"10","author":"J Putten","year":"2020","unstructured":"Putten J, Struyvenberg M, Groof J, Curvers W, Schoon E, Baldaque-Silva F, Bergman J, Sommen F, With PH (2020) Endoscopy-driven pretraining for classification of dysplasia in Barrett\u2019s esophagus with endoscopic narrow-band imaging zoom videos. Appl Sci 10(10):3407","journal-title":"Appl Sci"},{"key":"3502_CR20","doi-asserted-by":"publisher","first-page":"101900","DOI":"10.1016\/j.media.2020.101900","volume":"68","author":"S Ali","year":"2021","unstructured":"Ali S, Zhou F, Bailey A, Braden B, East JE, Lu X, Rittscher J (2021) A deep learning framework for quality assessment and restoration in video endoscopy. Med Image Anal 68:101900","journal-title":"Med Image Anal"},{"issue":"1","key":"3502_CR21","doi-asserted-by":"publisher","first-page":"94","DOI":"10.1136\/gutjnl-2017-314547","volume":"68","author":"MF Byrne","year":"2019","unstructured":"Byrne MF, Chapados N, Soudan F, Oertel C, P\u00e9rez ML, Kelly R, Iqbal N, Chandelier F, Rex DK (2019) Real-time differentiation of adenomatous and hyperplastic diminutive colorectal polyps during analysis of unaltered videos of standard colonoscopy using a deep learning model. Gut 68(1):94\u2013100","journal-title":"Gut"},{"issue":"1","key":"3502_CR22","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1109\/JBHI.2016.2637004","volume":"21","author":"L Yu","year":"2016","unstructured":"Yu L, Chen H, Dou Q, Qin J, Heng PA (2016) Integrating online and offline three-dimensional deep learning for automated polyp detection in colonoscopy videos. IEEE J Biomed Health Inf 21(1):65\u201375","journal-title":"IEEE J Biomed Health Inf"},{"key":"3502_CR23","doi-asserted-by":"publisher","first-page":"575","DOI":"10.1007\/s11548-013-0814-x","volume":"8","author":"J Liu","year":"2013","unstructured":"Liu J, Subramanian KR, Yoo TS (2013) A robust method to track colonoscopy videos with non-informative images. Int J Comput Assist Radiol Surg 8:575\u2013592","journal-title":"Int J Comput Assist Radiol Surg"},{"key":"3502_CR24","doi-asserted-by":"crossref","unstructured":"Boers T, Putten J, Struyvenberg M, Fockens K, Jukema J, Schoon E, Sommen F, Bergman J, With P (2020) Improving temporal stability and accuracy for endoscopic video tissue classification using recurrent neural networks. Sensors 20(15):4133","DOI":"10.3390\/s20154133"},{"key":"3502_CR25","doi-asserted-by":"crossref","unstructured":"Harada S, Hayashi H, Bise R, Tanaka K, Meng Q, Uchida S (2019) Endoscopic image clustering with temporal ordering information based on dynamic programming. In: International conference of the IEEE engineering in medicine and biology society, pp 3681\u20133684","DOI":"10.1109\/EMBC.2019.8857011"},{"key":"3502_CR26","doi-asserted-by":"crossref","unstructured":"Kumar S, Haresh S, Ahmed A, Konin A, Zia MZ, Tran Q-H (2022) Unsupervised action segmentation by joint representation learning and online clustering. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 20174\u201320185","DOI":"10.1109\/CVPR52688.2022.01954"},{"key":"3502_CR27","doi-asserted-by":"crossref","unstructured":"Bueno-Benito E, Vecino BT, Dimiccoli M (2023) Leveraging triplet loss for unsupervised action segmentation. In: Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, pp 4922\u20134930","DOI":"10.1109\/CVPRW59228.2023.00520"},{"key":"3502_CR28","doi-asserted-by":"crossref","unstructured":"Barbed OL, Montiel JM, Fua P, Murillo AC (2023) Tracking adaptation to improve superpoint for 3d reconstruction in endoscopy. In: International conference on medical image computing and computer-assisted intervention, pp 583\u2013593","DOI":"10.1007\/978-3-031-43907-0_56"}],"container-title":["International Journal of Computer Assisted Radiology and Surgery"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11548-025-03502-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11548-025-03502-1","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11548-025-03502-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T10:21:17Z","timestamp":1774866077000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11548-025-03502-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,27]]},"references-count":28,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2026,3]]}},"alternative-id":["3502"],"URL":"https:\/\/doi.org\/10.1007\/s11548-025-03502-1","relation":{},"ISSN":["1861-6429"],"issn-type":[{"value":"1861-6429","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,27]]},"assertion":[{"value":"10 January 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 August 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 August 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}