{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T09:42:37Z","timestamp":1769852557116,"version":"3.49.0"},"reference-count":69,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2025,5,1]],"date-time":"2025-05-01T00:00:00Z","timestamp":1746057600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,5,12]],"date-time":"2025-05-12T00:00:00Z","timestamp":1747008000000},"content-version":"vor","delay-in-days":11,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J. King Saud Univ. Comput. Inf. Sci."],"published-print":{"date-parts":[[2025,5]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Anatomical landmark detection is crucial in medical image analysis, facilitating accurate diagnosis, surgical planning, and treatment evaluation. However, existing methods often struggle to simultaneously capture global context and local details while exhibiting limited generalization across diverse datasets and imaging modalities. To relieve this, we propose a hybrid model that leverages convolutional operations to capture local information and a Swin Transformer to enhance global context. Specifically, we introduce a novel U-shaped architecture, termed Convolutional Attention Swin Enhanced Landmark Detection Network (CASEMark). CASEMark integrates three key innovations: (1) a Convolutional Attention Swin Transformer module (CAST) that integrates transformer-based global context modeling with convolutional operations for local feature extraction, (2) an Enhanced Skip Attention Module (ESAM) enabling adaptive feature fusion between encoder and decoder pathways, and (3) a multi-resolution heatmap learning strategy that aggregates information across scales. This approach effectively balances global-local feature extraction with robust cross-modality generalization. Extensive experiments on four public datasets demonstrate the superiority of CASEMark. The code and datasets will be made publicly available.<\/jats:p>","DOI":"10.1007\/s44443-025-00031-4","type":"journal-article","created":{"date-parts":[[2025,5,12]],"date-time":"2025-05-12T09:50:02Z","timestamp":1747043402000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["CASEMark: A hybrid model for robust anatomical landmark detection in multi-structure X-rays"],"prefix":"10.1007","volume":"37","author":[{"given":"Zhen","family":"Huang","sequence":"first","affiliation":[]},{"given":"Xiaoqian","family":"Zhou","sequence":"additional","affiliation":[]},{"given":"Xu","family":"He","sequence":"additional","affiliation":[]},{"given":"Yangbo","family":"Wei","sequence":"additional","affiliation":[]},{"given":"Wenkai","family":"Yang","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3450-0826","authenticated-orcid":false,"given":"Suhua","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Xiaoxin","family":"Sun","sequence":"additional","affiliation":[]},{"given":"Han","family":"Li","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,5,12]]},"reference":[{"issue":"2","key":"31_CR1","doi-asserted-by":"publisher","first-page":"547","DOI":"10.1007\/s10278-022-00718-4","volume":"36","author":"Y Ao","year":"2023","unstructured":"Ao Y, Wu H (2023) Feature aggregation and refinement network for 2d anatomical landmark detection. J Digit Imaging 36(2):547\u2013561","journal-title":"J Digit Imaging"},{"key":"31_CR2","unstructured":"Beal J, Kim E, Tzeng E, Park DH, Zhai A, Kislyuk D (2020) Toward transformer-based object detection. arXiv:2012.09958"},{"key":"31_CR3","doi-asserted-by":"crossref","unstructured":"Cai Z, Vasconcelos N (2018) Cascade r-cnn: Delving into high quality object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp. 6154\u20136162","DOI":"10.1109\/CVPR.2018.00644"},{"key":"31_CR4","doi-asserted-by":"crossref","unstructured":"Cao H, Wang Y, Chen J, Jiang D, Zhang X, Tian Q, Wang M (2022) Swin-unet: Unet-like pure transformer for medical image segmentation. In: European conference on computer vision. pp. 205\u2013218. Springer","DOI":"10.1007\/978-3-031-25066-8_9"},{"issue":"2","key":"31_CR5","doi-asserted-by":"publisher","first-page":"275","DOI":"10.1109\/42.293920","volume":"13","author":"J Cardillo","year":"1994","unstructured":"Cardillo J, Sid-Ahmed MA (1994) An image processing system for locating craniofacial landmarks. IEEE Trans Med Imaging 13(2):275\u2013289","journal-title":"IEEE Trans Med Imaging"},{"key":"31_CR6","unstructured":"Chiras, J., Depriester, C., Weill, A., Sola-Martinez, M., Deramond, H.: Percutaneous vertebral surgery. technics and indications. Journal of neuroradiology= Journal de neuroradiologie 24(1), 45\u201359 (1997)"},{"key":"31_CR7","unstructured":"Dosovitskiy A (2020) An image is worth 16x16 words: Transformers for image recognition at scale. arXiv:2010.11929"},{"issue":"3","key":"31_CR8","doi-asserted-by":"publisher","first-page":"609","DOI":"10.1016\/j.patcog.2003.09.002","volume":"37","author":"I El-Feghi","year":"2004","unstructured":"El-Feghi I, Sid-Ahmed MA, Ahmadi M (2004) Automatic localization of craniofacial landmarks for assisted cephalometry. Pattern Recogn 37(3):609\u2013621","journal-title":"Pattern Recogn"},{"key":"31_CR9","doi-asserted-by":"crossref","unstructured":"Gallego P, Ambroa E, P\u00e9rezAlija J, Jornet N, Anson C, Tejedor N, Vivancos H, Ruiz A, Barcel\u00f3 M, Dominguez A et\u00a0al (2024) Breast radiotherapy planning: A decision-making framework using deep learning. Medical Physics","DOI":"10.1002\/mp.17527"},{"key":"31_CR10","doi-asserted-by":"crossref","unstructured":"Grau V, Alcaniz M, Juan M, Monserrat C, Knoll C (2001) Automatic localization of cephalometric landmarks. J Biomed Inform 34(3):146\u2013156","DOI":"10.1006\/jbin.2001.1014"},{"key":"31_CR11","doi-asserted-by":"crossref","unstructured":"Han D, Gao Y, Wu G, Yap PT, Shen D (2014) Robust anatomical landmark detection for mr brain image registration. In: Medical Image Computing and Computer-Assisted Intervention\u2013MICCAI 2014: 17th International Conference, Boston, MA, USA, September 14-18, 2014, Proceedings, Part I 17. pp. 186\u2013193. Springer","DOI":"10.1007\/978-3-319-10404-1_24"},{"issue":"8","key":"31_CR12","doi-asserted-by":"publisher","first-page":"101637","DOI":"10.1016\/j.jksuci.2023.101637","volume":"35","author":"W Hao","year":"2023","unstructured":"Hao W, Liu S, Lv C, Wang Y, Wang J (2023) Efficient and privacy-preserving multi-party skyline queries in online medical primary diagnosis. J King Saud University-Comput Inf Sci 35(8):101637","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"5","key":"31_CR13","doi-asserted-by":"publisher","first-page":"939","DOI":"10.1007\/s11548-024-03089-z","volume":"19","author":"Z Huang","year":"2024","unstructured":"Huang Z, Li H, Shao S, Zhu H, Hu H, Cheng Z, Wang J, Kevin Zhou S (2024) Pele scores: pelvic x-ray landmark detection with pelvis extraction and enhancement. Int J Comput Assist Radiol Surg 19(5):939\u2013950","journal-title":"Int J Comput Assist Radiol Surg"},{"key":"31_CR14","doi-asserted-by":"crossref","unstructured":"Huang Z, Wang S, Hu H, Xu Y (2024b) Retigan: A hybrid image enhancement method for medical images. In: 2024 5th International Conference on Computer Vision, Image and Deep Learning (CVIDL). pp. 25\u201329. IEEE","DOI":"10.1109\/CVIDL62147.2024.10603883"},{"issue":"7","key":"31_CR15","doi-asserted-by":"publisher","first-page":"102144","DOI":"10.1016\/j.jksuci.2024.102144","volume":"36","author":"H Jiang","year":"2024","unstructured":"Jiang H, Chen Q, Wang R, Du J, Chen T (2024) Swformer: a scale-wise hybrid cnn-transformer network for multi-classes weed segmentation. J King Saud University-Comput Inf Sci 36(7):102144","journal-title":"J King Saud University-Comput Inf Sci"},{"key":"31_CR16","doi-asserted-by":"crossref","unstructured":"Jiang Y, Li Y, Wang X, Tao Y, Lin J, Lin H (2022) Cephalformer: incorporating global structure constraint into visual features for general cephalometric landmark detection. In: International conference on medical image computing and computer-assisted intervention. pp. 227\u2013237. Springer","DOI":"10.1007\/978-3-031-16437-8_22"},{"key":"31_CR17","doi-asserted-by":"crossref","unstructured":"Kasturi A, Vosoughi A, Hadjiyski N, Stockmaster L, Sehnert WJ, Wism\u00fcller A (2024) Anatomical landmark detection in chest x-ray images using transformer-based networks. In: Medical imaging 2024: computer-aided diagnosis. vol. 12927, pp. 647\u2013660. SPIE","DOI":"10.1117\/12.3006881"},{"key":"31_CR18","doi-asserted-by":"crossref","unstructured":"Keustermans J, Mollemans W, Vandermeulen D, Suetens P (2010) Automated cephalometric landmark identification using shape and local appearance models. In: 2010 20th International conference on pattern recognition. pp. 2464\u20132467. IEEE","DOI":"10.1109\/ICPR.2010.603"},{"key":"31_CR19","unstructured":"Khalid MA, Zulfiqar K, Bashir U, Shaheen A, Iqbal R, Rizwan Z, Rizwan G, Fraz MM (2022) Cepha29: Automatic cephalometric landmark detection challenge 2023. arXiv:2212.04808"},{"issue":"7","key":"31_CR20","doi-asserted-by":"publisher","first-page":"4335","DOI":"10.1016\/j.jksuci.2022.05.006","volume":"34","author":"S Khan","year":"2022","unstructured":"Khan S, Fazil M, Sejwal VK, Alshara MA, Alotaibi RM, Kamal A, Baig AR (2022) Bichat: Bilstm with deep cnn and hierarchical attention for hate speech detection. J King Saud University-Comput Inf Sci 34(7):4335\u20134344","journal-title":"J King Saud University-Comput Inf Sci"},{"key":"31_CR21","doi-asserted-by":"crossref","unstructured":"Lee H, Park M, Kim J (2017) Cephalometric landmark detection in dental x-ray images using convolutional neural networks. In: Medical imaging 2017: Computer-aided diagnosis. vol. 10134, pp. 494\u2013499. SPIE","DOI":"10.1117\/12.2255870"},{"issue":"9","key":"31_CR22","doi-asserted-by":"publisher","first-page":"6176","DOI":"10.1002\/mp.17235","volume":"51","author":"Y Lei","year":"2024","unstructured":"Lei Y, Matkovic LA, Roper J, Wang T, Zhou J, Ghavidel B, McDonald M, Patel P, Yang X (2024) Diffeomorphic transformer-based abdomen mri-ct deformable image registration. Med Phys 51(9):6176\u20136184","journal-title":"Med Phys"},{"issue":"9","key":"31_CR23","doi-asserted-by":"publisher","first-page":"1862","DOI":"10.1109\/TPAMI.2014.2382106","volume":"37","author":"C Lindner","year":"2014","unstructured":"Lindner C, Bromiley PA, Ionita MC, Cootes TF (2014) Robust and accurate shape model matching using random forest regression-voting. IEEE Trans Pattern Anal Mach Intell 37(9):1862\u20131874","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"31_CR24","doi-asserted-by":"crossref","unstructured":"Liu D, Zhou KS, Bernhardt D, Comaniciu D (2010) Search strategies for multiple landmark detection by submodular maximization. In: 2010 IEEE computer society conference on computer vision and pattern recognition. pp. 2831\u20132838. IEEE","DOI":"10.1109\/CVPR.2010.5540016"},{"key":"31_CR25","doi-asserted-by":"crossref","unstructured":"Liu D, Zhou KS, Bernhardt D, Comaniciu D (2010) Search strategies for multiple landmark detection by submodular maximization. In: 2010 IEEE computer society conference on computer vision and pattern recognition. pp. 2831\u20132838. IEEE","DOI":"10.1109\/CVPR.2010.5540016"},{"key":"31_CR26","doi-asserted-by":"crossref","unstructured":"Lowe DG (1999a) Object recognition from local scale-invariant features. In: Proceedings of the seventh IEEE international conference on computer vision. vol.\u00a02, pp. 1150\u20131157. Ieee","DOI":"10.1109\/ICCV.1999.790410"},{"key":"31_CR27","doi-asserted-by":"crossref","unstructured":"Lowe DG (1999b) Object recognition from local scale-invariant features. In: Proceedings of the seventh IEEE international conference on computer vision. vol.\u00a02, pp. 1150\u20131157. Ieee","DOI":"10.1109\/ICCV.1999.790410"},{"key":"31_CR28","doi-asserted-by":"crossref","unstructured":"Lu X, Yang R, Zhou J, Jiao J, Liu F, Liu Y, Su B, Gu P (2022) A hybrid model of ghost-convolution enlightened transformer for effective diagnosis of grape leaf disease and pest. J King Saud University-Comput Inf Sci 34(5):1755\u20131767","DOI":"10.1016\/j.jksuci.2022.03.006"},{"issue":"9","key":"31_CR29","doi-asserted-by":"publisher","first-page":"102203","DOI":"10.1016\/j.jksuci.2024.102203","volume":"36","author":"Y Lu","year":"2024","unstructured":"Lu Y, Fan X, Wang J, Chen S, Meng J (2024) Parau-net: An improved unet parallel coding network for lung nodule segmentation. J King Saud University-Comput Inf Sci 36(9):102203","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"1","key":"31_CR30","doi-asserted-by":"publisher","first-page":"332","DOI":"10.1109\/TMI.2016.2597270","volume":"36","author":"O Oktay","year":"2016","unstructured":"Oktay O, Bai W, Guerrero R, Rajchl M, De Marvao A, O\u2019Regan DP, Cook SA, Heinrich MP, Glocker B, Rueckert D (2016) Stratified decision forests for accurate anatomical landmark localization in cardiac images. IEEE Trans Med Imaging 36(1):332\u2013342","journal-title":"IEEE Trans Med Imaging"},{"key":"31_CR31","doi-asserted-by":"crossref","unstructured":"O\u2019Neil AQ, Kascenas A, Henry J, Wyeth D, Shepherd M, Beveridge E, Clunie L, Sansom C, Seduikyte Keith\u00a0Muir E, Poole I (2018) Attaining human-level performance with atlas location autocontext for anatomical landmark detection in 3d ct data. In: Proceedings of the European Conference on Computer Vision (ECCV) Workshops","DOI":"10.1007\/978-3-030-11015-4_34"},{"issue":"10","key":"31_CR32","doi-asserted-by":"publisher","first-page":"102262","DOI":"10.1016\/j.jksuci.2024.102262","volume":"36","author":"Z Pan","year":"2024","unstructured":"Pan Z, Gu J, Wang W, Fang X, Xia Z, Wang Q, Wang M (2024) Picking point identification and localization method based on swin-transformer for high-quality tea. J King Saud University-Comput Inf Sci 36(10):102262","journal-title":"J King Saud University-Comput Inf Sci"},{"key":"31_CR33","doi-asserted-by":"crossref","unstructured":"Payer C, \u0160tern D, Bischof H, Urschler M (2016) Regressing heatmaps for multiple landmark localization using cnns. In: International conference on medical image computing and computer-assisted intervention. pp. 230\u2013238. Springer","DOI":"10.1007\/978-3-319-46723-8_27"},{"key":"31_CR34","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1016\/j.media.2019.03.007","volume":"54","author":"C Payer","year":"2019","unstructured":"Payer C, \u0160tern D, Bischof H, Urschler M (2019) Integrating spatial configuration into heatmap regression based cnns for landmark localization. Med Image Anal 54:207\u2013219","journal-title":"Med Image Anal"},{"issue":"6","key":"31_CR35","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2016","unstructured":"Ren S, He K, Girshick R, Sun J (2016) Faster r-cnn: Towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137\u20131149","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"31_CR36","doi-asserted-by":"crossref","unstructured":"Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: Medical image computing and computer-assisted intervention\u2013MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18. pp. 234\u2013241. Springer","DOI":"10.1007\/978-3-319-24574-4_28"},{"issue":"2","key":"31_CR37","doi-asserted-by":"publisher","first-page":"173","DOI":"10.1016\/S0889-5406(98)70289-6","volume":"113","author":"D Rudolph","year":"1998","unstructured":"Rudolph D, Sinclair P, Coggins J (1998) Automatic computerized radiographic identification of cephalometric landmarks. Am J Orthod Dentofac Orthop 113(2):173\u2013179","journal-title":"Am J Orthod Dentofac Orthop"},{"issue":"6","key":"31_CR38","doi-asserted-by":"publisher","first-page":"102086","DOI":"10.1016\/j.jksuci.2024.102086","volume":"36","author":"EK Rutoh","year":"2024","unstructured":"Rutoh EK, Guang QZ, Bahadar N, Raza R, Hanif MS (2024) Gair-u-net: 3d guided attention inception residual u-net for brain tumor segmentation using multimodal mri images. J King Saud University-Comput Inf Sci 36(6):102086","journal-title":"J King Saud University-Comput Inf Sci"},{"key":"31_CR39","first-page":"51","volume":"6","author":"A Saad","year":"2006","unstructured":"Saad A, El-Bialy A, Kandil A, Sayed A (2006) Automatic cephalometric analysis using active appearance model and simulated annealing. ICGST Int J on Graphics, Vision and Image Processing, Special Issue on Image Retrieval and Representation 6:51\u201367","journal-title":"ICGST Int J on Graphics, Vision and Image Processing, Special Issue on Image Retrieval and Representation"},{"key":"31_CR40","unstructured":"Shao S, Yuan X, Huang Z, Qiu Z, Wang S, Zhou K (2023) Diffuseexpand: Expanding dataset for 2d medical image segmentation using diffusion models. arXiv:2304.13416"},{"issue":"8","key":"31_CR41","doi-asserted-by":"publisher","first-page":"102194","DOI":"10.1016\/j.jksuci.2024.102194","volume":"36","author":"Q Song","year":"2024","unstructured":"Song Q, Dai Y, Lu H, Jin G (2024) High-throughput systolic array-based accelerator for hybrid transformer-cnn networks. J King Saud University-Comput Inf Sci 36(8):102194","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"8","key":"31_CR42","doi-asserted-by":"publisher","first-page":"102194","DOI":"10.1016\/j.jksuci.2024.102194","volume":"36","author":"Q Song","year":"2024","unstructured":"Song Q, Dai Y, Lu H, Jin G (2024) High-throughput systolic array-based accelerator for hybrid transformer-cnn networks. J King Saud University-Comput Inf Sci 36(8):102194","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"23","key":"31_CR43","doi-asserted-by":"publisher","first-page":"7505","DOI":"10.1088\/0031-9155\/56\/23\/011","volume":"56","author":"D \u0160tern","year":"2011","unstructured":"\u0160tern D, Likar B, Pernu\u0161 F, Vrtovec T (2011) Parametric modelling and segmentation of vertebral bodies in 3d ct and mr spine images. Phys Med Biol 56(23):7505","journal-title":"Phys Med Biol"},{"issue":"5","key":"31_CR44","doi-asserted-by":"publisher","first-page":"101560","DOI":"10.1016\/j.jksuci.2023.101560","volume":"35","author":"H Sultan","year":"2023","unstructured":"Sultan H, Owais M, Nam SH, Haider A, Akram R, Usman M, Park KR (2023) Mdfu-net: multiscale dilated features up-sampling network for accurate segmentation of tumor from heterogeneous brain data. J King Saud University-Comput Inf Sci 35(5):101560","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"5","key":"31_CR45","doi-asserted-by":"publisher","first-page":"101553","DOI":"10.1016\/j.jksuci.2023.04.006","volume":"35","author":"J Sun","year":"2023","unstructured":"Sun J, Zhao J, Wu X, Tang C, Wang S, Zhang Y (2023) Dsga-net: Deeply separable gated transformer and attention strategy for medical image segmentation network. J King Saud University-Comput Inf Sci 35(5):101553","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"4","key":"31_CR46","doi-asserted-by":"publisher","first-page":"102012","DOI":"10.1016\/j.jksuci.2024.102012","volume":"36","author":"Z Tan","year":"2024","unstructured":"Tan Z, Madzin H, Norafida B, Rahmat RWO, Khalid F, Sulaiman PS (2024) Swinunelcst: Global-local spatial representation learning with hybrid cnn-transformer for efficient tuberculosis lung cavity weakly supervised semantic segmentation. J King Saud University-Comput Inf Sci 36(4):102012","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"4","key":"31_CR47","doi-asserted-by":"publisher","first-page":"102012","DOI":"10.1016\/j.jksuci.2024.102012","volume":"36","author":"Z Tan","year":"2024","unstructured":"Tan Z, Madzin H, Norafida B, Rahmat RWO, Khalid F, Sulaiman PS (2024) Swinunelcst: Global-local spatial representation learning with hybrid cnn-transformer for efficient tuberculosis lung cavity weakly supervised semantic segmentation. J King Saud University-Comput Inf Sci 36(4):102012","journal-title":"J King Saud University-Comput Inf Sci"},{"key":"31_CR48","doi-asserted-by":"crossref","unstructured":"Valanarasu JMJ, Oza P, Hacihaliloglu I, Patel VM (2021) Medical transformer: Gated axial-attention for medical image segmentation. In: Medical image computing and computer assisted intervention\u2013MICCAI 2021: 24th international conference, Strasbourg, France, September 27\u2013October 1, 2021, proceedings, part I 24. pp. 36\u201346. Springer","DOI":"10.1007\/978-3-030-87193-2_4"},{"key":"31_CR49","unstructured":"Vaswani A (2017) Attention is all you need. Advances in neural information processing systems"},{"key":"31_CR50","doi-asserted-by":"crossref","unstructured":"Viriyasaranon T, Ma S, Choi JH (2023) Anatomical landmark detection using a multiresolution learning approach with a hybrid transformer-cnn model. In: International conference on medical image computing and computer-assisted intervention. pp. 433\u2013443. Springer","DOI":"10.1007\/978-3-031-43987-2_42"},{"issue":"9","key":"31_CR51","doi-asserted-by":"publisher","first-page":"1890","DOI":"10.1109\/TMI.2015.2412951","volume":"34","author":"CW Wang","year":"2015","unstructured":"Wang CW, Huang CT, Hsieh MC, Li CH, Chang SW, Li WC, Vandaele R, Mar\u00e9e R, Jodogne S, Geurts P et al (2015) Evaluation and comparison of anatomical landmark detection methods for cephalometric x-ray images: a grand challenge. IEEE Trans Med Imaging 34(9):1890\u20131900","journal-title":"IEEE Trans Med Imaging"},{"key":"31_CR52","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1016\/j.media.2016.02.004","volume":"31","author":"CW Wang","year":"2016","unstructured":"Wang CW, Huang CT, Lee JH, Li CH, Chang SW, Siao MJ, Lai TM, Ibragimov B, Vrtovec T, Ronneberger O et al (2016) A benchmark for comparison of dental radiography analysis algorithms. Med Image Anal 31:63\u201376","journal-title":"Med Image Anal"},{"key":"31_CR53","doi-asserted-by":"publisher","first-page":"6","DOI":"10.3934\/era.2023160","volume":"31","author":"S Wang","year":"2023","unstructured":"Wang S, Huang Z, Zhang B, Heng X, Jiang Y, Sun X (2023) Plot-aware transformer for recommender systems. Electron Res Arch 31:6","journal-title":"Electron Res Arch"},{"issue":"10","key":"31_CR54","doi-asserted-by":"publisher","first-page":"102250","DOI":"10.1016\/j.jksuci.2024.102250","volume":"36","author":"X Wen","year":"2024","unstructured":"Wen X, Zhang A, Lin C, Pang X (2024) Crnet: Cascaded refinement network for polyp segmentation. J King Saud University-Comput Inf Sci 36(10):102250","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"9","key":"31_CR55","doi-asserted-by":"publisher","first-page":"5880","DOI":"10.1007\/s00330-022-08692-9","volume":"32","author":"C Wu","year":"2022","unstructured":"Wu C, Meng G, Lian J, Xu J, Gao M, Huang C, Zhang S, Zhang Y, Yu Y, Wang H et al (2022) A multi-stage ensemble network system to diagnose adolescent idiopathic scoliosis. Eur Radiol 32(9):5880\u20135889","journal-title":"Eur Radiol"},{"key":"31_CR56","doi-asserted-by":"crossref","unstructured":"Wu H, Wang C, Mei L, Yang T, Zhu M, Shen D, Cui Z (2024) Cephalometric landmark detection across ages with prototypical network. In: International conference on medical image computing and computer-assisted intervention. pp. 155\u2013165. Springer","DOI":"10.1007\/978-3-031-72086-4_15"},{"key":"31_CR57","unstructured":"Xu Y, Zhang J, Zhang Q, Tao D (2022) Vitpose: Simple vision transformer baselines for human pose estimation. Adv Neural Inf Process Syst 35:38571\u201338584"},{"key":"31_CR58","doi-asserted-by":"crossref","unstructured":"Yang S, Quan Z, Nie M, Yang W (2021) Transpose: Keypoint localization via transformer. In: Proceedings of the IEEE\/CVF international conference on computer vision. pp. 11802\u201311812","DOI":"10.1109\/ICCV48922.2021.01159"},{"key":"31_CR59","doi-asserted-by":"crossref","unstructured":"Yi S, Chen Z, Yi L, She F (2023) Cas: Breast cancer diagnosis framework based on lesion region recognition in ultrasound images. J King Saud University-Comput Inf Sci 35(8):101707","DOI":"10.1016\/j.jksuci.2023.101707"},{"key":"31_CR60","doi-asserted-by":"crossref","unstructured":"Yueyuan A, Hong W (2021) Swin transformer combined with convolutional encoder for cephalometric landmarks detection. In: 2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP). pp. 184\u2013187. IEEE","DOI":"10.1109\/ICCWAMTIP53232.2021.9674147"},{"issue":"1","key":"31_CR61","doi-asserted-by":"publisher","first-page":"101917","DOI":"10.1016\/j.jksuci.2024.101917","volume":"36","author":"M Zhang","year":"2024","unstructured":"Zhang M, Liu D, Sun Q, Han Y, Liu B, Zhang J, Zhang M (2024) Augmented transformer network for mri brain tumor segmentation. J King Saud University-Comput Inf Sci 36(1):101917","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"1","key":"31_CR62","doi-asserted-by":"publisher","first-page":"101873","DOI":"10.1016\/j.jksuci.2023.101873","volume":"36","author":"Z Zhang","year":"2024","unstructured":"Zhang Z, Lu Y, Ye M, Huang W, Jin L, Zhang G, Ge Y, Baghban A, Zhang Q, Wang H et al (2024) A novel evolutionary ensemble prediction model using harmony search and stacking for diabetes diagnosis. J King Saud University-Comput Inf Sci 36(1):101873","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"8","key":"31_CR63","doi-asserted-by":"publisher","first-page":"101670","DOI":"10.1016\/j.jksuci.2023.101670","volume":"35","author":"J Zhao","year":"2023","unstructured":"Zhao J, Zhu H, Niu L (2023) Bitnet: a lightweight object detection network for real-time classroom behavior recognition with transformer and bi-directional pyramid network. J King Saud University-Comput Inf Sci 35(8):101670","journal-title":"J King Saud University-Comput Inf Sci"},{"issue":"11","key":"31_CR64","doi-asserted-by":"publisher","first-page":"1333","DOI":"10.3390\/bioengineering10111333","volume":"10","author":"M Zhao","year":"2023","unstructured":"Zhao M, Meng N, Cheung JPY, Yu C, Lu P, Zhang T (2023) Spinehrformer: A transformer-based deep learning model for automatic spine deformity assessment with prospective validation. Bioengineering 10(11):1333","journal-title":"Bioengineering"},{"key":"31_CR65","volume-title":"Handbook of medical image computing and computer assisted intervention","author":"SK Zhou","year":"2019","unstructured":"Zhou SK, Rueckert D, Fichtinger G (2019) Handbook of medical image computing and computer assisted intervention. Academic Press, New York"},{"key":"31_CR66","doi-asserted-by":"crossref","unstructured":"Zhu H, Yao Q, Xiao L, Zhou SK (2021) You only learn once: Universal anatomical landmark detection. In: Medical Image Computing and Computer Assisted Intervention\u2013MICCAI 2021: 24th International Conference, Strasbourg, France, September 27\u2013October 1, 2021, Proceedings, Part V 24. pp. 85\u201395. Springer","DOI":"10.1007\/978-3-030-87240-3_9"},{"key":"31_CR67","unstructured":"Zhu H, Yao Q, Zhou SK (2022) Datr: Domain-adaptive transformer for multi-domain landmark detection. arXiv:2203.06433"},{"issue":"2","key":"31_CR68","doi-asserted-by":"publisher","first-page":"771","DOI":"10.1002\/mp.17500","volume":"52","author":"J Zhu","year":"2025","unstructured":"Zhu J, Zhang X, Su T, Cui H, Tan Y, Huang H, Guo J, Zheng H, Liang D, Wu G et al (2025) Mmd-net: Image domain multi-material decomposition network for dual-energy ct imaging. Med Phys 52(2):771\u201386","journal-title":"Med Phys"},{"key":"31_CR69","unstructured":"Zhu X, Su W, Lu L, Li B, Wang X, Dai J (2020) Deformable detr: Deformable transformers for end-to-end object detection. arXiv:2010.04159"}],"container-title":["Journal of King Saud University Computer and Information Sciences"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44443-025-00031-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s44443-025-00031-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s44443-025-00031-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,30]],"date-time":"2025-05-30T11:46:29Z","timestamp":1748605589000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s44443-025-00031-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5]]},"references-count":69,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2025,5]]}},"alternative-id":["31"],"URL":"https:\/\/doi.org\/10.1007\/s44443-025-00031-4","relation":{},"ISSN":["1319-1578","2213-1248"],"issn-type":[{"value":"1319-1578","type":"print"},{"value":"2213-1248","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,5]]},"assertion":[{"value":"24 January 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 March 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 May 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. The IRB approval is obtained.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical Approval"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing Interests"}}],"article-number":"37"}}