{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,21]],"date-time":"2025-12-21T10:04:17Z","timestamp":1766311457745,"version":"build-2065373602"},"reference-count":29,"publisher":"MDPI AG","issue":"22","license":[{"start":{"date-parts":[[2022,11,9]],"date-time":"2022-11-09T00:00:00Z","timestamp":1667952000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100003718","name":"Korea Institute of Oriental Medicine","doi-asserted-by":"publisher","award":["KSN2022130"],"award-info":[{"award-number":["KSN2022130"]}],"id":[{"id":"10.13039\/501100003718","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>With the prevalence of degenerative diseases due to the increase in the aging population, we have encountered many spine-related disorders. Since the spine is a crucial part of the body, fast and accurate diagnosis is critically important. Generally, clinicians use X-ray images to diagnose the spine, but X-ray images are commonly occluded by the shadows of some bones, making it hard to identify the whole spine. Therefore, recently, various deep-learning-based spinal X-ray image analysis approaches have been proposed to help diagnose the spine. However, these approaches did not consider the characteristics of frequent occlusion in the X-ray image and the properties of the vertebra shape. Therefore, based on the X-ray image properties and vertebra shape, we present a novel landmark detection network specialized in lumbar X-ray images. The proposed network consists of two stages: The first step detects the centers of the lumbar vertebrae and the upper end plate of the first sacral vertebra (S1), and the second step detects the four corner points of each lumbar vertebra and two corner points of S1 from the image obtained in the first step. We used random spine cutout augmentation in the first step to robustify the network against the commonly obscured X-ray images. Furthermore, in the second step, we used CoordConv to make the network recognize the location distribution of landmarks and part affinity fields to understand the morphological features of the vertebrae, resulting in more accurate landmark detection. The proposed network was evaluated using 304 X-ray images, and it achieved 98.02% accuracy in center detection and 8.34% relative distance error in corner detection. This indicates that our network can detect spinal landmarks reliably enough to support radiologists in analyzing the lumbar X-ray images.<\/jats:p>","DOI":"10.3390\/s22228628","type":"journal-article","created":{"date-parts":[[2022,11,10]],"date-time":"2022-11-10T02:11:15Z","timestamp":1668046275000},"page":"8628","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Part Affinity Fields and CoordConv for Detecting Landmarks of Lumbar Vertebrae and Sacrum in X-ray Images"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2010-0186","authenticated-orcid":false,"given":"Chang-Hyeon","family":"An","sequence":"first","affiliation":[{"name":"Intelligent Computer Vision Software Laboratory (ICVSLab), Department of Electronic Engineering, Yeungnam University, 280 Daehak-Ro, Gyeongsan 38541, Gyeongbuk, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2530-6033","authenticated-orcid":false,"given":"Jeong-Sik","family":"Lee","sequence":"additional","affiliation":[{"name":"Intelligent Computer Vision Software Laboratory (ICVSLab), Department of Electronic Engineering, Yeungnam University, 280 Daehak-Ro, Gyeongsan 38541, Gyeongbuk, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7639-5884","authenticated-orcid":false,"given":"Jun-Su","family":"Jang","sequence":"additional","affiliation":[{"name":"Korea Institute of Oriental Medicine, 1672 Yuseong-daero, Yuseong-gu, Daejeon 34054, South Chungcheong, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3991-9466","authenticated-orcid":false,"given":"Hyun-Chul","family":"Choi","sequence":"additional","affiliation":[{"name":"Intelligent Computer Vision Software Laboratory (ICVSLab), Department of Electronic Engineering, Yeungnam University, 280 Daehak-Ro, Gyeongsan 38541, Gyeongbuk, Korea"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,11,9]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Yi, J., Wu, P., Huang, Q., Qu, H., and Metaxas, D.N. (2020, January 3\u20137). Vertebra-focused landmark detection for scoliosis assessment. Proceedings of the 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI), Iowa City, IA, USA.","DOI":"10.1109\/ISBI45749.2020.9098675"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"7618","DOI":"10.1038\/s41598-021-87141-x","article-title":"Deep learning approach for automatic landmark detection and alignment analysis in whole-spine lateral radiographs","volume":"11","author":"Yeh","year":"2021","journal-title":"Sci. Rep."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1177\/2192568219868190","article-title":"Automated measurement of lumbar lordosis on radiographs using machine learning and computer vision","volume":"10","author":"Cho","year":"2020","journal-title":"Glob. Spine J."},{"key":"ref_4","unstructured":"Tran, V.L., Lin, H.Y., and Liu, H.W. (December, January 30). MBNet: A multi-task deep neural network for semantic segmentation and lumbar vertebra inspection on X-ray images. Proceedings of the Asian Conference on Computer Vision, Kyoto, Japan."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"105833","DOI":"10.1016\/j.cmpb.2020.105833","article-title":"Automatic detection and segmentation of lumbar vertebrae from X-ray images for compression fracture evaluation","volume":"200","author":"Kim","year":"2021","journal-title":"Comput. Methods Programs Biomed."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"9482","DOI":"10.1038\/s41598-021-89102-w","article-title":"2-step deep learning model for landmarks localization in spine radiographs","volume":"11","author":"Cina","year":"2021","journal-title":"Sci. Rep."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Khanal, B., Dahal, L., Adhikari, P., and Khanal, B. (2019). Automatic cobb angle detection using vertebra detector and vertebra corners regression. International Workshop and Challenge on Computational Methods and Clinical Applications for Spine Imaging, Springer.","DOI":"10.1007\/978-3-030-39752-4_9"},{"key":"ref_8","unstructured":"DeVries, T., and Taylor, G.W. (2017). Improved regularization of convolutional neural networks with cutout. arXiv."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1597","DOI":"10.1109\/TMI.2018.2791488","article-title":"Joint optic disc and cup segmentation based on multi-label deep network and polar transformation","volume":"37","author":"Fu","year":"2018","journal-title":"IEEE Trans. Med Imaging"},{"key":"ref_10","first-page":"9628","article-title":"An intriguing failing of convolutional neural networks and the coordconv solution","volume":"31","author":"Liu","year":"2018","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Toshev, A., and Szegedy, C. (2014, January 23\u201328). Deeppose: Human pose estimation via deep neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.214"},{"key":"ref_12","first-page":"1799","article-title":"Joint training of a convolutional network and a graphical model for human pose estimation","volume":"27","author":"Tompson","year":"2014","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Cao, Z., Simon, T., Wei, S.E., and Sheikh, Y. (2017, January 21\u201326). Realtime multi-person 2d pose estimation using part affinity fields. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.143"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"3349","DOI":"10.1109\/TPAMI.2020.2983686","article-title":"Deep high-resolution representation learning for visual recognition","volume":"43","author":"Wang","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"178","DOI":"10.1016\/S1076-6332(03)00671-8","article-title":"Statistical validation of image segmentation quality based on a spatial overlap index1: Scientific reports","volume":"11","author":"Zou","year":"2004","journal-title":"Acad. Radiol."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., and Sang, N. (2018, January 8\u201314). Bisenet: Bilateral segmentation network for real-time semantic segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01261-8_20"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1016\/j.neucom.2020.05.120","article-title":"Automated vertebral landmarks and spinal curvature estimation using non-directional part affinity fields","volume":"438","author":"Zhang","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_19","first-page":"91","article-title":"Faster r-cnn: Towards real-time object detection with region proposal networks","volume":"28","author":"Ren","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"He, J., Liu, W., Wang, Y., Ma, X., and Hua, X.S. (2021, January 9\u201312). SpineOne: A One-Stage Detection Framework for Degenerative Discs and Vertebrae. Proceedings of the 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Houston, TX, USA.","DOI":"10.1109\/BIBM52615.2021.9669541"},{"key":"ref_21","unstructured":"Lu, J.T., Pedemonte, S., Bizzo, B., Doyle, S., Andriole, K.P., Michalski, M.H., Gonzalez, R.G., and Pomerantz, S.R. (2018, January 17\u201318). Deep Spine: Automated lumbar vertebral segmentation, disc-level designation, and spinal stenosis grading using deep learning. Proceedings of the Machine Learning for Healthcare Conference (PMLR), Palo Alto, CA, USA."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Payer, C., Stern, D., Bischof, H., and Urschler, M. (2020, January 27\u201329). Coarse to Fine Vertebrae Localization and Segmentation with SpatialConfiguration-Net and U-Net. Proceedings of the VISIGRAPP (5: VISAPP), Valletta, Malta.","DOI":"10.5220\/0008975201240133"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zuiderveld, K. (1994). Contrast Limited Adaptive Histogram Equalization, Graphics Gems IV, Academic Press Inc.","DOI":"10.1016\/B978-0-12-336156-1.50061-6"},{"key":"ref_24","unstructured":"Ulyanov, D., Vedaldi, A., and Lempitsky, V. (2016). Instance normalization: The missing ingredient for fast stylization. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Wang, X., Kong, T., Shen, C., Jiang, Y., and Li, L. (2020). Solo: Segmenting objects by locations. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-030-58523-5_38"},{"key":"ref_26","unstructured":"(2018, December 11). National Library of Medicine, Available online: https:\/\/www.nlm.nih.gov\/databases\/download\/nhanes.html."},{"key":"ref_27","unstructured":"(2022, July 21). Burapha University. Available online: https:\/\/services.informatics.buu.ac.th\/spine\/."},{"key":"ref_28","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1778","DOI":"10.3174\/ajnr.A2036","article-title":"Lumbosacral transitional vertebrae: Classification, imaging findings, and clinical relevance","volume":"31","author":"Konin","year":"2010","journal-title":"Am. J. Neuroradiol."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/22\/8628\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:12:58Z","timestamp":1760145178000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/22\/8628"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,11,9]]},"references-count":29,"journal-issue":{"issue":"22","published-online":{"date-parts":[[2022,11]]}},"alternative-id":["s22228628"],"URL":"https:\/\/doi.org\/10.3390\/s22228628","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2022,11,9]]}}}