{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,2]],"date-time":"2026-02-02T01:48:35Z","timestamp":1769996915269,"version":"3.49.0"},"reference-count":18,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2022,4,30]],"date-time":"2022-04-30T00:00:00Z","timestamp":1651276800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Funds through the Portuguese funding agency, FCT-Foundation for Science and Technology Portugal","award":["2021.05767.BD"],"award-info":[{"award-number":["2021.05767.BD"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Lung cancer is a highly prevalent pathology and a leading cause of cancer-related deaths. Most patients are diagnosed when the disease has manifested itself, which usually is a sign of lung cancer in an advanced stage and, as a consequence, the 5-year survival rates are low. To increase the chances of survival, improving the cancer early detection capacity is crucial, for which computed tomography (CT) scans represent a key role. The manual evaluation of the CTs is a time-consuming task and computer-aided diagnosis (CAD) systems can help relieve that burden. The segmentation of the lung is one of the first steps in these systems, yet it is very challenging given the heterogeneity of lung diseases usually present and associated with cancer development. In our previous work, a segmentation model based on a ResNet34 and U-Net combination was developed on a cross-cohort dataset that yielded good segmentation masks for multiple pathological conditions but misclassified some of the lung nodules. The multiple datasets used for the model development were originated from different annotation protocols, which generated inconsistencies for the learning process, and the annotations are usually not adequate for lung cancer studies since they did not comprise lung nodules. In addition, the initial datasets used for training presented a reduced number of nodules, which was showed not to be enough to allow the segmentation model to learn to include them as a lung part. In this work, an objective protocol for the lung mask\u2019s segmentation was defined and the previous annotations were carefully reviewed and corrected to create consistent and adequate ground-truth masks for the development of the segmentation model. Data augmentation with domain knowledge was used to create lung nodules in the cases used to train the model. The model developed achieved a Dice similarity coefficient (DSC) above 0.9350 for all test datasets and it showed an ability to cope, not only with a variety of lung patterns, but also with the presence of lung nodules as well. This study shows the importance of using consistent annotations for the supervised learning process, which is a very time-consuming task, but that has great importance to healthcare applications. Due to the lack of massive datasets in the medical field, which consequently brings a lack of wide representativity, data augmentation with domain knowledge could represent a promising help to overcome this limitation for learning models development.<\/jats:p>","DOI":"10.3390\/s22093443","type":"journal-article","created":{"date-parts":[[2022,5,3]],"date-time":"2022-05-03T08:26:35Z","timestamp":1651566395000},"page":"3443","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":10,"title":["The Influence of a Coherent Annotation and Synthetic Addition of Lung Nodules for Lung Segmentation in CT Scans"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1563-2978","authenticated-orcid":false,"given":"Joana","family":"Sousa","sequence":"first","affiliation":[{"name":"INESC TEC\u2014Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal"},{"name":"FEUP\u2014Faculty of Engineering, University of Porto, 4200-465 Porto, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1681-2436","authenticated-orcid":false,"given":"Tania","family":"Pereira","sequence":"additional","affiliation":[{"name":"INESC TEC\u2014Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal"}]},{"given":"In\u00eas","family":"Neves","sequence":"additional","affiliation":[{"name":"ICBAS\u2014Abel Salazar Biomedical Sciences Institute, University of Porto, 4050-313 Porto, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3069-2282","authenticated-orcid":false,"given":"Francisco","family":"Silva","sequence":"additional","affiliation":[{"name":"INESC TEC\u2014Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal"},{"name":"FCUP\u2014Faculty of Science, University of Porto, 4169-007 Porto, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6193-8540","authenticated-orcid":false,"given":"H\u00e9lder P.","family":"Oliveira","sequence":"additional","affiliation":[{"name":"INESC TEC\u2014Institute for Systems and Computer Engineering, Technology and Science, 4200-465 Porto, Portugal"},{"name":"FCUP\u2014Faculty of Science, University of Porto, 4169-007 Porto, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2022,4,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"209","DOI":"10.3322\/caac.21660","article-title":"Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries","volume":"71","author":"Sung","year":"2021","journal-title":"CA Cancer J. Clin."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1016\/j.lungcan.2015.08.017","article-title":"The relationship between COPD and lung cancer","volume":"90","author":"Durham","year":"2015","journal-title":"Lung Cancer"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Silva, F., Pereira, T., Neves, I., Morgado, J., Freitas, C., Malafaia, M., Sousa, J., Fonseca, J., Negr\u00e3o, E., and Flor de Lima, B. (2022). Towards Machine Learning-Aided Lung Cancer Clinical Routines: Approaches and Open Challenges. J. Pers. Med., 12.","DOI":"10.3390\/jpm12030480"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1186\/s12938-015-0120-7","article-title":"Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy","volume":"15","author":"Firmino","year":"2016","journal-title":"BioMed. Eng. OnLine"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1314","DOI":"10.1016\/j.bbe.2020.07.007","article-title":"A deep Residual U-Net convolutional neural network for automated lung segmentation in computed tomography images","volume":"40","author":"Khanna","year":"2020","journal-title":"Biocybern. Biomed. Eng."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. Medical Image Computing and Computer-Assisted Intervention\u2014MICCAI 2015, Springer.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"101817","DOI":"10.1016\/j.compmedimag.2020.101817","article-title":"LGAN: Lung segmentation in CT scans using generative adversarial network","volume":"87","author":"Tan","year":"2021","journal-title":"Comput. Med. Imaging Graph."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Sousa, J., Pereira, T., Silva, F., Silva, M., Vilares, A., Cunha, A., and Oliveira, H. (2022). Lung Segmentation in CT Images: A Residual U-Net Approach on a Cross-Cohort Dataset. Appl. Sci., 12.","DOI":"10.3390\/app12041959"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"101759","DOI":"10.1016\/j.media.2020.101759","article-title":"Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis","volume":"65","author":"Karimi","year":"2020","journal-title":"Med. Image Anal."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"101693","DOI":"10.1016\/j.media.2020.101693","article-title":"Embracing Imperfect Datasets: A Review of Deep Learning Solutions for Medical Image Segmentation","volume":"63","author":"Tajbakhsh","year":"2020","journal-title":"Med. Image Anal."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1186\/s41747-020-00173-2","article-title":"Automatic lung segmentation in routine imaging is primarily a data diversity problem, not a methodology problem","volume":"4","author":"Hofmanninger","year":"2020","journal-title":"Eur. Radiol. Exp."},{"key":"ref_13","unstructured":"Yang, J., Sharp, G., Veeraraghavan, H., van Elmpt, W., Dekker, A., Lustberg, T., and Gooding, M. (2017). Data from Lung CT Segmentation Challenge, The Cancer Imaging Archive."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.media.2017.06.015","article-title":"Validation, comparison, and combination of algorithms for automatic detection of pulmonary nodules in computed tomography images: The LUNA16 challenge","volume":"42","author":"Setio","year":"2017","journal-title":"Med. Image Anal."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/j.compmedimag.2011.07.003","article-title":"Building a Reference Multimedia Database for Interstitial Lung Diseases","volume":"36","author":"Depeursinge","year":"2012","journal-title":"Comput. Med. Imaging Graph."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1217","DOI":"10.1016\/j.media.2014.07.003","article-title":"Comparing algorithms for automated vessel segmentation in computed tomography scans of the lung: The VESSEL12 study","volume":"18","author":"Rudyanto","year":"2014","journal-title":"Med. Image Anal."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"358","DOI":"10.1016\/j.radphyschem.2011.12.026","article-title":"CT number definition","volume":"81","author":"Bryant","year":"2012","journal-title":"Radiat. Phys. Chem."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"015006","DOI":"10.1117\/1.JMI.5.1.015006","article-title":"Family of boundary overlap metrics for the evaluation of medical image segmentation","volume":"5","author":"Yeghiazaryan","year":"2018","journal-title":"J. Med. Imaging"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/9\/3443\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T23:05:07Z","timestamp":1760137507000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/9\/3443"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,30]]},"references-count":18,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2022,5]]}},"alternative-id":["s22093443"],"URL":"https:\/\/doi.org\/10.3390\/s22093443","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,4,30]]}}}