{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,4]],"date-time":"2026-06-04T17:12:52Z","timestamp":1780593172380,"version":"3.54.1"},"reference-count":22,"publisher":"World Scientific Pub Co Pte Ltd","issue":"03n04","funder":[{"DOI":"10.13039\/100020595","name":"National Science and Technology Council","doi-asserted-by":"publisher","award":["MOST 109-2221-E-224-048-MY2"],"award-info":[{"award-number":["MOST 109-2221-E-224-048-MY2"]}],"id":[{"id":"10.13039\/100020595","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2025,3,30]]},"abstract":"<jats:p> Chest X-ray is one of the most common tests in radiology and plays a vital role in helping physicians spot different chest conditions. This paper proposes a new model called CAT-UNet to segment lung masses in chest X-ray images. The CAT-UNet uses TransUNet as the leading architecture and mixes CNN and transformer as an encoder. The CNN uses ResNet50 as the backbone, embedding the coordinate attention (CA) block and four skip connections of different scales to improve the accuracy of finding shallow features. Vision Transformer (ViT), which applied the transformer structure, was used in our method to enhance the feature representation ability of images, and Atrous Spatial Pyramid Pooling (ASPP) was used to adjust the filter\u2019s field-of-view and control the resolution of features. In the decoder, the Convolutional Block Attention Modules (CBAM) are embedded for upsampling so that the segmentation details can be better optimized. To evaluate the performance and generalizability of the proposed method, we conducted a 3-fold cross-validation experiment using 1914 chest X-ray images labeled by radiologists collected from the Department of Radiology at Dalin Tzu Chi Hospital, Taiwan. Experimental results show that the proposed CAT-UNet achieves 89.06% on Dice, 91.62% on sensitivity, 98.64% on specificity, and 95.15% on accuracy, outperforming U-Net, TransUNet, and Swin-UNet encoders. <\/jats:p>","DOI":"10.1142\/s0218001425520020","type":"journal-article","created":{"date-parts":[[2025,3,1]],"date-time":"2025-03-01T03:52:26Z","timestamp":1740801146000},"source":"Crossref","is-referenced-by-count":3,"title":["CAT-UNet: Integrating CNN Attention Mechanism and TransUNet for Lung Mass Segmentation"],"prefix":"10.1142","volume":"39","author":[{"given":"Ade Irma","family":"Suryani","sequence":"first","affiliation":[{"name":"Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin 64002, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5010-2767","authenticated-orcid":false,"given":"Chuan-Wang","family":"Chang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Engineering, National Chin-Yi University of Technology, Taichung 41170, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hsin-Tien","family":"Cheng","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin 64002, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tin-Kwang","family":"Lin","sequence":"additional","affiliation":[{"name":"Department of Internal Medicine, Dalin Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Chiayi 622, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chin-Wen","family":"Lin","sequence":"additional","affiliation":[{"name":"Department of Radiology, Dalin Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Chiayi 622, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9476-8130","authenticated-orcid":false,"given":"Chuan-Yu","family":"Chang","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin 64002, Taiwan"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"219","published-online":{"date-parts":[[2025,4,22]]},"reference":[{"key":"S0218001425520020BIB001","volume-title":"Proc. ICLR","author":"Bahdanau D.","year":"2015"},{"key":"S0218001425520020BIB003","volume-title":"Proc. IPPR Conf. Computer Vision, Graphics, and Image Processing","author":"Chang C. Y.","year":"2021"},{"key":"S0218001425520020BIB004","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2017.2699184"},{"key":"S0218001425520020BIB006","first-page":"424","volume-title":"Int. Conference on Medical Image Computing and Computer-Assisted Intervention","author":"\u00c7i\u00e7ek \u00d6.","year":"2016"},{"key":"S0218001425520020BIB008","volume":"32","author":"He F.","year":"2019","journal-title":"Adv. Neural Inf. Process Syst."},{"key":"S0218001425520020BIB009","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"S0218001425520020BIB010","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR46437.2021.01350"},{"key":"S0218001425520020BIB011","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01228-1_15"},{"key":"S0218001425520020BIB013","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2019.00656"},{"key":"S0218001425520020BIB014","doi-asserted-by":"publisher","DOI":"10.1016\/j.compmedimag.2019.05.005"},{"key":"S0218001425520020BIB015","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"S0218001425520020BIB016","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"S0218001425520020BIB017","doi-asserted-by":"publisher","DOI":"10.1109\/3DV.2016.79"},{"key":"S0218001425520020BIB019","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"S0218001425520020BIB020","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2022.3224486"},{"key":"S0218001425520020BIB021","first-page":"6000","author":"Vaswani A.","year":"2017","journal-title":"Proc. of the 31st Int. Conf. on Neural Information Processing Systems (NIPS'17)"},{"key":"S0218001425520020BIB022","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.369"},{"key":"S0218001425520020BIB023","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"S0218001425520020BIB025","doi-asserted-by":"publisher","DOI":"10.1109\/ITME.2018.00080"},{"key":"S0218001425520020BIB026","doi-asserted-by":"publisher","DOI":"10.1093\/nsr\/nwx106"},{"key":"S0218001425520020BIB027","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.319"},{"key":"S0218001425520020BIB028","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-00889-5_1"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001425520020","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,23]],"date-time":"2025-04-23T03:31:21Z","timestamp":1745379081000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/10.1142\/S0218001425520020"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,30]]},"references-count":22,"journal-issue":{"issue":"03n04","published-print":{"date-parts":[[2025,3,30]]}},"alternative-id":["10.1142\/S0218001425520020"],"URL":"https:\/\/doi.org\/10.1142\/s0218001425520020","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,3,30]]},"article-number":"2552002"}}