{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T15:29:16Z","timestamp":1781018956048,"version":"3.54.1"},"publisher-location":"New York, NY, USA","reference-count":33,"publisher":"ACM","license":[{"start":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T00:00:00Z","timestamp":1774224000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/legalcode"}],"funder":[{"name":"National Research Foundation (NRF) funded by the Korean government (MSIT)","award":["No. RS-2023-00229822"],"award-info":[{"award-number":["No. RS-2023-00229822"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2026,3,23]]},"DOI":"10.1145\/3748522.3779771","type":"proceedings-article","created":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T14:17:49Z","timestamp":1781014669000},"page":"1172-1179","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["SKIM: Semantic Knowledge Infused Modeling for Medical Report Generation"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2046-3091","authenticated-orcid":false,"given":"Hyojeong","family":"Lee","sequence":"first","affiliation":[{"name":"Yonsei University, Seoul, Republic of Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2773-7670","authenticated-orcid":false,"given":"Youngwan","family":"Jo","sequence":"additional","affiliation":[{"name":"Yonsei University, Seoul, Republic of Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1679-1268","authenticated-orcid":false,"given":"Inpyo","family":"Hong","sequence":"additional","affiliation":[{"name":"Department of Computer Science,, Yonsei University, Seoul, Republic of Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5196-6193","authenticated-orcid":false,"given":"Sanghyun","family":"Park","sequence":"additional","affiliation":[{"name":"Yonsei University, Seoul, Republic of Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2026,6,9]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","unstructured":"Zubair Ahmad Shabina Rahim Maha Zubair and Jamshid Abdul-Ghafar. 2021. Artificial intelligence (ai) in medicine current applications and future role with special emphasis on its potential and promise in pathology: present and future impact obstacles including costs and acceptance among pathologists practical and philosophical considerations. a comprehensive review. Diagnostic Pathology 16 (Mar. 2021). 10.1186\/s13000-021-01085-4","DOI":"10.1186\/s13000-021-01085-4"},{"key":"e_1_3_2_1_2_1","volume-title":"Proceedings of the acl workshop on intrinsic and extrinsic evaluation measures for machine translation and\/or summarization, 65\u201372","author":"Banerjee Satanjeev","year":"2005","unstructured":"Satanjeev Banerjee and Alon Lavie. 2005. Meteor: an automatic metric for mt evaluation with improved correlation with human judgments. In Proceedings of the acl workshop on intrinsic and extrinsic evaluation measures for machine translation and\/or summarization, 65\u201372."},{"key":"e_1_3_2_1_3_1","unstructured":"Hangbo Bao Li Dong Songhao Piao and Furu Wei. 2021. Beit: bert pre-training of image transformers. arXiv preprint arXiv:2106.08254."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","unstructured":"Zhihong Chen Yaling Shen Yan Song and Xiang Wan. 2021. Cross-modal memory networks for radiology report generation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Chengqing Zong Fei Xia Wenjie Li and Roberto Navigli (Eds.) Association for Computational Linguistics Online (Aug. 2021) 5904\u20135914. 10.18653\/v1\/2021.acl-long.459","DOI":"10.18653\/v1\/2021.acl-long.459"},{"key":"e_1_3_2_1_5_1","volume-title":"Generating radiology reports via memory-driven transformer. (2022). https:\/\/arxiv.org\/abs\/2010.16056 arXiv","author":"Chen Zhihong","year":"2010","unstructured":"Zhihong Chen, Yan Song, Tsung-Hui Chang, and Xiang Wan. 2022. Generating radiology reports via memory-driven transformer. (2022). https:\/\/arxiv.org\/abs\/2010.16056 arXiv: 2010.16056 [cs.CL]."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocv080"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPRW63382.2024.00208"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/WACV61041.2025.00047"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v39i4.32393"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Xun Huang and Serge Belongie. 2017. Arbitrary style transfer in real-time with adaptive instance normalization. (2017). https:\/\/arxiv.org\/abs\/1703.06868 arXiv: 1703.06868 [cs.CV].","DOI":"10.1109\/ICCV.2017.167"},{"key":"e_1_3_2_1_11_1","unstructured":"Andrew Jaegle et al. 2022. Perceiver io: a general architecture for structured inputs & outputs. (2022). https:\/\/arxiv.org\/abs\/2107.14795 arXiv: 2107.14795 [cs.LG]."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i3.28038"},{"key":"e_1_3_2_1_13_1","volume-title":"Johnson et al","author":"Alistair E.","year":"2019","unstructured":"Alistair E. W. Johnson et al. 2019. Mimic-cxr-jpg, a large publicly available database of labeled chest radiographs. (2019). https:\/\/arxiv.org\/abs\/1901.07042 arXiv: 1901.07042 [cs.CV]."},{"key":"e_1_3_2_1_14_1","volume-title":"Proceedings of the 40th International Conference on Machine Learning (Proceedings of Machine Learning Research)","author":"Li Junnan","year":"2023","unstructured":"Junnan Li, Dongxu Li, Silvio Savarese, and Steven Hoi. 2023. BLIP-2: bootstrapping language-image pre-training with frozen image encoders and large language models. In Proceedings of the 40th International Conference on Machine Learning (Proceedings of Machine Learning Research). Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett, (Eds.) Vol. 202. PMLR, (23\u201329 Jul 2023), 19730\u201319742. https:\/\/proceedings.mlr.press\/v202\/li23q.html."},{"key":"e_1_3_2_1_15_1","unstructured":"Kevin Y Li Sachin Goyal Joao D Semedo and J Zico Kolter. 2024. Inference optimal vlms need fewer visual tokens and more parameters. arXiv preprint arXiv:2411.03312."},{"key":"e_1_3_2_1_16_1","unstructured":"Chin-Yew Lin. 2004. Rouge: a package for automatic evaluation of summaries. In Text summarization branches out 74\u201381."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i17.29826"},{"key":"e_1_3_2_1_18_1","volume-title":"Proceedings of the 40th annual meeting of the Association for Computational Linguistics, 311\u2013318","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting of the Association for Computational Linguistics, 311\u2013318."},{"key":"e_1_3_2_1_19_1","unstructured":"Yongming Rao Wenliang Zhao Benlin Liu Jiwen Lu Jie Zhou and Cho-Jui Hsieh. 2021. Dynamicvit: efficient vision transformers with dynamic token sparsification. (2021). https:\/\/arxiv.org\/abs\/2106.02034 arXiv: 2106.02034 [cs.CV]."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v38i5.28279"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"crossref","unstructured":"Akshay Smit Saahil Jain Pranav Rajpurkar Anuj Pareek Andrew Y Ng and Matthew P Lungren. 2020. Chexbert: combining automatic labelers and expert annotations for accurate radiology report labeling using bert. arXiv preprint arXiv:2004.09167.","DOI":"10.18653\/v1\/2020.emnlp-main.117"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.healthpol.2025.105444"},{"key":"e_1_3_2_1_23_1","unstructured":"Hugo Touvron et al. 2023. Llama 2: open foundation and fine-tuned chat models. (2023). https:\/\/arxiv.org\/abs\/2307.09288 arXiv: 2307.09288 [cs.CL]."},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"e_1_3_2_1_25_1","unstructured":"Xiao Wang Fuling Wang Yuehang Li Qingchuan Ma Shiao Wang Bo Jiang Chuanfu Li and Jin Tang. 2024. Cxpmrg-bench: pre-training and benchmarking for x-ray medical report generation on chexpert plus dataset. (2024). https:\/\/arxiv.org\/abs\/2410.00379 arXiv: 2410.00379 [cs.CV]."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01112"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR52729.2023.01112"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.metrad.2023.100033"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"crossref","unstructured":"Zifeng Wang Zhenbang Wu Dinesh Agarwal and Jimeng Sun. 2022. Medclip: contrastive learning from unpaired medical images and text. (2022). https:\/\/arxiv.org\/abs\/2210.10163 arXiv: 2210.10163 [cs.CV].","DOI":"10.18653\/v1\/2022.emnlp-main.256"},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Qilong Xing Zikai Song Youjia Zhang Na Feng Junqing Yu and Wei Yang. 2025. Mca-rg: enhancing llms with medical concept alignment for radiology report generation. arXiv preprint arXiv:2507.06992.","DOI":"10.1007\/978-3-032-04971-1_36"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2023.121260"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMI.2024.3412402"},{"key":"e_1_3_2_1_33_1","volume-title":"International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 152\u2013161","author":"Yun Hannah","year":"2025","unstructured":"Hannah Yun, Junyeong Maeng, Eunsong Kang, and Heung-Il Suk. 2025. Diff-rrg: longitudinal disease-wise patch difference as guidance for llm-based radiology report generation. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 152\u2013161."}],"event":{"name":"SAC '26: 41st ACM\/SIGAPP Symposium on Applied Computing","location":"Grand Hotel Palace Thessaloniki Greece","acronym":"SAC '26","sponsor":["SIGAPP ACM Special Interest Group on Applied Computing"]},"container-title":["Proceedings of the 41st ACM\/SIGAPP Symposium on Applied Computing"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3748522.3779771","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T14:38:33Z","timestamp":1781015913000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3748522.3779771"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,23]]},"references-count":33,"alternative-id":["10.1145\/3748522.3779771","10.1145\/3748522"],"URL":"https:\/\/doi.org\/10.1145\/3748522.3779771","relation":{},"subject":[],"published":{"date-parts":[[2026,3,23]]},"assertion":[{"value":"2026-06-09","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}