{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,28]],"date-time":"2026-04-28T22:49:00Z","timestamp":1777416540994,"version":"3.51.4"},"reference-count":22,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,3,13]],"date-time":"2025-03-13T00:00:00Z","timestamp":1741824000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,3,13]],"date-time":"2025-03-13T00:00:00Z","timestamp":1741824000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Med Syst"],"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Manually converting unstructured text pathology reports into structured pathology reports is very time-consuming and prone to errors. This study demonstrates the transformative potential of generative AI in automating the analysis of free-text pathology reports. Employing the ChatGPT Large Language Model within a Streamlit web application, we automated the extraction and structuring of information from 33 unstructured breast cancer pathology reports from Taipei Medical University Hospital. Achieving a 99.61% accuracy rate, the AI system notably reduced the processing time compared to traditional methods. This not only underscores the efficacy of AI in converting unstructured medical text into structured data but also highlights its potential to enhance the efficiency and reliability of medical text analysis. However, this study is limited to breast cancer pathology reports and was conducted using data obtained from hospitals associated with a single institution. In the future, we plan to expand the scope of this research to include pathology reports for other cancer types incrementally and conduct external validation to further substantiate the robustness and generalizability of the proposed system. Through this technological integration, we aimed to substantiate the capabilities of generative AI in improving both the speed and reliability of data processing. The outcomes of this study affirm that generative AI can significantly transform the handling of pathology reports, promising substantial advancements in biomedical research by facilitating the structured analysis of complex medical data.<\/jats:p>","DOI":"10.1007\/s10916-025-02167-2","type":"journal-article","created":{"date-parts":[[2025,3,13]],"date-time":"2025-03-13T14:53:22Z","timestamp":1741877602000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Using Generative AI to Extract Structured Information from Free Text Pathology Reports"],"prefix":"10.1007","volume":"49","author":[{"given":"Fahad","family":"Shahid","sequence":"first","affiliation":[]},{"given":"Min-Huei","family":"Hsu","sequence":"additional","affiliation":[]},{"given":"Yung-Chun","family":"Chang","sequence":"additional","affiliation":[]},{"given":"Wen-Shan","family":"Jian","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,3,13]]},"reference":[{"issue":"1","key":"2167_CR1","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1007\/s00428-015-1834-4","volume":"468","author":"DW Ellis","year":"2016","unstructured":"Ellis DW, Srigley J. Does standardised structured reporting contribute to quality in diagnostic pathology? The importance of evidence-based datasets. Virchows Arch. 2016;468(1):51-9. https:\/\/doi.org\/10.1007\/s00428-015-1834-4","journal-title":"Virchows Arch."},{"key":"2167_CR2","doi-asserted-by":"publisher","first-page":"100154","DOI":"10.1016\/j.jpi.2022.100154","volume":"13","author":"H Lam","year":"2022","unstructured":"Lam H, Nguyen F, Wang X, Stock A, Lenskaya V, Kooshesh M, et al. An accessible, efficient, and accurate natural language processing method for extracting diagnostic data from pathology reports. J Pathol Inform. 2022;13:100154. https:\/\/doi.org\/10.1016\/j.jpi.2022.100154","journal-title":"J Pathol Inform."},{"issue":"11","key":"2167_CR3","doi-asserted-by":"publisher","first-page":"1486","DOI":"10.1016\/j.ajpath.2022.07.012","volume":"192","author":"P Lopez-Ubeda","year":"2022","unstructured":"Lopez-Ubeda P, Martin-Noguerol T, Aneiros-Fernandez J, Luna A. Natural Language Processing in Pathology: Current Trends and Future Insights. Am J Pathol. 2022;192(11):1486-95. https:\/\/doi.org\/10.1016\/j.ajpath.2022.07.012","journal-title":"Am J Pathol."},{"issue":"1","key":"2167_CR4","doi-asserted-by":"publisher","first-page":"244","DOI":"10.1109\/JBHI.2017.2700722","volume":"22","author":"JX Qiu","year":"2018","unstructured":"Qiu JX, Yoon HJ, Fearn PA, Tourassi GD. Deep Learning for Automated Extraction of Primary Sites From Cancer Pathology Reports. IEEE J Biomed Health Inform. 2018;22(1):244-51. https:\/\/doi.org\/10.1109\/JBHI.2017.2700722","journal-title":"IEEE J Biomed Health Inform."},{"key":"2167_CR5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1200\/CCI.18.00138","volume":"3","author":"JT Senders","year":"2019","unstructured":"Senders JT, Karhade AV, Cote DJ, Mehrtash A, Lamba N, DiRisio A, et al. Natural Language Processing for Automated Quantification of Brain Metastases Reported in Free-Text Radiology Reports. JCO Clin Cancer Inform. 2019;3:1-9. https:\/\/doi.org\/10.1200\/CCI.18.00138","journal-title":"JCO Clin Cancer Inform."},{"key":"2167_CR6","doi-asserted-by":"publisher","unstructured":"Shastry KA, Shastry A. An integrated deep learning and natural language processing approach for continuous remote monitoring in digital health. Decision Analytics Journal. 2023;8:100301. https:\/\/doi.org\/10.1016\/j.dajour.2023.100301","DOI":"10.1016\/j.dajour.2023.100301"},{"key":"2167_CR7","doi-asserted-by":"publisher","unstructured":"Marchisio A, Shafique M, Ozkan M, Benini L, Silvano C, Kim M, et al. Deep learning for edge computing: Current trends, cross-layer optimizations, and open research challenges. In: 2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI). IEEE; 2019. p. 553\u20139. https:\/\/doi.org\/10.1109\/ISVLSI.2019.00105","DOI":"10.1109\/ISVLSI.2019.00105"},{"key":"2167_CR8","doi-asserted-by":"publisher","unstructured":"Lu Y, Srinivasan G, Preum S, Pettus J, Davis M, Greenburg J, et al. Assessing the impact of pretraining domain relevance on large language models across various pathology reporting tasks. medRxiv [Preprint]. 2023. Available from: https:\/\/doi.org\/10.1101\/2023.09.10.23295318","DOI":"10.1101\/2023.09.10.23295318"},{"key":"2167_CR9","first-page":"e1031","volume":"201","author":"AY Odisho","year":"2019","unstructured":"Odisho AY, Park B, Altieri N, Murdoch WJ, Carroll P, Coopberberg M, et al. PD58-09\u2003EXTRACTING STRUCTURED INFORMATION FROM PATHOLOGY REPORTS USING NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING. The Journal of Urology. 2019;201:e1031\u2013e2.","journal-title":"The Journal of Urology."},{"key":"2167_CR10","unstructured":"Wang H. Knowledge efficient deep learning for natural language processing. arXiv [Preprint]. 2020. Available from: https:\/\/arxiv.org\/abs\/2008.12878"},{"key":"2167_CR11","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1200\/CCI.18.00080","volume":"2","author":"S-R Leyh-Bannurah","year":"2018","unstructured":"Leyh-Bannurah S-R, Tian Z, Karakiewicz PI, Wolffgang U, Sauter G, Fisch M, et al. Deep Learning for Natural Language Processing in Urology: State-of-the-Art Automated Extraction of Detailed Pathologic Prostate Cancer Data From Narratively Written Electronic Health Records. JCO Clinical Cancer Informatics. 2018(2):1-9. https:\/\/doi.org\/10.1200\/CCI.18.00080","journal-title":"JCO Clinical Cancer Informatics."},{"issue":"1","key":"2167_CR12","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1186\/s12911-021-01533-7","volume":"21","author":"A Casey","year":"2021","unstructured":"Casey A, Davidson E, Poon M, Dong H, Duma D, Grivas A, et al. A systematic review of natural language processing applied to radiology reports. BMC Medical Informatics and Decision Making. 2021;21(1):179. https:\/\/doi.org\/10.1186\/s12911-021-01533-7","journal-title":"BMC Medical Informatics and Decision Making."},{"key":"2167_CR13","doi-asserted-by":"crossref","unstructured":"Tan WM, Teoh KH, Ganggayah MD, Taib NA, Zaini HS, Dhillon SK. Automated generation of synoptic reports from narrative pathology reports in University Malaya Medical Centre using natural language processing. Diagnostics (Basel). 2022;12(879).","DOI":"10.3390\/diagnostics12040879"},{"issue":"4","key":"2167_CR14","doi-asserted-by":"publisher","first-page":"e230725","DOI":"10.1148\/radiol.230725","volume":"307","author":"LC Adams","year":"2023","unstructured":"Adams LC, Truhn D, Busch F, Kader A, Niehues SM, Makowski MR, et al. Leveraging GPT-4 for Post Hoc Transformation of Free-text Radiology Reports into Structured Reporting: A Multilingual Feasibility Study. Radiology. 2023;307(4):e230725. https:\/\/doi.org\/10.1148\/radiol.230725","journal-title":"Radiology."},{"issue":"3","key":"2167_CR15","doi-asserted-by":"publisher","first-page":"310","DOI":"10.1002\/path.6232","volume":"262","author":"D Truhn","year":"2024","unstructured":"Truhn D, Loeffler CM, Muller-Franzes G, Nebelung S, Hewitt KJ, Brandner S, et al. Extracting structured information from unstructured histopathology reports using generative pre-trained transformer 4 (GPT-4). J Pathol. 2024;262(3):310-9. https:\/\/doi.org\/10.1002\/path.6232","journal-title":"J Pathol."},{"issue":"1","key":"2167_CR16","doi-asserted-by":"publisher","first-page":"1418","DOI":"10.1038\/s41467-024-45563-x","volume":"15","author":"J Dagdelen","year":"2024","unstructured":"Dagdelen J, Dunn A, Lee S, Walker N, Rosen AS, Ceder G, et al. Structured information extraction from scientific text with large language models. Nat Commun. 2024;15(1):1418. https:\/\/doi.org\/10.1038\/s41467-024-45563-x","journal-title":"Nat Commun."},{"issue":"5","key":"2167_CR17","doi-asserted-by":"publisher","first-page":"701","DOI":"10.2967\/jnumed.123.265687","volume":"64","author":"J Kleesiek","year":"2023","unstructured":"Kleesiek J, Wu Y, Stiglic G, Egger J, Bian J. An Opinion on ChatGPT in Health Care-Written by Humans Only. J Nucl Med. 2023;64(5):701-3. https:\/\/doi.org\/10.2967\/jnumed.123.265687","journal-title":"J Nucl Med."},{"key":"2167_CR18","doi-asserted-by":"publisher","first-page":"e50945","DOI":"10.2196\/50945","volume":"9","author":"CW Safranek","year":"2023","unstructured":"Safranek CW, Sidamon-Eristoff AE, Gilson A, Chartash D. The Role of Large Language Models in Medical Education: Applications and Implications. JMIR Med Educ. 2023;9:e50945. https:\/\/doi.org\/10.2196\/50945","journal-title":"JMIR Med Educ."},{"issue":"4","key":"2167_CR19","doi-asserted-by":"publisher","first-page":"e37587","DOI":"10.7759\/cureus.37587","volume":"15","author":"HR Naik","year":"2023","unstructured":"Naik HR, Prather AD, Gurda GT. Synchronous Bilateral Breast Cancer: A Case Report Piloting and Evaluating the Implementation of the AI-Powered Large Language Model (LLM) ChatGPT. Cureus. 2023;15(4):e37587. https:\/\/doi.org\/10.7759\/cureus.37587","journal-title":"Cureus."},{"key":"2167_CR20","doi-asserted-by":"publisher","unstructured":"Wang J, Shi E, Yu S, Wu Z, Chong M, Dai H, et al. Prompt engineering for healthcare: Methodologies and applications. arXiv [Preprint]. 2023. Available from: https:\/\/doi.org\/10.48550\/arXiv.2304.14670","DOI":"10.48550\/arXiv.2304.14670"},{"issue":"3","key":"2167_CR21","doi-asserted-by":"publisher","first-page":"209","DOI":"10.3857\/roj.2023.00633","volume":"41","author":"HS Choi","year":"2023","unstructured":"Choi HS, Song JY, Shin KH, Chang JH, Jang BS. Developing prompts from large language model for extracting clinical information from pathology and ultrasound reports in breast cancer. Radiat Oncol J. 2023;41(3):209-16. https:\/\/doi.org\/10.3857\/roj.2023.00633","journal-title":"Radiat Oncol J."},{"key":"2167_CR22","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-024-19209-5","author":"\u00c1 Garc\u00eda-Barrag\u00e1n","year":"2024","unstructured":"Garc\u00eda-Barrag\u00e1n \u00c1, Gonz\u00e1lez Calatayud A, Solarte-Pab\u00f3n O, Provencio M, Menasalvas E, Robles V. GPT for medical entity recognition in Spanish. Multimedia Tools and Applications. 2024. https:\/\/doi.org\/10.1007\/s11042-024-19209-5","journal-title":"Multimedia Tools and Applications."}],"container-title":["Journal of Medical Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10916-025-02167-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10916-025-02167-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10916-025-02167-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,13]],"date-time":"2025-03-13T14:53:26Z","timestamp":1741877606000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10916-025-02167-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3,13]]},"references-count":22,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["2167"],"URL":"https:\/\/doi.org\/10.1007\/s10916-025-02167-2","relation":{},"ISSN":["1573-689X"],"issn-type":[{"value":"1573-689X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,3,13]]},"assertion":[{"value":"11 September 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 March 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 March 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"This study was approved by the TMU-Joint Institutional Review Board (TMU-JIRB No: N202103047).","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics Approval"}},{"value":"The authors declare no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing Interests"}}],"article-number":"36"}}