{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,23]],"date-time":"2026-06-23T20:48:39Z","timestamp":1782247719267,"version":"3.54.5"},"reference-count":56,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T00:00:00Z","timestamp":1700092800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T00:00:00Z","timestamp":1700092800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100006093","name":"Patient-Centered Outcomes Research Institute","doi-asserted-by":"publisher","award":["ME-2018C3-14754"],"award-info":[{"award-number":["ME-2018C3-14754"]}],"id":[{"id":"10.13039\/100006093","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"U.S. Department of Health & Human Services | NIH | National Cancer Institute","doi-asserted-by":"publisher","award":["R56AG069880"],"award-info":[{"award-number":["R56AG069880"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000054","name":"U.S. Department of Health & Human Services | NIH | National Cancer Institute","doi-asserted-by":"publisher","award":["1R01CA246418"],"award-info":[{"award-number":["1R01CA246418"]}],"id":[{"id":"10.13039\/100000054","id-type":"DOI","asserted-by":"publisher"}]},{"name":"U.S. Department of Health & Human Services | NIH | National Cancer Institute"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["npj Digit. Med."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>There are enormous enthusiasm and concerns in applying large language models (LLMs) to healthcare. Yet current assumptions are based on general-purpose LLMs such as ChatGPT, which are not developed for medical use. This study develops a generative clinical LLM, GatorTronGPT, using 277 billion words of text including (1) 82 billion words of clinical text from 126 clinical departments and approximately 2 million patients at the University of Florida Health and (2) 195 billion words of diverse general English text. We train GatorTronGPT using a GPT-3 architecture with up to 20 billion parameters and evaluate its utility for biomedical natural language processing (NLP) and healthcare text generation. GatorTronGPT improves biomedical natural language processing. We apply GatorTronGPT to generate 20 billion words of synthetic text. Synthetic NLP models trained using synthetic text generated by GatorTronGPT outperform models trained using real-world clinical text. Physicians\u2019 Turing test using 1 (worst) to 9 (best) scale shows that there are no significant differences in linguistic readability (<jats:italic>p<\/jats:italic>\u2009=\u20090.22; 6.57 of GatorTronGPT compared with 6.93 of human) and clinical relevance (<jats:italic>p<\/jats:italic>\u2009=\u20090.91; 7.0 of GatorTronGPT compared with 6.97 of human) and that physicians cannot differentiate them (<jats:italic>p<\/jats:italic>\u2009&lt;\u20090.001). This study provides insights into the opportunities and challenges of LLMs for medical research and healthcare.<\/jats:p>","DOI":"10.1038\/s41746-023-00958-w","type":"journal-article","created":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T20:02:03Z","timestamp":1700164923000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":335,"title":["A study of generative large language model for medical research and healthcare"],"prefix":"10.1038","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1994-893X","authenticated-orcid":false,"given":"Cheng","family":"Peng","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xi","family":"Yang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Aokun","family":"Chen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kaleb E.","family":"Smith","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nima","family":"PourNejatian","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Anthony B.","family":"Costa","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Cheryl","family":"Martin","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7362-3044","authenticated-orcid":false,"given":"Mona G.","family":"Flores","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4210-2104","authenticated-orcid":false,"given":"Ying","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tanja","family":"Magoc","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5616-2701","authenticated-orcid":false,"given":"Gloria","family":"Lipori","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6049-213X","authenticated-orcid":false,"given":"Duane A.","family":"Mitchell","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Naykky S.","family":"Ospina","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mustafa M.","family":"Ahmed","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9881-1017","authenticated-orcid":false,"given":"William R.","family":"Hogan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4903-1804","authenticated-orcid":false,"given":"Elizabeth A.","family":"Shenkman","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0587-4105","authenticated-orcid":false,"given":"Yi","family":"Guo","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2238-5429","authenticated-orcid":false,"given":"Jiang","family":"Bian","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6780-6135","authenticated-orcid":false,"given":"Yonghui","family":"Wu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2023,11,16]]},"reference":[{"key":"958_CR1","unstructured":"Introducing ChatGPT. https:\/\/openai.com\/blog\/chatgpt."},{"key":"958_CR2","doi-asserted-by":"publisher","first-page":"1233","DOI":"10.1056\/NEJMsr2214184","volume":"388","author":"P Lee","year":"2023","unstructured":"Lee, P., Bubeck, S. & Petro, J. Benefits, limits, and risks of GPT-4 as an AI chatbot for medicine. N. Engl. J. Med. 388, 1233\u20131239 (2023).","journal-title":"N. Engl. J. Med."},{"key":"958_CR3","doi-asserted-by":"publisher","first-page":"e107","DOI":"10.1016\/S2589-7500(23)00021-3","volume":"5","author":"SB Patel","year":"2023","unstructured":"Patel, S. B. & Lam, K. ChatGPT: the future of discharge summaries? Lancet Digit Health 5, e107\u2013e108 (2023).","journal-title":"Lancet Digit Health"},{"key":"958_CR4","doi-asserted-by":"publisher","first-page":"e179","DOI":"10.1016\/S2589-7500(23)00048-1","volume":"5","author":"SR Ali","year":"2023","unstructured":"Ali, S. R., Dobbs, T. D., Hutchings, H. A. & Whitaker, I. S. Using ChatGPT to write patient clinic letters. Lancet Digit Health 5, e179\u2013e181 (2023).","journal-title":"Lancet Digit Health"},{"key":"958_CR5","doi-asserted-by":"crossref","unstructured":"Hirosawa, T. et al. Diagnostic accuracy of differential-diagnosis lists generated by generative pretrained transformer 3 chatbot for clinical vignettes with common chief complaints: a pilot study. Int. J. Environ. Res. Public Health 20, 3378 (2023).","DOI":"10.3390\/ijerph20043378"},{"key":"958_CR6","doi-asserted-by":"publisher","unstructured":"Gr\u00fcnebaum, A., Chervenak, J., Pollet, S. L., Katz, A. & Chervenak, F. A. The Exciting Potential for ChatGPT in Obstetrics and Gynecology. Am. J. Obstet. Gynecol. https:\/\/doi.org\/10.1016\/j.ajog.2023.03.009 (2023).","DOI":"10.1016\/j.ajog.2023.03.009"},{"key":"958_CR7","doi-asserted-by":"publisher","DOI":"10.1007\/s10916-023-01925-4","volume":"47","author":"M Cascella","year":"2023","unstructured":"Cascella, M., Montomoli, J., Bellini, V. & Bignami, E. Evaluating the feasibility of ChatGPT in healthcare: an analysis of multiple clinical and research scenarios. J. Med. Syst. 47, 33 (2023).","journal-title":"J. Med. Syst."},{"key":"958_CR8","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1186\/s13054-023-04393-x","volume":"27","author":"R Azamfirei","year":"2023","unstructured":"Azamfirei, R., Kudchadkar, S. R. & Fackler, J. Large language models and the perils of their hallucinations. Crit. Care 27, 120 (2023).","journal-title":"Crit. Care"},{"key":"958_CR9","doi-asserted-by":"publisher","first-page":"e0240376","DOI":"10.1371\/journal.pone.0240376","volume":"15","author":"I Straw","year":"2020","unstructured":"Straw, I. & Callison-Burch, C. Artificial Intelligence in mental health and the biases of language based models. PLoS One 15, e0240376 (2020).","journal-title":"PLoS One"},{"key":"958_CR10","doi-asserted-by":"publisher","unstructured":"Li, H. et al. Ethics of large language models in medicine and medical research. Lancet Digital Health https:\/\/doi.org\/10.1016\/S2589-7500(23)00083-3 (2023).","DOI":"10.1016\/S2589-7500(23)00083-3"},{"key":"958_CR11","unstructured":"Kojima, T., Gu, S. S., Reid, M., Matsuo, Y. & Iwasawa, Y. Large Language Models are Zero-Shot Reasoners. Adv. Neural Inf. Process. Syst. 35, 22199\u2013213 (2022)."},{"key":"958_CR12","unstructured":"Bommasani, R. et al. On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258 (2021)."},{"key":"958_CR13","first-page":"1877","volume":"33","author":"T Brown","year":"2020","unstructured":"Brown, T., Mann, B. & Ryder, N. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33, 1877\u20131901 (2020).","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"958_CR14","first-page":"1","volume":"55","author":"P Liu","year":"2023","unstructured":"Liu, P. et al. Pre-train, prompt, and predict: a systematic survey of prompting methods in natural language processing. ACM Comput. Surv. 55, 1\u201335 (2023).","journal-title":"ACM Comput. Surv."},{"key":"958_CR15","doi-asserted-by":"publisher","first-page":"194","DOI":"10.1038\/s41746-022-00742-2","volume":"5","author":"X Yang","year":"2022","unstructured":"Yang, X. et al. A large language model for electronic health records. NPJ Digit. Med. 5, 194 (2022).","journal-title":"NPJ Digit. Med."},{"key":"958_CR16","unstructured":"Gao, L. et al. The Pile: an 800GB Dataset of Diverse Text for Language Modeling. arXiv:2101.00027 (2020)."},{"key":"958_CR17","doi-asserted-by":"publisher","first-page":"681","DOI":"10.1007\/s11023-020-09548-1","volume":"30","author":"L Floridi","year":"2020","unstructured":"Floridi, L. & Chiriatti, M. GPT-3: its nature, scope, limits, and consequences. Minds Mach. 30, 681\u2013694 (2020).","journal-title":"Minds Mach."},{"key":"958_CR18","doi-asserted-by":"crossref","unstructured":"Luo, R. et al. BioGPT: generative pre-trained transformer for biomedical text generation and mining. Brief. Bioinform. 23, bbac409 (2022).","DOI":"10.1093\/bib\/bbac409"},{"key":"958_CR19","doi-asserted-by":"publisher","unstructured":"Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: pre-training of deep bidirectional transformers for language understanding. in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) 4171\u20134186 (Association for Computational Linguistics, 2019). https:\/\/doi.org\/10.18653\/v1\/N19-1423.","DOI":"10.18653\/v1\/N19-1423"},{"key":"958_CR20","doi-asserted-by":"publisher","unstructured":"Mohammed, M., Khan, M. B. & Bashier, E. B. M. Machine Learning (CRC Press, 2016). https:\/\/doi.org\/10.1201\/9781315371658.","DOI":"10.1201\/9781315371658"},{"key":"958_CR21","doi-asserted-by":"publisher","first-page":"135","DOI":"10.1038\/s41746-023-00879-8","volume":"6","author":"M Wornow","year":"2023","unstructured":"Wornow, M. et al. The shaky foundations of large language models and foundation models for electronic health records. NPJ Digit Med. 6, 135 (2023).","journal-title":"NPJ Digit Med."},{"key":"958_CR22","doi-asserted-by":"publisher","DOI":"10.1038\/sdata.2016.35","volume":"3","author":"AEW Johnson","year":"2016","unstructured":"Johnson, A. E. W. et al. MIMIC-III, a freely accessible critical care database. Sci. Data 3, 160035 (2016).","journal-title":"Sci. Data"},{"key":"958_CR23","doi-asserted-by":"publisher","first-page":"103938","DOI":"10.1016\/j.jbi.2021.103938","volume":"124","author":"T Searle","year":"2021","unstructured":"Searle, T., Ibrahim, Z., Teo, J. & Dobson, R. Estimating redundancy in clinical text. J. Biomed. Inform. 124, 103938 (2021).","journal-title":"J. Biomed. Inform."},{"key":"958_CR24","doi-asserted-by":"publisher","first-page":"2193","DOI":"10.1093\/jamia\/ocab112","volume":"28","author":"J Li","year":"2021","unstructured":"Li, J. et al. Are synthetic clinical notes useful for real natural language processing tasks: a case study on clinical entity recognition. J. Am. Med. Inform. Assoc. 28, 2193\u20132201 (2021).","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"958_CR25","doi-asserted-by":"publisher","unstructured":"Huguet Cabot, P.-L. & Navigli, R. REBEL: relation extraction by end-to-end language generation. in Findings of the Association for Computational Linguistics: EMNLP 2021 2370\u20132381 (Association for Computational Linguistics, 2021). https:\/\/doi.org\/10.18653\/v1\/2021.findings-emnlp.204.","DOI":"10.18653\/v1\/2021.findings-emnlp.204"},{"key":"958_CR26","doi-asserted-by":"publisher","unstructured":"Peng, C. et al. Clinical concept and relation extraction using prompt-based machine reading comprehension. J. Am. Med. Inform. Assoc. https:\/\/doi.org\/10.1093\/jamia\/ocad107 (2023).","DOI":"10.1093\/jamia\/ocad107"},{"key":"958_CR27","doi-asserted-by":"publisher","first-page":"564","DOI":"10.1001\/jamainternmed.2022.0372","volume":"182","author":"A Gaffney","year":"2022","unstructured":"Gaffney, A. et al. Medical documentation burden among US office-based physicians in 2019: a national study. JAMA Intern. Med. 182, 564\u2013566 (2022).","journal-title":"JAMA Intern. Med."},{"key":"958_CR28","doi-asserted-by":"publisher","first-page":"50","DOI":"10.7326\/M18-0139","volume":"169","author":"NL Downing","year":"2018","unstructured":"Downing, N. L., Bates, D. W. & Longhurst, C. A. Physician burnout in the electronic health record era: are we ignoring the real cause? Ann. Intern. Med. 169, 50 (2018).","journal-title":"Ann. Intern. Med."},{"key":"958_CR29","doi-asserted-by":"publisher","first-page":"e199609","DOI":"10.1001\/jamanetworkopen.2019.9609","volume":"2","author":"PJ Kroth","year":"2019","unstructured":"Kroth, P. J. et al. Association of electronic health record design and use factors with clinician stress and burnout. JAMA Netw. Open 2, e199609 (2019).","journal-title":"JAMA Netw. Open"},{"key":"958_CR30","unstructured":"Diaz, N. Epic to use Microsoft\u2019s GPT-4 in EHRs. https:\/\/www.beckershospitalreview.com\/ehrs\/epic-to-use-microsofts-open-ai-in-ehrs.html."},{"key":"958_CR31","unstructured":"Trang, B. We\u2019re getting much more aggressive\u2019: Microsoft\u2019s Nuance adds GPT-4 AI to its medical note-taking tool. https:\/\/www.statnews.com\/2023\/03\/20\/microsoft-nuance-gpt4-dax-chatgpt\/."},{"key":"958_CR32","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1038\/s41586-023-06160-y","volume":"619","author":"LY Jiang","year":"2023","unstructured":"Jiang, L. Y. et al. Health system-scale language models are all-purpose prediction engines. Nature 619, 357\u2013362 (2023).","journal-title":"Nature"},{"key":"958_CR33","doi-asserted-by":"publisher","unstructured":"Kleesiek, J., Wu, Y., Stiglic, G., Egger, J. & Bian, J. An opinion on ChatGPT in health care-written by humans only. J. Nucl. Med. https:\/\/doi.org\/10.2967\/jnumed.123.265687 (2023).","DOI":"10.2967\/jnumed.123.265687"},{"key":"958_CR34","unstructured":"Ouyang, L. et al. Training language models to follow instructions with human feedback. arXiv [cs.CL] (2022)."},{"key":"958_CR35","unstructured":"Ray, S. Samsung bans ChatGPT among employees after sensitive code leak. Forbes Magazine (2023)."},{"key":"958_CR36","doi-asserted-by":"publisher","first-page":"183","DOI":"10.1126\/science.aal4230","volume":"356","author":"A Caliskan","year":"2017","unstructured":"Caliskan, A., Bryson, J. J. & Narayanan, A. Semantics derived automatically from language corpora contain human-like biases. Science 356, 183\u2013186 (2017).","journal-title":"Science"},{"key":"958_CR37","unstructured":"Center for Devices & Radiological Health. Artificial Intelligence and Machine Learning in Software as a Medical Device. U.S. Food and Drug Administration https:\/\/www.fda.gov\/medical-devices\/software-medical-device-samd\/artificial-intelligence-and-machine-learning-software-medical-device."},{"key":"958_CR38","doi-asserted-by":"publisher","DOI":"10.1186\/s12911-019-0935-4","volume":"19","author":"X Yang","year":"2019","unstructured":"Yang, X. et al. A study of deep learning methods for de-identification of clinical notes in cross-institute settings. BMC Med. Inform. Decis. Mak. 19, 232 (2019).","journal-title":"BMC Med. Inform. Decis. Mak."},{"key":"958_CR39","unstructured":"Levine, Y., Wies, N., Sharir, O., Bata, H. & Shashua, A. The depth-to-width interplay in self-attention. arXiv [cs.LG] (2020)."},{"key":"958_CR40","unstructured":"Shoeybi, M. et al. Megatron-LM: training multi-billion parameter language models using model parallelism. arXiv [cs.CL] (2019)."},{"key":"958_CR41","doi-asserted-by":"publisher","unstructured":"Li, X. L. & Liang, P. Prefix-tuning: optimizing continuous prompts for generation. in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers) 4582\u20134597 (Association for Computational Linguistics, 2021). https:\/\/doi.org\/10.18653\/v1\/2021.acl-long.353.","DOI":"10.18653\/v1\/2021.acl-long.353"},{"key":"958_CR42","first-page":"1","volume":"59","author":"P Liu","year":"2023","unstructured":"Liu, P., Yuan, W., Fu, J., Jiang, Z., Hayashi, H. & Neubig, G. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Computing Surveys. 59, 1\u201335 (2023).","journal-title":"ACM Computing Surveys."},{"key":"958_CR43","unstructured":"Radford A., Wu J., Child R., Luan D. & Amodei D. Language models are unsupervised multitask learners. OpenAI, 1, (2019)"},{"key":"958_CR44","doi-asserted-by":"crossref","unstructured":"The ddi corpus: An annotated corpus with pharmacological sub-stances and drug-drug interactions. J. Biomed. Inform. 46, 914\u2013920 (2013).","DOI":"10.1016\/j.jbi.2013.07.011"},{"key":"958_CR45","doi-asserted-by":"publisher","first-page":"baw068","DOI":"10.1093\/database\/baw068","volume":"2016","author":"J Li","year":"2016","unstructured":"Li, J. et al. BioCreative V CDR task corpus: a resource for chemical disease relation extraction. Database (Oxf.) 2016, baw068 (2016).","journal-title":"Database (Oxf.)"},{"key":"958_CR46","doi-asserted-by":"publisher","first-page":"5100","DOI":"10.1093\/bioinformatics\/btac648","volume":"38","author":"Y Hou","year":"2022","unstructured":"Hou, Y. et al. Discovering drug\u2013target interaction knowledge from biomedical literature. Bioinformatics 38, 5100\u20135107 (2022).","journal-title":"Bioinformatics"},{"key":"958_CR47","doi-asserted-by":"publisher","unstructured":"Jin, Q., Dhingra, B., Liu, Z., Cohen, W. & Lu, X. PubMedQA: a dataset for biomedical research question answering. in Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (Association for Computational Linguistics, 2019). https:\/\/doi.org\/10.18653\/v1\/d19-1259.","DOI":"10.18653\/v1\/d19-1259"},{"key":"958_CR48","unstructured":"Singhal, K. et al. Large language models encode clinical knowledge. arXiv [cs.CL] (2022)."},{"key":"958_CR49","first-page":"6421","volume":"11","author":"D Jin","year":"2021","unstructured":"Jin, D. et al. What disease does this patient have? A large-scale open domain question answering dataset from medical exams. NATO Adv. Sci. Inst. E Appl. Sci. 11, 6421 (2021).","journal-title":"NATO Adv. Sci. Inst. E Appl. Sci."},{"key":"958_CR50","unstructured":"NeMo: NeMo: a toolkit for conversational AI. (NVIDIA GitHub)."},{"key":"958_CR51","unstructured":"Holtzman A., Buys J., Forbes M. & Choi Y. The curious case of neural text degeneration. arXiv preprint arXiv:1904.09751 (2019)."},{"key":"958_CR52","doi-asserted-by":"publisher","unstructured":"Clark, E., Ji, Y. & Smith, N. A. Neural text generation in stories using entity representations as context. in Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers) 2250\u20132260 (Association for Computational Linguistics, 2018). https:\/\/doi.org\/10.18653\/v1\/N18-1204.","DOI":"10.18653\/v1\/N18-1204"},{"key":"958_CR53","unstructured":"Celikyilmaz, A., Clark, E. & Gao, J. Evaluation of text generation: a survey. arXiv preprint arXiv:2006.14799 (2020)."},{"key":"958_CR54","unstructured":"Holtzman, A., Buys, J., Du, L., Forbes, M. & Choi, Y. The curious case of neural text degeneration. arXiv preprint arXiv:1904.09751 (2019)."},{"key":"958_CR55","unstructured":"Huang, K., Altosaar, J. & Ranganath, R. ClinicalBERT: modeling clinical notes and predicting hospital readmission. arXiv preprint arXiv:1904.05342 (2019)."},{"key":"958_CR56","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2288-13-61","volume":"13","author":"N Wongpakaran","year":"2013","unstructured":"Wongpakaran, N., Wongpakaran, T., Wedding, D. & Gwet, K. L. A comparison of Cohen\u2019s Kappa and Gwet\u2019s AC1 when calculating inter-rater reliability coefficients: a study conducted with personality disorder samples. BMC Med. Res. Methodol. 13, 61 (2013).","journal-title":"BMC Med. Res. Methodol."}],"container-title":["npj Digital Medicine"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.nature.com\/articles\/s41746-023-00958-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41746-023-00958-w","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41746-023-00958-w.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,16]],"date-time":"2023-11-16T20:29:19Z","timestamp":1700166559000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.nature.com\/articles\/s41746-023-00958-w"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,11,16]]},"references-count":56,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["958"],"URL":"https:\/\/doi.org\/10.1038\/s41746-023-00958-w","relation":{},"ISSN":["2398-6352"],"issn-type":[{"value":"2398-6352","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,11,16]]},"assertion":[{"value":"5 June 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 November 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 November 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"K.E.S., N.P.N., A.B.C., C.M., and M.G.F. are employed by NVIDIA. There are no other competing financial or non-financial interests. The work presented in this study was conducted exclusively within the University of Florida Health.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"210"}}