{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:20:11Z","timestamp":1772166011504,"version":"3.50.1"},"reference-count":19,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2025,7,1]],"date-time":"2025-07-01T00:00:00Z","timestamp":1751328000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,7,1]],"date-time":"2025-07-01T00:00:00Z","timestamp":1751328000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Johann Wolfgang Goethe-Universit\u00e4t, Frankfurt am Main"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>This paper presents a comparative study of two state-of-the-art language models, OpenAI\u2019s GPT and DeepL, in the context of terminology translation within the medical domain.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>This study was conducted on the human phenotype ontology (HPO), which is used in medical research and diagnosis. Medical experts assess the performance of both models on a set of 120 translated HPO terms and their 180 synonyms, employing a 4-point Likert scale (strongly agree\u2009=\u20091, agree\u2009=\u20092, disagree\u2009=\u20093, strongly disagree\u2009=\u20094). An independent reference translation from the HeTOP database was used to validate the quality of the translation.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>The average Likert rating for the selected HPO terms was 1.29 for GPT-3.5 and 1.37 for DeepL. The quality of the translations was also found to be satisfactory for multi-word terms with greater ontological depth. The comparison with HeTOP revealed a high degree of similarity between the models\u2019 translations and the reference translations.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>Statistical analysis revealed no significant differences in the mean ratings between the two models, indicating their comparable performance in terms of translation quality. The study not only illustrates the potential of machine translation but also shows incomplete coverage of translated medical terminology. This underscores the relevance of this study for cross-lingual medical research. However, the evaluation methods need to be further refined, specific translation issues need to be addressed, and the sample size needs to be increased to allow for more generalizable conclusions.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12911-025-03075-8","type":"journal-article","created":{"date-parts":[[2025,7,1]],"date-time":"2025-07-01T09:19:13Z","timestamp":1751361553000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Assessing GPT and DeepL for terminology translation in the medical domain: A comparative study on the human phenotype ontology"],"prefix":"10.1186","volume":"25","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5240-8484","authenticated-orcid":false,"given":"Richard","family":"Noll","sequence":"first","affiliation":[]},{"given":"Alexandra","family":"Berger","sequence":"additional","affiliation":[]},{"given":"Dominik","family":"Kieu","sequence":"additional","affiliation":[]},{"given":"Tobias","family":"Mueller","sequence":"additional","affiliation":[]},{"given":"Ferdinand","family":"O. Bohmann","sequence":"additional","affiliation":[]},{"given":"Angelina","family":"M\u00fcller","sequence":"additional","affiliation":[]},{"given":"Svea","family":"Holtz","sequence":"additional","affiliation":[]},{"given":"Philipp","family":"Stoffers","sequence":"additional","affiliation":[]},{"given":"Sebastian","family":"Hoehl","sequence":"additional","affiliation":[]},{"given":"Oya","family":"Guengoeze","sequence":"additional","affiliation":[]},{"given":"Jan-Niklas","family":"Eckardt","sequence":"additional","affiliation":[]},{"given":"Holger","family":"Storf","sequence":"additional","affiliation":[]},{"given":"Jannik","family":"Schaaf","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2025,7,1]]},"reference":[{"key":"3075_CR1","doi-asserted-by":"publisher","first-page":"5158","DOI":"10.1016\/j.eswa.2014.02.045","volume":"41","author":"M Ivanovi\u0107","year":"2014","unstructured":"Ivanovi\u0107 M, Budimac Z. An overview of ontologies and data resources in medical domains. Expert Syst Appl. 2014;41:5158\u201366.","journal-title":"Expert Syst Appl"},{"key":"3075_CR2","doi-asserted-by":"publisher","first-page":"120","DOI":"10.1016\/j.nbt.2023.08.004","volume":"77","author":"R Noll","year":"2023","unstructured":"Noll R, Frischen LS, Boeker M, Storf H, Schaaf J. Machine translation of standardized medical terminology using natural language processing: A scoping review. N Biotechnol. 2023;77:120\u201329. https:\/\/doi.org\/10.1016\/j.nbt.2023.08.004.","journal-title":"N Biotechnol"},{"key":"3075_CR3","unstructured":"DeepL. https:\/\/www.deepl.com\/en\/whydeepl\/. Accessed 12 October 2023."},{"key":"3075_CR4","doi-asserted-by":"publisher","first-page":"89","DOI":"10.3233\/SHTI230933","volume":"310","author":"R Noll","year":"2024","unstructured":"Noll R, Berger A, Facchinello C, et al. Translation of ontological concepts from English into German using commercial translation software and expert evaluation. Stud Health Technol Inform. 2024;310:89\u201393. https:\/\/doi.org\/10.3233\/SHTI230933.","journal-title":"Stud Health Technol Inform"},{"issue":"1","key":"3075_CR5","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1017\/S1351324920000601","volume":"27","author":"R Dale","year":"2021","unstructured":"Dale R. GPT-3: What\u2019s it good for? Nat Lang Eng. 2021;27(1):113\u201318. https:\/\/doi.org\/10.1017\/S1351324920000601.","journal-title":"Nat Lang Eng"},{"issue":"D1","key":"3075_CR6","doi-asserted-by":"publisher","first-page":"D1207","DOI":"10.1093\/nar\/gkaa1043","volume":"49","author":"S K\u00f6hler","year":"2021","unstructured":"K\u00f6hler S, Gargano M, Matentzoglu N, et al. The human phenotype ontology in 2021. Nucleic Acids Res. 2021;49(D1):D1207\u201317. https:\/\/doi.org\/10.1093\/nar\/gkaa1043.","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"3075_CR7","doi-asserted-by":"publisher","first-page":"D1333","DOI":"10.1093\/nar\/gkad1005","volume":"52","author":"MA Gargano","year":"2024","unstructured":"Gargano MA, Matentzoglu N, Coleman B, et al. The human phenotype ontology in 2024: Phenotypes around the world. Nucleic Acids Res. 2024;52(D1):D1333\u201346. https:\/\/doi.org\/10.1093\/nar\/gkad1005.","journal-title":"Nucleic Acids Res"},{"key":"3075_CR8","unstructured":"Robinson P, K\u00f6hler S, et al. https:\/\/github.com\/obophenotype\/human-phenotype-ontology. 26 July 2024."},{"key":"3075_CR9","doi-asserted-by":"publisher","unstructured":"Koubaa A. GPT-4 vs. GPT-3.5: A concise showdown. Prepr. 2023. https:\/\/doi.org\/10.20944\/preprints202303.0422.v1.","DOI":"10.20944\/preprints202303.0422.v1"},{"key":"3075_CR10","unstructured":"HPO. https:\/\/hpo.jax.org\/. Accessed 12 July 2024."},{"key":"3075_CR11","unstructured":"ChatGPT. https:\/\/chat.openai.com\/. Accessed 12 July 2024."},{"key":"3075_CR12","unstructured":"DeepL API. https:\/\/www.deepl.com\/en\/docs-api\/. Accessed 12 October 2023."},{"key":"3075_CR13","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1214\/aoms\/1177730491","volume":"18","author":"HB Mann","year":"1947","unstructured":"Mann HB, Whitney DR. On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat. 1947;18:50\u201360. https:\/\/doi.org\/10.1214\/aoms\/1177730491.","journal-title":"Ann Math Stat"},{"key":"3075_CR14","first-page":"212","volume-title":"Encyclopedia of statistical sciences","author":"GG Koch","year":"1982","unstructured":"Koch GG. Intraclass correlation coefficient. In: Kotz S, Johnson NL, editors. Encyclopedia of statistical sciences. New York: John Wiley & Sons, Inc.;1982. p. 212\u201317."},{"issue":"23","key":"3075_CR15","doi-asserted-by":"publisher","first-page":"3116","DOI":"10.1002\/sim.6546","volume":"34","author":"KP Nelson","year":"2015","unstructured":"Nelson KP, Edwards D. Measures of agreement between many raters for ordinal classifications. Stat Med. 2015;34(23):3116\u201332. https:\/\/doi.org\/10.1002\/sim.6546.","journal-title":"Stat Med"},{"key":"3075_CR16","unstructured":"HeTOP. https:\/\/www.hetop.eu\/hetop\/. Accessed 12 July 2024."},{"issue":"406","key":"3075_CR17","doi-asserted-by":"publisher","first-page":"414","DOI":"10.2307\/2289924","volume":"84","author":"MA Jaro","year":"1989","unstructured":"Jaro MA. Advances in record-linkage methodology as applied to matching the 1985 census of Tampa, Florida. J Am Stat Assoc. 1989;84(406):414\u201320. https:\/\/doi.org\/10.2307\/2289924.","journal-title":"J Am Stat Assoc"},{"key":"3075_CR18","doi-asserted-by":"publisher","first-page":"178","DOI":"10.3233\/SHTI210144","volume":"281","author":"A Prunotto","year":"2021","unstructured":"Prunotto A, Schulz S, Boeker M. Automatic generation of german translation candidates for snomed ct textual descriptions. Stud Health Technol Inform. 2021;281:178\u201382. https:\/\/doi.org\/10.3233\/SHTI210144.","journal-title":"Stud Health Technol Inform"},{"key":"3075_CR19","unstructured":"OpenAI. https:\/\/openai.com\/gpt-4. Accessed 12 July 2024."}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-025-03075-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12911-025-03075-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-025-03075-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,1]],"date-time":"2025-07-01T21:03:23Z","timestamp":1751403803000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/s12911-025-03075-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,1]]},"references-count":19,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,12]]}},"alternative-id":["3075"],"URL":"https:\/\/doi.org\/10.1186\/s12911-025-03075-8","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-4836251\/v1","asserted-by":"object"}]},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,1]]},"assertion":[{"value":"31 July 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 June 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 July 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"237"}}