{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,26]],"date-time":"2026-06-26T22:06:14Z","timestamp":1782511574919,"version":"3.54.5"},"reference-count":46,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2026,2,18]],"date-time":"2026-02-18T00:00:00Z","timestamp":1771372800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T00:00:00Z","timestamp":1774569600000},"content-version":"vor","delay-in-days":37,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100005972","name":"German Cancer Aid","doi-asserted-by":"crossref","award":["DECADE, 70115166"],"award-info":[{"award-number":["DECADE, 70115166"]}],"id":[{"id":"10.13039\/501100005972","id-type":"DOI","asserted-by":"crossref"}]},{"name":"German Academic Exchange Service","award":["SECAI, 57616814"],"award-info":[{"award-number":["SECAI, 57616814"]}]},{"name":"German Federal Joint Committee","award":["TransplantKI, 01VSF21048"],"award-info":[{"award-number":["TransplantKI, 01VSF21048"]}]},{"name":"European Union\u2019s Horizon Europe research and innovation programme","award":["ODELIA, 101057091; GENIAL, 101096312"],"award-info":[{"award-number":["ODELIA, 101057091; GENIAL, 101096312"]}]},{"name":"European Research Council","award":["ERC; NADIR, 101114631"],"award-info":[{"award-number":["ERC; NADIR, 101114631"]}]},{"name":"National Institute for Health and Care Research (NIHR) Leeds Biomedical Research Centre","award":["NIHR203331"],"award-info":[{"award-number":["NIHR203331"]}]},{"name":"German Research Foundation DFG","award":["TRR 412\/1, 535081457; SFB 1709\/1 2025, 533056198"],"award-info":[{"award-number":["TRR 412\/1, 535081457; SFB 1709\/1 2025, 533056198"]}]},{"DOI":"10.13039\/100001006","name":"Breast Cancer Research Foundation","doi-asserted-by":"publisher","award":["BELLADONNA, BCRF-25-225"],"award-info":[{"award-number":["BELLADONNA, BCRF-25-225"]}],"id":[{"id":"10.13039\/100001006","id-type":"DOI","asserted-by":"publisher"}]},{"name":"German Federal Ministry of Research, Technology and Space BMFTR","award":["PEARL, 01KD2104C; CAMINO, 01EO2101; TRANSFORM LIVER, 031L0312A; TANGERINE, 01KT2302 through ERA-NET Transcan; Come2Data, 16DKZ2044A; DEEP-HCC, 031L0315A; DECIPHER-M, 01KD2420A; NextBIG, 01ZU2402A"],"award-info":[{"award-number":["PEARL, 01KD2104C; CAMINO, 01EO2101; TRANSFORM LIVER, 031L0312A; TANGERINE, 01KT2302 through ERA-NET Transcan; Come2Data, 16DKZ2044A; DEEP-HCC, 031L0315A; DECIPHER-M, 01KD2420A; NextBIG, 01ZU2402A"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["npj Digit. Med."],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    Agentic artificial intelligence (AI) systems, designed to autonomously reason, plan, and invoke tools, have shown promise in healthcare, yet systematic benchmarking of their real-world performance remains limited. In this study, we evaluate two such systems: the open-source OpenManus, built on Meta\u2019s Llama-4 and extended with medically customized agents; and Manus, a proprietary agent system employing a multistep planner-executor-verifier architecture. Both systems were assessed across three benchmark families:\n                    <jats:italic>AgentClinic<\/jats:italic>\n                    , a stepwise dialog-based diagnostic simulation;\n                    <jats:italic>MedAgentsBench<\/jats:italic>\n                    , a knowledge-intensive medical QA dataset; and\n                    <jats:italic>Humanity\u2019s Last Exam<\/jats:italic>\n                    (HLE), a suite of challenging text-only and multimodal questions. Despite access to advanced tools (e.g., web browsing, code development and execution, and text file editing) agent systems yielded only modest accuracy gains over baseline LLMs, reaching 60.3% and 28.0% in AgentClinic MedQA and MIMIC, 30.3% on MedAgentsBench, and 8.6% on HLE text. Multimodal accuracy remained low (15.5% on multimodal HLE, 29.2% on AgentClinic NEJM), while resource demands increased substantially, with &gt;10\u00d7 token usage and &gt;2\u00d7 latency. Although 89.9% of hallucinations were filtered by in-agent safeguards, hallucinations remained prevalent. These findings reveal that current agentic designs offer modest performance benefits at significant computational and workflow cost, underscoring the need for more accurate, efficient, and clinically viable agent systems.\n                  <\/jats:p>","DOI":"10.1038\/s41746-026-02443-6","type":"journal-article","created":{"date-parts":[[2026,2,18]],"date-time":"2026-02-18T05:56:20Z","timestamp":1771394180000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Benchmarking large language model-based agent systems for clinical decision tasks"],"prefix":"10.1038","volume":"9","author":[{"given":"Yunsong","family":"Liu","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zunamys I.","family":"Carrero","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaofeng","family":"Jiang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Dyke","family":"Ferber","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Georg","family":"W\u00f6lflein","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Li","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sanddhya","family":"Jayabalan","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Tim","family":"Lenz","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Zhouguang","family":"Hui","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jakob Nikolas","family":"Kather","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2026,2,18]]},"reference":[{"key":"2443_CR1","doi-asserted-by":"publisher","first-page":"2199","DOI":"10.1001\/jama.2018.17163","volume":"320","author":"E Shortliffe","year":"2018","unstructured":"Shortliffe, E. & Sep\u00falveda, M. Clinical decision support in the era of artificial intelligence. JAMA 320, 2199\u20132200 (2018).","journal-title":"JAMA"},{"key":"2443_CR2","first-page":"e57728","volume":"16","author":"M Elhaddad","year":"2024","unstructured":"Elhaddad, M. & Hamam, S. AI-driven clinical decision support systems: an ongoing pursuit of potential. Cureus 16, e57728 (2024).","journal-title":"Cureus"},{"key":"2443_CR3","doi-asserted-by":"publisher","first-page":"255","DOI":"10.1002\/hcs2.61","volume":"2","author":"R Yang","year":"2023","unstructured":"Yang, R. et al. Large language models in health care: development, applications, and challenges. Health Care Sci. 2, 255\u2013263 (2023).","journal-title":"Health Care Sci."},{"key":"2443_CR4","doi-asserted-by":"publisher","first-page":"319","DOI":"10.1001\/jama.2024.21700","volume":"333","author":"S Bedi","year":"2025","unstructured":"Bedi, S. et al. Testing and evaluation of health care applications of large language models: a systematic review: a systematic review. JAMA 333, 319\u2013328 (2025).","journal-title":"JAMA"},{"key":"2443_CR5","doi-asserted-by":"publisher","first-page":"1134","DOI":"10.1038\/s41591-024-02855-5","volume":"30","author":"D Van Veen","year":"2024","unstructured":"Van Veen, D. et al. Adapted large language models can outperform medical experts in clinical text summarization. Nat. Med. 30, 1134\u20131142 (2024).","journal-title":"Nat. Med."},{"key":"2443_CR6","doi-asserted-by":"publisher","unstructured":"Nori, H., King, N., McKinney, S. M., Carignan, D. & Horvitz, E. Capabilities of GPT-4 on medical challenge problems. Preprint at arXiv https:\/\/doi.org\/10.48550\/arXiv.2303.13375 (2023).","DOI":"10.48550\/arXiv.2303.13375"},{"key":"2443_CR7","doi-asserted-by":"publisher","first-page":"943","DOI":"10.1038\/s41591-024-03423-7","volume":"31","author":"K Singhal","year":"2025","unstructured":"Singhal, K. et al. Toward expert-level medical question answering with large language models. Nat. Med. 31, 943\u2013950 (2025).","journal-title":"Nat. Med."},{"key":"2443_CR8","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1038\/s41746-025-01543-z","volume":"8","author":"H Takita","year":"2025","unstructured":"Takita, H. et al. A systematic review and meta-analysis of diagnostic performance comparison between generative AI and physicians. NPJ Digit. Med. 8, 175 (2025).","journal-title":"NPJ Digit. Med."},{"key":"2443_CR9","doi-asserted-by":"publisher","unstructured":"Jiang, Y. et al. MedAgentBench: A Virtual EHR Environment to Benchmark Medical LLMAgents. NEJM AI 2 https:\/\/doi.org\/10.1056\/AIdbp2500144 (2025).","DOI":"10.1056\/AIdbp2500144"},{"key":"2443_CR10","doi-asserted-by":"publisher","unstructured":"Schmidgall, S. et al. AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments. arXiv https:\/\/doi.org\/10.48550\/arXiv.2405.07960 (2024).","DOI":"10.48550\/arXiv.2405.07960"},{"key":"2443_CR11","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1038\/s41591-024-03328-5","volume":"31","author":"S Johri","year":"2025","unstructured":"Johri, S. et al. An evaluation framework for clinical use of large language models in patient interaction tasks. Nat. Med. 31, 77\u201386 (2025).","journal-title":"Nat. Med."},{"key":"2443_CR12","doi-asserted-by":"publisher","unstructured":"Tang, X. et al. MedAgentsBench: benchmarking thinking models and agent frameworks for complex medical reasoning. Preprint at arXiv https:\/\/doi.org\/10.48550\/arXiv.2503.07459 (2025).","DOI":"10.48550\/arXiv.2503.07459"},{"key":"2443_CR13","doi-asserted-by":"publisher","first-page":"103599","DOI":"10.1016\/j.inffus.2025.103599","volume":"126","author":"R Sapkota","year":"2026","unstructured":"Sapkota, R., Roumeliotis, K. I. & Karkee, M. AI agents vs. agentic AI: a conceptual taxonomy, applications and challenges. Inf. Fusion 126, 103599 (2026).","journal-title":"Inf. Fusion"},{"key":"2443_CR14","doi-asserted-by":"crossref","unstructured":"Xi, Z. et al. The rise and potential of large language model based agents: a survey. Sci. China Inf. Sci. 68, 121101 (2025).","DOI":"10.1007\/s11432-024-4222-0"},{"key":"2443_CR15","unstructured":"Hong, S. et al. MetaGPT: Meta programming for a multi-agent collaborative framework. In Proc. International Conference Learning Representations Vol. 2024, 23247\u201323275 (2024)."},{"key":"2443_CR16","doi-asserted-by":"publisher","first-page":"e2502649","DOI":"10.1002\/adma.202502649","volume":"37","author":"MJ Robson","year":"2025","unstructured":"Robson, M. J., Xu, S., Wang, Z., Chen, Q. & Ciucci, F. Multi-agent-network-based idea generator for zinc-ion battery electrolyte discovery: a case study on zinc tetrafluoroborate hydrate-based deep eutectic electrolytes. Adv. Mater. 37, e2502649 (2025).","journal-title":"Adv. Mater."},{"key":"2443_CR17","doi-asserted-by":"crossref","unstructured":"Su, H. et al. Many heads are better than one: improved scientific idea generation by A LLM-based multi-agent system. In (eds. Che, W., Nabende, J., Shutova, E. & Pilehvar, M. T.) Proc. 63rd Annual Meeting of the Association for Computational Linguistics, Vol. 1: Long Papers, 28201\u201328240 (Association for Computational Linguistics, 2025).","DOI":"10.18653\/v1\/2025.acl-long.1368"},{"key":"2443_CR18","doi-asserted-by":"publisher","first-page":"541","DOI":"10.1038\/s41746-025-01940-4","volume":"8","author":"R Li","year":"2025","unstructured":"Li, R. et al. CARE-AD: a multi-agent large language model framework for Alzheimer\u2019s disease prediction using longitudinal clinical notes. NPJ Digit. Med. 8, 541 (2025).","journal-title":"NPJ Digit. Med."},{"key":"2443_CR19","doi-asserted-by":"publisher","unstructured":"Shen, M. & Yang, Q. From mind to machine: the rise of Manus AI as a fully autonomous digital agent. Preprint at arXiv https:\/\/doi.org\/10.48550\/arXiv.2505.02024 (2025).","DOI":"10.48550\/arXiv.2505.02024"},{"key":"2443_CR20","unstructured":"LLMHacker. Manus AI: the best autonomous AI agent redefining automation and productivity. https:\/\/huggingface.co\/blog\/LLMhacker\/manus-ai-best-ai-agent."},{"key":"2443_CR21","doi-asserted-by":"publisher","first-page":"432","DOI":"10.1038\/s41551-025-01363-2","volume":"9","author":"M Moritz","year":"2025","unstructured":"Moritz, M., Topol, E. & Rajpurkar, P. Coordinated AI agents for advancing healthcare. Nat. Biomed. Eng. 9, 432\u2013438 (2025).","journal-title":"Nat. Biomed. Eng."},{"key":"2443_CR22","unstructured":"Wang, Q. et al. AgentTaxo: Dissecting and Benchmarking Token Distribution of LLM Multi-Agent Systems. ICLR 2025 Workshop on Foundation Models in the Wild. https:\/\/openreview.net\/forum?id=0iLbiYYIpC (2025)."},{"key":"2443_CR23","doi-asserted-by":"publisher","first-page":"153","DOI":"10.1038\/s41746-025-01546-w","volume":"8","author":"M Fern\u00e1ndez-Pichel","year":"2025","unstructured":"Fern\u00e1ndez-Pichel, M., Pichel, J. C. & Losada, D. E. Evaluating search engines and large language models for answering health questions. NPJ Digit. Med. 8, 153 (2025).","journal-title":"NPJ Digit. Med."},{"key":"2443_CR24","doi-asserted-by":"publisher","first-page":"543","DOI":"10.1038\/s41746-025-01955-x","volume":"8","author":"J Song","year":"2025","unstructured":"Song, J., Xu, Z., He, M., Feng, J. & Shen, B. Graph retrieval augmented large language models for facial phenotype associated rare genetic disease. NPJ Digit. Med. 8, 543 (2025).","journal-title":"NPJ Digit. Med."},{"key":"2443_CR25","doi-asserted-by":"publisher","first-page":"e106187","DOI":"10.7554\/eLife.106187","volume":"14","author":"SZY Sim","year":"2025","unstructured":"Sim, S. Z. Y. & Chen, T. Critique of impure reason: unveiling the reasoning behaviour of medical large language models. Elife 14, e106187 (2025).","journal-title":"Elife"},{"key":"2443_CR26","doi-asserted-by":"publisher","unstructured":"Nori, H. et al. Sequential diagnosis with language models. Preprint at arXiv https:\/\/doi.org\/10.48550\/arXiv.2506.22405 (2025).","DOI":"10.48550\/arXiv.2506.22405"},{"key":"2443_CR27","doi-asserted-by":"publisher","first-page":"159","DOI":"10.1038\/s41746-025-01550-0","volume":"8","author":"X Chen","year":"2025","unstructured":"Chen, X. et al. Enhancing diagnostic capability with multi-agents conversational large language models. NPJ Digit. Med. 8, 159 (2025).","journal-title":"NPJ Digit. Med."},{"key":"2443_CR28","doi-asserted-by":"publisher","unstructured":"Ji, Z. et al. Survey of hallucination in natural Language Generation. ACM Comput. Surv. https:\/\/doi.org\/10.1145\/3571730 (2022).","DOI":"10.1145\/3571730"},{"key":"2443_CR29","doi-asserted-by":"publisher","first-page":"499","DOI":"10.3390\/a18080499","volume":"18","author":"S Brohi","year":"2025","unstructured":"Brohi, S., Mastoi, Q.-U. -A., Jhanjhi, N. Z. & Pillai, T. R. A research landscape of agentic AI and large language models: applications, challenges and future directions. Algorithms 18, 499 (2025).","journal-title":"Algorithms"},{"key":"2443_CR30","doi-asserted-by":"publisher","unstructured":"Xu, G. et al. A comprehensive survey of AI Agents in Healthcare. Preprint at TechRxiv https:\/\/doi.org\/10.36227\/techrxiv.176240542.22279040\/v2 (2025).","DOI":"10.36227\/techrxiv.176240542.22279040\/v2"},{"key":"2443_CR31","doi-asserted-by":"publisher","first-page":"6421","DOI":"10.3390\/app11146421","volume":"11","author":"D Jin","year":"2021","unstructured":"Jin, D. et al. What disease does this patient have? A large-scale open domain question answering dataset from medical exams. Appl. Sci. 11, 6421 (2021).","journal-title":"Appl. Sci."},{"key":"2443_CR32","doi-asserted-by":"publisher","unstructured":"Johnson, A. et al. MIMIC-IV (version 3.1). PhysioNet https:https:\/\/doi.org\/10.13026\/kpb9-mt58 (2024).","DOI":"10.13026\/kpb9-mt58"},{"key":"2443_CR33","doi-asserted-by":"publisher","DOI":"10.1038\/s41597-022-01899-x","volume":"10","author":"AEW Johnson","year":"2023","unstructured":"Johnson, A. E. W. et al. MIMIC-IV, a freely accessible electronic health record dataset. Sci. Data 10, 1 (2023).","journal-title":"Sci. Data"},{"key":"2443_CR34","doi-asserted-by":"crossref","unstructured":"Jin, Q., Dhingra, B., Liu, Z., Cohen, W. & Lu, X. PubMedQA: A Dataset for Biomedical Research Question Answering. In Proc. 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). (eds. Inui, K., Jiang, J., Ng, V. & Wan, X.) 2567\u20132577 (Association for Computational Linguistics, Hong Kong, 2019).","DOI":"10.18653\/v1\/D19-1259"},{"key":"2443_CR35","unstructured":"Pal, A., Umapathi, L. K. & Sankarasubbu, M. MedMCQA: A Large-scale Multi-Subject Multi-Choice Dataset for Medicaldomain Question Answering. In Proc. Conference on Health, Inference, and Learning. (eds. Flores, G., Chen, G. H., Pollard, T., Ho, J. C. & Naumann, T.) Vol. 174, 248\u2013260 (PMLR, 2022)."},{"key":"2443_CR36","doi-asserted-by":"publisher","unstructured":"Kim, Y., Wu, J., Abdulle, Y. & Wu, H. MedExQA: medical question answering benchmark with multiple explanations. Preprint at arXiv https:\/\/doi.org\/10.48550\/arXiv.2406.06331 (2024).","DOI":"10.48550\/arXiv.2406.06331"},{"key":"2443_CR37","doi-asserted-by":"crossref","unstructured":"Wang, Y. et al. MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark. In Advances In Neural Information Processing Systems. (eds. Globerson, A. et al.) Vol. 37 95266\u201395290 (Curran Associates, Inc., 2024).","DOI":"10.52202\/079017-3018"},{"key":"2443_CR38","doi-asserted-by":"crossref","unstructured":"Chen, H., Fang, Z., Singla, Y. & Dredze, M. Benchmarking Large Language Models on Answering and Explaining Challenging Medical Questions. In Proc. 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies. (eds. Chiruzzo, L.,Ritter, A. & Wang, L.) Vol. 1, Long Paper, 3563\u20133599 (Association for Computational Linguistics, Albuquerque, New Mexico, 2025).","DOI":"10.18653\/v1\/2025.naacl-long.182"},{"key":"2443_CR39","doi-asserted-by":"publisher","unstructured":"Hendrycks, D. et al. Measuring massive multitask language understanding. Preprint at arXiv https:\/\/doi.org\/10.48550\/arXiv.2009.03300 (2020).","DOI":"10.48550\/arXiv.2009.03300"},{"key":"2443_CR40","doi-asserted-by":"publisher","unstructured":"Zuo, Y. et al. MedXpertQA: benchmarking expert-level medical reasoning and understanding. Preprint at arXiv https:\/\/doi.org\/10.48550\/arXiv.2501.18362 (2025).","DOI":"10.48550\/arXiv.2501.18362"},{"key":"2443_CR41","doi-asserted-by":"crossref","unstructured":"Center for AI Safety, Scale AI & HLE Contributors Consortium. A benchmark of expert-level academic questions toassess AI capabilities. Nature 649, 1139\u20131146 (2026).","DOI":"10.1038\/s41586-025-09962-4"},{"key":"2443_CR42","doi-asserted-by":"crossref","unstructured":"Kwon, W. et al. Efficient Memory Management for Large Language Model Serving with PagedAttention. In Proc. 29th Symposium on Operating Systems Principles. 611\u2013626 (Association for Computing Machinery, New York, NY, 2023).","DOI":"10.1145\/3600006.3613165"},{"key":"2443_CR43","unstructured":"Georgi Gerganov. llama.cpp: LLM inference in C\/C++. https:\/\/github.com\/ggerganov\/llama.cpp (Github, 2023)."},{"key":"2443_CR44","unstructured":"OpenAI. OpenAI API. https:\/\/openai.com\/api (2023)."},{"key":"2443_CR45","doi-asserted-by":"publisher","unstructured":"Liang, X. et al. OpenManus: an open-source framework for building general AI agents. Preprint at https:\/\/doi.org\/10.5281\/zenodo.15186407 (2025).","DOI":"10.5281\/zenodo.15186407"},{"key":"2443_CR46","doi-asserted-by":"publisher","unstructured":"Zhu, Y. et al. MedAgentBoard: benchmarking multi-agent collaboration with conventional methods for diverse medical tasks. Preprint at arXiv https:\/\/doi.org\/10.48550\/arXiv.2505.12371 (2025).","DOI":"10.48550\/arXiv.2505.12371"}],"container-title":["npj Digital Medicine"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.nature.com\/articles\/s41746-026-02443-6","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41746-026-02443-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41746-026-02443-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T14:55:08Z","timestamp":1774623308000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.nature.com\/articles\/s41746-026-02443-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,2,18]]},"references-count":46,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2026,12]]}},"alternative-id":["2443"],"URL":"https:\/\/doi.org\/10.1038\/s41746-026-02443-6","relation":{},"ISSN":["2398-6352"],"issn-type":[{"value":"2398-6352","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,2,18]]},"assertion":[{"value":"9 September 2025","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 February 2026","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 February 2026","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"J.N.K. declares ongoing consulting services for AstraZeneca and Bioptimus. Furthermore, he holds shares in StratifAI, Synagen, and Spira Labs, has received an institutional research grant from GSK and AstraZeneca, as well as honoraria from AstraZeneca, Bayer, Daiichi Sankyo, Eisai, Janssen, Merck, MSD, BMS, Roche, Pfizer, and Fresenius. Author J.N.K. is the Deputy Editor of the npj Precision Oncology. J.N.K. was not involved in the journal\u2019s review of, or decisions related to, this manuscript.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"259"}}