{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T18:12:40Z","timestamp":1775585560392,"version":"3.50.1"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,3,26]],"date-time":"2020-03-26T00:00:00Z","timestamp":1585180800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,3,26]],"date-time":"2020-03-26T00:00:00Z","timestamp":1585180800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000025","name":"U.S. Department of Health & Human Services | NIH | National Institute of Mental Health","doi-asserted-by":"publisher","award":["K01MH109836"],"award-info":[{"award-number":["K01MH109836"]}],"id":[{"id":"10.13039\/100000025","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003725","name":"National Research Foundation of Korea","doi-asserted-by":"publisher","award":["200-20190251"],"award-info":[{"award-number":["200-20190251"]}],"id":[{"id":"10.13039\/501100003725","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["npj Digit. Med."],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>\n                    Nationwide population-based cohort provides a new opportunity to build an automated risk prediction model based on individuals\u2019 history of health and healthcare beyond existing risk prediction models. We tested the possibility of machine learning models to predict future incidence of Alzheimer\u2019s disease (AD) using large-scale administrative health data. From the Korean National Health Insurance Service database between 2002 and 2010, we obtained de-identified health data in elders above 65 years (\n                    <jats:italic>N<\/jats:italic>\n                    \u2009=\u200940,736) containing 4,894 unique clinical features including ICD-10 codes, medication codes, laboratory values, history of personal and family illness and socio-demographics. To define incident AD we considered two operational definitions: \u201cdefinite AD\u201d with diagnostic codes and dementia medication (\n                    <jats:italic>n<\/jats:italic>\n                    \u2009=\u2009614) and \u201cprobable AD\u201d with only diagnosis (\n                    <jats:italic>n<\/jats:italic>\n                    \u2009=\u20092026). We trained and validated random forest, support vector machine and logistic regression to predict incident AD in 1, 2, 3, and 4 subsequent years. For predicting future incidence of AD in balanced samples (bootstrapping), the machine learning models showed reasonable performance in 1-year prediction with AUC of 0.775 and 0.759, based on \u201cdefinite AD\u201d and \u201cprobable AD\u201d outcomes, respectively; in 2-year, 0.730 and 0.693; in 3-year, 0.677 and 0.644; in 4-year, 0.725 and 0.683. The results were similar when the entire (unbalanced) samples were used. Important clinical features selected in logistic regression included hemoglobin level, age and urine protein level. This study may shed a light on the utility of the data-driven machine learning model based on large-scale administrative health data in AD risk prediction, which may enable better selection of individuals at risk for AD in clinical trials or early detection in clinical settings.\n                  <\/jats:p>","DOI":"10.1038\/s41746-020-0256-0","type":"journal-article","created":{"date-parts":[[2020,3,26]],"date-time":"2020-03-26T07:03:00Z","timestamp":1585206180000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":124,"title":["Machine learning prediction of incidence of Alzheimer\u2019s disease using large-scale administrative health data"],"prefix":"10.1038","volume":"3","author":[{"given":"Ji Hwan","family":"Park","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5625-3013","authenticated-orcid":false,"given":"Han Eol","family":"Cho","sequence":"additional","affiliation":[]},{"given":"Jong Hun","family":"Kim","sequence":"additional","affiliation":[]},{"given":"Melanie M.","family":"Wall","sequence":"additional","affiliation":[]},{"given":"Yaakov","family":"Stern","sequence":"additional","affiliation":[]},{"given":"Hyunsun","family":"Lim","sequence":"additional","affiliation":[]},{"given":"Shinjae","family":"Yoo","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5310-4802","authenticated-orcid":false,"given":"Hyoung Seop","family":"Kim","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5314-7992","authenticated-orcid":false,"given":"Jiook","family":"Cha","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,3,26]]},"reference":[{"key":"256_CR1","doi-asserted-by":"publisher","first-page":"1337","DOI":"10.2105\/AJPH.88.9.1337","volume":"88","author":"R Brookmeyer","year":"1998","unstructured":"Brookmeyer, R., Gray, S. & Kawas, C. Projections of Alzheimer\u2019s disease in the United States and the public health impact of delaying disease onset. Am. J. Public Health 88, 1337\u20131342 (1998).","journal-title":"Am. J. Public Health"},{"key":"256_CR2","doi-asserted-by":"publisher","first-page":"1326","DOI":"10.1056\/NEJMsa1204629","volume":"368","author":"MD Hurd","year":"2013","unstructured":"Hurd, M. D., Martorell, P., Delavande, A., Mullen, K. J. & Langa, K. M. Monetary costs of dementia in the United States. N. Engl. J. Med. 368, 1326\u20131334 (2013).","journal-title":"N. Engl. J. Med."},{"key":"256_CR3","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1515\/fhep-2014-0013","volume":"18","author":"J Zissimopoulos","year":"2014","unstructured":"Zissimopoulos, J., Crimmins, E. & St Clair, P. The value of delaying Alzheimer\u2019s disease onset. Forum Health Econ. Policy 18, 25\u201339 (2014).","journal-title":"Forum Health Econ. Policy"},{"key":"256_CR4","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1186\/2047-2501-2-3","volume":"2","author":"W Raghupathi","year":"2014","unstructured":"Raghupathi, W. & Raghupathi, V. Big data analytics in healthcare: promise and potential. Health Inf. Sci. Syst. 2, 3 (2014).","journal-title":"Health Inf. Sci. Syst."},{"key":"256_CR5","doi-asserted-by":"publisher","first-page":"735","DOI":"10.1016\/S1474-4422(06)70537-3","volume":"5","author":"M Kivipelto","year":"2006","unstructured":"Kivipelto, M. et al. Risk score for the prediction of dementia risk in 20 years among middle aged people: a longitudinal, population-based study. Lancet Neurol. 5, 735\u2013741 (2006).","journal-title":"Lancet Neurol."},{"key":"256_CR6","doi-asserted-by":"publisher","first-page":"318","DOI":"10.1038\/nrneurol.2010.54","volume":"6","author":"BC Stephan","year":"2010","unstructured":"Stephan, B. C., Kurth, T., Matthews, F. E., Brayne, C. & Dufouil, C. Dementia risk prediction in the population: are screening models accurate? Nat. Rev. Neurol. 6, 318\u2013326 (2010).","journal-title":"Nat. Rev. Neurol."},{"key":"256_CR7","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1111\/j.1365-2796.2004.01386.x","volume":"256","author":"L Backman","year":"2004","unstructured":"Backman, L., Jones, S., Berger, A. K., Laukka, E. J. & Small, B. J. Multiple cognitive deficits during the transition to Alzheimer\u2019s disease. J. Intern Med. 256, 195\u2013204 (2004).","journal-title":"J. Intern Med."},{"key":"256_CR8","doi-asserted-by":"publisher","first-page":"452","DOI":"10.1111\/j.1532-5415.2005.53163.x","volume":"53","author":"AF Jorm","year":"2005","unstructured":"Jorm, A. F., Masaki, K. H., Petrovitch, H., Ross, G. W. & White, L. R. Cognitive deficits 3 to 6 years before dementia onset in a population sample: the Honolulu-Asia aging study. J. Am. Geriatr. Soc. 53, 452\u2013455 (2005).","journal-title":"J. Am. Geriatr. Soc."},{"key":"256_CR9","doi-asserted-by":"publisher","unstructured":"Farran, B., Channanath, A. M., Behbehani, K. & Thanaraj, T. A. Predictive models to assess risk of type 2 diabetes, hypertension and comorbidity: machine-learning algorithms and validation using national health data from Kuwait\u2013a cohort study. BMJ Open 3, https:\/\/doi.org\/10.1136\/bmjopen-2012-002457 (2013).","DOI":"10.1136\/bmjopen-2012-002457"},{"key":"256_CR10","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1016\/j.ijmedinf.2017.12.009","volume":"111","author":"A Shimoda","year":"2018","unstructured":"Shimoda, A., Ichikawa, D. & Oyama, H. Prediction models to identify individuals at risk of metabolic syndrome who are unlikely to participate in a health intervention program. Int. J. Med. Inform. 111, 90\u201399 (2018).","journal-title":"Int. J. Med. Inform."},{"key":"256_CR11","doi-asserted-by":"publisher","first-page":"8","DOI":"10.1016\/j.jad.2018.01.019","volume":"231","author":"SB Choi","year":"2018","unstructured":"Choi, S. B., Lee, W., Yoon, J. H., Won, J. U. & Kim, D. W. Ten-year prediction of suicide death using Cox regression and machine learning in a nationwide retrospective cohort study in South Korea. J. Affect. Disord. 231, 8\u201314 (2018).","journal-title":"J. Affect. Disord."},{"key":"256_CR12","doi-asserted-by":"publisher","DOI":"10.1001\/jamanetworkopen.2019.0968","volume":"2","author":"WH Lo-Ciganic","year":"2019","unstructured":"Lo-Ciganic, W. H. et al. Evaluation of machine-learning algorithms for predicting opioid overdose risk among medicare beneficiaries with opioid prescriptions. JAMA Netw. Open 2, e190968 (2019).","journal-title":"JAMA Netw. Open"},{"key":"256_CR13","doi-asserted-by":"publisher","first-page":"118","DOI":"10.1016\/j.yebeh.2018.10.013","volume":"89","author":"S An","year":"2018","unstructured":"An, S. et al. Predicting drug-resistant epilepsy-A machine learning approach based on administrative claims data. Epilepsy Behav. 89, 118\u2013125 (2018).","journal-title":"Epilepsy Behav."},{"key":"256_CR14","doi-asserted-by":"publisher","first-page":"1216","DOI":"10.1056\/NEJMp1606181","volume":"375","author":"Z Obermeyer","year":"2016","unstructured":"Obermeyer, Z. & Emanuel, E. J. Predicting the future-big data, machine learning, and clinical medicine. N. Engl. J. Med. 375, 1216\u20131219 (2016).","journal-title":"N. Engl. J. Med."},{"key":"256_CR15","doi-asserted-by":"publisher","first-page":"1099","DOI":"10.1001\/jama.2018.11103","volume":"320","author":"CD Naylor","year":"2018","unstructured":"Naylor, C. D. On the prospects for a (deep) learning health care system. JAMA 320, 1099\u20131100 (2018).","journal-title":"JAMA"},{"key":"256_CR16","doi-asserted-by":"publisher","first-page":"1101","DOI":"10.1001\/jama.2018.11100","volume":"320","author":"G Hinton","year":"2018","unstructured":"Hinton, G. Deep learning-a technology with the potential to transform health care. JAMA 320, 1101\u20131102 (2018).","journal-title":"JAMA"},{"key":"256_CR17","volume":"46","author":"J Lee","year":"2017","unstructured":"Lee, J., Lee, J. S., Park, S. H., Shin, S. A. & Kim, K. Cohort profile: the National Health Insurance Service-National Sample Cohort (NHIS-NSC), South Korea. Int. J. Epidemiol. 46, e15 (2017).","journal-title":"Int. J. Epidemiol."},{"key":"256_CR18","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0136181","volume":"10","author":"EY Tang","year":"2015","unstructured":"Tang, E. Y. et al. Current developments in dementia risk prediction modelling: an updated systematic review. PLoS ONE 10, e0136181 (2015).","journal-title":"PLoS ONE"},{"key":"256_CR19","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1038\/s41746-018-0029-1","volume":"1","author":"A Rajkomar","year":"2018","unstructured":"Rajkomar, A. et al. Scalable and accurate deep learning with electronic health records. NPJ Digit Med. 1, 18 (2018).","journal-title":"NPJ Digit Med."},{"key":"256_CR20","doi-asserted-by":"publisher","first-page":"278","DOI":"10.1016\/j.neurobiolaging.2005.02.007","volume":"27","author":"AR Atti","year":"2006","unstructured":"Atti, A. R. et al. Anaemia increases the risk of dementia in cognitively intact elderly. Neurobiol. Aging 27, 278\u2013284 (2006).","journal-title":"Neurobiol. Aging"},{"key":"256_CR21","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1212\/WNL.0b013e318225aaa9","volume":"77","author":"RC Shah","year":"2011","unstructured":"Shah, R. C., Buchman, A. S., Wilson, R. S., Leurgans, S. E. & Bennett, D. A. Hemoglobin level in older persons and incident Alzheimer disease: prospective cohort analysis. Neurology 77, 219\u2013226 (2011).","journal-title":"Neurology"},{"key":"256_CR22","doi-asserted-by":"publisher","first-page":"528","DOI":"10.1212\/WNL.0b013e31829e701d","volume":"81","author":"CH Hong","year":"2013","unstructured":"Hong, C. H. et al. Anemia and risk of dementia in older adults: findings from the Health ABC study. Neurology 81, 528\u2013533 (2013).","journal-title":"Neurology"},{"key":"256_CR23","doi-asserted-by":"publisher","first-page":"94","DOI":"10.1186\/s13195-017-0322-2","volume":"9","author":"SM Jeong","year":"2017","unstructured":"Jeong, S. M. et al. Anemia is associated with incidence of dementia: a national health screening study in Korea involving 37,900 persons. Alzheimer\u2019s Res. Ther. 9, 94 (2017).","journal-title":"Alzheimer\u2019s Res. Ther."},{"key":"256_CR24","doi-asserted-by":"publisher","first-page":"124","DOI":"10.1097\/RHU.0b013e318214bd18","volume":"17","author":"T Chotayaporn","year":"2011","unstructured":"Chotayaporn, T., Kasitanon, N., Sukitawut, W. & Louthrenoo, W. Comparison of proteinuria determination by urine dipstick, spot urine protein creatinine index, and urine protein 24\u2009h in lupus patients. J. Clin. Rheumatol. 17, 124\u2013129 (2011).","journal-title":"J. Clin. Rheumatol."},{"key":"256_CR25","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1053\/j.ajkd.2010.12.026","volume":"58","author":"SL White","year":"2011","unstructured":"White, S. L. et al. Diagnostic accuracy of urine dipsticks for detection of albuminuria in the general community. Am. J. Kidney Dis. 58, 19\u201328 (2011).","journal-title":"Am. J. Kidney Dis."},{"key":"256_CR26","doi-asserted-by":"publisher","first-page":"198","DOI":"10.1212\/WNL.0000000000003482","volume":"88","author":"K Deckers","year":"2017","unstructured":"Deckers, K. et al. Dementia risk in renal dysfunction: a systematic review and meta-analysis of prospective studies. Neurology 88, 198\u2013208 (2017).","journal-title":"Neurology"},{"key":"256_CR27","doi-asserted-by":"publisher","first-page":"2421","DOI":"10.1016\/j.neurobiolaging.2013.04.002","volume":"34","author":"GM Subaiea","year":"2013","unstructured":"Subaiea, G. M., Adwan, L. I., Ahmed, A. H., Stevens, K. E. & Zawia, N. H. Short-term treatment with tolfenamic acid improves cognitive functions in Alzheimer\u2019s disease mice. Neurobiol. Aging 34, 2421\u20132430 (2013).","journal-title":"Neurobiol. Aging"},{"key":"256_CR28","doi-asserted-by":"publisher","first-page":"266","DOI":"10.1111\/jnc.12960","volume":"133","author":"L Adwan","year":"2015","unstructured":"Adwan, L., Subaiea, G. M., Basha, R. & Zawia, N. H. Tolfenamic acid reduces tau and CDK5 levels: implications for dementia and tauopathies. J. Neurochem. 133, 266\u2013272 (2015).","journal-title":"J. Neurochem."},{"key":"256_CR29","doi-asserted-by":"publisher","first-page":"596","DOI":"10.1016\/j.neuropharm.2014.01.009","volume":"79","author":"L Adwan","year":"2014","unstructured":"Adwan, L., Subaiea, G. M. & Zawia, N. H. Tolfenamic acid downregulates BACE1 and protects against lead-induced upregulation of Alzheimer\u2019s disease related biomarkers. Neuropharmacology 79, 596\u2013602 (2014).","journal-title":"Neuropharmacology"},{"key":"256_CR30","doi-asserted-by":"publisher","first-page":"655","DOI":"10.2174\/1567205015666180119104036","volume":"15","author":"JK Chang","year":"2018","unstructured":"Chang, J. K. et al. Tolfenamic acid: a modifier of the Tau protein and its role in cognition and tauopathy. Curr. Alzheimer Res. 15, 655\u2013663 (2018).","journal-title":"Curr. Alzheimer Res."},{"key":"256_CR31","doi-asserted-by":"publisher","first-page":"2100","DOI":"10.1111\/j.1532-5415.2011.03660.x","volume":"59","author":"Y Rhee","year":"2011","unstructured":"Rhee, Y., Csernansky, J. G., Emanuel, L. L., Chang, C. G. & Shega, J. W. Psychotropic medication burden and factors associated with antipsychotic use: an analysis of a population-based sample of community-dwelling older persons with dementia. J. Am. Geriatr. Soc. 59, 2100\u20132107 (2011).","journal-title":"J. Am. Geriatr. Soc."},{"key":"256_CR32","doi-asserted-by":"publisher","first-page":"2047","DOI":"10.2147\/NDT.S172933","volume":"14","author":"L Cai","year":"2018","unstructured":"Cai, L. & Huang, J. Schizophrenia and risk of dementia: a meta-analysis study. Neuropsychiatr. Dis. Treat. 14, 2047\u20132055 (2018).","journal-title":"Neuropsychiatr. Dis. Treat."},{"key":"256_CR33","doi-asserted-by":"publisher","first-page":"1571","DOI":"10.1007\/s00213-018-4867-y","volume":"235","author":"CH Perng","year":"2018","unstructured":"Perng, C. H., Chang, Y. C. & Tzang, R. F. The treatment of cognitive dysfunction in dementia: a multiple treatments meta-analysis. Psychopharmacology 235, 1571\u20131580 (2018).","journal-title":"Psychopharmacology"},{"key":"256_CR34","doi-asserted-by":"publisher","first-page":"280","DOI":"10.1111\/j.1747-4949.2011.00601.x","volume":"6","author":"SN McLennan","year":"2011","unstructured":"McLennan, S. N. et al. Role of vasodilation in cognitive impairment. Int. J. Stroke. 6, 280 (2011).","journal-title":"Int. J. Stroke."},{"key":"256_CR35","doi-asserted-by":"publisher","first-page":"337","DOI":"10.3233\/JAD-160105","volume":"54","author":"RL Jaakkimainen","year":"2016","unstructured":"Jaakkimainen, R. L. et al. Identification of physician-diagnosed Alzheimer\u2019s disease and related dementias in population-based administrative data: a validation atudy using family physicians\u2019 electronic medical records. J. Alzheimers Dis. 54, 337\u2013349 (2016).","journal-title":"J. Alzheimers Dis."},{"key":"256_CR36","unstructured":"WHO. International Statistical Classification of Diseases and Related Health Problems 10th Revision (ICD-10)-WHO Version for 2016, http:\/\/apps.who.int\/classifications\/icd10\/browse\/2016\/en#\/F00 (2016)."},{"key":"256_CR37","doi-asserted-by":"publisher","first-page":"281","DOI":"10.3233\/JAD-2010-101221","volume":"23","author":"KW Kim","year":"2011","unstructured":"Kim, K. W. et al. A nationwide survey on the prevalence of dementia and mild cognitive impairment in South Korea. J. Alzheimers Dis. 23, 281\u2013291 (2011).","journal-title":"J. Alzheimers Dis."},{"key":"256_CR38","first-page":"1157","volume":"3","author":"I Guyon","year":"2003","unstructured":"Guyon, I. & Elisseeff, A. An introduction to variable and feature selection. J. Mach. Learn. Res. 3, 1157\u20131182 (2003).","journal-title":"J. Mach. Learn. Res."}],"container-title":["npj Digital Medicine"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.nature.com\/articles\/s41746-020-0256-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41746-020-0256-0","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41746-020-0256-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,6]],"date-time":"2022-12-06T21:07:06Z","timestamp":1670360826000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.nature.com\/articles\/s41746-020-0256-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,3,26]]},"references-count":38,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2020,12]]}},"alternative-id":["256"],"URL":"https:\/\/doi.org\/10.1038\/s41746-020-0256-0","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/625582","asserted-by":"object"}]},"ISSN":["2398-6352"],"issn-type":[{"value":"2398-6352","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,3,26]]},"assertion":[{"value":"3 September 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 March 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 March 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"46"}}