{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T04:10:04Z","timestamp":1774584604896,"version":"3.50.1"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,10,30]],"date-time":"2020-10-30T00:00:00Z","timestamp":1604016000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,10,30]],"date-time":"2020-10-30T00:00:00Z","timestamp":1604016000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"National Institute for Health Research (NIHR) School for Primary Care Research"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["npj Digit. Med."],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Familial hypercholesterolaemia (FH) is a common inherited disorder, causing lifelong elevated low-density lipoprotein cholesterol (LDL-C). Most individuals with FH remain undiagnosed, precluding opportunities to prevent premature heart disease and death. Some machine-learning approaches improve detection of FH in electronic health records, though clinical impact is under-explored. We assessed performance of an array of machine-learning approaches for enhancing detection of FH, and their clinical utility, within a large primary care population. A retrospective cohort study was done using routine primary care clinical records of 4,027,775 individuals from the United Kingdom with total cholesterol measured from 1 January 1999 to 25 June 2019. Predictive accuracy of five common machine-learning algorithms (logistic regression, random forest, gradient boosting machines, neural networks and ensemble learning) were assessed for detecting FH. Predictive accuracy was assessed by area under the receiver operating curves (AUC) and expected vs observed calibration slope; with clinical utility assessed by expected case-review workload and likelihood ratios. There were 7928 incident diagnoses of FH. In addition to known clinical features of FH (raised total cholesterol or LDL-C and family history of premature coronary heart disease), machine-learning (ML) algorithms identified features such as raised triglycerides which reduced the likelihood of FH. Apart from logistic regression (AUC, 0.81), all four other ML approaches had similarly high predictive accuracy (AUC\u2009&gt;\u20090.89). Calibration slope ranged from 0.997 for gradient boosting machines to 1.857 for logistic regression. Among those screened, high probability cases requiring clinical review varied from 0.73% using ensemble learning to 10.16% using deep learning, but with positive predictive values of 15.5% and 2.8% respectively. Ensemble learning exhibited a dominant positive likelihood ratio (45.5) compared to all other ML models (7.0\u201314.4). Machine-learning models show similar high accuracy in detecting FH, offering opportunities to increase diagnosis. However, the clinical case-finding workload required for yield of cases will differ substantially between models.<\/jats:p>","DOI":"10.1038\/s41746-020-00349-5","type":"journal-article","created":{"date-parts":[[2020,10,30]],"date-time":"2020-10-30T11:03:14Z","timestamp":1604055794000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":42,"title":["Performance and clinical utility of supervised machine-learning approaches in detecting familial hypercholesterolaemia in primary care"],"prefix":"10.1038","volume":"3","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4529-8237","authenticated-orcid":false,"given":"Ralph K.","family":"Akyea","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4909-0644","authenticated-orcid":false,"given":"Nadeem","family":"Qureshi","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9040-9384","authenticated-orcid":false,"given":"Joe","family":"Kai","sequence":"additional","affiliation":[]},{"given":"Stephen F.","family":"Weng","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,10,30]]},"reference":[{"key":"349_CR1","doi-asserted-by":"publisher","first-page":"407","DOI":"10.1093\/aje\/kwh236","volume":"160","author":"MA Austin","year":"2004","unstructured":"Austin, M. A., Hutter, C. M., Zimmern, R. L. & Humphries, S. E. Genetic causes of monogenic heterozygous familial hypercholesterolemia: A HuGE prevalence review. Am. J. Epidemiol. 160, 407\u2013420 (2004).","journal-title":"Am. J. Epidemiol."},{"key":"349_CR2","doi-asserted-by":"publisher","first-page":"893","DOI":"10.1136\/bmj.303.6807.893","volume":"303","author":"Scientific Steering Committee on behalf of the Simon Broome Register Group.","year":"1991","unstructured":"Scientific Steering Committee on behalf of the Simon Broome Register Group. Risk of fatal coronary heart disease in familial hypercholesterolaemia. BMJ 303, 893\u2013896 (1991).","journal-title":"BMJ"},{"key":"349_CR3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/S0021-9150(02)00330-1","volume":"168","author":"D Marks","year":"2003","unstructured":"Marks, D., Thorogood, M., Neil, H. A. W. & Humphries, S. E. A review on the diagnosis, natural history, and treatment of familial hypercholesterolaemia. Atherosclerosis 168, 1\u201314 (2003).","journal-title":"Atherosclerosis"},{"key":"349_CR4","doi-asserted-by":"publisher","first-page":"3478","DOI":"10.1093\/eurheartj\/eht273","volume":"34","author":"BG Nordestgaard","year":"2013","unstructured":"Nordestgaard, B. G. et al. Familial hypercholesterolaemia is underdiagnosed and undertreated in the general population: guidance for clinicians to prevent coronary heart disease: Consensus Statement of the European Atherosclerosis Society. Eur. Heart J. 34, 3478\u20133490 (2013).","journal-title":"Eur. Heart J."},{"key":"349_CR5","doi-asserted-by":"publisher","first-page":"e016461","DOI":"10.1136\/bmjopen-2017-016461","volume":"7","author":"LE Akioyamen","year":"2017","unstructured":"Akioyamen, L. E. et al. Estimating the prevalence of heterozygous familial hypercholesterolaemia: a systematic review and meta-analysis. BMJ Open 7, e016461 (2017).","journal-title":"BMJ Open"},{"key":"349_CR6","doi-asserted-by":"publisher","first-page":"2408","DOI":"10.1161\/CIRCULATIONAHA.112.144055","volume":"126","author":"F Raal","year":"2012","unstructured":"Raal, F. et al. Low-density lipoprotein cholesterol-lowering effects of AMG 145, a monoclonal antibody to proprotein convertase subtilisin\/kexin type 9 serine protease in patients with heterozygous familial hypercholesterolemia: the Reduction of LDL-C with PCSK9 Inhibiti. Circulation 126, 2408\u20132417 (2012).","journal-title":"Circulation"},{"key":"349_CR7","doi-asserted-by":"publisher","first-page":"2625","DOI":"10.1093\/eurheartj\/ehn422","volume":"29","author":"A Neil","year":"2008","unstructured":"Neil, A. et al. Reductions in all-cause, cancer, and coronary mortality in statin-treated patients with heterozygous familial hypercholesterolaemia: a prospective registry study. Eur. Heart J. 29, 2625\u20132633 (2008).","journal-title":"Eur. Heart J."},{"key":"349_CR8","doi-asserted-by":"publisher","first-page":"252","DOI":"10.1016\/j.jacc.2016.04.054","volume":"68","author":"J Besseling","year":"2016","unstructured":"Besseling, J., Hovingh, G. K., Huijgen, R., Kastelein, J. J. P. & Hutten, B. A. Statins in familial hypercholesterolemia: consequences for coronary artery disease and all-cause mortality. J. Am. Coll. Cardiol. 68, 252\u2013260 (2016).","journal-title":"J. Am. Coll. Cardiol."},{"key":"349_CR9","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1016\/j.atherosclerosis.2003.11.010","volume":"173","author":"F Civeira","year":"2004","unstructured":"Civeira, F. et al. Guidelines for the diagnosis and management of heterozygous familial hypercholesterolemia. Atherosclerosis 173, 55\u201368 (2004).","journal-title":"Atherosclerosis"},{"key":"349_CR10","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1016\/0002-9149(93)90155-6","volume":"72","author":"RR Williams","year":"1993","unstructured":"Williams, R. R. et al. Diagnosing heterozygous familial hypercholesterolemia using new practical criteria validated by molecular genetics. Am. J. Cardiol. 72, 171\u2013176 (1993).","journal-title":"Am. J. Cardiol."},{"key":"349_CR11","doi-asserted-by":"publisher","first-page":"1043","DOI":"10.5551\/jat.14621","volume":"19","author":"M Harada-Shiba","year":"2012","unstructured":"Harada-Shiba, M. et al. Guidelines for the management of familial hypercholesterolemia. J. Atheroscler. Thromb. 19, 1043\u20131060 (2012).","journal-title":"J. Atheroscler. Thromb."},{"key":"349_CR12","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1016\/j.atherosclerosis.2018.08.019","volume":"277","author":"T Brett","year":"2018","unstructured":"Brett, T., Qureshi, N., Gidding, S. & Watts, G. F. Screening for familial hypercholesterolaemia in primary care: time for general practice to play its part. Atherosclerosis 277, 399\u2013406 (2018).","journal-title":"Atherosclerosis"},{"key":"349_CR13","doi-asserted-by":"publisher","first-page":"1230","DOI":"10.1016\/j.jacl.2016.08.001","volume":"10","author":"MS Safarova","year":"2016","unstructured":"Safarova, M. S., Liu, H. & Kullo, I. J. Rapid identification of familial hypercholesterolemia from electronic health records: The SEARCH study. J. Clin. Lipidol. 10, 1230\u20131239 (2016).","journal-title":"J. Clin. Lipidol."},{"key":"349_CR14","doi-asserted-by":"publisher","first-page":"e256","DOI":"10.1016\/S2468-2667(19)30061-1","volume":"4","author":"S Weng","year":"2019","unstructured":"Weng, S., Kai, J., Akyea, R. & Qureshi, N. Detection of familial hypercholesterolaemia: external validation of the FAMCAT clinical case-finding algorithm to identify patients in primary care. Lancet Public Health 4, e256\u2013e264 (2019).","journal-title":"Lancet Public Health"},{"key":"349_CR15","unstructured":"Akyea, R. et al. Identifying familial hypercholesterolaemia in primary care: validation and optimisation of a clinical tool (FAMCAT). BJGP Open (2020)."},{"key":"349_CR16","doi-asserted-by":"publisher","first-page":"54","DOI":"10.1016\/j.atherosclerosis.2018.04.037","volume":"274","author":"S Weng","year":"2018","unstructured":"Weng, S., Kai, J., Tranter, J., Leonardi-Bee, J. & Qureshi, N. Improving identification and management of familial hypercholesterolaemia in primary care: Pre- and post-intervention study. Atherosclerosis 274, 54\u201360 (2018).","journal-title":"Atherosclerosis"},{"key":"349_CR17","doi-asserted-by":"publisher","first-page":"e393","DOI":"10.1016\/S2589-7500(19)30150-5","volume":"1","author":"KD Myers","year":"2019","unstructured":"Myers, K. D. et al. Precision screening for familial hypercholesterolaemia: a machine learning study applied to electronic health encounter data. Lancet Digit. Health 1, e393\u2013e402 (2019).","journal-title":"Lancet Digit. Health"},{"key":"349_CR18","doi-asserted-by":"publisher","first-page":"336","DOI":"10.1016\/j.atherosclerosis.2014.12.034","volume":"238","author":"SF Weng","year":"2015","unstructured":"Weng, S. F., Kai, J., Andrew Neil, H., Humphries, S. E. & Qureshi, N. Improving identification of familial hypercholesterolaemia in primary care: Derivation and validation of the familial hypercholesterolaemia case ascertainment tool (FAMCAT). Atherosclerosis 238, 336\u2013343 (2015).","journal-title":"Atherosclerosis"},{"key":"349_CR19","first-page":"170","volume":"8","author":"D Yao","year":"2013","unstructured":"Yao, D., Yang, J. & Zhan, X. A novel method for disease prediction: hybrid of random forest and multivariate adaptive regression splines. J. Comput. 8, 170\u2013177 (2013).","journal-title":"J. Comput."},{"key":"349_CR20","doi-asserted-by":"publisher","first-page":"e0174944","DOI":"10.1371\/journal.pone.0174944","volume":"12","author":"SF Weng","year":"2017","unstructured":"Weng, S. F., Reps, J., Kai, J., Garibaldi, J. M. & Qureshi, N. Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLoS ONE 12, e0174944\u2013e0174944 (2017).","journal-title":"PLoS ONE"},{"key":"349_CR21","unstructured":"NHS Digital. Patients Registered at a GP Practice March 2020. https:\/\/digital.nhs.uk\/data-and-information\/publications\/statistical\/patients-registered-at-a-gp-practice\/march-2020#summary (2020). Accessed 26 March 2020."},{"key":"349_CR22","unstructured":"National Institute of Health and Care Excellence. Familial hypercholesterolaemia: identification and management (2017)."},{"key":"349_CR23","doi-asserted-by":"publisher","first-page":"349","DOI":"10.1016\/j.ijcard.2010.08.009","volume":"147","author":"GD Kolovou","year":"2011","unstructured":"Kolovou, G. D., Kostakou, P. M. & Anagnostopoulou, K. K. Familial hypercholesterolemia and triglyceride metabolism. Int. J. Cardiol. 147, 349\u2013358 (2011).","journal-title":"Int. J. Cardiol."},{"key":"349_CR24","doi-asserted-by":"publisher","first-page":"l6927","DOI":"10.1136\/bmj.l6927","volume":"368","author":"S Vollmer","year":"2020","unstructured":"Vollmer, S. et al. Machine learning and artificial intelligence research for patient benefit: 20 critical questions on transparency, replicability, ethics, and effectiveness. BMJ 368, l6927 (2020).","journal-title":"BMJ"},{"key":"349_CR25","doi-asserted-by":"publisher","first-page":"601","DOI":"10.1007\/s10654-018-0389-5","volume":"33","author":"L McDonald","year":"2018","unstructured":"McDonald, L., Schultze, A., Carroll, R. & Ramagopalan, S. V. Performing studies using the UK clinical practice research datalink: to link or not to link? Eur. J. Epidemiol. 33, 601\u2013605 (2018).","journal-title":"Eur. J. Epidemiol."},{"key":"349_CR26","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1111\/j.1365-2125.2009.03537.x","volume":"69","author":"E Herrett","year":"2010","unstructured":"Herrett, E., Thomas, S. L., Schoonen, W. M., Smeeth, L. & Hall, A. J. Validation and validity of diagnoses in the General Practice Research Database: a systematic review. Br. J. Clin. Pharmacol. 69, 4\u201314 (2010).","journal-title":"Br. J. Clin. Pharmacol."},{"key":"349_CR27","doi-asserted-by":"publisher","first-page":"1769","DOI":"10.1093\/eurheartj\/ehr158","volume":"32","author":"Z Reiner","year":"2011","unstructured":"Reiner, Z. et al. ESC\/EAS Guidelines for the management of dyslipidaemias: The Task Force for the management of dyslipidaemias of the European Society of Cardiology (ESC) and the European Atherosclerosis Society (EAS). Eur. Heart J. 32, 1769\u20131818 (2011).","journal-title":"Eur. Heart J."},{"key":"349_CR28","doi-asserted-by":"publisher","first-page":"e81998","DOI":"10.1371\/journal.pone.0081998","volume":"9","author":"P Dhiman","year":"2014","unstructured":"Dhiman, P., Kai, J., Horsfall, L., Walters, K. & Qureshi, N. Availability and quality of coronary heart disease family history in primary care medical records: Implications for cardiovascular risk assessment. PLoS ONE 9, e81998 (2014).","journal-title":"PLoS ONE"},{"key":"349_CR29","doi-asserted-by":"publisher","first-page":"2280","DOI":"10.1161\/01.CIR.0000145140.06171.3D","volume":"110","author":"NJ Stone","year":"2004","unstructured":"Stone, N. J. Stopping statins. Circulation 110, 2280\u20132282 (2004).","journal-title":"Circulation"},{"key":"349_CR30","doi-asserted-by":"publisher","first-page":"1423","DOI":"10.1136\/bmj.326.7404.1423","volume":"326","author":"MR Law","year":"2003","unstructured":"Law, M. R., Wald, N. J. & Rudnicka, A. R. Quantifying effect of statins on low density lipoprotein cholesterol, ischaemic heart disease, and stroke: systematic review and meta-analysis. BMJ 326, 1423 (2003).","journal-title":"BMJ"},{"key":"349_CR31","doi-asserted-by":"publisher","first-page":"111","DOI":"10.21037\/atm.2016.02.15","volume":"4","author":"Z Zhang","year":"2016","unstructured":"Zhang, Z. Model building strategy for logistic regression: purposeful selection. Ann. Transl. Med. 4, 111 (2016).","journal-title":"Ann. Transl. Med."},{"key":"349_CR32","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman, L. Random forests. Mach. Learn. 45, 5\u201332 (2001).","journal-title":"Mach. Learn."},{"key":"349_CR33","doi-asserted-by":"publisher","first-page":"1189","DOI":"10.1214\/aos\/1013203451","volume":"29","author":"JH Friedman","year":"2001","unstructured":"Friedman, J. H. Greedy function approximation: a gradient boosting machine. Ann. Stat. 29, 1189\u20131232 (2001).","journal-title":"Ann. Stat."},{"key":"349_CR34","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1016\/j.gpb.2017.07.003","volume":"16","author":"C Cao","year":"2018","unstructured":"Cao, C. et al. Deep learning and its applications in biomedicine. Genom. Proteom. Bioinform. 16, 17\u201332 (2018).","journal-title":"Genom. Proteom. Bioinform."},{"key":"349_CR35","doi-asserted-by":"crossref","unstructured":"Dietterich, T. G. Ensemble methods in machine learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 1857 LNCS, 1\u201315 (2000).","DOI":"10.1007\/3-540-45014-9_1"},{"key":"349_CR36","doi-asserted-by":"publisher","first-page":"527","DOI":"10.1177\/1536867X0500500404","volume":"5","author":"P Royston","year":"2005","unstructured":"Royston, P. Multiple imputation of missing values: update of ice. Stata J. 5, 527\u2013536 (2005).","journal-title":"Stata J."},{"key":"349_CR37","doi-asserted-by":"crossref","unstructured":"Rubin, D. B. Multiple imputation for nonresponse in surveys (Wiley, 1987).","DOI":"10.1002\/9780470316696"},{"key":"349_CR38","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1177\/1536867X0600600302","volume":"6","author":"R Newson","year":"2006","unstructured":"Newson, R. Confidence intervals for rank statistics: Somers\u2019 D and extensions. Stata J. 6, 309\u2013334 (2006).","journal-title":"Stata J."},{"key":"349_CR39","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1136\/emermed-2017-206735","volume":"34","author":"ZH Hoo","year":"2017","unstructured":"Hoo, Z. H., Candlish, J. & Teare, D. What is an ROC curve? Emerg. Med. J. 34, 357\u2013359 (2017).","journal-title":"Emerg. Med. J."},{"key":"349_CR40","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1161\/CIRCULATIONAHA.114.014508","volume":"131","author":"GS Collins","year":"2015","unstructured":"Collins, G. S., Reitsma, J. B., Altman, D. G. & Moons, K. G. M. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD). Circulation 131, 211\u2013219 (2015).","journal-title":"Circulation"}],"container-title":["npj Digital Medicine"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.nature.com\/articles\/s41746-020-00349-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41746-020-00349-5","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/www.nature.com\/articles\/s41746-020-00349-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,7]],"date-time":"2022-12-07T02:41:49Z","timestamp":1670380909000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.nature.com\/articles\/s41746-020-00349-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,30]]},"references-count":40,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2020,12]]}},"alternative-id":["349"],"URL":"https:\/\/doi.org\/10.1038\/s41746-020-00349-5","relation":{},"ISSN":["2398-6352"],"issn-type":[{"value":"2398-6352","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,10,30]]},"assertion":[{"value":"14 May 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 September 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 October 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"N.Q. is a member of the most recent NICE Familial Hypercholesterolaemia and Lipid Modification Guideline Development Groups (CG71 and CG181). S.F.W. is a member of the Clinical Practice Research Datalink (CPRD) Independent Scientific Advisory Committee (ISAC), academic advisor to Quealth Ltd., and has received independent research grant funding from AMGEN. N.Q. and S.F.W. have previously received honorarium from AMGEN. R.K.A. currently holds an NIHR-SPCR funded studentship (2018\u20132021). J.K. has no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"142"}}