{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,21]],"date-time":"2026-04-21T14:57:39Z","timestamp":1776783459331,"version":"3.51.2"},"reference-count":40,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,6,23]],"date-time":"2020-06-23T00:00:00Z","timestamp":1592870400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,6,23]],"date-time":"2020-06-23T00:00:00Z","timestamp":1592870400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Big Data"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Data-driven healthcare policy discussions are gaining traction after the Covid-19 outbreak and ahead of the 2020 US presidential elections. The US has a hybrid healthcare structure; it is a system that does not provide universal coverage, albeit few years ago enacted a mandate (Affordable Care Act-ACA) that provides coverage for the majority of Americans. The US has the highest health expenditure per capita of all western and developed countries; however, most Americans don\u2019t tap into the benefits of preventive healthcare. It is estimated that only 8% of Americans undergo routine preventive screenings. On a national level, very few states (15 out of the 50) have above-average preventive healthcare metrics. In literature, many studies focus on the cure of diseases (research areas such as drug discovery and disease prediction); whilst a minority have examined data-driven preventive measures\u2014a matter that Americans and policy makers ought to place at the forefront of national issues. In this work, we present solutions for preventive practices and policies through Machine Learning (ML) methods. ML is morally neutral, it depends on the data that train the models; in this work, we make the case that Big Data is an imperative paradigm for healthcare. We examine disparities in clinical data for US patients by developing correlation and imputation methods for data completeness. Non-conventional patterns are identified. The data lifecycle followed is methodical and deliberate; 1000+\u2009clinical, demographical, and laboratory variables are collected from the Centers for Disease Control and Prevention (CDC). Multiple statistical models are deployed (Pearson correlations, Cramer\u2019s\u00a0V, MICE, and ANOVA). Other unsupervised ML models are also examined (K-modes and K-prototypes for clustering). Through the results presented in the paper, pointers to preventive chronic disease tests are presented, and the models are tested and evaluated.<\/jats:p>","DOI":"10.1186\/s40537-020-00315-8","type":"journal-article","created":{"date-parts":[[2020,6,23]],"date-time":"2020-06-23T11:03:07Z","timestamp":1592910187000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":36,"title":["Preventive healthcare policies in the US: solutions for disease management using Big Data Analytics"],"prefix":"10.1186","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6062-2747","authenticated-orcid":false,"given":"Feras A.","family":"Batarseh","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0067-377X","authenticated-orcid":false,"given":"Iya","family":"Ghassib","sequence":"additional","affiliation":[]},{"given":"Deri","family":"Chong","sequence":"additional","affiliation":[]},{"given":"Po-Hsuan","family":"Su","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,6,23]]},"reference":[{"key":"315_CR1","unstructured":"Institute of Medicine (IOM) of the National Academies. 2012. Best Care at Lower Cost: The Path to Continuously Learning Health Care in America. Available at: http:\/\/www.nationalacademies.org\/hmd\/~\/media\/Files\/Report%20Files\/2012\/Best-Care\/BestCareReportBrief.pdf."},{"key":"315_CR2","unstructured":"United Healthcare. America\u2019s Health Rankings. 2016. Spotlight: Prevention. Available at: https:\/\/assets.americashealthrankings.org\/app\/uploads\/spotlight-report_web.pdf."},{"key":"315_CR3","unstructured":"Malone K, Hinman A. Vaccination Mandates: The Public Health Imperative and Individual Rights. Chapter 13 from a report by the Centers for Disease Control and Prevention (CDC). 2019, https:\/\/www.cdc.gov\/vaccines\/imz-managers\/guides-pubs\/downloads\/vacc_mandates_chptr13.pdf."},{"key":"315_CR4","doi-asserted-by":"publisher","first-page":"1023","DOI":"10.1097\/MLR.0b013e318185c913","volume":"46","author":"JD Freeman","year":"2008","unstructured":"Freeman JD, Kadiyala S, Bell JF, Martin DP. The causal effect of health insurance on utilization and outcomes in adults: a systematic review of US studies. Med Care. 2008;46:1023\u201332.","journal-title":"Med Care"},{"key":"315_CR5","unstructured":"Centers for Disease Control and Prevention (CDC). 2015. Health insurance and access to care. NCHS Fact Sheet. http:\/\/www.cdc.gov\/nchs\/data\/factsheets\/factsheet_hiac.pdf."},{"key":"315_CR6","unstructured":"Department of Professional Employees (DPE). 2019. The U.S. Health Care System: An International Perspective, Fact Sheet https:\/\/dpeaflcio.org\/wp-content\/uploads\/US-Health-Care-in-Intl-Perspective-2016.pdf."},{"key":"315_CR7","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pmed.1002578","author":"J Olive","year":"2018","unstructured":"Olive J, Hotez P, Damania A, Nolan MS. The state of the Antivaccine Movement in the United States: a focused examination of nonmedical exemptions in States and Counties. PLOS Med. 2018. https:\/\/doi.org\/10.1371\/journal.pmed.1002578.","journal-title":"PLOS Med."},{"key":"315_CR8","unstructured":"OECD, Health at a Glance. 2015. OECD Publishing. http:\/\/www.oecd-ilibrary.org\/social-issues-migration-health\/health-at-a-glance_19991312."},{"issue":"1","key":"315_CR9","doi-asserted-by":"publisher","first-page":"69","DOI":"10.1111\/j.1600-0757.2011.00416.x","volume":"58","author":"L Borrell","year":"2000","unstructured":"Borrell L, Crawford N. Socioeconomic position indicators and periodontitis: examining the evidence. Periodontol. 2000;58(1):69\u201383.","journal-title":"Periodontol."},{"issue":"6","key":"315_CR10","doi-asserted-by":"publisher","first-page":"503","DOI":"10.1370\/afm.1431","volume":"10","author":"S Petterson","year":"2012","unstructured":"Petterson S, Liaw W, Phillips R, Rabin D, Meyers D, Bazemore A. Projecting US primary care physician workforce needs: 2010\u20132025. Ann Fam Med. 2012;10(6):503\u20139.","journal-title":"Ann Fam Med."},{"key":"315_CR11","unstructured":"Association of American Medical Colleges, State Physician Workforce Data Book. 2015. https:\/\/www.aamc.org\/data\/workforce\/reports\/442830\/statedataandreports.html."},{"key":"315_CR12","unstructured":"Tinker, A. 2014. How to Improve Patient Outcomes for Chronic Diseases and Comorbidities. A report by the Health Catalyst. http:\/\/www.healthcatalyst.com\/wp-content\/uploads\/2014\/04\/How-to-Improve-Patient-Outcomes.pdf."},{"key":"315_CR13","unstructured":"Stead, W., Lin, H. 2009. Computational Technology for Effective Health Care. The National Academic Press (NAP). https:\/\/www.nap.edu\/catalog\/12572\/computational-technology-for-effective-health-care-immediate-steps-and-strategic."},{"key":"315_CR14","unstructured":"Healthcare.gov (ACA website). https:\/\/www.healthcare.gov\/preventive-care-adults\/. Accessed June 2020."},{"key":"315_CR15","unstructured":"PNHP, Healthcare Systems, Four Basic Models. 2015. Physicians for a National Health Program. www.pnhp.org\/single_payer_resources\/health_care_systems_four_basic_models.php."},{"key":"315_CR16","unstructured":"CDC Data Surveys: National Health and Nutrition Examination Survey (NHANES). https:\/\/www.cdc.gov\/nchs\/nhanes\/index.htm. Accessed June 2020."},{"issue":"8","key":"315_CR17","first-page":"114f","volume":"38","author":"G Moore","year":"1965","unstructured":"Moore G. Cramming more components onto integrated circuits. Electronics. 1965;38(8):114f.","journal-title":"Electronics."},{"key":"315_CR18","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1038\/nrd3681","volume":"11","author":"J Scanell","year":"2012","unstructured":"Scanell J, Blanckley A, Boldon H, Warrington B. Diagnosing the decline in pharmaceutical R&D efficiency. Nat Rev Drug Discov. 2012;11:191\u2013200.","journal-title":"Nat Rev Drug Discov"},{"key":"315_CR19","unstructured":"Food and Drug Administration (FDA). FDA Amendments Act. 2007. H.R. 3580."},{"key":"315_CR20","unstructured":"White House. Open Government Initiative. Open Data. 2013. https:\/\/obamawhitehouse.archives.gov\/open. Accessed June 2020."},{"key":"315_CR21","first-page":"121","volume":"7","author":"N Ahluwalia","year":"2016","unstructured":"Ahluwalia N, Dwyer J, Terry A, Moshfegh A, Johnson C. Update on NHANES Dietary Data: focus on collection, release, analytical considerations, and uses to Inform Public Policy. American Society for Nutrition. Section.\u00a02016;7:121\u201334.","journal-title":"American Society for Nutrition. Section"},{"key":"315_CR22","unstructured":"Batarseh, F., Incremental Lifecycle Validation of Knowledge-Based Systems through CommonKADS. 2011. A doctoral dissertation published at the Florida State University Library Services. https:\/\/stars.library.ucf.edu\/etd\/2006\/. Accessed June 2020."},{"key":"315_CR23","unstructured":"Gonzalez AJ, Dankel D. The Engineering of Knowledge-Based Systems, Theory and Practice. Prentice Hall. 2013. ISBN: 978-0132769402 ."},{"key":"315_CR24","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1016\/j.bdr.2015.10.001","volume":"4","author":"F Batarseh","year":"2015","unstructured":"Batarseh F, Abdul-Latif E. Assessing the quality of service using Big Data Analytics\u2014with application to healthcare. J Big Data Res. 2015;4:13\u201324.","journal-title":"J Big Data Res."},{"key":"315_CR25","unstructured":"Sumeet D, Acharya U, Dua P. Machine Learning in Healthcare Informatics. Vol. 56. Berlin: Springer. 2014. ISBN: 978-3-642-40016-2."},{"key":"315_CR26","unstructured":"Stanford Med. The democratization of healthcare. 2018. Stanford medicine health trends report. med.stanford.edu. Accessed June 2020."},{"key":"315_CR27","doi-asserted-by":"crossref","unstructured":"Strome T. Healthcare analytics for quality and performance improvement. Hoboken: Wiley. 2013. ISBN: 978-1-118-51969-1 .","DOI":"10.1002\/9781118761946"},{"issue":"5","key":"315_CR28","doi-asserted-by":"publisher","first-page":"524","DOI":"10.1177\/0956797611430953","volume":"23","author":"L John","year":"2012","unstructured":"John L, Loewenstein G, Drazen P. Measuring the prevalence of questionable research practices with Incentives for truth telling. Psychol Sci. 2012;23(5):524\u201332.","journal-title":"Psychol Sci"},{"issue":"6","key":"315_CR29","doi-asserted-by":"publisher","first-page":"304","DOI":"10.3102\/0013189X14545513","volume":"43","author":"M Makel","year":"2014","unstructured":"Makel M, Plucker J. Facts are more important than novelty: replication in the education sciences. Educ Res. 2014;43(6):304\u201316.","journal-title":"Educ Res."},{"key":"315_CR30","unstructured":"Kogan, S., Zeng, Q., Ash, N., Greenes. R. A. 2001. Problems and Challenges in Patient Information Retrieval: a Descriptive Study. Proceedings of AMIA Symposium. p. 329\u2013333."},{"key":"315_CR31","unstructured":"American Diabetes Association (ADA). 2019. https:\/\/www.diabetes.org\/. Accessed June 2020."},{"key":"315_CR32","unstructured":"R-CRAN Libraries, MICE, Accessed on June 2020. Available at: https:\/\/cran.r-project.org\/web\/packages\/mice\/mice.pdf."},{"issue":"7","key":"315_CR33","doi-asserted-by":"publisher","first-page":"576","DOI":"10.1016\/j.adaj.2018.04.023","volume":"149","author":"P Eke","year":"2018","unstructured":"Eke P, Thornton-Evans G, Wei L, Borgnakke W, Dye B, Genco R. Periodontitis in US adults: National Health and Nutrition Examination Survey 2009\u20132014. J Am Dent Assoc. 2018;149(7):576\u201388.","journal-title":"J Am Dent Assoc."},{"key":"315_CR34","unstructured":"R-CRAN Libraries, ClustMixType. https:\/\/cran.r-project.org\/web\/packages\/clustMixType\/clustMixType.pdf. Accessed June 2020."},{"issue":"1","key":"315_CR35","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1007\/s00125-011-2342-y","volume":"55","author":"P Preshaw","year":"2012","unstructured":"Preshaw P, Alba A, Herrera D, Jepsen S, Konstantinidis A, Makrilakis K, Taylor R. Periodontitis and diabetes: a two-way relationship. Diabetologia. 2012;55(1):21\u201331.","journal-title":"Diabetologia"},{"key":"315_CR36","doi-asserted-by":"crossref","unstructured":"Batarseh F, Yang R. Federal Data Science: Transforming Government and Agricultural Policy Using Artificial Intelligence. Chapter 4: Making the Case for Artificial Intelligence at Government: Guidelines to Transforming Federal Software Systems. 2018. Cambridge: Academic Press. ISBN: 9780128124437 .","DOI":"10.1016\/B978-0-12-812443-7.00004-1"},{"key":"315_CR37","unstructured":"Batarseh F, Yang R. Data Democracy: At the Nexus of Artificial Intelligence, Software Development, and Knowledge Engineering. 1st Edition. Cambridge: Academic Press. 2020. ISBN: 9780128183663."},{"issue":"5","key":"315_CR38","first-page":"81","volume":"39","author":"J Loversidge","year":"2020","unstructured":"Loversidge J, Zurmehly J. Evidence-informed health policy: using EBP to transform policy in nursing and healthcare. Sigma Theta Tau Critical Care Nurse. 2020;39(5):81.","journal-title":"Sigma Theta Tau Critical Care Nurse."},{"key":"315_CR39","doi-asserted-by":"publisher","first-page":"420","DOI":"10.1111\/jcpe.13098","volume":"46","author":"E Montero","year":"2019","unstructured":"Montero E, Herrera D, Sanz M, Dhir S, Dyke T, Sima C. Development and validation of a predictive model for periodontitis using NHANES 2011\u20132012 data. J Clin Periodontol. 2019;46:420\u20139.","journal-title":"J Clin Periodontol"},{"key":"315_CR40","doi-asserted-by":"publisher","first-page":"1","DOI":"10.2307\/2137284","volume":"36","author":"R Andersen","year":"1995","unstructured":"Andersen R. Revisiting the behavioral model and access to medical care: does it matter. J Health Soc Behav. 1995;36:1\u201310.","journal-title":"J Health Soc Behav"}],"container-title":["Journal of Big Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00315-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s40537-020-00315-8\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s40537-020-00315-8.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,6,22]],"date-time":"2021-06-22T23:53:27Z","timestamp":1624406007000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofbigdata.springeropen.com\/articles\/10.1186\/s40537-020-00315-8"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,6,23]]},"references-count":40,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["315"],"URL":"https:\/\/doi.org\/10.1186\/s40537-020-00315-8","relation":{},"ISSN":["2196-1115"],"issn-type":[{"value":"2196-1115","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,6,23]]},"assertion":[{"value":"11 February 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 June 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 June 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare that they have no competing interests.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"38"}}