{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T06:16:20Z","timestamp":1761718580518},"reference-count":29,"publisher":"Georg Thieme Verlag KG","issue":"02","funder":[{"name":"Department of Health and Human Services","award":["1C1CMS331001\u201301\u201300"],"award-info":[{"award-number":["1C1CMS331001\u201301\u201300"]}]},{"name":"US Department of Education Graduate Assistance in Areas of National Need (GAANN) Fellowship","award":["P200A100053"],"award-info":[{"award-number":["P200A100053"]}]},{"name":"National Science Foundation","award":["CNS-1429294"],"award-info":[{"award-number":["CNS-1429294"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Appl Clin Inform"],"published-print":{"date-parts":[[2017,4]]},"abstract":"<jats:title>Summary<\/jats:title><jats:p>Background: Because 5% of patients incur 50% of healthcare expenses, population health managers need to be able to focus preventive and longitudinal care on those patients who are at highest risk of increased utilization. Predictive analytics can be used to identify these patients and to better manage their care. Data mining permits the development of models that surpass the size restrictions of traditional statistical methods and take advantage of the rich data available in the electronic health record (EHR), without limiting predictions to specific chronic conditions.<\/jats:p><jats:p>Objective: The objective was to demonstrate the usefulness of unrestricted EHR data for predictive analytics in managed healthcare.<\/jats:p><jats:p>Methods: In a population of 9,568 Medicare and Medicaid beneficiaries, patients in the highest 5% of charges were compared to equal numbers of patients with the lowest charges. Contrast mining was used to discover the combinations of clinical attributes frequently associated with high utilization and infrequently associated with low utilization. The attributes found in these combinations were then tested by multiple logistic regression, and the discrimination of the model was evaluated by the c-statistic.<\/jats:p><jats:p>Results: Of 19,014 potential EHR patient attributes, 67 were found in combinations frequently associated with high utilization, but not with low utilization (support&gt;20%). Eleven of these attributes were significantly associated with high utilization (p&lt;0.05). A prediction model composed of these eleven attributes had a discrimination of 84%.<\/jats:p><jats:p>Conclusions: EHR mining reduced an unusably high number of patient attributes to a manageable set of potential healthcare utilization predictors, without conjecturing on which attributes would be useful. Treating these results as hypotheses to be tested by conventional methods yielded a highly accurate predictive model. This novel, two-step methodology can assist population health managers to focus preventive and longitudinal care on those patients who are at highest risk for increased utilization.<\/jats:p><jats:p>Citation: Sheets L, Petroski GF, Zhuang Y, Phinney MA, Ge B, Parker JC, Shyu C-R. Combining contrast mining with logistic regression to predict healthcare Appl Clin Inform 2017; 8: 430\u2013446 https:\/\/doi.org\/10.4338\/ACI-2016-05-RA-0078<\/jats:p>","DOI":"10.4338\/aci-2016-05-ra-0078","type":"journal-article","created":{"date-parts":[[2017,5,3]],"date-time":"2017-05-03T05:55:27Z","timestamp":1493790927000},"page":"430-446","source":"Crossref","is-referenced-by-count":5,"title":["Combining Contrast Mining with Logistic Regression To Predict Healthcare Utilization in a Managed Care Population"],"prefix":"10.4338","volume":"08","author":[{"given":"Lincoln","family":"Sheets","sequence":"additional","affiliation":[]},{"given":"Gregory","family":"Petroski","sequence":"additional","affiliation":[]},{"given":"Yan","family":"Zhuang","sequence":"additional","affiliation":[]},{"given":"Michael","family":"Phinney","sequence":"additional","affiliation":[]},{"given":"Bin","family":"Ge","sequence":"additional","affiliation":[]},{"given":"Jerry","family":"Parker","sequence":"additional","affiliation":[]},{"given":"Chi-Ren","family":"Shyu","sequence":"additional","affiliation":[]}],"member":"194","published-online":{"date-parts":[[2017,12,21]]},"reference":[{"key":"10.4338\/ACI-2016-05-RA-0078-1","doi-asserted-by":"publisher","DOI":"10.1377\/hlthaff.27.3.759"},{"issue":"2","key":"10.4338\/ACI-2016-05-RA-0078-2","first-page":"63","volume":"27","author":"Wagner","year":"2001","journal-title":"Jt Comm J Qual Improv"},{"key":"10.4338\/ACI-2016-05-RA-0078-3","doi-asserted-by":"publisher","unstructured":"Glasgow RE, Orleans CT, Wagner EH. Does the chronic care model serve also as a template for improving prevention? Milbank Q 2001; 79(4): 579-612, iv-v","DOI":"10.1111\/1468-0009.00222"},{"key":"10.4338\/ACI-2016-05-RA-0078-4","doi-asserted-by":"publisher","DOI":"10.1001\/jama.288.15.1909"},{"key":"10.4338\/ACI-2016-05-RA-0078-5","doi-asserted-by":"publisher","DOI":"10.1377\/hlthaff.28.1.75"},{"key":"10.4338\/ACI-2016-05-RA-0078-6","doi-asserted-by":"publisher","DOI":"10.1097\/00001888-200311000-00002"},{"issue":"4","key":"10.4338\/ACI-2016-05-RA-0078-7","first-page":"102","volume":"66","author":"Bradley","year":"2012","journal-title":"Healthc Financ Manage"},{"key":"10.4338\/ACI-2016-05-RA-0078-8","unstructured":"Cohen SB, Uberoi N, United States Agency for Healthcare Research and Quality. Differentials in the concentration in the level of health expenditures across population subgroups in the US, 2010. Rockville: Agency for Healthcare Research and Quality; 2013"},{"key":"10.4338\/ACI-2016-05-RA-0078-9","doi-asserted-by":"publisher","DOI":"10.1377\/hlthaff.2014.0352"},{"key":"10.4338\/ACI-2016-05-RA-0078-10","doi-asserted-by":"publisher","DOI":"10.1016\/j.maturitas.2015.03.009"},{"key":"10.4338\/ACI-2016-05-RA-0078-11","doi-asserted-by":"publisher","unstructured":"Kantardzic M. Data mining: Concepts, models, methods, and algorithms. 2nd ed. Hoboken: John Wiley & Sons; 2011","DOI":"10.1002\/9781118029145"},{"key":"10.4338\/ACI-2016-05-RA-0078-12","doi-asserted-by":"publisher","DOI":"10.1111\/ijcp.12265"},{"key":"10.4338\/ACI-2016-05-RA-0078-13","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2013-002033"},{"key":"10.4338\/ACI-2016-05-RA-0078-14","doi-asserted-by":"publisher","DOI":"10.1016\/j.acra.2015.09.014"},{"issue":"3","key":"10.4338\/ACI-2016-05-RA-0078-15","first-page":"68","volume":"9","author":"Chechulin","year":"2014","journal-title":"Healthc Policy"},{"issue":"5","key":"10.4338\/ACI-2016-05-RA-0078-16","first-page":"381","volume":"9","author":"Dove","year":"2003","journal-title":"Am J Manag Care"},{"key":"10.4338\/ACI-2016-05-RA-0078-17","doi-asserted-by":"publisher","DOI":"10.4338\/ACI-2012-12-RA-0058"},{"key":"10.4338\/ACI-2016-05-RA-0078-18","doi-asserted-by":"publisher","DOI":"10.4338\/ACI-2012-12-RA-0051"},{"key":"10.4338\/ACI-2016-05-RA-0078-19","unstructured":"Witten IH, Frank E.\u00a0Data Mining: Practical machine learning tools and techniques, 2nd Ed. San Francisco: Morgan Kaufmann; 2005"},{"key":"10.4338\/ACI-2016-05-RA-0078-20","doi-asserted-by":"crossref","unstructured":"Dong G. Preliminaries. In: Dong G, Bailey J, editors. Contrast data mining: concepts, algorithms, and applications. Boca Raton: CRC Press; 2013. p. 8","DOI":"10.1201\/b12986-3"},{"key":"10.4338\/ACI-2016-05-RA-0078-21","doi-asserted-by":"publisher","unstructured":"Shvachko K, Kuang H, Radia S, Chansler R. The Hadoop distributed file system. Mass Storage Systems and Technologies (MSST), 2010 IEEE 26th Symposium on. New York: Institute of Electrical and Electronics Engineers; 2010","DOI":"10.1109\/MSST.2010.5496972"},{"key":"10.4338\/ACI-2016-05-RA-0078-22","unstructured":"Health Data Interactive. Atlanta: Centers for Disease Control and Prevention; c2016 [updated 2016 May 16, cited 2016 May 26]. Available from: http:\/\/www.cdc.gov\/nchs\/hdi.htm"},{"key":"10.4338\/ACI-2016-05-RA-0078-23","unstructured":"Agrawal R, Srikant R. Fast algorithms for mining association rules in large databases. Proceedings of the 20th International Conference on Very Large Data Bases 1994; 487-499"},{"key":"10.4338\/ACI-2016-05-RA-0078-24","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1111\/j.2517-6161.1958.tb00292.x","volume":"20","author":"Cox,","year":"1958","journal-title":"J Roy Stat Soc B"},{"key":"10.4338\/ACI-2016-05-RA-0078-25","unstructured":"Myers RH. Classical and Modern Regression with Applications, Second Edition. Boston: PWK Kent; 1990"},{"key":"10.4338\/ACI-2016-05-RA-0078-26","doi-asserted-by":"publisher","DOI":"10.1148\/radiology.143.1.7063747"},{"key":"10.4338\/ACI-2016-05-RA-0078-27","doi-asserted-by":"publisher","unstructured":"Zhou Xh, Obuchowski NA, McClish DK. Statistical Methods in Diagnostic Medicine. New York: John Wiley and Sons; 2002","DOI":"10.1002\/9780470317082"},{"key":"10.4338\/ACI-2016-05-RA-0078-28","unstructured":"Total Expenses and Percent Distribution for Selected Conditions by Type of Service: United States, 2013. Rockville: Agency for Healthcare Research and Quality; c2016 [updated 2016 May 26, cited 2016 May 26]. Available from: http:\/\/meps.ahrq.gov\/mepsweb\/data_stats\/tables_compendia_hh_interactive.jsp?_SERVICE=MEPSSocket0&_PROGRAM=MEPSPGM.TC.SAS&File=HCFY2013&Table=HCFY2013_CNDXP_C&_Debug="},{"key":"10.4338\/ACI-2016-05-RA-0078-29","doi-asserted-by":"crossref","unstructured":"Gelman A, Hill J. Data Analysis Using Regression and Multilevel\/Hierarchical Models. New York: Cambridge University Press; 2007","DOI":"10.1017\/CBO9780511790942"}],"container-title":["Applied Clinical Informatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.thieme-connect.de\/products\/ejournals\/pdf\/10.4338\/ACI-2016-05-RA-0078.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,23]],"date-time":"2024-06-23T22:25:33Z","timestamp":1719181533000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.thieme-connect.de\/DOI\/DOI?10.4338\/ACI-2016-05-RA-0078"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,4]]},"references-count":29,"journal-issue":{"issue":"02","published-online":{"date-parts":[[2017,12,21]]},"published-print":{"date-parts":[[2017,4]]}},"URL":"https:\/\/doi.org\/10.4338\/aci-2016-05-ra-0078","relation":{},"ISSN":["1869-0327"],"issn-type":[{"value":"1869-0327","type":"electronic"}],"subject":[],"published":{"date-parts":[[2017,4]]}}}