{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,17]],"date-time":"2026-06-17T04:40:56Z","timestamp":1781671256283,"version":"3.54.5"},"reference-count":75,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2023,4,17]],"date-time":"2023-04-17T00:00:00Z","timestamp":1681689600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/pages\/standard-publication-reuse-rights"}],"funder":[{"name":"Patient-Centered Outcomes Research Institute (PCORI) Project Program Awards","award":["ME-2018C3-14899"],"award-info":[{"award-number":["ME-2018C3-14899"]}]},{"name":"Eunice Kennedy Shriver National Institute of Child Health","award":["R01HD099348"],"award-info":[{"award-number":["R01HD099348"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,6,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Objectives<\/jats:title>\n                  <jats:p>The impacts of missing data in comparative effectiveness research (CER) using electronic health records (EHRs) may vary depending on the type and pattern of missing data. In this study, we aimed to quantify these impacts and compare the performance of different imputation methods.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Materials and Methods<\/jats:title>\n                  <jats:p>We conducted an empirical (simulation) study to quantify the bias and power loss in estimating treatment effects in CER using EHR data. We considered various missing scenarios and used the propensity scores to control for confounding. We compared the performance of the multiple imputation and spline smoothing methods to handle missing data.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>When missing data depended on the stochastic progression of disease and medical practice patterns, the spline smoothing method produced results that were close to those obtained when there were no missing data. Compared to multiple imputation, the spline smoothing generally performed similarly or better, with smaller estimation bias and less power loss. The multiple imputation can still reduce study bias and power loss in some restrictive scenarios, eg, when missing data did not depend on the stochastic process of disease progression.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Discussion and Conclusion<\/jats:title>\n                  <jats:p>Missing data in EHRs could lead to biased estimates of treatment effects and false negative findings in CER even after missing data were imputed. It is important to leverage the temporal information of disease trajectory to impute missing values when using EHRs as a data resource for CER and to consider the missing rate and the effect size when choosing an imputation method.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/jamia\/ocad066","type":"journal-article","created":{"date-parts":[[2023,4,17]],"date-time":"2023-04-17T19:42:40Z","timestamp":1681760560000},"page":"1246-1256","source":"Crossref","is-referenced-by-count":28,"title":["Missing data matter: an empirical evaluation of the impacts of missing EHR data in comparative effectiveness research"],"prefix":"10.1093","volume":"30","author":[{"given":"Yizhao","family":"Zhou","sequence":"first","affiliation":[{"name":"Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine at the University of Pennsylvania , Philadelphia, Pennsylvania, USA"},{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia , Philadelphia, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jiasheng","family":"Shi","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine at the University of Pennsylvania , Philadelphia, Pennsylvania, USA"},{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia , Philadelphia, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ronen","family":"Stein","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia , Philadelphia, Pennsylvania, USA"},{"name":"Department of Pediatrics, Perelman School of Medicine at the University of Pennsylvania , Philadelphia, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6920-5598","authenticated-orcid":false,"given":"Xiaokang","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine at the University of Pennsylvania , Philadelphia, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Robert N","family":"Baldassano","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia , Philadelphia, Pennsylvania, USA"},{"name":"Department of Pediatrics, Perelman School of Medicine at the University of Pennsylvania , Philadelphia, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Christopher B","family":"Forrest","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia , Philadelphia, Pennsylvania, USA"},{"name":"Department of Pediatrics, Perelman School of Medicine at the University of Pennsylvania , Philadelphia, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0835-0788","authenticated-orcid":false,"given":"Yong","family":"Chen","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine at the University of Pennsylvania , Philadelphia, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jing","family":"Huang","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine at the University of Pennsylvania , Philadelphia, Pennsylvania, USA"},{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia , Philadelphia, Pennsylvania, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2023,4,17]]},"reference":[{"issue":"6","key":"2023062008373731800_ocad066-B1","doi-asserted-by":"crossref","first-page":"764","DOI":"10.2310\/JIM.0b013e3181e3d2af","article-title":"Comparative effectiveness research: what kind of studies do we need?","volume":"58","author":"Concato","year":"2010","journal-title":"J Investig Med"},{"issue":"5","key":"2023062008373731800_ocad066-B2","doi-asserted-by":"crossref","first-page":"367","DOI":"10.7326\/0003-4819-156-5-201203060-00009","article-title":"What comparative effectiveness research is needed? A framework for using guidelines and systematic reviews to identify evidence gaps and research priorities","volume":"156","author":"Li","year":"2012","journal-title":"Ann Intern Med"},{"issue":"3","key":"2023062008373731800_ocad066-B3","doi-asserted-by":"crossref","first-page":"203","DOI":"10.7326\/0003-4819-151-3-200908040-00125","article-title":"Comparative effectiveness research: a report from the Institute of Medicine","volume":"151","author":"Sox","year":"2009","journal-title":"Ann Intern Med"},{"issue":"14","key":"2023062008373731800_ocad066-B4","doi-asserted-by":"crossref","first-page":"1359","DOI":"10.1001\/jama.2019.4064","article-title":"The evolving uses of \u201creal-world\u201d data","volume":"321","author":"Basch","year":"2019","journal-title":"JAMA"},{"issue":"34","key":"2023062008373731800_ocad066-B5","doi-asserted-by":"crossref","first-page":"4243","DOI":"10.1200\/JCO.2012.42.8011","article-title":"Importance of health information technology, electronic health records, and continuously aggregating data to comparative effectiveness research and learning health care","volume":"30","author":"Miriovsky","year":"2012","journal-title":"J Clin Oncol"},{"issue":"8 Suppl 3","key":"2023062008373731800_ocad066-B6","doi-asserted-by":"crossref","first-page":"S30","DOI":"10.1097\/MLR.0b013e31829b1dbd","article-title":"Caveats for the use of operational electronic health record data in comparative effectiveness research","volume":"51","author":"Hersh","year":"2013","journal-title":"Med Care"},{"issue":"6","key":"2023062008373731800_ocad066-B7","doi-asserted-by":"crossref","first-page":"529","DOI":"10.2217\/cer.13.65","article-title":"Role of electronic health records in comparative effectiveness research","volume":"2","author":"Gallego","year":"2013","journal-title":"J Comp Eff Res"},{"issue":"3","key":"2023062008373731800_ocad066-B8","doi-asserted-by":"crossref","first-page":"7","DOI":"10.13063\/2327-9214.1035","article-title":"Strategies for handling missing data in electronic health record derived data","volume":"1","author":"Wells","year":"2013","journal-title":"eGEMs"},{"issue":"2","key":"2023062008373731800_ocad066-B9","doi-asserted-by":"crossref","first-page":"e210184","DOI":"10.1001\/jamanetworkopen.2021.0184","article-title":"Assessing missing data assumptions in EHR-based studies: a complex and underappreciated task","volume":"4","author":"Haneuse","year":"2021","journal-title":"JAMA Netw Open"},{"issue":"1","key":"2023062008373731800_ocad066-B10","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1097\/EDE.0000000000000393","article-title":"Learning about missing data mechanisms in electronic health records-based research: a survey-based approach","volume":"27","author":"Haneuse","year":"2016","journal-title":"Epidemiology"},{"issue":"7","key":"2023062008373731800_ocad066-B11","doi-asserted-by":"crossref","first-page":"976","DOI":"10.1007\/s11606-014-2883-0","article-title":"Prospective EHR-based clinical trials: the challenge of missing data","volume":"29","author":"Kharrazi","year":"2014","journal-title":"J Gen Intern Med"},{"issue":"3","key":"2023062008373731800_ocad066-B12","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1093\/biomet\/63.3.581","article-title":"Inference and missing data","volume":"63","author":"Rubin","year":"1976","journal-title":"Biometrika"},{"key":"2023062008373731800_ocad066-B13","volume-title":"Statistical Analysis with Missing Data","author":"Little","year":"2019"},{"issue":"4","key":"2023062008373731800_ocad066-B14","doi-asserted-by":"crossref","first-page":"602","DOI":"10.1136\/amiajnl-2014-002743","article-title":"PEDSnet: a national pediatric learning health system","volume":"21","author":"Forrest","year":"2014","journal-title":"J Am Med Inform Assoc"},{"issue":"Suppl 1","key":"2023062008373731800_ocad066-B15","doi-asserted-by":"crossref","first-page":"i109","DOI":"10.1136\/amiajnl-2011-000463","article-title":"Exploiting time in electronic health record correlations","volume":"18","author":"Hripcsak","year":"2011","journal-title":"J Am Med Inform Assoc"},{"issue":"1","key":"2023062008373731800_ocad066-B16","doi-asserted-by":"crossref","first-page":"013111","DOI":"10.1063\/1.3675621","article-title":"Using time-delayed mutual information to discover and interpret temporal correlation structure in complex populations","volume":"22","author":"Albers","year":"2012","journal-title":"Chaos"},{"issue":"9","key":"2023062008373731800_ocad066-B17","doi-asserted-by":"crossref","first-page":"1159","DOI":"10.1016\/j.physleta.2009.12.067","article-title":"A statistical dynamics approach to the study of human health data: resolving population scale diurnal variation in laboratory data","volume":"374","author":"Albers","year":"2010","journal-title":"Phys Lett A"},{"key":"2023062008373731800_ocad066-B18","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1016\/j.jbi.2018.08.014","article-title":"Methodological variations in lagged regression for detecting physiologic drug effects in EHR data","volume":"86","author":"Levine","year":"2018","journal-title":"J Biomed Inform"},{"issue":"e2","key":"2023062008373731800_ocad066-B19","doi-asserted-by":"crossref","first-page":"e311","DOI":"10.1136\/amiajnl-2013-001922","article-title":"Correlating electronic health record concepts with healthcare process events","volume":"20","author":"Hripcsak","year":"2013","journal-title":"J Am Med Inform Assoc"},{"issue":"1887","key":"2023062008373731800_ocad066-B20","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1098\/rsta.2008.0157","article-title":"Robust parameter extraction for decision support using multimodal intensive care data","volume":"367","author":"Clifford","year":"2009","journal-title":"Phil Trans R Soc A"},{"issue":"1","key":"2023062008373731800_ocad066-B21","first-page":"446","article-title":"A multivariate timeseries modeling approach to severity of illness assessment and forecasting in ICU with sparse, heterogeneous clinical data","volume":"29","author":"Ghassemi","year":"2015","journal-title":"Proc AAAI Conf Artif Intell"},{"key":"2023062008373731800_ocad066-B22","doi-asserted-by":"crossref","first-page":"k1479","DOI":"10.1136\/bmj.k1479","article-title":"Biases in electronic health record data due to processes within the healthcare system: retrospective observational study","volume":"361","author":"Agniel","year":"2018","journal-title":"BMJ"},{"issue":"6","key":"2023062008373731800_ocad066-B23","doi-asserted-by":"crossref","first-page":"e66341","DOI":"10.1371\/journal.pone.0066341","article-title":"Computational phenotype discovery using unsupervised feature learning over noisy, sparse, and irregular clinical data","volume":"8","author":"Lasko","year":"2013","journal-title":"PLoS One"},{"key":"2023062008373731800_ocad066-B24","author":"Saria","year":"2011"},{"key":"2023062008373731800_ocad066-B25","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1016\/j.jbi.2014.03.016","article-title":"Identifying and mitigating biases in EHR laboratory tests","volume":"51","author":"Pivovarov","year":"2014","journal-title":"J Biomed Inform"},{"issue":"6","key":"2023062008373731800_ocad066-B26","doi-asserted-by":"crossref","first-page":"1038","DOI":"10.1136\/amiajnl-2013-002592","article-title":"Temporal trends of hemoglobin A1c testing","volume":"21","author":"Pivovarov","year":"2014","journal-title":"J Am Med Inform Assoc"},{"key":"2023062008373731800_ocad066-B27","author":"Levine","year":"2016"},{"issue":"10","key":"2023062008373731800_ocad066-B28","doi-asserted-by":"crossref","first-page":"1392","DOI":"10.1093\/jamia\/ocy106","article-title":"Mechanistic machine learning: how data assimilation leverages physiologic knowledge using Bayesian inference to forecast the future, infer the present, and phenotype","volume":"25","author":"Albers","year":"2018","journal-title":"J Am Med Inform Assoc"},{"issue":"1","key":"2023062008373731800_ocad066-B29","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1016\/j.artmed.2013.01.003","article-title":"Missing data in medical databases: impute, delete or classify?","volume":"58","author":"Cismondi","year":"2013","journal-title":"Artif Intell Med"},{"issue":"1","key":"2023062008373731800_ocad066-B30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.jbi.2007.06.001","article-title":"Exploiting missing clinical data in Bayesian network modeling for predicting medical problems","volume":"41","author":"Lin","year":"2008","journal-title":"J Biomed Inform"},{"issue":"1","key":"2023062008373731800_ocad066-B31","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/srep46226","article-title":"Analysis of free text in electronic health records for identification of cancer patient trajectories","volume":"7","author":"Jensen","year":"2017","journal-title":"Sci Rep"},{"issue":"7","key":"2023062008373731800_ocad066-B32","doi-asserted-by":"crossref","first-page":"e542","DOI":"10.1016\/S2589-7500(22)00091-7","article-title":"COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records","volume":"4","author":"Thygesen","year":"2022","journal-title":"Lancet Digit Health"},{"issue":"7","key":"2023062008373731800_ocad066-B33","doi-asserted-by":"crossref","first-page":"2476","DOI":"10.1109\/JBHI.2021.3089441","article-title":"A computational method for learning disease trajectories from partially observable EHR data","volume":"25","author":"Oh","year":"2021","journal-title":"IEEE J Biomed Health Inform"},{"issue":"9","key":"2023062008373731800_ocad066-B34","doi-asserted-by":"crossref","first-page":"e0237724","DOI":"10.1371\/journal.pone.0237724","article-title":"A new analytical framework for missing data imputation and classification with uncertainty: missing data imputation and heart failure readmission prediction","volume":"15","author":"Hu","year":"2020","journal-title":"PLoS One"},{"issue":"4","key":"2023062008373731800_ocad066-B35","doi-asserted-by":"crossref","first-page":"772","DOI":"10.1093\/jamia\/ocaa288","article-title":"High-throughput phenotyping with temporal sequences","volume":"28","author":"Estiri","year":"2021","journal-title":"J Am Med Inform Assoc"},{"key":"2023062008373731800_ocad066-B36","author":"Liu","year":"2015"},{"key":"2023062008373731800_ocad066-B37","doi-asserted-by":"crossref","first-page":"103314","DOI":"10.1016\/j.jbi.2019.103314","article-title":"A method for the graphical modeling of relative temporal constraints","volume":"100","author":"Mate","year":"2019","journal-title":"J Biomed Inform"},{"key":"2023062008373731800_ocad066-B38","doi-asserted-by":"crossref","first-page":"103335","DOI":"10.1016\/j.jbi.2019.103335","article-title":"Temporal phenotyping by mining healthcare data to derive lines of therapy for cancer","volume":"100","author":"Meng","year":"2019","journal-title":"J Biomed Inform"},{"key":"2023062008373731800_ocad066-B39","doi-asserted-by":"crossref","first-page":"103361","DOI":"10.1016\/j.jbi.2019.103361","article-title":"Identifying sub-phenotypes of acute kidney injury using structured and unstructured electronic health record data with memory networks","volume":"102","author":"Xu","year":"2020","journal-title":"J Biomed Inform"},{"key":"2023062008373731800_ocad066-B40","author":"Cheng","year":"2016"},{"key":"2023062008373731800_ocad066-B41","doi-asserted-by":"crossref","first-page":"260","DOI":"10.1016\/j.jbi.2016.01.009","article-title":"Developing EHR-driven heart failure risk prediction models using CPXR (Log) with the probabilistic loss function","volume":"60","author":"Taslimitehrani","year":"2016","journal-title":"J Biomed Inform"},{"issue":"11","key":"2023062008373731800_ocad066-B42","doi-asserted-by":"crossref","first-page":"1764","DOI":"10.1093\/jamia\/ocaa143","article-title":"Social determinants of health in electronic health records and their impact on analysis and risk prediction: a systematic review","volume":"27","author":"Chen","year":"2020","journal-title":"J Am Med Inform Assoc"},{"key":"2023062008373731800_ocad066-B43","author":"Che","year":"2017"},{"key":"2023062008373731800_ocad066-B44","doi-asserted-by":"crossref","first-page":"815674","DOI":"10.3389\/fpubh.2022.815674","article-title":"A process mining pipeline to characterise COVID-19 patients\u2019 trajectories and identify relevant temporal phenotypes from EHR data","volume":"10","author":"Dagliati","year":"2022","journal-title":"Front Public Health"},{"issue":"6","key":"2023062008373731800_ocad066-B45","doi-asserted-by":"crossref","first-page":"1134","DOI":"10.1093\/jamia\/ocx071","article-title":"Biases introduced by filtering electronic health records for patients with \u201ccomplete data\u201d","volume":"24","author":"Weber","year":"2017","journal-title":"J Am Med Inform Assoc"},{"issue":"5","key":"2023062008373731800_ocad066-B46","doi-asserted-by":"crossref","first-page":"2125","DOI":"10.1007\/s11695-021-05226-y","article-title":"Investigating bias from missing data in an electronic health records-based study of weight loss after bariatric surgery","volume":"31","author":"Koffman","year":"2021","journal-title":"Obes Surg"},{"key":"2023062008373731800_ocad066-B47","author":"Beaulieu-Jones","year":"2017"},{"issue":"11","key":"2023062008373731800_ocad066-B48","doi-asserted-by":"crossref","first-page":"1544","DOI":"10.1001\/jamainternmed.2018.3763","article-title":"Potential biases in machine learning algorithms using electronic health record data","volume":"178","author":"Gianfrancesco","year":"2018","journal-title":"JAMA Intern Med"},{"issue":"4","key":"2023062008373731800_ocad066-B49","doi-asserted-by":"crossref","first-page":"946","DOI":"10.1111\/1475-6773.12295","article-title":"Imputing missing race\/ethnicity in pediatric electronic health records: reducing bias with use of US census location and surname data","volume":"50","author":"Grundmeier","year":"2015","journal-title":"Health Serv Res"},{"key":"2023062008373731800_ocad066-B50","author":"Cismondi","year":"2011"},{"issue":"1","key":"2023062008373731800_ocad066-B51","doi-asserted-by":"crossref","first-page":"9","DOI":"10.21037\/atm-20-3623","article-title":"Missing data imputation: focusing on single imputation","volume":"4","author":"Zhang","year":"2016","journal-title":"Ann Transl Med"},{"issue":"10","key":"2023062008373731800_ocad066-B52","doi-asserted-by":"crossref","first-page":"1087","DOI":"10.1016\/j.jclinepi.2006.01.014","article-title":"A gentle introduction to imputation of missing values","volume":"59","author":"Donders","year":"2006","journal-title":"J Clin Epidemiol"},{"issue":"3","key":"2023062008373731800_ocad066-B53","doi-asserted-by":"crossref","first-page":"341","DOI":"10.2306\/scienceasia1513-1874.2008.34.341","article-title":"Estimation of missing values in air pollution data using single imputation techniques","volume":"34","author":"Norazian","year":"2008","journal-title":"ScienceAsia"},{"key":"2023062008373731800_ocad066-B54","doi-asserted-by":"crossref","first-page":"103270","DOI":"10.1016\/j.jbi.2019.103270","article-title":"Detecting time-evolving phenotypic topics via tensor factorization on electronic health records: cardiovascular disease case study","volume":"98","author":"Zhao","year":"2019","journal-title":"J Biomed Inform"},{"issue":"1","key":"2023062008373731800_ocad066-B55","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1177\/096228029900800102","article-title":"Multiple imputation: a primer","volume":"8","author":"Schafer","year":"1999","journal-title":"Stat Methods Med Res"},{"issue":"434","key":"2023062008373731800_ocad066-B56","doi-asserted-by":"crossref","first-page":"473","DOI":"10.1080\/01621459.1996.10476908","article-title":"Multiple imputation after 18+ years","volume":"91","author":"Rubin","year":"1996","journal-title":"J Am Stat Assoc"},{"issue":"1","key":"2023062008373731800_ocad066-B57","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1136\/amiajnl-2012-001145","article-title":"Next-generation phenotyping of electronic health records","volume":"20","author":"Hripcsak","year":"2013","journal-title":"J Am Med Inform Assoc"},{"issue":"4","key":"2023062008373731800_ocad066-B58","doi-asserted-by":"crossref","first-page":"794","DOI":"10.1093\/jamia\/ocu051","article-title":"Parameterizing time in electronic health record studies","volume":"22","author":"Hripcsak","year":"2015","journal-title":"J Am Med Inform Assoc"},{"issue":"1","key":"2023062008373731800_ocad066-B59","doi-asserted-by":"crossref","first-page":"31","DOI":"10.3748\/wjg.v20.i1.31","article-title":"Natural history and long-term clinical course of Crohn\u2019s disease","volume":"20","author":"Freeman","year":"2014","journal-title":"World J Gastroenterol"},{"key":"2023062008373731800_ocad066-B60","first-page":"574","author":"Hripcsak","year":"2015"},{"key":"2023062008373731800_ocad066-B61","author":"Lu"},{"issue":"1\/2","key":"2023062008373731800_ocad066-B62","article-title":"Comparatives outcomes study of patients hospitalized with diabetes and myocardial infarction: EHR data interrogation among hospital categories","volume":"14","author":"Okunji","year":"2019","journal-title":"Can J Nurs Inform"},{"key":"2023062008373731800_ocad066-B63","author":"Zhou","year":"2022"},{"issue":"260","key":"2023062008373731800_ocad066-B64","doi-asserted-by":"crossref","first-page":"663","DOI":"10.1080\/01621459.1952.10483446","article-title":"A generalization of sampling without replacement from a finite universe","volume":"47","author":"Horvitz","year":"1952","journal-title":"J Am Stat Assoc"},{"issue":"521","key":"2023062008373731800_ocad066-B65","doi-asserted-by":"crossref","first-page":"390","DOI":"10.1080\/01621459.2016.1260466","article-title":"Balancing covariates via propensity score weighting","volume":"113","author":"Li","year":"2018","journal-title":"J Am Stat Assoc"},{"key":"2023062008373731800_ocad066-B66","doi-asserted-by":"crossref","first-page":"104204","DOI":"10.1016\/j.jbi.2022.104204","article-title":"Adjusting for indirectly measured confounding using large-scale propensity score","volume":"134","author":"Zhang","year":"2022","journal-title":"J Biomed Inform"},{"issue":"17","key":"2023062008373731800_ocad066-B67","doi-asserted-by":"crossref","first-page":"2308","DOI":"10.1002\/sim.8540","article-title":"A novel approach for propensity score matching and stratification for multiple treatments: application to an electronic health record\u2013derived study","volume":"39","author":"Brown","year":"2020","journal-title":"Stat Med"},{"key":"2023062008373731800_ocad066-B68","author":"Zeileis","year":"2005"},{"issue":"2","key":"2023062008373731800_ocad066-B69","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/0021-9045(76)90040-X","article-title":"Optimal error bounds for cubic spline interpolation","volume":"16","author":"Hall","year":"1976","journal-title":"J Approx Theory"},{"issue":"394","key":"2023062008373731800_ocad066-B70","doi-asserted-by":"crossref","first-page":"366","DOI":"10.1080\/01621459.1986.10478280","article-title":"Multiple imputation for interval estimation from simple random samples with ignorable nonresponse","volume":"81","author":"Rubin","year":"1986","journal-title":"J Am Stat Assoc"},{"key":"2023062008373731800_ocad066-B71","volume-title":"Multiple Imputation for Nonresponse in Surveys","author":"Rubin","year":"2004"},{"key":"2023062008373731800_ocad066-B72","doi-asserted-by":"crossref","DOI":"10.1201\/9781420010404","volume-title":"Generalized Additive Models: An Introduction with R","author":"Wood","year":"2006"},{"key":"2023062008373731800_ocad066-B73","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4614-7138-7","volume-title":"An Introduction to Statistical Learning","author":"James","year":"2013"},{"issue":"12","key":"2023062008373731800_ocad066-B74","doi-asserted-by":"crossref","first-page":"1609","DOI":"10.1093\/jamia\/ocz148","article-title":"How and when informative visit processes can bias inference when using electronic health records data for clinical research","volume":"26","author":"Goldstein","year":"2019","journal-title":"J Am Med Inform Assoc"},{"issue":"7","key":"2023062008373731800_ocad066-B75","doi-asserted-by":"crossref","first-page":"1191","DOI":"10.1093\/jamia\/ocac050","article-title":"Informative presence bias in analyses of electronic health records-derived data: a cautionary note","volume":"29","author":"Harton","year":"2022","journal-title":"J Am Med Inform Assoc"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/30\/7\/1246\/50634362\/ocad066.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/jamia\/article-pdf\/30\/7\/1246\/50634362\/ocad066.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,20]],"date-time":"2023-06-20T08:40:09Z","timestamp":1687250409000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/30\/7\/1246\/7126960"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,17]]},"references-count":75,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2023,4,17]]},"published-print":{"date-parts":[[2023,6,20]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocad066","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,7,1]]},"published":{"date-parts":[[2023,4,17]]}}}