{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,3]],"date-time":"2026-03-03T06:20:29Z","timestamp":1772518829321,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2019,3,8]],"date-time":"2019-03-08T00:00:00Z","timestamp":1552003200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Pew Charitable Trust","award":["PEW30381"],"award-info":[{"award-number":["PEW30381"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Objective<\/jats:title>\n                  <jats:p>This study evaluated the degree to which recommendations for demographic data standardization improve patient matching accuracy using real-world datasets.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Materials and Methods<\/jats:title>\n                  <jats:p>We used 4 manually reviewed datasets, containing a random selection of matches and nonmatches. Matching datasets included health information exchange (HIE) records, public health registry records, Social Security Death Master File records, and newborn screening records. Standardized fields including last name, telephone number, social security number, date of birth, and address. Matching performance was evaluated using 4 metrics: sensitivity, specificity, positive predictive value, and accuracy.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Standardizing address was independently associated with improved matching sensitivities for both the public health and HIE datasets of approximately 0.6% and 4.5%. Overall accuracy was unchanged for both datasets due to reduced match specificity. We observed no similar impact for address standardization in the death master file dataset. Standardizing last name yielded improved matching sensitivity of 0.6% for the HIE dataset, while overall accuracy remained the same due to a decrease in match specificity. We noted no similar impact for other datasets. Standardizing other individual fields (telephone, date of birth, or social security number) showed no matching improvements. As standardizing address and last name improved matching sensitivity, we examined the combined effect of address and last name standardization, which showed that standardization improved sensitivity from 81.3% to 91.6% for the HIE dataset.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Conclusions<\/jats:title>\n                  <jats:p>Data standardization can improve match rates, thus ensuring that patients and clinicians have better data on which to make decisions to enhance care quality and safety.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/jamia\/ocy191","type":"journal-article","created":{"date-parts":[[2019,1,3]],"date-time":"2019-01-03T20:11:08Z","timestamp":1546546268000},"page":"447-456","source":"Crossref","is-referenced-by-count":40,"title":["Evaluating the effect of data standardization and validation on patient matching accuracy"],"prefix":"10.1093","volume":"26","author":[{"given":"Shaun J","family":"Grannis","sequence":"first","affiliation":[{"name":"Regenstrief Institute, Inc, Center for Biomedical Informatics, Indianapolis, Indiana, USA"},{"name":"School of Medicine, Department of Family Medicine, Indiana University, Indianapolis, Indiana, USA"}]},{"given":"Huiping","family":"Xu","sequence":"additional","affiliation":[{"name":"Regenstrief Institute, Inc, Center for Biomedical Informatics, Indianapolis, Indiana, USA"},{"name":"School of Medicine, Department of Biostatistics, Indiana University, Indianapolis, Indiana, USA"},{"name":"Richard M. Fairbanks School of Public Health, Department of Biostatistics, Indiana University, Indianapolis, Indiana, USA"}]},{"given":"Joshua R","family":"Vest","sequence":"additional","affiliation":[{"name":"Regenstrief Institute, Inc, Center for Biomedical Informatics, Indianapolis, Indiana, USA"},{"name":"Richard M. Fairbanks School of Public Health, Department of Health Policy and Management, Indiana University, Indianapolis, Indiana, USA"}]},{"given":"Suranga","family":"Kasthurirathne","sequence":"additional","affiliation":[{"name":"Regenstrief Institute, Inc, Center for Biomedical Informatics, Indianapolis, Indiana, USA"},{"name":"School of Informatics and Computing, Department of BioHealth Informatics, Indiana University, Indianapolis, Indiana, USA"}]},{"given":"Na","family":"Bo","sequence":"additional","affiliation":[{"name":"School of Medicine, Department of Biostatistics, Indiana University, Indianapolis, Indiana, USA"}]},{"given":"Ben","family":"Moscovitch","sequence":"additional","affiliation":[{"name":"The Pew Charitable Trusts, Washington DC, USA"}]},{"given":"Rita","family":"Torkzadeh","sequence":"additional","affiliation":[{"name":"The Pew Charitable Trusts, Washington DC, USA"}]},{"given":"Josh","family":"Rising","sequence":"additional","affiliation":[{"name":"The Pew Charitable Trusts, Washington DC, USA"}]}],"member":"286","published-online":{"date-parts":[[2019,3,8]]},"reference":[{"issue":"15","key":"2021010612301079900_ocy191-B1","doi-asserted-by":"crossref","first-page":"1325","DOI":"10.1001\/jama.280.15.1325","article-title":"Canopy computing using the web in clinical practice","volume":"280","author":"McDonald","year":"1998","journal-title":"JAMA"},{"key":"2021010612301079900_ocy191-B2","first-page":"409","article-title":"All health care is not local: an evaluation of the distribution of emergency department care delivered in Indiana","volume":"2011","author":"Finnell","year":"2011","journal-title":"AMIA Annu Symp Proc"},{"issue":"57","key":"2021010612301079900_ocy191-B3","doi-asserted-by":"crossref","first-page":"57cm29","DOI":"10.1126\/scitranslmed.3001456","article-title":"Achieving a nationwide learning health system","volume":"2","author":"Friedman","year":"2010","journal-title":"Sci Transl Med"},{"issue":"1","key":"2021010612301079900_ocy191-B4","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1097\/NUR.0b013e3182776dcb","article-title":"The emergence of a learning health care system","volume":"27","author":"Mason","year":"2013","journal-title":"Clin Nurse Spec"},{"key":"2021010612301079900_ocy191-B5","volume-title":"Identity Crisis: An Examination of the Costs and Benefits of a Unique Patient Identifier for the US Health Care System","author":"Hillestad","year":"2008"},{"key":"2021010612301079900_ocy191-B6","first-page":"259","article-title":"Analysis of a probabilistic record linkage technique without human review","volume":"2003","author":"Grannis","year":"2003","journal-title":"AMIA Annu Symp Proc"},{"key":"2021010612301079900_ocy191-B7","author":"Grannis"},{"key":"2021010612301079900_ocy191-B8","author":"Consistent Nationwide Patient Data Matching Strategy"},{"key":"2021010612301079900_ocy191-B9","author":"Marchibroda"},{"key":"2021010612301079900_ocy191-B10","author":"Health IT: Setting the Foundation to Transform Our Future"},{"key":"2021010612301079900_ocy191-B11","author":"Linking Health Care Information: Proposed Methods for Improving Care and Protecting Privacy"},{"key":"2021010612301079900_ocy191-B12","author":"Heflin"},{"key":"2021010612301079900_ocy191-B13","author":"Morris"},{"key":"2021010612301079900_ocy191-B14","author":"Tang"},{"key":"2021010612301079900_ocy191-B15","author":"Morris"},{"issue":"5","key":"2021010612301079900_ocy191-B16","doi-asserted-by":"crossref","first-page":"738","DOI":"10.1197\/jamia.M3186","article-title":"An empiric modification to the probabilistic record linkage algorithm using frequency-based weight scaling","volume":"16","author":"Zhu","year":"2009","journal-title":"J Am Med Inform Assoc"},{"issue":"24","key":"2021010612301079900_ocy191-B17","doi-asserted-by":"crossref","first-page":"4250","DOI":"10.1002\/sim.6230","article-title":"Evaluating latent class models with conditional dependence in record linkage","volume":"33","author":"Daggy","year":"2014","journal-title":"Statist Med"},{"key":"2021010612301079900_ocy191-B18","first-page":"1524","article-title":"A practical method for predicting frequent use of emergency department care using routinely available electronic registration data","volume":"2013","author":"Wu","year":"2013","journal-title":"AMIA Annu Symp Proc"},{"issue":"Pt 1","key":"2021010612301079900_ocy191-B19","first-page":"43","article-title":"Real world performance of approximate string comparators for use in patient matching","volume":"107","author":"Grannis","year":"2004","journal-title":"Stud Health Technol Inform"},{"issue":"3","key":"2021010612301079900_ocy191-B20","doi-asserted-by":"crossref","first-page":"500","DOI":"10.1002\/sim.5946","article-title":"Optimal two-phase sampling design for comparing accuracies of two binary classification rules","volume":"33","author":"Xu","year":"2014","journal-title":"Statist Med"},{"key":"2021010612301079900_ocy191-B21","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1186\/1472-6947-13-97","article-title":"A practical approach for incorporating dependence among fields in probabilistic record linkage","volume":"13","author":"Daggy","year":"2013","journal-title":"BMC Med Inform Decis Mak"},{"key":"2021010612301079900_ocy191-B22","first-page":"305","article-title":"Analysis of identifier performance using a deterministic linkage algorithm","author":"Grannis","year":"2002","journal-title":"Proc AMIA Symp"},{"issue":"1","key":"2021010612301079900_ocy191-B23","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1377\/hlthaff.2010.0935","article-title":"Driving population health through accountable care organizations","volume":"30","author":"Devore","year":"2011","journal-title":"Health Aff (Millwood)"},{"issue":"4","key":"2021010612301079900_ocy191-B24","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1108\/JHOM-01-2015-0003","article-title":"Using health information technology to manage a patient population in accountable care organizations","volume":"30","author":"Wu","year":"2016","journal-title":"J Health Org Mgt"},{"issue":"24","key":"2021010612301079900_ocy191-B25","doi-asserted-by":"crossref","first-page":"2357","DOI":"10.1056\/NEJMsa1600142","article-title":"Early performance of accountable care organizations in medicare","volume":"374","author":"McWilliams","year":"2016","journal-title":"N Engl J Med"},{"issue":"12","key":"2021010612301079900_ocy191-B26","doi-asserted-by":"crossref","first-page":"1166","DOI":"10.1002\/ppul.21509","article-title":"Factors accounting for a missed diagnosis of cystic fibrosis after newborn screening","volume":"46","author":"Rock","year":"2011","journal-title":"Pediatr Pulmonol"},{"issue":"10","key":"2021010612301079900_ocy191-B27","doi-asserted-by":"crossref","first-page":"994","DOI":"10.1001\/archpedi.161.10.994","article-title":"Long-term follow-up data collection and use in state newborn screening programs","volume":"161","author":"Hoff","year":"2007","journal-title":"Arch Pediatr Adolesc Med"},{"key":"2021010612301079900_ocy191-B28","first-page":"440","article-title":"Learning blocking schemes for record linkage","volume-title":"Proceedings of the 21st National Conference on Artificial Intelligence\u00a0\u2013 Volume 1 (AAAI\u201906)","author":"Michelson","year":"2006"},{"key":"2021010612301079900_ocy191-B29","author":"Council for Affordable Quality Health Care 2011"},{"key":"2021010612301079900_ocy191-B30","author":"Series E: Overall Network Operation Telephone Service, Service Operation and Human Factors"},{"key":"2021010612301079900_ocy191-B31","author":"High Group List and Other Ways to Determine if an SSN is Valid"},{"key":"2021010612301079900_ocy191-B32","author":"Mailing Standards of the United States Postal Service Publication 28 \u2013 Postal Addressing Standards"},{"issue":"9","key":"2021010612301079900_ocy191-B33","doi-asserted-by":"crossref","first-page":"1537","DOI":"10.1109\/TKDE.2011.127","article-title":"A survey of indexing techniques for scalable record linkage and deduplication","volume":"24","author":"Christen","year":"2011","journal-title":"IEEE Trans Knowl Data Eng"},{"issue":"6","key":"2021010612301079900_ocy191-B34","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1109\/79.543975","article-title":"The expectation-maximization algorithm","volume":"13","author":"Moon","year":"1996","journal-title":"IEEE Signal Process Mag"},{"issue":"1","key":"2021010612301079900_ocy191-B35","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1002\/1097-0142(1950)3:1<32::AID-CNCR2820030106>3.0.CO;2-3","article-title":"Index for rating diagnostic tests","volume":"3","author":"Youden","year":"1950","journal-title":"Cancer"},{"key":"2021010612301079900_ocy191-B36","doi-asserted-by":"crossref","first-page":"3762651","DOI":"10.1155\/2017\/3762651","article-title":"Defining an optimal cut-point value in roc analysis: an alternative approach","volume":"2017","author":"Unal","year":"2017","journal-title":"Comput Math Methods Med"},{"key":"2021010612301079900_ocy191-B37","doi-asserted-by":"crossref","first-page":"617946","DOI":"10.1155\/2009\/617946","article-title":"Regularized F-Measure Maximization for Feature\u00a0Selection and Classification","volume":"2009","author":"Liu","year":"2009","journal-title":"J Biomed Biotechnol"}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/26\/5\/447\/34946365\/ocy191.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/26\/5\/447\/34946365\/ocy191.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,1,10]],"date-time":"2021-01-10T22:57:11Z","timestamp":1610319431000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/26\/5\/447\/5372371"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,3,8]]},"references-count":37,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2019,3,8]]},"published-print":{"date-parts":[[2019,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocy191","relation":{},"ISSN":["1527-974X"],"issn-type":[{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,5]]},"published":{"date-parts":[[2019,3,8]]}}}