{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T06:44:43Z","timestamp":1769928283889,"version":"3.49.0"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"6","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Objective<\/jats:title>\n                  <jats:p>PEDSnet is a clinical data research network (CDRN) that aggregates electronic health record data from multiple children\u2019s hospitals to enable large-scale research. Assessing data quality to ensure suitability for conducting research is a key requirement in PEDSnet. This study presents a range of data quality issues identified over a period of 18 months and interprets them to evaluate the research capacity of PEDSnet.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Materials and Methods<\/jats:title>\n                  <jats:p>Results were generated by a semiautomated data quality assessment workflow. Two investigators reviewed programmatic data quality issues and conducted discussions with the data partners\u2019 extract-transform-load analysts to determine the cause for each issue.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>The results include a longitudinal summary of 2182 data quality issues identified across 9 data submission cycles. The metadata from the most recent cycle includes annotations for 850 issues: most frequent types, including missing data (&amp;gt;300) and outliers (&amp;gt;100); most complex domains, including medications (&amp;gt;160) and lab measurements (&amp;gt;140); and primary causes, including source data characteristics (83%) and extract-transform-load errors (9%).<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Discussion<\/jats:title>\n                  <jats:p>The longitudinal findings demonstrate the network\u2019s evolution from identifying difficulties with aligning the data to a common data model to learning norms in clinical pediatrics and determining research capability.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Conclusion<\/jats:title>\n                  <jats:p>While data quality is recognized as a critical aspect in establishing and utilizing a CDRN, the findings from data quality assessments are largely unpublished. This paper presents a real-world account of studying and interpreting data quality findings in a pediatric CDRN, and the lessons learned could be used by other CDRNs.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/jamia\/ocx033","type":"journal-article","created":{"date-parts":[[2017,3,16]],"date-time":"2017-03-16T20:55:04Z","timestamp":1489697704000},"page":"1072-1079","source":"Crossref","is-referenced-by-count":59,"title":["A longitudinal analysis of data quality in a large pediatric data research network"],"prefix":"10.1093","volume":"24","author":[{"given":"Ritu","family":"Khare","sequence":"first","affiliation":[{"name":"Department of Biomedical and Health Informatics, Children\u2019s Hospital of Philadelphia, Philadelphia, PA, USA"},{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia"}]},{"given":"Levon","family":"Utidjian","sequence":"additional","affiliation":[{"name":"Department of Biomedical and Health Informatics, Children\u2019s Hospital of Philadelphia, Philadelphia, PA, USA"},{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia"}]},{"given":"Byron J","family":"Ruth","sequence":"additional","affiliation":[{"name":"Department of Biomedical and Health Informatics, Children\u2019s Hospital of Philadelphia, Philadelphia, PA, USA"}]},{"given":"Michael G","family":"Kahn","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, University of Colorado Denver Anschutz Medical Campus, Aurora, CO, USA"}]},{"given":"Evanette","family":"Burrows","sequence":"additional","affiliation":[{"name":"Department of Biomedical and Health Informatics, Children\u2019s Hospital of Philadelphia, Philadelphia, PA, USA"},{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia"}]},{"given":"Keith","family":"Marsolo","sequence":"additional","affiliation":[{"name":"University of Cincinnati Department of Pediatrics, Cincinnati Children\u2019s Hospital Medical Center, Cincinnati, OH, USA"}]},{"given":"Nandan","family":"Patibandla","sequence":"additional","affiliation":[{"name":"Information Services Department, Children\u2019s Hospital Boston, Boston, MA, USA"}]},{"given":"Hanieh","family":"Razzaghi","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia"}]},{"given":"Ryan","family":"Colvin","sequence":"additional","affiliation":[{"name":"Department of Pediatrics, Washington University in St. Louis, St. Louis, MO, USA"}]},{"given":"Daksha","family":"Ranade","sequence":"additional","affiliation":[{"name":"Research Informatics, Seattle Children\u2019s Research Institute, Seattle, WA, USA"}]},{"given":"Melody","family":"Kitzmiller","sequence":"additional","affiliation":[{"name":"Research Information Solutions and Innovation, Nationwide Children\u2019s Hospital, Columbus, OH, USA"}]},{"given":"Daniel","family":"Eckrich","sequence":"additional","affiliation":[{"name":"Center for Pediatric Auditory and Speech Sciences, Nemours Biomedical Research, Wilmington, DE, USA"}]},{"given":"L Charles","family":"Bailey","sequence":"additional","affiliation":[{"name":"Department of Biomedical and Health Informatics, Children\u2019s Hospital of Philadelphia, Philadelphia, PA, USA"},{"name":"Department of Pediatrics, Children\u2019s Hospital of Philadelphia"},{"name":"Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA"}]}],"member":"286","published-online":{"date-parts":[[2017,4,8]]},"reference":[{"issue":"4","key":"2020110612445313400_ocx033-B1","doi-asserted-by":"crossref","first-page":"576","DOI":"10.1136\/amiajnl-2014-002864","article-title":"PCORnet: turning a dream into reality","volume":"21","author":"Collins","year":"2014","journal-title":"J Am Med Inform Assoc."},{"issue":"6","key":"2020110612445313400_ocx033-B2","doi-asserted-by":"crossref","first-page":"e66192","DOI":"10.1371\/journal.pone.0066192","article-title":"Multi-institutional sharing of electronic health record data to assess childhood obesity","volume":"8","author":"Bailey","year":"2013","journal-title":"PLoS One."},{"issue":"8 Suppl 3","key":"2020110612445313400_ocx033-B3","doi-asserted-by":"crossref","first-page":"S22","DOI":"10.1097\/MLR.0b013e31829b1e2c","article-title":"Data quality assessment for comparative effectiveness research in distributed data networks","volume":"51","author":"Brown","year":"2013","journal-title":"Med Care."},{"key":"2020110612445313400_ocx033-B4","doi-asserted-by":"crossref","first-page":"S21","DOI":"10.1097\/MLR.0b013e318257dd67","article-title":"A pragmatic framework for single-site and multisite data quality assessment in electronic health record\u2013based clinical research","volume":"50","author":"Kahn","year":"2012","journal-title":"Med Care."},{"issue":"1","key":"2020110612445313400_ocx033-B5","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1136\/amiajnl-2011-000681","article-title":"Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research","volume":"20","author":"Weiskopf","year":"2013","journal-title":"J Am Med Inform Assoc."},{"issue":"8 Suppl 3","key":"2020110612445313400_ocx033-B6","doi-asserted-by":"crossref","first-page":"S30","DOI":"10.1097\/MLR.0b013e31829b1dbd","article-title":"Caveats for the use of operational electronic health record data in comparative effectiveness research","volume":"51","author":"Hersh","year":"2013","journal-title":"Med Care."},{"issue":"1","key":"2020110612445313400_ocx033-B7","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1093\/fampra\/cmp068","article-title":"Using your electronic medical record for research: a primer for avoiding pitfalls","volume":"27","author":"Terry","year":"2010","journal-title":"Fam Pract."},{"issue":"6","key":"2020110612445313400_ocx033-B8","doi-asserted-by":"crossref","first-page":"600","DOI":"10.1197\/jamia.M1087","article-title":"Defining and improving data quality in medical registries: a literature review, case study, and generic framework","volume":"9","author":"Arts","year":"2002","journal-title":"J Am Med Inform Assoc."},{"key":"2020110612445313400_ocx033-B9","first-page":"86","article-title":"A comprehensive framework for data quality assessment in CER","volume":"2013","author":"Holve","year":"2013","journal-title":"AMIA Jt Summits Transl Sci Proc."},{"issue":"7","key":"2020110612445313400_ocx033-B10","doi-asserted-by":"crossref","first-page":"1171","DOI":"10.1377\/hlthaff.2014.0127","article-title":"PEDSnet: how a prototype pediatric learning health system is being expanded into a national network","volume":"33","author":"Forrest","year":"2014","journal-title":"Health Aff (Millwood)."},{"issue":"4","key":"2020110612445313400_ocx033-B11","doi-asserted-by":"crossref","first-page":"602","DOI":"10.1136\/amiajnl-2014-002743","article-title":"PEDSnet: a National Pediatric Learning Health System","volume":"21","author":"Forrest","year":"2014","journal-title":"J Am Med Inform Assoc."},{"issue":"8 Suppl 3","key":"2020110612445313400_ocx033-B12","doi-asserted-by":"crossref","first-page":"S80","DOI":"10.1097\/MLR.0b013e31829b1d48","article-title":"Challenges in using electronic health record data for CER: experience of 4 learning organizations and solutions applied","volume":"51","author":"Bayley","year":"2013","journal-title":"Med Care."},{"issue":"1","key":"2020110612445313400_ocx033-B13","doi-asserted-by":"crossref","DOI":"10.13063\/2327-9214.1052","article-title":"Transparent reporting of data quality in distributed data networks","volume":"3","author":"Kahn","year":"2015","journal-title":"eGEMs."},{"key":"2020110612445313400_ocx033-B14","article-title":"Identifying and understanding data quality issues in a pediatric distributed research network","volume-title":"Americal Medical Informatics Association Anuual Symposium","author":"Khare"},{"key":"2020110612445313400_ocx033-B15","unstructured":"Center PDC. ETL Conventions for use with PEDSnet CDM v2.2 OMOP V5. 2015. https:\/\/pedsnet.org\/documents\/18\/ETL_Conventions_for_use_with_PEDSnet_CDM_v2_2_OMOP_V5.pdf. Accessed October 15, 2016."},{"key":"2020110612445313400_ocx033-B16","volume-title":"OMOP Common Data Model","author":"Observational Medical Outcomes Partnership"},{"key":"2020110612445313400_ocx033-B17","article-title":"Establishing Interoperability Standards between OMOP CDM v4, v5, and PCORnet CDM","volume-title":"OHDSI Symposium 2015","author":"Belenkaya"},{"issue":"1","key":"2020110612445313400_ocx033-B18","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1145\/320434.320440","article-title":"The entity-relationship model: toward a unified view of data","volume":"1","author":"Chen","year":"1976","journal-title":"ACM Transactions on Database Systems (TODS) Special Issue: Papers from the International Conference on Very Large Data Bases"},{"key":"2020110612445313400_ocx033-B19","article-title":"Promoting data quality in a clinical data research network using GitHub","volume-title":"AMIA Joint Summit on Clinical Research Informatics","author":"Browne"},{"key":"2020110612445313400_ocx033-B20","unstructured":"Bedside IfIBat. The i2b2 Data Model. https:\/\/www.i2b2.org\/about\/intro.html. Accessed October 15, 2016."},{"issue":"1","key":"2020110612445313400_ocx033-B21","doi-asserted-by":"crossref","DOI":"10.13063\/2327-9214.1244","article-title":"A harmonized data quality assessment terminology and framework for the secondary use of electronic health record data","volume":"4","author":"Kahn","year":"2016","journal-title":"eGEMs (Generating Evidence and Methods to improve patient outcomes"},{"key":"2020110612445313400_ocx033-B22","article-title":"To err is human","volume-title":"Proceedings of the First Workshop on Evaluating and Architecting System Dependability (EASY\u201901)","author":"Brown"},{"key":"2020110612445313400_ocx033-B23","article-title":"Understanding the EMR error control practices among gynecologic physicians","volume-title":"iConference 2013","author":"Khare","year":"2013"},{"issue":"5","key":"2020110612445313400_ocx033-B24","doi-asserted-by":"crossref","first-page":"477","DOI":"10.1016\/S0197-2456(98)00033-6","article-title":"Guidelines for quality assurance in multicenter trials: a position paper","volume":"19","author":"Knatterud","year":"1998","journal-title":"Control Clin Trials."},{"key":"2020110612445313400_ocx033-B25","first-page":"1","article-title":"Secondary use of EHR: data quality issues and informatics opportunities","volume":"2010","author":"Botsis","year":"2010","journal-title":"AMIA Jt Summits Transl Sci Proc."},{"issue":"Suppl 10","key":"2020110612445313400_ocx033-B26","doi-asserted-by":"crossref","first-page":"K2","DOI":"10.1016\/j.vaccine.2013.06.048","article-title":"Methods for systematic reviews of administrative database studies capturing health outcomes of interest","volume":"31","author":"McPheeters","year":"2013","journal-title":"Vaccine."},{"key":"2020110612445313400_ocx033-B27","doi-asserted-by":"crossref","DOI":"10.13063\/2327-9214.1239","article-title":"Multi-site evaluation of a data quality tool for patient-level clinical datasets","author":"Huser","year":"2016","journal-title":"eGEMs."},{"key":"2020110612445313400_ocx033-B28","article-title":"Understanding the gaps between data quality checks and research capabilities in a pediatric data research network","volume-title":"AMIA Jt Summits Trans Sci 2017","author":"Khare","year":"2017"},{"key":"2020110612445313400_ocx033-B29","article-title":"PEDSnet: from building a high-quality CDRN to conducting science","volume-title":"AMIA Ann Symp 2016","author":"Bailey","year":"2016"},{"key":"2020110612445313400_ocx033-B30","doi-asserted-by":"crossref","first-page":"448","DOI":"10.1016\/j.jbi.2014.08.004","article-title":"LabeledIn: cataloging labeled indications for human drugs","volume":"52","author":"Khare","year":"2014","journal-title":"J Biomed Inform."},{"key":"2020110612445313400_ocx033-B31","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1080\/07421222.1996.11518099","article-title":"Beyond accuracy: what data quality means to data consumers","volume":"12","author":"Wang","year":"1996","journal-title":"J Manag Inform Syst."}],"container-title":["Journal of the American Medical Informatics Association"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/24\/6\/1072\/34149371\/ocx033.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/jamia\/article-pdf\/24\/6\/1072\/34149371\/ocx033.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,11,6]],"date-time":"2020-11-06T18:25:05Z","timestamp":1604687105000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/jamia\/article\/24\/6\/1072\/3238563"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,4,8]]},"references-count":31,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2017,4,8]]},"published-print":{"date-parts":[[2017,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/jamia\/ocx033","relation":{},"ISSN":["1067-5027","1527-974X"],"issn-type":[{"value":"1067-5027","type":"print"},{"value":"1527-974X","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2017,11]]},"published":{"date-parts":[[2017,4,8]]}}}