{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,17]],"date-time":"2026-04-17T07:45:38Z","timestamp":1776411938242,"version":"3.51.2"},"reference-count":35,"publisher":"Georg Thieme Verlag KG","issue":"04","license":[{"start":{"date-parts":[[2021,8,25]],"date-time":"2021-08-25T00:00:00Z","timestamp":1629849600000},"content-version":"vor","delay-in-days":24,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"funder":[{"name":"German Federal Ministry of Education and Research","award":["01ZZ1801A"],"award-info":[{"award-number":["01ZZ1801A"]}]},{"name":"German Federal Ministry of Education and Research","award":["01ZZ1801C"],"award-info":[{"award-number":["01ZZ1801C"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Appl Clin Inform"],"published-print":{"date-parts":[[2021,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>\n          Background\u2003Many research initiatives aim at using data from electronic health records (EHRs) in observational studies. Participating sites of the German Medical Informatics Initiative (MII) established data integration centers to integrate EHR data within research data repositories to support local and federated analyses. To address concerns regarding possible data quality (DQ) issues of hospital routine data compared with data specifically collected for scientific purposes, we have previously presented a data quality assessment (DQA) tool providing a standardized approach to assess DQ of the research data repositories at the MIRACUM consortium's partner sites.<\/jats:p><jats:p>\n          Objectives\u2003Major limitations of the former approach included manual interpretation of the results and hard coding of analyses, making their expansion to new data elements and databases time-consuming and error prone. We here present an enhanced version of the DQA tool by linking it to common data element definitions stored in a metadata repository (MDR), adopting the harmonized DQA framework from Kahn et al and its application within the MIRACUM consortium.<\/jats:p><jats:p>\n          Methods\u2003Data quality checks were consequently aligned to a harmonized DQA terminology. Database-specific information were systematically identified and represented in an MDR. Furthermore, a structured representation of logical relations between data elements was developed to model plausibility-statements in the MDR.<\/jats:p><jats:p>\n          Results\u2003The MIRACUM DQA tool was linked to data element definitions stored in a consortium-wide MDR. Additional databases used within MIRACUM were linked to the DQ checks by extending the respective data elements in the MDR with the required information. The evaluation of DQ checks was automated. An adaptable software implementation is provided with the R package DQAstats.<\/jats:p><jats:p>\n          Conclusion\u2003The enhancements of the DQA tool facilitate the future integration of new data elements and make the tool scalable to other databases and data models. It has been provided to all ten MIRACUM partners and was successfully deployed and integrated into their respective data integration center infrastructure.<\/jats:p>","DOI":"10.1055\/s-0041-1733847","type":"journal-article","created":{"date-parts":[[2021,8,25]],"date-time":"2021-08-25T23:03:42Z","timestamp":1629932622000},"page":"826-835","source":"Crossref","is-referenced-by-count":30,"title":["Linking a Consortium-Wide Data Quality Assessment Tool with the MIRACUM Metadata Repository"],"prefix":"10.1055","volume":"12","author":[{"given":"Lorenz A.","family":"Kapsner","sequence":"additional","affiliation":[{"name":"Medical Center for Information and Communication Technology, Universit\u00e4tsklinikum Erlangen, Erlangen, Germany"},{"name":"Department of Radiology, Universit\u00e4tsklinikum Erlangen, Friedrich-Alexander-University Erlangen-N\u00fcrnberg, Erlangen, Germany"}]},{"given":"Jonathan M.","family":"Mang","sequence":"additional","affiliation":[{"name":"Medical Center for Information and Communication Technology, Universit\u00e4tsklinikum Erlangen, Erlangen, Germany"}]},{"given":"Sebastian","family":"Mate","sequence":"additional","affiliation":[{"name":"Medical Center for Information and Communication Technology, Universit\u00e4tsklinikum Erlangen, Erlangen, Germany"}]},{"given":"Susanne A.","family":"Seuchter","sequence":"additional","affiliation":[{"name":"Medical Center for Information and Communication Technology, Universit\u00e4tsklinikum Erlangen, Erlangen, Germany"}]},{"given":"Abishaa","family":"Vengadeswaran","sequence":"additional","affiliation":[{"name":"Medical Informatics Group (MIG), Goethe University Frankfurt, University Hospital Frankfurt, Frankfurt am Main, Germany"}]},{"given":"Franziska","family":"Bathelt","sequence":"additional","affiliation":[{"name":"Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, Technical University Dresden, Dresden, Germany"}]},{"given":"Noemi","family":"Deppenwiese","sequence":"additional","affiliation":[{"name":"Medical Center for Information and Communication Technology, Universit\u00e4tsklinikum Erlangen, Erlangen, Germany"}]},{"given":"Dennis","family":"Kadioglu","sequence":"additional","affiliation":[{"name":"Medical Informatics Group (MIG), Goethe University Frankfurt, University Hospital Frankfurt, Frankfurt am Main, Germany"},{"name":"Data Integration Center, University Hospital Frankfurt, Frankfurt am Main, Germany"}]},{"given":"Detlef","family":"Kraska","sequence":"additional","affiliation":[{"name":"Medical Center for Information and Communication Technology, Universit\u00e4tsklinikum Erlangen, Erlangen, Germany"}]},{"given":"Hans-Ulrich","family":"Prokosch","sequence":"additional","affiliation":[{"name":"Medical Center for Information and Communication Technology, Universit\u00e4tsklinikum Erlangen, Erlangen, Germany"},{"name":"Department of Medical Informatics, Friedrich-Alexander-University Erlangen-N\u00fcrnberg (FAU), Erlangen, Germany"}]}],"member":"194","published-online":{"date-parts":[[2021,8,25]]},"reference":[{"issue":"04","key":"ref1","doi-asserted-by":"crossref","first-page":"416","DOI":"10.1136\/amiajnl-2010-000032","article-title":"Enabling collaborative research using the Biomedical Informatics Research Network (BIRN)","volume":"18","author":"K G Helmer","year":"2011","journal-title":"J Am Med Inform Assoc"},{"key":"ref2","doi-asserted-by":"crossref","first-page":"S7","DOI":"10.1097\/MLR.0b013e318257a66b","article-title":"The Electronic Data Methods (EDM) forum for comparative effectiveness research (CER)","volume":"50","author":"E Holve","year":"2012","journal-title":"Med Care"},{"issue":"S 03","key":"ref3","doi-asserted-by":"crossref","first-page":"e55811","DOI":"10.1371\/journal.pone.0055811","article-title":"SHRINE: enabling nationally scalable multi-site disease studies","volume":"8","author":"A J McMurry","year":"2013","journal-title":"PLoS ONE"},{"issue":"216","key":"ref4","first-page":"574","article-title":"Observational Health Data Sciences and Informatics (OHDSI): opportunities for observational researchers","volume":"216","author":"G Hripcsak","year":"2015","journal-title":"Stud Health Technol Inform"},{"key":"ref5","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1055\/s-0039-1693685","article-title":"A generic method and implementation to evaluate and improve data quality in distributed research networks","volume":"58","author":"D Ju\u00e1rez","year":"2019","journal-title":"Methods Inf Med"},{"issue":"01","key":"ref6","first-page":"e50","article-title":"German medical informatics initiative: a national approach to integrating health data from patient care and medical research","volume":"57","author":"S Semler","year":"2018","journal-title":"Methods Inf Med"},{"issue":"01","key":"ref7","first-page":"1244","article-title":"A harmonized data quality assessment terminology and framework for the secondary use of electronic health record data","volume":"4","author":"M G Kahn","year":"2016","journal-title":"EGEMS (Wash DC)"},{"issue":"01","key":"ref8","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1136\/jamia.2000.0070106","article-title":"Assessing data quality: from concordance, through correctness and completeness, to valid manipulatable representations","volume":"7","author":"P F Brennan","year":"2000","journal-title":"J Am Med Inform Assoc"},{"key":"ref10","doi-asserted-by":"crossref","first-page":"S30","DOI":"10.1097\/MLR.0b013e31829b1dbd","article-title":"Caveats for the use of operational electronic health record data in comparative effectiveness research","volume":"51","author":"W R Hersh","year":"2013","journal-title":"Med Care"},{"key":"ref11","author":"International Organization of Standardization (ISO)","year":"2013"},{"issue":"01","key":"ref12","first-page":"82","article-title":"MIRACUM: Medical Informatics in Research and Care in University Medicine: a large data sharing network to enhance translational research and medical care","volume":"57","author":"H-U Prokosch","year":"2018","journal-title":"Methods Inf Med"},{"key":"ref13","first-page":"50","article-title":"Samply.MDR\u2014a metadata repository and its application in various research networks","volume":"253","author":"D Kadioglu","year":"2018","journal-title":"Stud Health Technol Inform"},{"key":"ref15","first-page":"247","article-title":"Moving towards an EHR data quality framework: the MIRACUM approach","volume":"267","author":"L A Kapsner","year":"2019","journal-title":"Stud Health Technol Inform"},{"issue":"02","key":"ref16","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1007\/s00062-017-0656-y","article-title":"Regional differences in thrombectomy rates : secondary use of billing codes in the MIRACUM (Medical Informatics for Research and Care in University Medicine) Consortium","volume":"28","author":"C Haverkamp","year":"2018","journal-title":"Clin Neuroradiol"},{"key":"ref18","doi-asserted-by":"crossref","DOI":"10.1201\/9781138359444","volume-title":"R Markdown: The Definitive Guide","author":"Y Xie","year":"2018"},{"key":"ref19","volume-title":"Datenqualit\u00e4t in der medizinischen Forschung: Leitlinie zum adaptiven Management von Datenqualit\u00e4t in Kohortenstudien und Registern","author":"D Nasseh","year":"2014"},{"issue":"05","key":"ref21","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1145\/253769.253804","article-title":"Data quality in context","volume":"40","author":"D M Strong","year":"1997","journal-title":"Commun ACM"},{"issue":"01","key":"ref22","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1136\/amiajnl-2011-000681","article-title":"Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research","volume":"20","author":"N G Weiskopf","year":"2013","journal-title":"J Am Med Inform Assoc"},{"issue":"05","key":"ref23","doi-asserted-by":"crossref","first-page":"830","DOI":"10.1016\/j.jbi.2013.06.010","article-title":"Defining and measuring completeness of electronic health records for secondary use","volume":"46","author":"N G Weiskopf","year":"2013","journal-title":"J Biomed Inform"},{"issue":"06","key":"ref24","doi-asserted-by":"crossref","first-page":"1072","DOI":"10.1093\/jamia\/ocx033","article-title":"A longitudinal analysis of data quality in a large pediatric data research network","volume":"24","author":"R Khare","year":"2017","journal-title":"J Am Med Inform Assoc"},{"issue":"01","key":"ref26","first-page":"8","article-title":"A comparison of data quality assessment checks in six data sharing networks","volume":"5","author":"T J Callahan","year":"2017","journal-title":"EGEMS (Wash DC)"},{"issue":"01","key":"ref27","first-page":"3","article-title":"Evaluating foundational data quality in the national Patient-Centered Clinical Research Network (PCORnet\u00ae)","volume":"6","author":"L G Qualls","year":"2018","journal-title":"EGEMS (Wash DC)"},{"issue":"05","key":"ref29","doi-asserted-by":"crossref","first-page":"794","DOI":"10.1055\/s-0039-1697598","article-title":"Incrementally transforming electronic medical records into the observational medical outcomes partnership common data model: a multidimensional quality assurance approach","volume":"10","author":"K E Lynch","year":"2019","journal-title":"Appl Clin Inform"},{"issue":"04","key":"ref30","doi-asserted-by":"crossref","first-page":"622","DOI":"10.1055\/s-0040-1715567","article-title":"A rule-based data quality assessment system for electronic health record data","volume":"11","author":"Z Wang","year":"2020","journal-title":"Appl Clin Inform"},{"key":"ref31","first-page":"ocaa340","article-title":"Quality assessment of real-world data repositories across the data life cycle: A literature review","author":"S-T Liaw","year":"2021","journal-title":"J Am Med Inform Assoc"},{"key":"ref32","volume-title":"R: A Language and Environment for Statistical Computing","author":"R Core Team","year":"2020"},{"key":"ref33","volume-title":"Knitr: A Comprehensive Tool for Reproducible Research in r","author":"Y Xie","year":"2014"},{"key":"ref34","volume-title":"Dynamic Documents with R and Knitr","author":"Y Xie","year":"2015","edition":"2nd ed."},{"key":"ref36","first-page":"2","article-title":"Docker: lightweight Linux containers for consistent development and deployment","volume":"239","author":"D Merkel","year":"2014","journal-title":"Linux J"},{"issue":"04","key":"ref37","doi-asserted-by":"crossref","first-page":"e25645","DOI":"10.2196\/25645","article-title":"A framework for criteria-based selection and processing of fast healthcare interoperability resources (FHIR) data for statistical analysis: design and implementation study","volume":"9","author":"J Gruendner","year":"2021","journal-title":"JMIR Med Inform"},{"issue":"01","key":"ref40","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1055\/s-0037-1617452","article-title":"Towards Implementation of OMOP in a German University Hospital Consortium","volume":"9","author":"C Maier","year":"2018","journal-title":"Appl Clin Inform"},{"issue":"01","key":"ref42","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1186\/s12874-021-01252-7","article-title":"Facilitating harmonized data quality assessments. A data quality framework for observational health research data collections with software implementations in R","volume":"21","author":"C O Schmidt","year":"2021","journal-title":"BMC Med Res Methodol"},{"issue":"03","key":"ref44","doi-asserted-by":"crossref","first-page":"373","DOI":"10.1007\/s00103-015-2299-y","article-title":"Strategien zur Vernetzung von Biobanken. Klassifizierung verschiedener Ans\u00e4tze zur Probensuche und Ausblick auf die Zukunft in der BBMRI-ERIC","volume":"59","author":"M Lablans","year":"2016","journal-title":"Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz"},{"key":"ref46","doi-asserted-by":"crossref","first-page":"676","DOI":"10.1016\/j.procs.2019.09.223","article-title":"Data quality in ETL process: a preliminary study","volume":"159","author":"M Souibgui","year":"2019","journal-title":"Procedia Comput Sci"},{"key":"ref47","volume-title":"Juran's Quality Handbook","author":"J M Juran","year":"1999","edition":"5th ed."}],"container-title":["Applied Clinical Informatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/www.thieme-connect.de\/products\/ejournals\/pdf\/10.1055\/s-0041-1733847.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,16]],"date-time":"2021-09-16T00:21:00Z","timestamp":1631751660000},"score":1,"resource":{"primary":{"URL":"http:\/\/www.thieme-connect.de\/DOI\/DOI?10.1055\/s-0041-1733847"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8]]},"references-count":35,"journal-issue":{"issue":"04","published-online":{"date-parts":[[2021,8,4]]},"published-print":{"date-parts":[[2021,8]]}},"URL":"https:\/\/doi.org\/10.1055\/s-0041-1733847","archive":["Portico","CLOCKSS"],"relation":{},"ISSN":["1869-0327"],"issn-type":[{"value":"1869-0327","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8]]}}}