{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:20:05Z","timestamp":1772166005393,"version":"3.50.1"},"reference-count":66,"publisher":"Springer Science and Business Media LLC","issue":"S4","license":[{"start":{"date-parts":[[2020,12,1]],"date-time":"2020-12-01T00:00:00Z","timestamp":1606780800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2020,12,14]],"date-time":"2020-12-14T00:00:00Z","timestamp":1607904000000},"content-version":"vor","delay-in-days":13,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["UL1TR001427"],"award-info":[{"award-number":["UL1TR001427"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"crossref","award":["R01CA246418"],"award-info":[{"award-number":["R01CA246418"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100006093","name":"Patient-Centered Outcomes Research Institute","doi-asserted-by":"publisher","award":["ME-2018C3-14754"],"award-info":[{"award-number":["ME-2018C3-14754"]}],"id":[{"id":"10.13039\/100006093","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000009","name":"Foundation for the National Institutes of Health","doi-asserted-by":"publisher","award":["R21CA245858"],"award-info":[{"award-number":["R21CA245858"]}],"id":[{"id":"10.13039\/100000009","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Med Inform Decis Mak"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>To reduce cancer mortality and improve cancer outcomes, it is critical to understand the various cancer risk factors (RFs) across different domains (e.g., genetic, environmental, and behavioral risk factors) and levels (e.g., individual, interpersonal, and community levels). However, prior research on RFs of cancer outcomes, has primarily focused on individual level RFs due to the lack of integrated datasets that contain multi-level, multi-domain RFs. Further, the lack of a consensus and proper guidance on systematically identify RFs also increase the difficulty of RF selection from heterogenous data sources in a multi-level integrative data analysis (mIDA) study. More importantly, as mIDA studies require integrating heterogenous data sources, the data integration processes in the limited number of existing mIDA studies are inconsistently performed and poorly documented, and thus threatening transparency and reproducibility.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>Informed by the National Institute on Minority Health and Health Disparities (NIMHD) research framework, we (1) reviewed existing reporting guidelines from the Enhancing the QUAlity and Transparency Of health Research (EQUATOR) network and (2) developed a theory-driven reporting guideline to guide the RF variable selection, data source selection, and data integration process. Then, we developed an ontology to standardize the documentation of the RF selection and data integration process in mIDA studies.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We summarized the review results and created a reporting guideline\u2014ATTEST\u2014for reporting the variable selection and data source selection and integration process. We provided an ATTEST check list to help researchers to annotate and clearly document each step of their mIDA studies to ensure the transparency and reproducibility. We used the ATTEST to report two mIDA case studies and further transformed annotation results into sematic triples, so that the relationships among variables, data sources and integration processes are explicitly standardized and modeled using the classes and properties from OD-ATTEST.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusion<\/jats:title>\n                    <jats:p>Our ontology-based reporting guideline solves some key challenges in current mIDA studies for cancer outcomes research, through providing (1) a theory-driven guidance for multi-level and multi-domain RF variable and data source selection; and (2) a standardized documentation of the data selection and integration processes powered by an ontology, thus a way to enable sharing of mIDA study reports among researchers.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12911-020-01270-3","type":"journal-article","created":{"date-parts":[[2020,12,14]],"date-time":"2020-12-14T03:02:54Z","timestamp":1607914974000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["An ontology-based documentation of data discovery and integration process in cancer outcomes research"],"prefix":"10.1186","volume":"20","author":[{"given":"Hansi","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Yi","family":"Guo","sequence":"additional","affiliation":[]},{"given":"Mattia","family":"Prosperi","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2238-5429","authenticated-orcid":false,"given":"Jiang","family":"Bian","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,12,14]]},"reference":[{"key":"1270_CR1","unstructured":"World Health Organization. Cancer - key facts. 2018. https:\/\/www.who.int\/news-room\/fact-sheets\/detail\/cancer. Accessed 2 Jan 2020."},{"key":"1270_CR2","volume-title":"Cancer Facts & Figures 2019","author":"Atlanta: American Cancer Society","year":"2019","unstructured":"Atlanta: American Cancer Society. Cancer Facts & Figures 2019. 2019. https:\/\/www.cancer.org\/research\/cancer-facts-statistics\/all-cancer-facts-figures\/cancer-facts-figures-2019.html. Accessed 2 Jan 2020."},{"key":"1270_CR3","doi-asserted-by":"publisher","first-page":"7","DOI":"10.3322\/caac.21551","volume":"69","author":"RL Siegel","year":"2019","unstructured":"Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin. 2019;69:7\u201334.","journal-title":"CA Cancer J Clin"},{"key":"1270_CR4","doi-asserted-by":"publisher","first-page":"4255","DOI":"10.1200\/JCO.2009.25.7816","volume":"28","author":"ZK Stadler","year":"2010","unstructured":"Stadler ZK, Thom P, Robson ME, Weitzel JN, Kauff ND, Hurley KE, et al. Genome-wide association studies of Cancer. J Clin Oncol. 2010;28:4255\u201367.","journal-title":"J Clin Oncol"},{"key":"1270_CR5","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1158\/1055-9965.EPI-16-0794","volume":"27","author":"Y Boss\u00e9","year":"2018","unstructured":"Boss\u00e9 Y, Amos CI. A decade of GWAS results in lung Cancer. Cancer Epidemiol Biomark Prev. 2018;27:363\u201379.","journal-title":"Cancer Epidemiol Biomark Prev"},{"key":"1270_CR6","doi-asserted-by":"publisher","first-page":"e17695","DOI":"10.2196\/17695","volume":"22","author":"S Chen","year":"2020","unstructured":"Chen S, Wu S. Identifying lung Cancer risk factors in the elderly using deep neural networks: quantitative analysis of web-based survey data. J Med Internet Res. 2020;22:e17695.","journal-title":"J Med Internet Res"},{"key":"1270_CR7","doi-asserted-by":"publisher","first-page":"1311","DOI":"10.1007\/s00521-013-1359-1","volume":"24","author":"C-J Tseng","year":"2014","unstructured":"Tseng C-J, Lu C-J, Chang C-C, Chen G-D. Application of machine learning to predict the recurrence-proneness for cervical cancer. Neural Comput Appl. 2014;24:1311\u20136.","journal-title":"Neural Comput Appl"},{"key":"1270_CR8","unstructured":"National Cancer Institute. Cancer Risk Factors. https:\/\/training.seer.cancer.gov\/disease\/cancer\/risk.html. Accessed 2 Jan 2020."},{"key":"1270_CR9","doi-asserted-by":"publisher","first-page":"2100","DOI":"10.1007\/s11606-018-4648-7","volume":"33","author":"AS Andrew","year":"2018","unstructured":"Andrew AS, Parker S, Anderson JC, Rees JR, Robinson C, Riddle B, et al. Risk factors for diagnosis of colorectal Cancer at a late stage: a population-based study. J Gen Intern Med. 2018;33:2100\u20135.","journal-title":"J Gen Intern Med"},{"key":"1270_CR10","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1007\/s40615-016-0219-y","volume":"4","author":"LR Mobley","year":"2017","unstructured":"Mobley LR, Kuo T-M. Demographic disparities in late-stage diagnosis of breast and colorectal cancers across the USA. J Racial Ethn Health Disparities. 2017;4:201\u201312.","journal-title":"J Racial Ethn Health Disparities"},{"key":"1270_CR11","doi-asserted-by":"publisher","first-page":"317","DOI":"10.1080\/03630242.2012.674091","volume":"52","author":"TW Markossian","year":"2012","unstructured":"Markossian TW, Hines RB. Disparities in late stage diagnosis, treatment, and breast cancer-related death by race, age, and rural residence among women in Georgia. Women Health. 2012;52:317\u201335.","journal-title":"Women Health"},{"key":"1270_CR12","doi-asserted-by":"publisher","first-page":"170","DOI":"10.2105\/AJPH.2011.300550","volume":"103","author":"NA Chatterjee","year":"2013","unstructured":"Chatterjee NA, He Y, Keating NL. Racial differences in breast cancer stage at diagnosis in the mammography era. Am J Public Health. 2013;103:170\u20136.","journal-title":"Am J Public Health"},{"key":"1270_CR13","doi-asserted-by":"publisher","first-page":"1985","DOI":"10.1007\/s10552-013-0274-1","volume":"24","author":"JR Montealegre","year":"2013","unstructured":"Montealegre JR, Zhou R, Amirian ES, Follen M, Scheurer ME. Nativity disparities in late-stage diagnosis and cause-specific survival among Hispanic women with invasive cervical cancer: an analysis of surveillance, epidemiology, and end results data. Cancer Causes Control. 2013;24:1985\u201394.","journal-title":"Cancer Causes Control"},{"key":"1270_CR14","doi-asserted-by":"publisher","first-page":"480","DOI":"10.1016\/S0027-9684(15)31294-3","volume":"100","author":"CR Baquet","year":"2008","unstructured":"Baquet CR, Mishra SI, Commiskey P, Ellison GL, DeShields M. Breast cancer epidemiology in blacks and whites: disparities in incidence, mortality, survival rates and histology. J Natl Med Assoc. 2008;100:480\u20138.","journal-title":"J Natl Med Assoc"},{"key":"1270_CR15","doi-asserted-by":"publisher","first-page":"3252","DOI":"10.1002\/cncr.25857","volume":"117","author":"S Yasmeen","year":"2011","unstructured":"Yasmeen S, Xing G, Morris C, Chlebowski RT, Romano PS. Comorbidities and mammography use interact to explain racial\/ethnic disparities in breast cancer stage at diagnosis. Cancer. 2011;117:3252\u201361.","journal-title":"Cancer."},{"key":"1270_CR16","doi-asserted-by":"publisher","first-page":"3024","DOI":"10.1158\/1055-9965.EPI-09-0390","volume":"18","author":"SE Echeverr\u00eda","year":"2009","unstructured":"Echeverr\u00eda SE, Borrell LN, Brown D, Rhoads G. A local area analysis of racial, ethnic, and neighborhood disparities in breast cancer staging. Cancer Epidemiol Biomark Prev Publ Am Assoc Cancer Res Cosponsored Am Soc Prev Oncol. 2009;18:3024\u20139.","journal-title":"Cancer Epidemiol Biomark Prev Publ Am Assoc Cancer Res Cosponsored Am Soc Prev Oncol"},{"key":"1270_CR17","unstructured":"NIMHD. NIMHD Research Framework. https:\/\/www.nimhd.nih.gov\/about\/ overview\/research-framework.html. Accessed 28 Jun 2019."},{"key":"1270_CR18","doi-asserted-by":"publisher","first-page":"277","DOI":"10.1590\/S1413-81232006000200007","volume":"11","author":"LL Dahlberg","year":"2006","unstructured":"Dahlberg LL, Krug EG. Violence a global public health problem. Ci\u00eanc Sa\u00fade Coletiva. 2006;11:277\u201392.","journal-title":"Ci\u00eanc Sa\u00fade Coletiva"},{"key":"1270_CR19","doi-asserted-by":"publisher","first-page":"603","DOI":"10.1186\/1471-2407-10-603","volume":"10","author":"TH Keegan","year":"2010","unstructured":"Keegan TH, Quach T, Shema S, Glaser SL, Gomez SL. The influence of nativity and neighborhoods on breast cancer stage at diagnosis and survival among California Hispanic women. BMC Cancer. 2010;10:603.","journal-title":"BMC Cancer"},{"key":"1270_CR20","doi-asserted-by":"publisher","first-page":"1612","DOI":"10.1002\/cam4.509","volume":"4","author":"Y Guo","year":"2015","unstructured":"Guo Y, Logan HL, Marks JG, Shenkman EA. The relationships among individual and regional smoking, socioeconomic status, and oral and pharyngeal cancer survival: a mediation analysis. Cancer Med. 2015;4:1612\u20139.","journal-title":"Cancer Med"},{"key":"1270_CR21","volume-title":"Data integration blueprint and modeling: techniques for a scalable and sustainable architecture","author":"A Giordano","year":"2011","unstructured":"Giordano A. Data integration blueprint and modeling: techniques for a scalable and sustainable architecture. Upper Saddle River: IBM Press Pearson; 2011."},{"key":"1270_CR22","doi-asserted-by":"publisher","first-page":"e00525","DOI":"10.1128\/mBio.00525-18","volume":"9","author":"PD Schloss","year":"2018","unstructured":"Schloss PD. Identifying and Overcoming Threats to Reproducibility, Replicability, Robustness, and Generalizability in Microbiome Research. mBio. 2018;9:e00525\u201318 \/mbio\/9\/3\/mBio.00525\u201318.atom.","journal-title":"mBio"},{"key":"1270_CR23","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1016\/j.compbiomed.2017.06.005","volume":"87","author":"R Alonso-Calvo","year":"2017","unstructured":"Alonso-Calvo R, Paraiso-Medina S, Perez-Rey D, Alonso-Oset E, van Stiphout R, Yu S, et al. A semantic interoperability approach to support integration of gene expression and clinical data in breast cancer. Comput Biol Med. 2017;87:179\u201386.","journal-title":"Comput Biol Med"},{"key":"1270_CR24","doi-asserted-by":"publisher","first-page":"32","DOI":"10.1016\/j.jbi.2016.05.006","volume":"62","author":"H Kondylakis","year":"2016","unstructured":"Kondylakis H, Claerhout B, Keyur M, Koumakis L, van Leeuwen J, Marias K, et al. The INTEGRATE project: delivering solutions for efficient multi-centric clinical research and trials. J Biomed Inform. 2016;62:32\u201347.","journal-title":"J Biomed Inform"},{"key":"1270_CR25","doi-asserted-by":"publisher","unstructured":"METABRIC Group, Papatheodorou I, Crichton C, Morris L, Maccallum P, Davies J, et al. A metadata approach for clinical data management in translational genomics studies in breast cancer. BMC Med Genomics. 2009;2. doi:https:\/\/doi.org\/10.1186\/1755-8794-2-66.","DOI":"10.1186\/1755-8794-2-66"},{"key":"1270_CR26","unstructured":"Centre for Statistics in Medicine, NDORMS, University of Oxford. Enhancing the QUAlity and Transparency Of health Research. 2020. https:\/\/www.equator-network.org\/reporting-guidelines\/. Accessed 28 Jan 2020."},{"key":"1270_CR27","volume-title":"SEPDA@ISWC","author":"H Zhang","year":"2019","unstructured":"Zhang H, Guo Y, Bian J. Ontology for documentation of variable and data source selection process to support integrative data analysis in Cancer outcomes research. In: SEPDA@ISWC; 2019."},{"key":"1270_CR28","doi-asserted-by":"crossref","unstructured":"Guo Y, Bian J, Modave F, Li Q, George TJ, Prosperi M, Shenkman E. Assessing the effect of data integration on predictive ability of cancer survival models. Health Informatics J. 2020;26(1):8\u201320.","DOI":"10.1177\/1460458218824692"},{"key":"1270_CR29","doi-asserted-by":"publisher","unstructured":"Zhang H, Guo Y, Li Q, George TJ, Shenkman E, Modave F, et al. An ontology-guided semantic data integration framework to support integrative data analysis of cancer survival. BMC Med Inform Decis Mak. 2018;18. https:\/\/doi.org\/10.1186\/s12911-018-0636-4.","DOI":"10.1186\/s12911-018-0636-4"},{"key":"1270_CR30","unstructured":"Rural-Urban Commuting Area Codes. 2019. https:\/\/www.ers.usda.gov\/data-products\/rural-urban-commuting-area-codes.aspx. Accessed 28 Jan 2020."},{"key":"1270_CR31","volume-title":"NCHS Urban-Rural Classification Scheme for Counties","author":"National Center for Health Statistics, Office of Analysis and Epidemiology","year":"2017","unstructured":"National Center for Health Statistics, Office of Analysis and Epidemiology. NCHS Urban-Rural Classification Scheme for Counties. 2017. https:\/\/www.cdc.gov\/nchs\/data_access\/urban_rural.htm#2013_Urban-Rural_Classification_Scheme_for_Counties. Accessed 28 Jan 2017."},{"key":"1270_CR32","doi-asserted-by":"publisher","unstructured":"Arp R, Smith B, Spear AD. Building ontologies with basic formal ontology. The MIT Press. 2015. https:\/\/doi.org\/10.7551\/mitpress\/9780262527811.001.0001.","DOI":"10.7551\/mitpress\/9780262527811.001.0001"},{"issue":"Web Server issu","key":"1270_CR33","doi-asserted-by":"publisher","first-page":"W541","DOI":"10.1093\/nar\/gkr469","volume":"39","author":"PL Whetzel","year":"2011","unstructured":"Whetzel PL, Noy NF, Shah NH, Alexander PR, Nyulas C, Tudorache T, et al. BioPortal: enhanced functionality via new Web services from the National Center for Biomedical Ontology to access and use ontologies in software applications. Nucleic Acids Res. 2011;39(Web Server issue):W541\u20135.","journal-title":"Nucleic Acids Res"},{"key":"1270_CR34","unstructured":"David Beckett, Tim Berners-Lee, Eric Prud\u2019hommeaux, Gavin Carothers, Lex Machina. RDF 1.1 Turtle. 2014. https:\/\/www.w3.org\/TR\/2014\/RECturtle-20140225\/Overview.html. Accessed 28 Jan 2020."},{"key":"1270_CR35","doi-asserted-by":"publisher","first-page":"61","DOI":"10.1002\/j.1556-6678.2010.tb00151.x","volume":"88","author":"NL Leech","year":"2010","unstructured":"Leech NL, Onwuegbuzie AJ. Guidelines for conducting and reporting mixed research in the Field of counseling and beyond. J Couns Dev. 2010;88:61\u20139.","journal-title":"J Couns Dev"},{"key":"1270_CR36","doi-asserted-by":"publisher","first-page":"55","DOI":"10.7326\/M14-0697","volume":"162","author":"GS Collins","year":"2015","unstructured":"Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med. 2015;162:55.","journal-title":"Ann Intern Med"},{"key":"1270_CR37","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1186\/s40364-014-0027-7","volume":"3","author":"KF Kerr","year":"2015","unstructured":"Kerr KF, Meisner A, Thiessen-Philbrook H, Coca SG, Parikh CR. RiGoR: reporting guidelines to address common sources of bias in risk model development. Biomark Res. 2015;3:2.","journal-title":"Biomark Res"},{"key":"1270_CR38","doi-asserted-by":"publisher","first-page":"401","DOI":"10.1016\/j.bbi.2012.01.014","volume":"26","author":"LA Jason","year":"2012","unstructured":"Jason LA, Unger ER, Dimitrakoff JD, Fagin AP, Houghton M, Cook DB, et al. Minimum data elements for research reports on CFS. Brain Behav Immun. 2012;26:401\u20136.","journal-title":"Brain Behav Immun"},{"key":"1270_CR39","doi-asserted-by":"publisher","first-page":"e202","DOI":"10.1016\/S1473-3099(16)30082-2","volume":"16","author":"EJA Fitchett","year":"2016","unstructured":"Fitchett EJA, Seale AC, Vergnano S, Sharland M, Heath PT, Saha SK, et al. Strengthening the reporting of observational studies in epidemiology for newborn infection (STROBE-NI): an extension of the STROBE statement for neonatal infection research. Lancet Infect Dis. 2016;16:e202\u201313.","journal-title":"Lancet Infect Dis"},{"key":"1270_CR40","doi-asserted-by":"publisher","first-page":"1463","DOI":"10.1016\/j.jclinepi.2015.04.002","volume":"68","author":"RG White","year":"2015","unstructured":"White RG, Hakim AJ, Salganik MJ, Spiller MW, Johnston LG, Kerr L, et al. Strengthening the reporting of observational studies in epidemiology for respondent-driven sampling studies: \u201cSTROBE-RDS\u201d statement. J Clin Epidemiol. 2015;68:1463\u201371.","journal-title":"J Clin Epidemiol"},{"key":"1270_CR41","doi-asserted-by":"publisher","first-page":"573","DOI":"10.7326\/0003-4819-147-8-200710160-00010","volume":"147","author":"E von Elm","year":"2007","unstructured":"von Elm E, Altman DG, Egger M, Pocock SJ, G\u00f8tzsche PC, Vandenbroucke JP, et al. The Strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. Ann Intern Med. 2007;147:573\u20137.","journal-title":"Ann Intern Med"},{"key":"1270_CR42","doi-asserted-by":"publisher","first-page":"272","DOI":"10.1037\/a0020462","volume":"55","author":"DL Jackson","year":"2010","unstructured":"Jackson DL. Reporting results of latent growth modeling and multilevel modeling analyses: some recommendations for rehabilitation psychology. Rehabil Psychol. 2010;55:272\u201385.","journal-title":"Rehabil Psychol"},{"key":"1270_CR43","first-page":"484","volume":"26","author":"F Wolfe","year":"1999","unstructured":"Wolfe F, Lassere M, van der Heijde D, Stucki G, Suarez-Almazor M, Pincus T, et al. Preliminary core set of domains and reporting requirements for longitudinal observational studies in rheumatology. J Rheumatol. 1999;26:484\u20139.","journal-title":"J Rheumatol"},{"key":"1270_CR44","doi-asserted-by":"publisher","first-page":"e19","DOI":"10.1016\/S0140-6736(16)30388-9","volume":"388","author":"GA Stevens","year":"2016","unstructured":"Stevens GA, Alkema L, Black RE, Boerma JT, Collins GS, Ezzati M, et al. Guidelines for accurate and transparent health estimates reporting: the GATHER statement. Lancet. 2016;388:e19\u201323.","journal-title":"Lancet"},{"key":"1270_CR45","doi-asserted-by":"publisher","first-page":"e1000420","DOI":"10.1371\/journal.pmed.1000420","volume":"8","author":"ACJW Janssens","year":"2011","unstructured":"Janssens ACJW, Ioannidis JPA, van Duijn CM, Little J, Khoury MJ, GRIPS group. Strengthening the reporting of Genetic RIsk Prediction Studies: the GRIPS Statement. Plos Med. 2011;8:e1000420.","journal-title":"Plos Med"},{"key":"1270_CR46","doi-asserted-by":"publisher","first-page":"e22","DOI":"10.1371\/journal.pmed.1000022","volume":"6","author":"J Little","year":"2009","unstructured":"Little J, Higgins JPT, Ioannidis JPA, Moher D, Gagnon F, von Elm E, et al. STrengthening the REporting of genetic association studies (STREGA): an extension of the STROBE statement. PLoS Med. 2009;6:e22.","journal-title":"PLoS Med"},{"key":"1270_CR47","doi-asserted-by":"publisher","first-page":"333","DOI":"10.1111\/j.1399-0039.2011.01777.x","volume":"78","author":"JA Hollenbach","year":"2011","unstructured":"Hollenbach JA, Mack SJ, Gourraud P-A, Single RM, Maiers M, Middleton D, et al. A community standard for immunogenomic data reporting and analysis: proposal for a STrengthening the REporting of Immunogenomic studies statement. Tissue Antigens. 2011;78:333\u201344.","journal-title":"Tissue Antigens"},{"key":"1270_CR48","doi-asserted-by":"publisher","first-page":"341","DOI":"10.1016\/S1473-3099(13)70324-4","volume":"14","author":"N Field","year":"2014","unstructured":"Field N, Cohen T, Struelens MJ, Palm D, Cookson B, Glynn JR, et al. Strengthening the reporting of molecular epidemiology for infectious diseases (STROME-ID): an extension of the STROBE statement. Lancet Infect Dis. 2014;14:341\u201352.","journal-title":"Lancet Infect Dis"},{"key":"1270_CR49","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1111\/j.1365-2362.2011.02561.x","volume":"42","author":"V Gallo","year":"2012","unstructured":"Gallo V, Egger M, McCormack V, Farmer PB, Ioannidis JPA, Kirsch-Volders M, et al. STrengthening the reporting of OBservational studies in epidemiology - molecular epidemiology (STROBE-ME): an extension of the STROBE statement. Eur J Clin Investig. 2012;42:1\u201316.","journal-title":"Eur J Clin Investig"},{"key":"1270_CR50","doi-asserted-by":"publisher","first-page":"1596","DOI":"10.1136\/ard.2009.125526","volume":"69","author":"WG Dixon","year":"2010","unstructured":"Dixon WG, Carmona L, Finckh A, Hetland ML, Kvien TK, Landewe R, et al. EULAR points to consider when establishing, analysing and reporting safety data of biologics registers in rheumatology. Ann Rheum Dis. 2010;69:1596\u2013602.","journal-title":"Ann Rheum Dis"},{"key":"1270_CR51","doi-asserted-by":"publisher","first-page":"628","DOI":"10.1136\/annrheumdis-2013-204102","volume":"73","author":"J Zavada","year":"2014","unstructured":"Zavada J, Dixon WG, Askling J, EULAR study group on longitudinal observational registers and drug studies. Launch of a checklist for reporting longitudinal observational drug studies in rheumatology: a EULAR extension of STROBE guidelines based on experience from biologics registries. Ann Rheum Dis. 2014;73:628.","journal-title":"Ann Rheum Dis"},{"key":"1270_CR52","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1037\/lhb0000090","volume":"39","author":"JP Singh","year":"2015","unstructured":"Singh JP, Yang S, Mulvey EP, RAGEE group. Reporting guidance for violence risk assessment predictive validity studies: the RAGEE statement. Law Hum Behav. 2015;39:15\u201322.","journal-title":"Law Hum Behav"},{"key":"1270_CR53","doi-asserted-by":"publisher","first-page":"e1002036","DOI":"10.1371\/journal.pmed.1002036","volume":"13","author":"C Lachat","year":"2016","unstructured":"Lachat C, Hawwash D, Ock\u00e9 MC, Berg C, Forsum E, H\u00f6rnell A, et al. Strengthening the reporting of observational studies in epidemiology\u2014nutritional epidemiology (STROBE-nut): an extension of the STROBE statement. PLoS Med. 2016;13:e1002036.","journal-title":"PLoS Med"},{"key":"1270_CR54","doi-asserted-by":"publisher","first-page":"30","DOI":"10.7326\/M18-0543","volume":"169","author":"S De Geest","year":"2018","unstructured":"De Geest S, Zullig LL, Dunbar-Jacob J, Helmy R, Hughes DA, Wilson IB, et al. ESPACOMP medication adherence reporting guideline (EMERGE). Ann Intern Med. 2018;169:30\u20135.","journal-title":"Ann Intern Med"},{"key":"1270_CR55","doi-asserted-by":"publisher","first-page":"14","DOI":"10.1016\/j.onehlt.2017.07.001","volume":"4","author":"MF Davis","year":"2017","unstructured":"Davis MF, Rankin SC, Schurer JM, Cole S, Conti L, Rabinowitz P, et al. Checklist for one health epidemiological reporting of evidence (COHERE). One Health. 2017;4:14\u201321.","journal-title":"One Health"},{"key":"1270_CR56","doi-asserted-by":"publisher","first-page":"1009","DOI":"10.1016\/j.jval.2017.08.3018","volume":"20","author":"SV Wang","year":"2017","unstructured":"Wang SV, Schneeweiss S, Berger ML, Brown J, de Vries F, Douglas I, et al. Reporting to improve reproducibility and facilitate validity assessment for healthcare database studies V1.0. Value Health J Int Soc Pharmacoeconomics Outcomes Res. 2017;20:1009\u201322.","journal-title":"Value Health J Int Soc Pharmacoeconomics Outcomes Res"},{"key":"1270_CR57","first-page":"1052","volume":"3","author":"MG Kahn","year":"2015","unstructured":"Kahn MG, Brown JS, Chun AT, Davidson BN, Meeker D, Ryan PB, et al. Transparent reporting of data quality in distributed data networks. EGEMS Wash DC. 2015;3:1052.","journal-title":"EGEMS Wash DC"},{"key":"1270_CR58","doi-asserted-by":"crossref","unstructured":"Langan SM, Schmidt SA, Wing K, Ehrenstein V, Nicholls SG, Filion KB, Klungel O, Petersen I, Sorensen HT, Dixon WG, Guttmann A, Harron K, Hemkens LG, Moher D, Schneeweiss S, Smeeth L, Sturkenboom M, von Elm E, Wang SV, Benchimol EI. The reporting of studies conducted using observational routinely collected health data statement for pharmacoepidemiology (RECORD-PE). BMJ. 2018;363:k3532.","DOI":"10.1136\/bmj.k3532"},{"key":"1270_CR59","doi-asserted-by":"publisher","first-page":"e1001885","DOI":"10.1371\/journal.pmed.1001885","volume":"12","author":"EI Benchimol","year":"2015","unstructured":"Benchimol EI, Smeeth L, Guttmann A, Harron K, Moher D, Petersen I, et al. The REporting of studies conducted using observational routinely-collected health data (RECORD) statement. PLoS Med. 2015;12:e1001885.","journal-title":"PLoS Med"},{"key":"1270_CR60","doi-asserted-by":"publisher","first-page":"821","DOI":"10.1212\/WNL.0000000000001866","volume":"85","author":"DA Bennett","year":"2015","unstructured":"Bennett DA, Brayne C, Feigin VL, Barker-Collo S, Brainin M, Davis D, et al. Development of the standards of reporting of neurological disorders (STROND) checklist: a guideline for the reporting of incidence and prevalence studies in neuroepidemiology. Neurology. 2015;85:821\u20138.","journal-title":"Neurology."},{"key":"1270_CR61","doi-asserted-by":"publisher","first-page":"1044","DOI":"10.1111\/j.1524-4733.2009.00600.x","volume":"12","author":"ML Berger","year":"2009","unstructured":"Berger ML, Mamdani M, Atkins D, Johnson ML. Good research practices for comparative effectiveness research: defining, reporting and interpreting nonrandomized studies of treatment effects using secondary data sources: the ISPOR good research practices for retrospective database analysis task force report--part I. Value Health J Int Soc Pharmacoeconomics Outcomes Res. 2009;12:1044\u201352.","journal-title":"Value Health J Int Soc Pharmacoeconomics Outcomes Res."},{"key":"1270_CR62","doi-asserted-by":"publisher","first-page":"407","DOI":"10.1111\/jcpe.12392","volume":"42","author":"B Holtfreter","year":"2015","unstructured":"Holtfreter B, Albandar JM, Dietrich T, Dye BA, Eaton KA, Eke PI, et al. Standards for reporting chronic periodontitis prevalence and severity in epidemiologic studies: proposed standards from the joint EU\/USA periodontal epidemiology working group. J Clin Periodontol. 2015;42:407\u201312.","journal-title":"J Clin Periodontol"},{"key":"1270_CR63","doi-asserted-by":"publisher","first-page":"e010134","DOI":"10.1136\/bmjopen-2015-010134","volume":"6","author":"E Tacconelli","year":"2016","unstructured":"Tacconelli E, Cataldo MA, Paul M, Leibovici L, Kluytmans J, Schr\u00f6der W, et al. STROBE-AMS: recommendations to optimise reporting of epidemiological studies on antimicrobial resistance and informing improvement in antimicrobial stewardship. BMJ Open. 2016;6:e010134.","journal-title":"BMJ Open"},{"key":"1270_CR64","doi-asserted-by":"publisher","first-page":"16","DOI":"10.1007\/s13755-017-0039-4","volume":"5","author":"MS Barakat","year":"2017","unstructured":"Barakat MS, Field M, Ghose A, Stirling D, Holloway L, Vinod S, et al. The effect of imputing missing clinical attribute values on training lung cancer survival prediction model performance. Health Inf Sci Syst. 2017;5:16.","journal-title":"Health Inf Sci Syst"},{"key":"1270_CR65","first-page":"227","volume-title":"Datenbanksysteme in Business, Technologie und Web (BTW)","author":"B Glavic","year":"2007","unstructured":"Glavic B, Dittrich KR. Data provenance: a categorization of existing approaches. In: Datenbanksysteme in Business, Technologie und Web (BTW). Aachen: Ges. f\u00fcr Informatik; 2007. p. 227\u201341."},{"key":"1270_CR66","doi-asserted-by":"publisher","DOI":"10.17226\/25303","volume-title":"Reproducibility and Replicability in Science","author":"Committee on Reproducibility and Replicability in Science, Board on Behavioral, Cognitive, and Sensory Sciences, Committee on National Statistics, Division of Behavioral and Social Sciences and Education, Nuclear and Radiation Studies Board, Division on Earth and Life Studies","year":"2019","unstructured":"Committee on Reproducibility and Replicability in Science, Board on Behavioral, Cognitive, and Sensory Sciences, Committee on National Statistics, Division of Behavioral and Social Sciences and Education, Nuclear and Radiation Studies Board, Division on Earth and Life Studies, et al. Reproducibility and Replicability in Science. Washington, D.C: National Academies Press; 2019. https:\/\/doi.org\/10.17226\/25303."}],"container-title":["BMC Medical Informatics and Decision Making"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-020-01270-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s12911-020-01270-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12911-020-01270-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,12,14]],"date-time":"2020-12-14T03:04:09Z","timestamp":1607915049000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcmedinformdecismak.biomedcentral.com\/articles\/10.1186\/s12911-020-01270-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12]]},"references-count":66,"journal-issue":{"issue":"S4","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["1270"],"URL":"https:\/\/doi.org\/10.1186\/s12911-020-01270-3","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.05.28.20115907","asserted-by":"object"}]},"ISSN":["1472-6947"],"issn-type":[{"value":"1472-6947","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,12]]},"assertion":[{"value":"9 September 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"17 September 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 December 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"292"}}