{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T18:26:27Z","timestamp":1769970387955,"version":"3.49.0"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1013249","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2025,7,23]],"date-time":"2025-07-23T00:00:00Z","timestamp":1753228800000}}],"reference-count":36,"publisher":"Public Library of Science (PLoS)","issue":"7","license":[{"start":{"date-parts":[[2025,7,17]],"date-time":"2025-07-17T00:00:00Z","timestamp":1752710400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["NIH 5R25GM141501"],"award-info":[{"award-number":["NIH 5R25GM141501"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>The increasing availability of big data and adoption of sophisticated computational techniques in biomedical research has exciting implications for our scientific understanding of human health. However, researchers report struggling to find data science education that meets their needs, despite the fact that many training programs and online resources exist. There is a lack of evidence on the strengths and weaknesses of various training options, making selecting an educational path daunting. We created a new data science training program focused on rigorous, reproducible methods for biomedical research, making use of tightly scoped modular content that can be flexibly arranged to provide a curriculum tailored to a researcher\u2019s specific needs and skill level. Moreover, we ran a study testing the program\u2019s effectiveness, providing not only another option for data science training but also a model for collecting and sharing relevant data on data science education programs. We ran two waves of research participants, adjusting our materials in between to improve both the training program and our research design. For both waves, we pre-registered hypotheses that learners\u2019 self-reported data science ability and level of agreement with important tenets of open science would increase over the course of the program. Indeed, learners showed significant improvement in data science ability (Wave 1: <jats:italic>t<\/jats:italic>(47) = 10.18, <jats:italic>p<\/jats:italic>\u2009&lt;\u2009.001, Wave 2: <jats:italic>t<\/jats:italic>(238) = 17.12, <jats:italic>p<\/jats:italic>\u2009&lt;\u2009.001) and grea<jats:italic>t<\/jats:italic>er agreement with open science values (Wave 1: <jats:italic>t<\/jats:italic>(47) = 3.56, <jats:italic>p<\/jats:italic>\u2009&lt;\u2009.001, Wave 2: <jats:italic>t<\/jats:italic>(238) = 7.95, <jats:italic>p<\/jats:italic>\u2009&lt;\u2009.001). Follow up analyses underscore <jats:italic>t<\/jats:italic>he robustness of improvemen<jats:italic>t<\/jats:italic> in data science ability. The improvement in open science values was more moderate and was significant only in some of our pre-registered hypothesis tests, likely due to a ceiling effect as most learners reported high agreement with open science values at pretest.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1013249","type":"journal-article","created":{"date-parts":[[2025,7,17]],"date-time":"2025-07-17T17:50:33Z","timestamp":1752774633000},"page":"e1013249","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":1,"title":["Modular, focused data science education improves biomedical learners\u2019 abilities: A study of the Data and Analytics for Research Training (DART) program"],"prefix":"10.1371","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5386-5831","authenticated-orcid":true,"given":"Rose","family":"Hartman","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-0198-8125","authenticated-orcid":true,"given":"Karen Joy","family":"Payton","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5750-6473","authenticated-orcid":true,"given":"Rose","family":"Franzen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Meredith","family":"Lee","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-4262-454X","authenticated-orcid":true,"given":"Elizabeth","family":"Drellich","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ali","family":"Shokoufandeh","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jeffrey","family":"Pennington","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"340","published-online":{"date-parts":[[2025,7,17]]},"reference":[{"issue":"2","key":"pcbi.1013249.ref001","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1093\/bib\/bbx100","article-title":"A global perspective on evolving bioinformatics and data science training needs","volume":"20","author":"TK Attwood","year":"2019","journal-title":"Brief Bioinform"},{"issue":"2","key":"pcbi.1013249.ref002","doi-asserted-by":"crossref","first-page":"342","DOI":"10.1038\/s41390-022-02264-9","article-title":"Improving child health through Big Data and data science","volume":"93","author":"ZA Vesoulis","year":"2023","journal-title":"Pediatr Res"},{"issue":"3","key":"pcbi.1013249.ref003","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1080\/23808993.2020.1758062","article-title":"How can bioinformatics contribute to the routine application of personalized precision medicine?","volume":"5","author":"C Carretero-Puche","year":"2020","journal-title":"Expert Rev Precis Med Drug Dev"},{"issue":"3","key":"pcbi.1013249.ref004","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1097\/NUR.0000000000000516","article-title":"Data science and graduate nursing education: a critical literature review","volume":"34","author":"M Foster","year":"2020","journal-title":"Clin Nurse Spec"},{"issue":"1","key":"pcbi.1013249.ref005","article-title":"Research rigor and reproducibility in research education: a CTSA institutional survey","volume":"8","author":"C Axfors","year":"2024","journal-title":"J Clin Transl Sci"},{"issue":"7","key":"pcbi.1013249.ref006","first-page":"1","article-title":"Researcher attitudes toward data sharing in public data repositories: a meta-evaluation of studies on researcher data sharing","volume":"78","author":"JL Thoegersen","year":"2021","journal-title":"JD"},{"issue":"8","key":"pcbi.1013249.ref007","doi-asserted-by":"crossref","first-page":"1171","DOI":"10.1038\/s41587-023-01891-9","article-title":"Grand challenges in bioinformatics education and training","volume":"41","author":"EB I\u015f\u0131k","year":"2023","journal-title":"Nat Biotechnol"},{"key":"pcbi.1013249.ref008","doi-asserted-by":"crossref","DOI":"10.7717\/peerj.5553","article-title":"Data challenges of biomedical researchers in the age of omics","volume":"6","author":"R Garcia-Milian","year":"2018","journal-title":"PeerJ"},{"issue":"10","key":"pcbi.1013249.ref009","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1005755","article-title":"Unmet needs for analyzing biological big data: a survey of 704 NSF principal investigators","volume":"13","author":"L Barone","year":"2017","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1013249.ref010","doi-asserted-by":"crossref","first-page":"62","DOI":"10.12688\/f1000research.3-62.v1","article-title":"Software Carpentry: lessons learned","volume":"3","author":"G Wilson","year":"2014","journal-title":"F1000Res"},{"key":"pcbi.1013249.ref011","doi-asserted-by":"crossref","DOI":"10.7554\/eLife.64719","article-title":"A community-led initiative for training in reproducible research","volume":"10","author":"S Auer","year":"2021","journal-title":"Elife"},{"key":"pcbi.1013249.ref012","author":"D Chen","year":"2022"},{"issue":"6","key":"pcbi.1013249.ref013","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1011160","article-title":"Teaching students to R3eason, not merely to solve problem sets: the role of philosophy and visual data communication in accessible data science education","volume":"19","author":"II Ciubotariu","year":"2023","journal-title":"PLoS Comput Biol"},{"issue":"2","key":"pcbi.1013249.ref014","article-title":"An integrated, modular approach to data science education in microbiology","volume":"17","author":"KA Dill-McFarland","year":"2021","journal-title":"PLoS Comput Biol"},{"issue":"11","key":"pcbi.1013249.ref015","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0293879","article-title":"An international consensus on effective, inclusive, and career-spanning short-format training in the life sciences and beyond","volume":"18","author":"JJ Williams","year":"2023","journal-title":"PLoS One"},{"issue":"37","key":"pcbi.1013249.ref016","doi-asserted-by":"crossref","first-page":"9854","DOI":"10.1073\/pnas.1705783114","article-title":"Null effects of boot camps and short-format training for PhD students in life sciences","volume":"114","author":"DF Feldon","year":"2017","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"2","key":"pcbi.1013249.ref017","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1111\/j.1467-8535.2010.01157.x","article-title":"Situational interest, computer self-efficacy, and self-regulation: their impact on student engagement in distance education","volume":"43","author":"JCY Sun","year":"2012","journal-title":"Br J Educ Technol"},{"key":"pcbi.1013249.ref018","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.iheduc.2015.04.007","article-title":"Self-regulated learning strategies & academic achievement in online higher education learning environments: a systematic review","volume":"27","author":"J Broadbent","year":"2015","journal-title":"Internet High Educ"},{"key":"pcbi.1013249.ref019","author":"R Hartman","year":"2023","journal-title":"OSF"},{"key":"pcbi.1013249.ref020","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v067.i01","article-title":"Fitting linear mixed-effects models using lme4","volume":"67","author":"D Bates","year":"2015","journal-title":"J Stat Softw"},{"issue":"2","key":"pcbi.1013249.ref021","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1037\/0021-9010.64.2.144","article-title":"Response-shift bias: a source of contamination of self-report measures","volume":"64","author":"GS Howard","year":"1979","journal-title":"J Appl Psychol"},{"issue":"5","key":"pcbi.1013249.ref022","doi-asserted-by":"crossref","first-page":"1202","DOI":"10.1037\/a0013314","article-title":"Knowing me, knowing you: the accuracy and unique predictive validity of self-ratings and other-ratings of daily behavior","volume":"95","author":"S Vazire","year":"2008","journal-title":"J Pers Soc Psychol"},{"key":"pcbi.1013249.ref023","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1080\/00461520.1991.9653133","article-title":"Self-efficacy and academic motivation","volume":"26","author":"D Schunk","year":"1991","journal-title":"Educational Psychologist"},{"key":"pcbi.1013249.ref024","article-title":"Can MOOC Programs Improve Student Employment Prospects?","author":"A Hadavand","year":"2018","journal-title":"SSRN Journal"},{"issue":"3","key":"pcbi.1013249.ref025","doi-asserted-by":"crossref","DOI":"10.19173\/irrodl.v16i3.2112","article-title":"Massive open online course completion rates revisited: assessment, length and attrition","volume":"16","author":"K Jordan","year":"2015","journal-title":"IRRODL"},{"key":"pcbi.1013249.ref026","doi-asserted-by":"crossref","first-page":"103961","DOI":"10.1016\/j.compedu.2020.103961","article-title":"Exploring student and teacher usage patterns associated with student attrition in an open educational resource-supported online learning platform","volume":"156","author":"D Kim","year":"2020","journal-title":"Comput Educ"},{"issue":"8","key":"pcbi.1013249.ref027","doi-asserted-by":"crossref","first-page":"860","DOI":"10.3390\/educsci14080860","article-title":"The interplay of self-regulated learning, cognitive load, and performance in learner-controlled environments","volume":"14","author":"A Gorbunova","year":"2024","journal-title":"Educ Sci"},{"key":"pcbi.1013249.ref028","doi-asserted-by":"crossref","DOI":"10.3389\/feduc.2022.851019","article-title":"Engagement in online learning: student attitudes and behavior during COVID-19","volume":"7","author":"B Hollister","year":"2022","journal-title":"Front Educ"},{"issue":"4","key":"pcbi.1013249.ref029","first-page":"51","article-title":"Asynchronous and synchronous e-learning","volume":"31","author":"S Hrastinski","year":"2008","journal-title":"Educause Quarterly"},{"key":"pcbi.1013249.ref030","first-page":"87","article-title":"Engagement in online learning: Student attitudes and behavior in asynchronous courses","volume":"3","author":"F Martin","year":"2018","journal-title":"Front Educ"},{"key":"pcbi.1013249.ref031","author":"KJ Payton"},{"key":"pcbi.1013249.ref032","author":"National Institutes of Health","year":"2018"},{"key":"pcbi.1013249.ref033","author":"A Dietrich","year":"2024"},{"issue":"2","key":"pcbi.1013249.ref034","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1016\/j.jbi.2008.08.010","article-title":"Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support","volume":"42","author":"PA Harris","year":"2009","journal-title":"J Biomed Inform"},{"key":"pcbi.1013249.ref035","doi-asserted-by":"crossref","first-page":"103208","DOI":"10.1016\/j.jbi.2019.103208","article-title":"The REDCap consortium: Building an international community of software platform partners","volume":"95","author":"PA Harris","year":"2019","journal-title":"J Biomed Inform"},{"issue":"1","key":"pcbi.1013249.ref036","doi-asserted-by":"crossref","first-page":"52","DOI":"10.3163\/1536-5050.104.1.008","article-title":"Data literacy training needs of biomedical researchers","volume":"104","author":"LM Federer","year":"2016","journal-title":"J Med Libr Assoc"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1013249","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2025,7,23]],"date-time":"2025-07-23T00:00:00Z","timestamp":1753228800000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1013249","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,23]],"date-time":"2025-07-23T17:48:12Z","timestamp":1753292892000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1013249"}},"subtitle":[],"editor":[{"given":"Francis","family":"Ouellette","sequence":"first","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2025,7,17]]},"references-count":36,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2025,7,17]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1013249","relation":{"new_version":[{"id-type":"doi","id":"10.1371\/journal.pcbi.1013249","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,17]]}}}