{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,6]],"date-time":"2026-06-06T15:41:28Z","timestamp":1780760488340,"version":"3.54.1"},"reference-count":16,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2025,9,9]],"date-time":"2025-09-09T00:00:00Z","timestamp":1757376000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","award":["1U24MH114827-01"],"award-info":[{"award-number":["1U24MH114827-01"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,10,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Summary<\/jats:title>\n                  <jats:p>In the era of large data, the cloud is increasingly used as a computing environment, necessitating the development of cloud-compatible pipelines that can provide uniform analysis across disparate biological datasets. The Warp Analysis Research Pipelines (WARP) repository is a GitHub repository of open-source, cloud-optimized workflows for biological data processing that are semantically versioned, tested, and documented. A companion repository, WARP-Tools, hosts Docker containers and custom tools used in WARP workflows.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The WARP and WARP-Tools repositories and code are freely available at https:\/\/github.com\/broadinstitute\/WARP and https:\/\/github.com\/broadinstitute\/WARP-tools, respectively. The pipelines are available for download from the WARP repository, can be exported from Dockstore, and can be imported to a bioinformatics platform such as Terra.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf494","type":"journal-article","created":{"date-parts":[[2025,9,6]],"date-time":"2025-09-06T11:46:50Z","timestamp":1757159210000},"source":"Crossref","is-referenced-by-count":6,"title":["Warp analysis research pipelines: cloud-optimized workflows for biological data processing and reproducible analysis"],"prefix":"10.1093","volume":"41","author":[{"given":"Kylee","family":"Degatano","sequence":"first","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0578-090X","authenticated-orcid":false,"given":"Aseel","family":"Awdeh","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"suffix":"III","given":"Robert Sidney","family":"Cox","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wes","family":"Dingman","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"George","family":"Grant","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Farzaneh","family":"Khajouei","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Elizabeth","family":"Kiernan","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kishori","family":"Konwar","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kaylee L","family":"Mathews","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kevin","family":"Palis","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nikelle","family":"Petrillo","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Geraldine","family":"Van der Auwera","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chengchen (Rex)","family":"Wang","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jessica","family":"Way","sequence":"additional","affiliation":[{"name":"Data Sciences Platform, Broad Institute of MIT and Harvard , Cambridge, MA 02142,","place":["United States"]}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2025,9,9]]},"reference":[{"key":"2025101607410600100_btaf494-B1","doi-asserted-by":"crossref","first-page":"D1075","DOI":"10.1093\/nar\/gkac962","article-title":"The neuroscience multi-omic archive: a brain initiative resource for single-cell transcriptomic and epigenomic data from the mammalian brain","volume":"51","author":"Ament","year":"2023","journal-title":"Nucleic Acids Res"},{"key":"2025101607410600100_btaf494-B2","author":"Degatano","year":"2021"},{"key":"2025101607410600100_btaf494-B3","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1038\/s41587-020-0439-x","article-title":"The nf-core framework for community-curated bioinformatics pipelines","volume":"38","author":"Ewels","year":"2020","journal-title":"Nat Biotechnol"},{"key":"2025101607410600100_btaf494-B4","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1165\/rcmb.2022-0165OC","article-title":"LungMAP portal ecosystem: systems-level exploration of the lung","volume":"70","author":"Gaddis","year":"2024","journal-title":"Am J Respir Cell Mol Biol"},{"key":"2025101607410600100_btaf494-B5","doi-asserted-by":"crossref","first-page":"905","DOI":"10.1016\/j.cell.2020.09.036","article-title":"Data sanitization to reduce private information leakage from functional genomics","volume":"183","author":"G\u00fcrsoy","year":"2020","journal-title":"Cell"},{"key":"2025101607410600100_btaf494-B6","doi-asserted-by":"crossref","first-page":"e3002133","DOI":"10.1371\/journal.pbio.3002133","article-title":"A guide to the brain initiative cell census network data ecosystem","volume":"21","author":"Hawrylycz","year":"2023","journal-title":"PLoS Biol"},{"key":"2025101607410600100_btaf494-B7","doi-asserted-by":"crossref","first-page":"giz095","DOI":"10.1093\/gigascience\/giz095","article-title":"Sharing interoperable workflow provenance: a review of best practices and their practical application in cwlprov","volume":"8","author":"Khan","year":"2019","journal-title":"Gigascience"},{"key":"2025101607410600100_btaf494-B8","doi-asserted-by":"crossref","first-page":"793","DOI":"10.1038\/s41592-020-0905-x","article-title":"Cumulus provides cloud-based data analysis for large-scale single-cell and single-nucleus RNA-seq","volume":"17","author":"Li","year":"2020","journal-title":"Nat Methods"},{"key":"2025101607410600100_btaf494-B9","author":"O\u2019Connor","year":"2017"},{"key":"2025101607410600100_btaf494-B10","doi-asserted-by":"crossref","DOI":"10.14806\/ej.24.0.910","article-title":"Genomic big data hitting the storage bottleneck","volume":"24","author":"Papageorgiou","year":"2018","journal-title":"EMBnet J"},{"key":"2025101607410600100_btaf494-B11","doi-asserted-by":"crossref","DOI":"10.1101\/2022.04.05.485833","article-title":"Deploying genomics workflows on high performance computing (HPC) platforms: storage, memory, and compute considerations","author":"Powers","year":"2022"},{"key":"2025101607410600100_btaf494-B12","doi-asserted-by":"crossref","first-page":"100085","DOI":"10.1016\/j.xgen.2021.100085","article-title":"Inverting the model of genomics data sharing with the NHGRI genomic data science analysis, visualization, and informatics lab-space","volume":"2","author":"Schatz","year":"2022","journal-title":"Cell Genom"},{"key":"2025101607410600100_btaf494-B13","doi-asserted-by":"crossref","first-page":"11.10.1","DOI":"10.1002\/0471250953.bi1110s43","article-title":"From FastQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline","volume":"43","author":"Van der Auwera","year":"2013","journal-title":"Curr Protoc Bioinform"},{"key":"2025101607410600100_btaf494-B14","article-title":"S et al. Single-nucleus analysis reveals oxidative stress in Down syndrome basal forebrain neurons at birth. Alzheimer's Dement 2025;21:e70445.","author":"West"},{"key":"2025101607410600100_btaf494-B15","doi-asserted-by":"crossref","first-page":"160018","DOI":"10.1038\/sdata.2016.18","article-title":"The fair guiding principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Sci Data"},{"key":"2025101607410600100_btaf494-B16","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1186\/s13059-021-02552-3","article-title":"Benchmarking UMI-based single-cell RNA-seq preprocessing workflows","volume":"22","author":"You","year":"2021","journal-title":"Genome Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf494\/64232063\/btaf494.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/10\/btaf494\/64232063\/btaf494.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/10\/btaf494\/64232063\/btaf494.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,16]],"date-time":"2025-10-16T11:41:17Z","timestamp":1760614877000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf494\/8250097"}},"subtitle":[],"editor":[{"given":"Christina","family":"Kendziorski","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2025,9,9]]},"references-count":16,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2025,10,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf494","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,10]]},"published":{"date-parts":[[2025,9,9]]},"article-number":"btaf494"}}