{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T12:25:08Z","timestamp":1763727908945,"version":"3.41.2"},"reference-count":37,"publisher":"Oxford University Press (OUP)","license":[{"start":{"date-parts":[[2021,3,30]],"date-time":"2021-03-30T00:00:00Z","timestamp":1617062400000},"content-version":"vor","delay-in-days":88,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000038","name":"U.S. Food and Drug Administration","doi-asserted-by":"publisher","award":["75F40119C10136"],"award-info":[{"award-number":["75F40119C10136"]}],"id":[{"id":"10.13039\/100000038","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000038","name":"U.S. Food and Drug Administration","doi-asserted-by":"publisher","award":["HHSF223201510129C"],"award-info":[{"award-number":["HHSF223201510129C"]}],"id":[{"id":"10.13039\/100000038","id-type":"DOI","asserted-by":"publisher"}]},{"name":"U.S. National Institute of Health, National Cancer Institute","award":["HHSN261201400008C"],"award-info":[{"award-number":["HHSN261201400008C"]}]},{"name":"U.S. National Institute of Health, National Cancer Institute","award":["HHSN261201500003I"],"award-info":[{"award-number":["HHSN261201500003I"]}]},{"name":"U.S. National Institute of Health, National Cancer Institute","award":["CA215010"],"award-info":[{"award-number":["CA215010"]}]},{"name":"U.S. National Institute of Health, Glycoscience Common Fund","award":["1U01GM125267 - 01"],"award-info":[{"award-number":["1U01GM125267 - 01"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,3,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Developments in high-throughput sequencing (HTS) result in an exponential increase in the amount of data generated by sequencing experiments, an increase in the complexity of bioinformatics analysis reporting and an increase in the types of data generated. These increases in volume, diversity and complexity of the data generated and their analysis expose the necessity of a structured and standardized reporting template. BioCompute Objects (BCOs) provide the requisite support for communication of HTS data analysis that includes support for workflow, as well as data, curation, accessibility and reproducibility of communication. BCOs standardize how researchers report provenance and the established verification and validation protocols used in workflows while also being robust enough to convey content integration or curation in knowledge bases. BCOs that encapsulate tools, platforms, datasets and workflows are FAIR (findable, accessible, interoperable and reusable) compliant. Providing operational workflow and data information facilitates interoperability between platforms and incorporation of future dataset within an HTS analysis for use within industrial, academic and regulatory settings. Cloud-based platforms, including High-performance Integrated Virtual Environment (HIVE), Cancer Genomics Cloud (CGC) and Galaxy, support BCO generation for users. Given the 100K+ userbase between these platforms, BioCompute can be leveraged for workflow documentation. In this paper, we report the availability of platform-dependent and platform-independent BCO tools: HIVE BCO App, CGC BCO App, Galaxy BCO API Extension and BCO Portal. Community engagement was utilized to evaluate tool efficacy. We demonstrate that these tools further advance BCO creation from text editing approaches used in earlier releases of the standard. Moreover, we demonstrate that integrating BCO generation within existing analysis platforms greatly streamlines BCO creation while capturing granular workflow details. We also demonstrate that the BCO tools described in the paper provide an approach to solve the long-standing challenge of standardizing workflow descriptions that are both human and machine readable while accommodating manual and automated curation with evidence tagging.<\/jats:p>\n               <jats:p>Database URL: \u00a0https:\/\/www.biocomputeobject.org\/resources<\/jats:p>","DOI":"10.1093\/database\/baab008","type":"journal-article","created":{"date-parts":[[2021,3,6]],"date-time":"2021-03-06T20:09:46Z","timestamp":1615061386000},"source":"Crossref","is-referenced-by-count":7,"title":["Bioinformatics tools developed to support BioCompute Objects"],"prefix":"10.1093","volume":"2021","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8824-4637","authenticated-orcid":false,"given":"Janisha A","family":"Patel","sequence":"first","affiliation":[{"name":"The Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7621-9717","authenticated-orcid":false,"given":"Dennis A","family":"Dean","sequence":"additional","affiliation":[{"name":"Seven Bridges, Charlestown, MA 02129, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1409-4549","authenticated-orcid":false,"given":"Charles Hadley","family":"King","sequence":"additional","affiliation":[{"name":"The Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA"},{"name":"The McCormick Genomic and Proteomic Center, The George Washington University, Washington, DC 20037, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0250-5673","authenticated-orcid":false,"given":"Nan","family":"Xiao","sequence":"additional","affiliation":[{"name":"Seven Bridges, Charlestown, MA 02129, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Soner","family":"Koc","sequence":"additional","affiliation":[{"name":"Seven Bridges, Charlestown, MA 02129, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4127-0456","authenticated-orcid":false,"given":"Ekaterina","family":"Minina","sequence":"additional","affiliation":[{"name":"CBER-HIVE, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD 20993, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anton","family":"Golikov","sequence":"additional","affiliation":[{"name":"CBER-HIVE, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD 20993, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Phillip","family":"Brooks","sequence":"additional","affiliation":[{"name":"Seven Bridges, Charlestown, MA 02129, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Robel","family":"Kahsay","sequence":"additional","affiliation":[{"name":"The Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4200-7409","authenticated-orcid":false,"given":"Rahi","family":"Navelkar","sequence":"additional","affiliation":[{"name":"The Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5046-3367","authenticated-orcid":false,"given":"Manisha","family":"Ray","sequence":"additional","affiliation":[{"name":"Seven Bridges, Charlestown, MA 02129, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dave","family":"Roberson","sequence":"additional","affiliation":[{"name":"Seven Bridges, Charlestown, MA 02129, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9236-472X","authenticated-orcid":false,"given":"Chris","family":"Armstrong","sequence":"additional","affiliation":[{"name":"The Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8823-9945","authenticated-orcid":false,"given":"Raja","family":"Mazumder","sequence":"additional","affiliation":[{"name":"The Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA"},{"name":"The McCormick Genomic and Proteomic Center, The George Washington University, Washington, DC 20037, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jonathon","family":"Keeney","sequence":"additional","affiliation":[{"name":"The Department of Biochemistry & Molecular Medicine, The George Washington University School of Medicine and Health Sciences, Washington, DC 20037, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2021,3,30]]},"reference":[{"key":"2021070818434340300_R1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/database\/baw022","article-title":"High-performance integrated virtual environment (HIVE): a robust infrastructure for next-generation sequence data analysis","volume":"2016","author":"Simonyan","year":"2016","journal-title":"Database (Oxford)"},{"key":"2021070818434340300_R2","doi-asserted-by":"crossref","first-page":"957","DOI":"10.3390\/genes5040957","article-title":"High-Performance Integrated Virtual Environment (HIVE) Tools and Applications for Big Data Analysis","volume":"5","author":"Simonyan","year":"2014","journal-title":"Genes (Basel)"},{"key":"2021070818434340300_R3","doi-asserted-by":"crossref","first-page":"e3","DOI":"10.1158\/0008-5472.CAN-17-0387","article-title":"The cancer genomics cloud: collaborative, reproducible, and democratized \u2013 a new paradigm in large-scale computational research","volume":"77","author":"Lau","year":"2017","journal-title":"Cancer Res."},{"key":"2021070818434340300_R4","doi-asserted-by":"crossref","first-page":"W395","DOI":"10.1093\/nar\/gkaa434","article-title":"The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2020 update","volume":"48","author":"Jalili","year":"2020","journal-title":"Nucleic Acids Res."},{"volume-title":"Genomic Knowledge Standards","year":"2017","key":"2021070818434340300_R5"},{"key":"2021070818434340300_R6","first-page":"1226","article-title":"Implementing the VMC specification to reduce ambiguity in genomic variant representation","volume":"2019","author":"Watkins","year":"2019","journal-title":"AMIA Annu. Symp. Proc."},{"volume-title":"FHIR Specification FHIR v0.0.82","year":"2014","key":"2021070818434340300_R7"},{"volume-title":"Common Workflow Language, v1.0 Common Workflow Language (CWL) Command Line Tool Description, v1.0","year":"2016","author":"Amstutz","key":"2021070818434340300_R8"},{"year":"2018","author":"Workflow Description Language","key":"2021070818434340300_R9"},{"key":"2021070818434340300_R10","doi-asserted-by":"crossref","first-page":"2520","DOI":"10.1093\/bioinformatics\/bts480","article-title":"Snakemake\u2014a scalable bioinformatics workflow engine","volume":"28","author":"Koster","year":"2012","journal-title":"Bioinformatics"},{"volume-title":"Nextflow - A DSL for Parallel and Scalable Computational Pipelines","year":"2020","key":"2021070818434340300_R11"},{"volume-title":"A lightweight approach to research object data packaging","year":"2019","author":"Carrag\u00e1in","key":"2021070818434340300_R12"},{"key":"2021070818434340300_R13","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-017-1747-0","article-title":"Investigating reproducibility and tracking provenance \u2013 a genomic workflow case study","volume":"18","author":"Kanwal","year":"2017","journal-title":"BMC Bioinform."},{"volume-title":"IEEE 2791\u20132020 - IEEE Standard for Bioinformatics Analyses Generated by High-Throughput Sequencing (HTS) to Facilitate Communication","year":"2020","key":"2021070818434340300_R14"},{"key":"2021070818434340300_R15","doi-asserted-by":"crossref","first-page":"136","DOI":"10.5731\/pdajpst.2016.006734","article-title":"Biocompute Objects-A Step towards Evaluation and Validation of Biomedical Scientific Computations","volume":"71","author":"Simonyan","year":"2017","journal-title":"PDA J. Pharm. Sci. Technol"},{"volume-title":"Repository for Support of the IEEE 2791\u20132020 Standard","year":"2018","author":"BCO_Specification","key":"2021070818434340300_R16"},{"key":"2021070818434340300_R17","first-page":"263","article-title":"Foundations of JSON schema","volume-title":"25th International World Wide Web Conference, WWW 2016","author":"Pezoa","year":"2016"},{"volume-title":"Electronic Submissions; Data Standards; Support for the International Institute of Electrical and Electronics Engineers Bioinformatics Computations and Analyses Standard for Bioinformatic Workflows","year":"2020","author":"Federal Register","key":"2021070818434340300_R18"},{"key":"2021070818434340300_R19","doi-asserted-by":"crossref","DOI":"10.12688\/f1000research.25902.1","article-title":"BCO app: tools for generating BioCompute Objects from next-generation sequencing workflows and computations","volume":"9","author":"Xiao","year":"2020","journal-title":"F1000Research"},{"key":"2021070818434340300_R20","doi-asserted-by":"crossref","first-page":"394","DOI":"10.1002\/wics.1212","article-title":"The comprehensive R archive network","volume":"4","author":"Hornik","year":"2012","journal-title":"Wiley Interdiscip. Rev. Comput. Stat."},{"key":"2021070818434340300_R21","article-title":"Strengthening the BioCompute standard by crowdsourcing on PrecisionFDA","author":"Stephens","year":"2020","journal-title":"bioRxiv"},{"key":"2021070818434340300_R22","doi-asserted-by":"crossref","DOI":"10.1038\/sdata.2016.18","article-title":"Comment: the FAIR guiding principles for scientific data management and stewardship","volume":"3","author":"Wilkinson","year":"2016","journal-title":"Sci. Data"},{"key":"2021070818434340300_R23","doi-asserted-by":"crossref","first-page":"W537","DOI":"10.1093\/nar\/gky379","article-title":"The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update","volume":"46","author":"Afgan","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"2021070818434340300_R24","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1016\/j.cels.2018.03.014","article-title":"Practical computational reproducibility in the life sciences","volume":"6","author":"Gr\u00fcning","year":"2018","journal-title":"Cell Syst."},{"key":"2021070818434340300_R25","doi-asserted-by":"crossref","first-page":"1685","DOI":"10.1093\/bioinformatics\/btt199","article-title":"BioBlend: automating pipeline analyses within Galaxy and CloudMan","volume":"29","author":"Sloggett","year":"2013","journal-title":"Bioinformatics"},{"key":"2021070818434340300_R26","doi-asserted-by":"crossref","first-page":"D1128","DOI":"10.1093\/nar\/gkx907","article-title":"BioMuta and BioXpress: mutation and expression knowledgebases for cancer biomarker discovery","volume":"46","author":"Dingerdissen","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2021070818434340300_R27","doi-asserted-by":"crossref","DOI":"10.14806\/ej.17.1.200","article-title":"Cutadapt removes adapter sequences from high-throughput sequencing reads","volume":"17","author":"Martin","year":"2011","journal-title":"EMBnet.J."},{"article-title":"About AWS","year":"2015","author":"Amazon","key":"2021070818434340300_R28"},{"key":"2021070818434340300_R29","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0099033","article-title":"HIVE-hexagon: high-performance, parallelized sequence alignment for next-generation sequencing data analysis","volume":"9","author":"Santana-Quintero","year":"2014","journal-title":"PLoS One"},{"key":"2021070818434340300_R30","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1016\/j.ygeno.2017.01.002","article-title":"HIVE-heptagon: a sensible variant-calling algorithm with post-alignment quality controls","volume":"109","author":"Simonyan","year":"2017","journal-title":"Genomics"},{"key":"2021070818434340300_R31","doi-asserted-by":"publisher","DOI":"10.1186\/1471-2164-15-918","article-title":"Census-based rapid and accurate metagenome taxonomic profiling","volume":"15","author":"Shamsaddini","year":"2014","journal-title":"BMC Genomics"},{"key":"2021070818434340300_R32","article-title":"Communicating regulatory high throughput sequencing data using BioCompute Objects disclaimer","author":"Hadley","year":"2020","journal-title":"bioRxiv"},{"volume-title":"Bioinformatics - DDL Diagnostic Laboratory","key":"2021070818434340300_R33"},{"key":"2021070818434340300_R34","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pbio.3000099","article-title":"Enabling precision medicine via standard communication of HTS provenance, analysis, and results","volume":"16","author":"Alterovitz","year":"2018","journal-title":"PLoS Biol."},{"volume-title":"Use of public human genetic variant databases to support clinical validity for genetic and genomic-based in vitro diagnostics","author":"FDA","key":"2021070818434340300_R35"},{"key":"2021070818434340300_R36","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1093\/glycob\/cwz080","article-title":"GlyGen: computational and informatics resources for glycoscience","volume":"30","author":"York","year":"2019","journal-title":"Glycobiology"},{"key":"2021070818434340300_R37","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1200\/CCI.19.00117","article-title":"OncoMX: a knowledgebase for exploring cancer biomarkers in the context of related cancer and healthy data","volume":"4","author":"Dingerdissen","year":"2020","journal-title":"JCO Clin. Cancer Inform."}],"container-title":["Database"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/database\/article-pdf\/doi\/10.1093\/database\/baab008\/36815954\/baab008.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/database\/article-pdf\/doi\/10.1093\/database\/baab008\/36815954\/baab008.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,7,9]],"date-time":"2021-07-09T02:16:23Z","timestamp":1625796983000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/database\/article\/doi\/10.1093\/database\/baab008\/6204168"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,1]]},"references-count":37,"URL":"https:\/\/doi.org\/10.1093\/database\/baab008","relation":{},"ISSN":["1758-0463"],"issn-type":[{"type":"electronic","value":"1758-0463"}],"subject":[],"published-other":{"date-parts":[[2021,1,1]]},"published":{"date-parts":[[2021,1,1]]},"article-number":"baab008"}}