{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,19]],"date-time":"2026-01-19T11:07:36Z","timestamp":1768820856755,"version":"3.49.0"},"reference-count":22,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,9,3]],"date-time":"2020-09-03T00:00:00Z","timestamp":1599091200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,9,3]],"date-time":"2020-09-03T00:00:00Z","timestamp":1599091200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100000060","name":"National Institute of Allergy and Infectious Diseases","doi-asserted-by":"publisher","award":["BCBB Support Services Contract HHSN316201300006W\/HHSN27200002 to MSC, Inc"],"award-info":[{"award-number":["BCBB Support Services Contract HHSN316201300006W\/HHSN27200002 to MSC, Inc"]}],"id":[{"id":"10.13039\/100000060","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n<jats:title>Background<\/jats:title>\n<jats:p>The improvements in genomics methods coupled with readily accessible high-throughput sequencing have contributed to our understanding of microbial species, metagenomes, infectious diseases and more. To maximize the impact of these genomics studies, it is important that data from biological samples will become publicly available with standardized metadata. The availability of data at public archives provides the hope that greater insights could be obtained through integration with multi-omics data, reproducibility of published studies, or meta-analyses of large diverse datasets. These datasets should include a description of the host, organism, environmental source of the specimen, spatial-temporal information and other relevant metadata, but unfortunately these attributes are often missing and when present, they show inconsistencies in the use of metadata standards and ontologies.<\/jats:p>\n<\/jats:sec><jats:sec>\n<jats:title>Results<\/jats:title>\n<jats:p>METAGENOTE (<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/metagenote.niaid.nih.gov\">https:\/\/metagenote.niaid.nih.gov<\/jats:ext-link>) is a web portal that greatly facilitates the annotation of samples from genomic studies and streamlines the submission process of sequencing files and metadata to the Sequence Read Archive (SRA) (Leinonen R, et al, Nucleic Acids Res, 39:D19-21, 2011) for public access. This platform offers a wide selection of packages for different types of biological and experimental studies with a special emphasis on the standardization of metadata reporting. These packages follow the guidelines from the MIxS standards developed by the Genomics Standard Consortium (GSC) and adopted by the three partners of the International Nucleotides Sequencing Database Collaboration (INSDC) (Cochrane G, et al, Nucleic Acids Res, 44:D48-50, 2016) - National Center for Biotechnology Information (NCBI), European Bioinformatics Institute (EBI) and the DNA Data Bank of Japan (DDBJ). METAGENOTE then compiles, validates and manages the submission through an easy-to-use web interface minimizing submission errors and eliminating the need for submitting sequencing files via a separate file transfer mechanism.<\/jats:p>\n<\/jats:sec><jats:sec>\n<jats:title>Conclusions<\/jats:title>\n<jats:p>METAGENOTE is a public resource that focuses on simplifying the annotation and submission process of data with its corresponding metadata. Users of METAGENOTE will benefit from the easy to use annotation interface but most importantly will be encouraged to publish metadata following standards and ontologies that make the public data available for reuse.<\/jats:p>\n<\/jats:sec>","DOI":"10.1186\/s12859-020-03694-0","type":"journal-article","created":{"date-parts":[[2020,9,3]],"date-time":"2020-09-03T11:04:24Z","timestamp":1599131064000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":26,"title":["\u201cMETAGENOTE: a simplified web platform for metadata annotation of genomic samples and streamlined submission to NCBI\u2019s sequence read archive\u201d"],"prefix":"10.1186","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8520-5114","authenticated-orcid":false,"given":"Mariam","family":"Qui\u00f1ones","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David T.","family":"Liou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Conrad","family":"Shyu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wongyu","family":"Kim","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ivan","family":"Vujkovic-Cvijin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yasmine","family":"Belkaid","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Darrell E.","family":"Hurt","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,9,3]]},"reference":[{"issue":"Database issue","key":"3694_CR1","doi-asserted-by":"publisher","first-page":"D19","DOI":"10.1093\/nar\/gkq1019","volume":"39","author":"R Leinonen","year":"2011","unstructured":"Leinonen R, Sugawara H, Shumway M. International nucleotide sequence database C: the sequence read archive. Nucleic Acids Res. 2011;39(Database issue):D19\u201321.","journal-title":"Nucleic Acids Res"},{"key":"3694_CR2","unstructured":"SRA Database Growth [https:\/\/www.ncbi.nlm.nih.gov\/sra\/docs\/sragrowth].."},{"key":"3694_CR3","unstructured":"Genomics Standards Consortium (GSC) [https:\/\/gensc.org\/]."},{"issue":"5","key":"3694_CR4","doi-asserted-by":"publisher","first-page":"415","DOI":"10.1038\/nbt.1823","volume":"29","author":"P Yilmaz","year":"2011","unstructured":"Yilmaz P, Kottmann R, Field D, Knight R, Cole JR, Amaral-Zettler L, Gilbert JA, Karsch-Mizrachi I, Johnston A, Cochrane G, et al. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat Biotechnol. 2011;29(5):415\u201320.","journal-title":"Nat Biotechnol"},{"issue":"8","key":"3694_CR5","doi-asserted-by":"publisher","first-page":"1112","DOI":"10.1093\/bioinformatics\/btq099","volume":"26","author":"J Malone","year":"2010","unstructured":"Malone J, Holloway E, Adamusiak T, Kapushesky M, Zheng J, Kolesnikov N, Zhukova A, Brazma A, Parkinson H. Modeling sample variables with an experimental factor ontology. Bioinformatics. 2010;26(8):1112\u20138.","journal-title":"Bioinformatics"},{"issue":"2","key":"3694_CR6","doi-asserted-by":"publisher","first-page":"119","DOI":"10.1016\/j.artmed.2012.11.002","volume":"57","author":"C Golbreich","year":"2013","unstructured":"Golbreich C, Grosjean J, Darmoni SJ. The foundational model of anatomy in OWL 2 and its use. Artif Intell Med. 2013;57(2):119\u201332.","journal-title":"Artif Intell Med"},{"issue":"1","key":"3694_CR7","doi-asserted-by":"publisher","first-page":"43","DOI":"10.1186\/2041-1480-4-43","volume":"4","author":"PL Buttigieg","year":"2013","unstructured":"Buttigieg PL, Morrison N, Smith B, Mungall CJ, Lewis SE, Consortium E. The environment ontology: contextualising biological and biomedical entities. J Biomed Semantics. 2013;4(1):43.","journal-title":"J Biomed Semantics"},{"key":"3694_CR8","doi-asserted-by":"publisher","first-page":"273","DOI":"10.1007\/978-1-61779-364-6_19","volume":"803","author":"P de Matos","year":"2012","unstructured":"de Matos P, Adams N, Hastings J, Moreno P, Steinbeck C. A database for chemical proteomics: ChEBI. Methods Mol Biol. 2012;803:273\u201396.","journal-title":"Methods Mol Biol"},{"issue":"D1","key":"3694_CR9","doi-asserted-by":"publisher","first-page":"D48","DOI":"10.1093\/nar\/gkv1323","volume":"44","author":"G Cochrane","year":"2016","unstructured":"Cochrane G, Karsch-Mizrachi I, Takagi T. International nucleotide sequence database C: the international nucleotide sequence database collaboration. Nucleic Acids Res. 2016;44(D1):D48\u201350.","journal-title":"Nucleic Acids Res"},{"issue":"D1","key":"3694_CR10","first-page":"D570","volume":"48","author":"AL Mitchell","year":"2020","unstructured":"Mitchell AL, Almeida A, Beracochea M, Boland M, Burgin J, Cochrane G, Crusoe MR, Kale V, Potter SC, Richardson LJ, et al. MGnify: the microbiome analysis resource in 2020. Nucleic Acids Res. 2020;48(D1):D570\u20138.","journal-title":"Nucleic Acids Res"},{"key":"3694_CR11","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1007\/978-1-4939-3369-3_13","volume":"1399","author":"KP Keegan","year":"2016","unstructured":"Keegan KP, Glass EM, Meyer F. MG-RAST, a Metagenomics Service for Analysis of microbial community structure and function. Methods Mol Biol. 2016;1399:207\u201333.","journal-title":"Methods Mol Biol"},{"issue":"10","key":"3694_CR12","doi-asserted-by":"publisher","first-page":"796","DOI":"10.1038\/s41592-018-0141-9","volume":"15","author":"A Gonzalez","year":"2018","unstructured":"Gonzalez A, Navas-Molina JA, Kosciolek T, McDonald D, V\u00e1zquez-Baeza Y, Ackermann G, Dereus J, Janssen S, Swafford AD, Orchanian SB, et al. Qiita: rapid, web-enabled microbiome meta-analysis. Nat Methods. 2018;15(10):796\u20138.","journal-title":"Nat Methods"},{"issue":"D1","key":"3694_CR13","doi-asserted-by":"publisher","first-page":"D54","DOI":"10.1093\/nar\/gkr854","volume":"40","author":"Y Kodama","year":"2012","unstructured":"Kodama Y, Shumway M, Leinonen R. The sequence read archive: explosive growth of sequencing data. Nucleic Acids Res. 2012;40(D1):D54\u20136.","journal-title":"Nucleic Acids Res"},{"issue":"8","key":"3694_CR14","doi-asserted-by":"publisher","first-page":"e2002925","DOI":"10.1371\/journal.pbio.2002925","volume":"15","author":"J Deck","year":"2017","unstructured":"Deck J, Gaither MR, Ewing R, Bird CE, Davies N, Meyer C, Riginos C, Toonen RJ, Crandall ED. The genomic observatories Metadatabase (GeOMe): a new repository for field and sampling event metadata associated with genetic samples. PLoS Biol. 2017;15(8):e2002925.","journal-title":"PLoS Biol"},{"issue":"1","key":"3694_CR15","doi-asserted-by":"publisher","first-page":"e29715","DOI":"10.1371\/journal.pone.0029715","volume":"7","author":"J Wieczorek","year":"2012","unstructured":"Wieczorek J, Bloom D, Guralnick R, Blum S, Doring M, Giovanni R, Robertson T, Vieglais D. Darwin Core: an evolving community-developed biodiversity data standard. PLoS One. 2012;7(1):e29715.","journal-title":"PLoS One"},{"key":"3694_CR16","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1186\/s13326-016-0055-3","volume":"7","author":"S Jupp","year":"2016","unstructured":"Jupp S, Burdett T, Welter D, Sarntivijai S, Parkinson H, Malone J. Webulous and the Webulous Google add-on--a web service and application for ontology building from templates. J Biomed Semantics. 2016;7:17.","journal-title":"J Biomed Semantics"},{"issue":"4","key":"3694_CR17","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1093\/bioinformatics\/bts718","volume":"29","author":"E Maguire","year":"2013","unstructured":"Maguire E, Gonzalez-Beltran A, Whetzel PL, Sansone SA, Rocca-Serra P. OntoMaton: a bioportal powered ontology widget for Google spreadsheets. Bioinformatics. 2013;29(4):525\u20137.","journal-title":"Bioinformatics"},{"issue":"1","key":"3694_CR18","doi-asserted-by":"publisher","first-page":"268","DOI":"10.1186\/s12859-018-2247-6","volume":"19","author":"SAC Bukhari","year":"2018","unstructured":"Bukhari SAC, Mart\u00ednez-Romero M, O\u2019 connor MJ, Egyedi AL, Willrett D, Graybeal J, Musen MA, Cheung K-H, Kleinstein SH. CEDAR OnDemand: a browser extension to generate ontology-based scientific metadata. BMC Bioinformatics. 2018;19(1):268.","journal-title":"BMC Bioinformatics"},{"key":"3694_CR19","unstructured":"Submission Portal. Preview BioSample Types and Attributes [https:\/\/submit.ncbi.nlm.nih.gov\/biosample\/template\/]."},{"issue":"8","key":"3694_CR20","doi-asserted-by":"publisher","first-page":"1411","DOI":"10.1093\/bioinformatics\/btx617","volume":"34","author":"N Weber","year":"2018","unstructured":"Weber N, Liou D, Dommer J, MacMenamin P, Quinones M, Misner I, Oler AJ, Wan J, Kim L, Coakley McCarthy M, et al. Nephele: a cloud platform for simplified, standardized and reproducible microbiome data analysis. Bioinformatics. 2018;34(8):1411\u20133.","journal-title":"Bioinformatics"},{"issue":"1","key":"3694_CR21","doi-asserted-by":"publisher","first-page":"160018","DOI":"10.1038\/sdata.2016.18","volume":"3","author":"MD Wilkinson","year":"2016","unstructured":"Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, Blomberg N, Boiten J-W, Da Silva Santos LB, Bourne PE, et al. The FAIR guiding principles for scientific data management and stewardship. Scientific Data. 2016;3(1):160018.","journal-title":"Scientific Data"},{"key":"3694_CR22","first-page":"864","volume":"2016","author":"DT Marc","year":"2016","unstructured":"Marc DT, Beattie J, Herasevich V, Gatewood L, Zhang R. Assessing metadata quality of a federally sponsored health data repository. AMIA Annu Symp Proc. 2016;2016:864\u201373.","journal-title":"AMIA Annu Symp Proc"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03694-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-020-03694-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03694-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,3]],"date-time":"2021-09-03T08:20:21Z","timestamp":1630657221000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-020-03694-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,3]]},"references-count":22,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["3694"],"URL":"https:\/\/doi.org\/10.1186\/s12859-020-03694-0","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,9,3]]},"assertion":[{"value":"21 February 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 July 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 September 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"378"}}