{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,8]],"date-time":"2026-02-08T18:47:24Z","timestamp":1770576444802,"version":"3.49.0"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"D1","license":[{"start":{"date-parts":[[2020,11,22]],"date-time":"2020-11-22T00:00:00Z","timestamp":1606003200000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001807","name":"FAPESP","doi-asserted-by":"publisher","award":["2019\/03396-9"],"award-info":[{"award-number":["2019\/03396-9"]}],"id":[{"id":"10.13039\/501100001807","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001807","name":"FAPESP","doi-asserted-by":"publisher","award":["2013\/07375-0"],"award-info":[{"award-number":["2013\/07375-0"]}],"id":[{"id":"10.13039\/501100001807","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001656","name":"Helmholtz Association","doi-asserted-by":"publisher","award":["VH-NG-1248"],"award-info":[{"award-number":["VH-NG-1248"]}],"id":[{"id":"10.13039\/501100001656","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,1,8]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Metagenomics became a standard strategy to comprehend the functional potential of microbial communities, including the human microbiome. Currently, the number of metagenomes in public repositories is increasing exponentially. The Sequence Read Archive (SRA) and the MG-RAST are the two main repositories for metagenomic data. These databases allow scientists to reanalyze samples and explore new hypotheses. However, mining samples from them can be a limiting factor, since the metadata available in these repositories is often misannotated, misleading, and decentralized, creating an overly complex environment for sample reanalysis. The main goal of the HumanMetagenomeDB is to simplify the identification and use of public human metagenomes of interest. HumanMetagenomeDB version 1.0 contains metadata of 69 822 metagenomes. We standardized 203 attributes, based on standardized ontologies, describing host characteristics (e.g.\u00a0sex, age and body mass index), diagnosis information (e.g. cancer, Crohn's disease and Parkinson), location (e.g. country, longitude and latitude), sampling site (e.g. gut, lung and skin) and sequencing attributes (e.g. sequencing platform, average length and sequence quality). Further, HumanMetagenomeDB version 1.0 metagenomes encompass 58 countries, 9 main sample sites (i.e. body parts), 58 diagnoses and multiple ages, ranging from just born to 91 years old. The HumanMetagenomeDB is publicly available at https:\/\/webapp.ufz.de\/hmgdb\/.<\/jats:p>","DOI":"10.1093\/nar\/gkaa1031","type":"journal-article","created":{"date-parts":[[2020,10,21]],"date-time":"2020-10-21T11:22:39Z","timestamp":1603279359000},"page":"D743-D750","source":"Crossref","is-referenced-by-count":55,"title":["HumanMetagenomeDB: a public repository of curated and standardized metadata for human metagenomes"],"prefix":"10.1093","volume":"49","author":[{"given":"Jonas Coelho","family":"Kasmanas","sequence":"first","affiliation":[{"name":"Institute of Mathematics and Computer Sciences, University of S\u00e3o Paulo, S\u00e3o Carlos, Brazil"},{"name":"Department of Environmental Microbiology, Helmholtz Centre for Environmental Research \u2013 UFZ GmbH, Leipzig, Saxony\u00a004318, Germany"},{"name":"Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig, Saxony\u00a004107, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"Bartholom\u00e4us","sequence":"additional","affiliation":[{"name":"GFZ German Research Centre for Geosciences, Section 3.7 Geomicrobiology, Telegrafenberg, 14473 Potsdam, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Felipe Borim","family":"Corr\u00eaa","sequence":"additional","affiliation":[{"name":"Department of Environmental Microbiology, Helmholtz Centre for Environmental Research \u2013 UFZ GmbH, Leipzig, Saxony\u00a004318, Germany"},{"name":"Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig, Saxony\u00a004107, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tamara","family":"Tal","sequence":"additional","affiliation":[{"name":"Department of Bioanalytical Ecotoxicology, Helmholtz Centre for Environmental Research \u2013 UFZ GmbH, Leipzig, Saxony\u00a004318, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nico","family":"Jehmlich","sequence":"additional","affiliation":[{"name":"Department of Molecular Systems Biology, Helmholtz Centre for Environmental Research \u2013 UFZ GmbH, Leipzig, Saxony\u00a004318, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gunda","family":"Herberth","sequence":"additional","affiliation":[{"name":"Department of Environmental Immunology, Helmholtz Centre for Environmental Research \u2013 UFZ GmbH, Leipzig, Saxony\u00a004318, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Martin","family":"von\u00a0Bergen","sequence":"additional","affiliation":[{"name":"Department of Molecular Systems Biology, Helmholtz Centre for Environmental Research \u2013 UFZ GmbH, Leipzig, Saxony\u00a004318, Germany"},{"name":"Institute of Biochemistry, Faculty of Life Sciences, University of Leipzig, Leipzig, Saxony\u00a004107, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5016-5191","authenticated-orcid":false,"given":"Peter F","family":"Stadler","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Interdisciplinary Center of Bioinformatics, University of Leipzig, Leipzig, Saxony\u00a004107, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andr\u00e9\u00a0Carlos\u00a0Ponce\u00a0de\u00a0Leon\u00a0Ferreira\u00a0de","family":"Carvalho","sequence":"additional","affiliation":[{"name":"Institute of Mathematics and Computer Sciences, University of S\u00e3o Paulo, S\u00e3o Carlos, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6972-6692","authenticated-orcid":false,"given":"Ulisses","family":"Nunes\u00a0da\u00a0Rocha","sequence":"additional","affiliation":[{"name":"Department of Environmental Microbiology, Helmholtz Centre for Environmental Research \u2013 UFZ GmbH, Leipzig, Saxony\u00a004318, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,11,22]]},"reference":[{"key":"2021010313115319800_B1","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1128\/MMBR.68.4.669-685.2004","article-title":"Metagenomics: application of genomics to uncultured microorganisms","volume":"68","author":"Handelsman","year":"2004","journal-title":"Microbiol. Mol. Biol. Rev."},{"key":"2021010313115319800_B2","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1016\/j.copbio.2011.11.013","article-title":"Next generation sequencing and bioinformatic bottlenecks: the current state of metagenomic data analysis","volume":"23","author":"Scholz","year":"2012","journal-title":"Curr. Opin. Biotechnol."},{"key":"2021010313115319800_B3","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1016\/B978-0-12-809657-4.99576-0","article-title":"Bioinformatics principles for deciphering cardiovascular diseases","volume-title":"Encyclopedia of Cardiovascular Research and Medicine","author":"Shu","year":"2018"},{"key":"2021010313115319800_B4","doi-asserted-by":"crossref","first-page":"D54","DOI":"10.1093\/nar\/gkr854","article-title":"The sequence read archive: explosive growth of sequencing data","volume":"40","author":"Kodama","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B5","doi-asserted-by":"crossref","first-page":"D48","DOI":"10.1093\/nar\/gkx1097","article-title":"The international nucleotide sequence database collaboration","volume":"46","author":"Karsch-Mizrachi","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B6","doi-asserted-by":"crossref","first-page":"D84","DOI":"10.1093\/nar\/gky1078","article-title":"The European Nucleotide Archive in 2018","volume":"47","author":"Harrison","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B7","doi-asserted-by":"crossref","first-page":"D51","DOI":"10.1093\/nar\/gkv1105","article-title":"DNA data bank of Japan (DDBJ) progress report","volume":"44","author":"Mashima","year":"2016","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B8","doi-asserted-by":"crossref","first-page":"D590","DOI":"10.1093\/nar\/gkv1322","article-title":"The MG-RAST metagenomics database and portal in 2015","volume":"44","author":"Wilke","year":"2016","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B9","doi-asserted-by":"crossref","first-page":"D726","DOI":"10.1093\/nar\/gkx967","article-title":"EBI Metagenomics in 2017: enriching the analysis of microbial communities, from sequence reads to assemblies","volume":"46","author":"Mitchell","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B10","doi-asserted-by":"crossref","first-page":"D637","DOI":"10.1093\/nar\/gky1008","article-title":"GcMeta: A Global Catalogue of Metagenomics platform to support the archiving, standardization and analysis of microbiome data","volume":"47","author":"Shi","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B11","doi-asserted-by":"crossref","first-page":"e02099-18","DOI":"10.1128\/mBio.02099-18","article-title":"Identifying and predicting novelty in microbiome studies","volume":"9","author":"Su","year":"2018","journal-title":"MBio"},{"key":"2021010313115319800_B12","doi-asserted-by":"crossref","first-page":"796","DOI":"10.1038\/s41592-018-0141-9","article-title":"Qiita: rapid, web-enabled microbiome meta-analysis","volume":"15","author":"Gonzalez","year":"2018","journal-title":"Nat. Methods"},{"key":"2021010313115319800_B13","doi-asserted-by":"crossref","first-page":"667","DOI":"10.1038\/s41591-019-0405-7","article-title":"Metagenomic analysis of colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and a link with choline degradation","volume":"25","author":"Thomas","year":"2019","journal-title":"Nat. Med."},{"key":"2021010313115319800_B14","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1038\/s41591-019-0406-6","article-title":"Meta-analysis of fecal metagenomes reveals global microbial signatures that are specific for colorectal cancer","volume":"25","author":"Wirbel","year":"2019","journal-title":"Nat. Med."},{"key":"2021010313115319800_B15","doi-asserted-by":"crossref","first-page":"2389","DOI":"10.1093\/bioinformatics\/btx184","article-title":"PARTIE: a partition engine to separate metagenomic and amplicon projects in the Sequence Read Archive","volume":"33","author":"Torres","year":"2017","journal-title":"Bioinformatics"},{"key":"2021010313115319800_B16","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1038\/nbt1360","article-title":"The minimum information about a genome sequence (MIGS) specification","volume":"26","author":"Field","year":"2008","journal-title":"Nat. Biotechnol."},{"key":"2021010313115319800_B17","doi-asserted-by":"crossref","first-page":"D57","DOI":"10.1093\/nar\/gkr1163","article-title":"BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata","volume":"40","author":"Barrett","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B18","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1038\/nbt.1823","article-title":"Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications","volume":"29","author":"Yilmaz","year":"2011","journal-title":"Nat. Biotechnol."},{"key":"2021010313115319800_B19","doi-asserted-by":"crossref","first-page":"D649","DOI":"10.1093\/nar\/gky977","article-title":"Genomes OnLine database (GOLD) v.7: Updates and new features","volume":"47","author":"Mukherjee","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B20","first-page":"D626","article-title":"TerrestrialMetagenomeDB: a public repository of curated and standardized metadata for terrestrial metagenomes","volume":"48","author":"Corr\u00eaa","year":"2020","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B21","doi-asserted-by":"crossref","first-page":"2317","DOI":"10.1101\/gr.096651.109","article-title":"The NIH human microbiome project","volume":"19","author":"Peterson","year":"2009","journal-title":"Genome Res."},{"key":"2021010313115319800_B22","doi-asserted-by":"crossref","first-page":"2914","DOI":"10.1093\/bioinformatics\/btx334","article-title":"MetaSRA: normalized human sample-specific metadata for the Sequence Read Archive","volume":"33","author":"Bernstein","year":"2017","journal-title":"Bioinformatics"},{"key":"2021010313115319800_B23","doi-asserted-by":"crossref","first-page":"1023","DOI":"10.1038\/nmeth.4468","article-title":"Accessible, curated metagenomic data through ExperimentHub","volume":"14","author":"Pasolli","year":"2017","journal-title":"Nat. Methods"},{"key":"2021010313115319800_B24","doi-asserted-by":"crossref","first-page":"R80","DOI":"10.1186\/gb-2004-5-10-r80","article-title":"Bioconductor: open software development for computational biology and bioinformatics","volume":"5","author":"Gentleman","year":"2004","journal-title":"Genome Biol."},{"key":"2021010313115319800_B25","doi-asserted-by":"crossref","first-page":"D1172","DOI":"10.1093\/nar\/gky1061","article-title":"Biosamples database: an updated sample metadata hub","volume":"47","author":"Courtot","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"2021010313115319800_B26","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1186\/1471-2105-14-19","article-title":"SRAdb: query and use public next-generation sequencing data from within R","volume":"14","author":"Zhu","year":"2013","journal-title":"BMC Bioinformatics"}],"container-title":["Nucleic Acids Research"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/nar\/article-pdf\/49\/D1\/D743\/35363836\/gkaa1031.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/academic.oup.com\/nar\/article-pdf\/49\/D1\/D743\/35363836\/gkaa1031.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,1,3]],"date-time":"2021-01-03T18:14:10Z","timestamp":1609697650000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/nar\/article\/49\/D1\/D743\/5998395"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,11,22]]},"references-count":26,"journal-issue":{"issue":"D1","published-online":{"date-parts":[[2020,11,22]]},"published-print":{"date-parts":[[2021,1,8]]}},"URL":"https:\/\/doi.org\/10.1093\/nar\/gkaa1031","relation":{},"ISSN":["0305-1048","1362-4962"],"issn-type":[{"value":"0305-1048","type":"print"},{"value":"1362-4962","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,1,8]]},"published":{"date-parts":[[2020,11,22]]}}}