{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,14]],"date-time":"2026-03-14T06:39:49Z","timestamp":1773470389176,"version":"3.50.1"},"reference-count":75,"publisher":"Public Library of Science (PLoS)","issue":"12","license":[{"start":{"date-parts":[[2023,12,19]],"date-time":"2023-12-19T00:00:00Z","timestamp":1702944000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Science Foundation Biology Directorate","award":["MCB 2117477, MCB 1921439"],"award-info":[{"award-number":["MCB 2117477, MCB 1921439"]}]},{"name":"NIH","award":["1R01GM151538"],"award-info":[{"award-number":["1R01GM151538"]}]},{"name":"National Science Foundation through the Center for Living Systems","award":["2317138"],"award-info":[{"award-number":["2317138"]}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>The metabolic activity of microbial communities is central to their role in biogeochemical cycles, human health, and biotechnology. Despite the abundance of sequencing data characterizing these consortia, it remains a serious challenge to predict microbial metabolic traits from sequencing data alone. Here we culture 96 bacterial isolates individually and assay their ability to grow on 10 distinct compounds as a sole carbon source. Using these data as well as two existing datasets, we show that statistical approaches can accurately predict bacterial carbon utilization traits from genomes. First, we show that classifiers trained on gene content can accurately predict bacterial carbon utilization phenotypes by encoding phylogenetic information. These models substantially outperform predictions made by constraint-based metabolic models automatically constructed from genomes. This result solidifies our current knowledge about the strong connection between phylogeny and metabolic traits. However, phylogeny-based predictions fail to predict traits for taxa that are phylogenetically distant from any strains in the training set. To overcome this we train improved models on gene presence\/absence to predict carbon utilization traits from gene content. We show that models that predict carbon utilization traits from gene presence\/absence can generalize to taxa that are phylogenetically distant from the training set either by exploiting biochemical information for feature selection or by having sufficiently large datasets. In the latter case, we provide evidence that a statistical approach can identify putatively mechanistic genes involved in metabolic traits. Our study demonstrates the potential power for predicting microbial phenotypes from genotypes using statistical approaches.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1011705","type":"journal-article","created":{"date-parts":[[2023,12,19]],"date-time":"2023-12-19T13:23:15Z","timestamp":1702992195000},"page":"e1011705","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":15,"title":["Statistical prediction of microbial metabolic traits from genomes"],"prefix":"10.1371","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0884-8028","authenticated-orcid":true,"given":"Zeqian","family":"Li","sequence":"first","affiliation":[]},{"given":"Ahmed","family":"Selim","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4130-6845","authenticated-orcid":true,"given":"Seppe","family":"Kuehn","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2023,12,19]]},"reference":[{"issue":"5879","key":"pcbi.1011705.ref001","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1126\/science.1153213","article-title":"The microbial engines that drive Earth\u2019s biogeochemical cycles","volume":"320","author":"PG Falkowski","year":"2008","journal-title":"Science"},{"issue":"7505","key":"pcbi.1011705.ref002","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1038\/nature13421","article-title":"Persistent gut microbiota immaturity in malnourished Bangladeshi children","volume":"510","author":"S Subramanian","year":"2014","journal-title":"Nature"},{"issue":"3","key":"pcbi.1011705.ref003","doi-asserted-by":"crossref","first-page":"490","DOI":"10.1053\/j.gastro.2011.11.037","article-title":"Fecal transplantation, through colonoscopy, is effective therapy for recurrent Clostridium difficile infection","volume":"142","author":"E Mattila","year":"2012","journal-title":"Gastroenterology"},{"key":"pcbi.1011705.ref004","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1016\/j.watres.2014.06.042","article-title":"Microbial ecology of denitrification in biological wastewater treatment","volume":"64","author":"H Lu","year":"2014","journal-title":"Water research"},{"issue":"7415","key":"pcbi.1011705.ref005","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1038\/nature11552","article-title":"Functional interactions between the gut microbiota and host metabolism","volume":"489","author":"V Tremaroli","year":"2012","journal-title":"Nature"},{"issue":"6","key":"pcbi.1011705.ref006","doi-asserted-by":"crossref","first-page":"896","DOI":"10.1016\/j.soilbio.2010.02.003","article-title":"Shifts in bacterial community structure associated with inputs of low molecular weight carbon compounds to soil","volume":"42","author":"KG Eilers","year":"2010","journal-title":"Soil Biology and Biochemistry"},{"issue":"2","key":"pcbi.1011705.ref007","doi-asserted-by":"crossref","first-page":"175","DOI":"10.3354\/ame028175","article-title":"Microbial ecology of organic aggregates in aquatic ecosystems","volume":"28","author":"M Simon","year":"2002","journal-title":"Aquatic microbial ecology"},{"issue":"7402","key":"pcbi.1011705.ref008","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1038\/nature11234","article-title":"Structure, function and diversity of the healthy human microbiome","volume":"486","author":"THMP Consortium","year":"2012","journal-title":"Nature"},{"issue":"7681","key":"pcbi.1011705.ref009","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1038\/nature24621","article-title":"A communal catalogue reveals Earth\u2019s multiscale microbial diversity","volume":"551","author":"LR Thompson","year":"2017","journal-title":"Nature"},{"issue":"6237","key":"pcbi.1011705.ref010","doi-asserted-by":"crossref","first-page":"1261359","DOI":"10.1126\/science.1261359","article-title":"Structure and function of the global ocean microbiome","volume":"348","author":"S Sunagawa","year":"2015","journal-title":"Science"},{"issue":"22","key":"pcbi.1011705.ref011","doi-asserted-by":"crossref","first-page":"E2329","DOI":"10.1073\/pnas.1319284111","article-title":"Relating the metatranscriptome and metagenome of the human gut","volume":"111","author":"EA Franzosa","year":"2014","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"1","key":"pcbi.1011705.ref012","first-page":"1","article-title":"An integrated meta-omics approach reveals substrates involved in synergistic interactions in a bisphenol A (BPA)-degrading microbial community","volume":"7","author":"K Yu","year":"2019","journal-title":"Microbiome"},{"key":"pcbi.1011705.ref013","author":"SL Salzberg","year":"2019","journal-title":"Next-generation genome annotation: we still struggle to get it right"},{"issue":"3","key":"pcbi.1011705.ref014","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1038\/nbt.1614","article-title":"What is flux balance analysis?","volume":"28","author":"JD Orth","year":"2010","journal-title":"Nature biotechnology"},{"issue":"2","key":"pcbi.1011705.ref015","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1038\/84379","article-title":"In silico predictions of Escherichia coli metabolic capabilities are consistent with experimental data","volume":"19","author":"JS Edwards","year":"2001","journal-title":"Nature biotechnology"},{"issue":"1","key":"pcbi.1011705.ref016","doi-asserted-by":"crossref","first-page":"801","DOI":"10.1038\/s41467-022-28467-6","article-title":"Whole-cell modeling in yeast predicts compartment-specific proteome constraints that drive metabolic strategies","volume":"13","author":"IE Elsemman","year":"2022","journal-title":"Nature communications"},{"issue":"15","key":"pcbi.1011705.ref017","doi-asserted-by":"crossref","first-page":"7542","DOI":"10.1093\/nar\/gky537","article-title":"Fast automated reconstruction of genome-scale metabolic models for microbial species and communities","volume":"46","author":"D Machado","year":"2018","journal-title":"Nucleic acids research"},{"issue":"2","key":"pcbi.1011705.ref018","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1038\/nrg3643","article-title":"Constraint-based models predict metabolic and associated cellular functions","volume":"15","author":"A Bordbar","year":"2014","journal-title":"Nature Reviews Genetics"},{"issue":"3","key":"pcbi.1011705.ref019","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1016\/j.cell.2021.12.036","article-title":"Genomic structure predicts metabolite dynamics in microbial communities","volume":"185","author":"K Gowda","year":"2022","journal-title":"Cell"},{"issue":"4","key":"pcbi.1011705.ref020","doi-asserted-by":"crossref","first-page":"830","DOI":"10.1038\/ismej.2012.160","article-title":"Phylogenetic conservatism of functional traits in microorganisms","volume":"7","author":"AC Martiny","year":"2013","journal-title":"The ISME journal"},{"issue":"9","key":"pcbi.1011705.ref021","doi-asserted-by":"crossref","first-page":"814","DOI":"10.1038\/nbt.2676","article-title":"Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences","volume":"31","author":"MG Langille","year":"2013","journal-title":"Nature biotechnology"},{"issue":"8","key":"pcbi.1011705.ref022","doi-asserted-by":"crossref","first-page":"1970","DOI":"10.1128\/jcm.32.8.1970-1975.1994","article-title":"Evaluation of Biolog system for identification of some gram-negative bacteria of clinical importance","volume":"32","author":"\u00e1 Holmes","year":"1994","journal-title":"Journal of Clinical Microbiology"},{"issue":"5","key":"pcbi.1011705.ref023","article-title":"The Biolog plates technique as a tool in ecological studies of microbial communities","volume":"15","author":"A Stefanowicz","year":"2006","journal-title":"Polish Journal of Environmental Studies"},{"issue":"9","key":"pcbi.1011705.ref024","doi-asserted-by":"crossref","first-page":"2183","DOI":"10.1038\/s41396-019-0427-7","article-title":"Resource heterogeneity structures aquatic bacterial communities","volume":"13","author":"ME Muscarella","year":"2019","journal-title":"The ISME journal"},{"issue":"6","key":"pcbi.1011705.ref025","doi-asserted-by":"crossref","DOI":"10.1016\/j.isci.2023.106879","article-title":"Algae drive convergent bacterial community assembly at low dilution frequency","volume":"26","author":"KH Prabhakara","year":"2023","journal-title":"IScience"},{"key":"pcbi.1011705.ref026","first-page":"1","article-title":"Genome content predicts the carbon catabolic preferences of heterotrophic bacteria","author":"M Gralka","year":"2023","journal-title":"Nature Microbiology"},{"issue":"1","key":"pcbi.1011705.ref027","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/0304-4165(71)90053-5","article-title":"The \u03b1-galactosidase from Escherichia coli K12","volume":"230","author":"C Burstein","year":"1971","journal-title":"Biochimica et Biophysica Acta (BBA)\u2014General Subjects"},{"issue":"1","key":"pcbi.1011705.ref028","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1111\/j.1432-1033.1976.tb10637.x","article-title":"Raffinose Metabolism in Escherichia coli K12","volume":"67","author":"K SCHMID","year":"1976","journal-title":"European Journal of Biochemistry"},{"issue":"1","key":"pcbi.1011705.ref029","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: kyoto encyclopedia of genes and genomes","volume":"28","author":"M Kanehisa","year":"2000","journal-title":"Nucleic acids research"},{"issue":"6261","key":"pcbi.1011705.ref030","doi-asserted-by":"crossref","first-page":"aac9323","DOI":"10.1126\/science.aac9323","article-title":"Microbiomes in light of traits: a phylogenetic perspective","volume":"350","author":"JB Martiny","year":"2015","journal-title":"Science"},{"issue":"1","key":"pcbi.1011705.ref031","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1109\/TIT.1967.1053964","article-title":"Nearest neighbor pattern classification","volume":"13","author":"T Cover","year":"1967","journal-title":"IEEE transactions on information theory"},{"issue":"19","key":"pcbi.1011705.ref032","doi-asserted-by":"crossref","first-page":"3617","DOI":"10.1016\/j.cell.2022.08.003","article-title":"Design, construction, and in vivo augmentation of a complex gut microbiome","volume":"185","author":"AG Cheng","year":"2022","journal-title":"Cell"},{"issue":"13","key":"pcbi.1011705.ref033","doi-asserted-by":"crossref","first-page":"2530","DOI":"10.1021\/bi00789a017","article-title":"Metabolism of benzoic acid by bacteria. Accumulation of (-)-3, 5-cyclohexadiene-1, 2-diol-1-carboxylic acid by a mutant strain of Alcaligenes eutrophus","volume":"10","author":"AM Reiner","year":"1971","journal-title":"Biochemistry"},{"key":"pcbi.1011705.ref034","doi-asserted-by":"crossref","unstructured":"Zhou K, Liu Z, Qiao Y, Xiang T, Loy CC. Domain generalization: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2022;.","DOI":"10.1109\/TPAMI.2022.3195549"},{"issue":"11","key":"pcbi.1011705.ref035","doi-asserted-by":"crossref","first-page":"1947","DOI":"10.1002\/pro.3715","article-title":"Toward understanding the origin and evolution of cellular organisms","volume":"28","author":"M Kanehisa","year":"2019","journal-title":"Protein Science"},{"key":"pcbi.1011705.ref036","article-title":"KEGG for taxonomy-based analysis of pathways and genomes","author":"M Kanehisa","year":"2022","journal-title":"Nucleic Acids Research"},{"issue":"2","key":"pcbi.1011705.ref037","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1016\/S0021-9258(18)65926-5","article-title":"The oxidation of L-arabinose by Pseudomonas saccharophila","volume":"217","author":"R Weimberg","year":"1955","journal-title":"Journal of Biological Chemistry"},{"issue":"2","key":"pcbi.1011705.ref038","doi-asserted-by":"crossref","first-page":"1031","DOI":"10.1016\/S0021-9258(18)70464-X","article-title":"Pentose fermentation by Lactobacillus plantarum: II. L-arabinose isomerase","volume":"231","author":"E Heath","year":"1958","journal-title":"Journal of Biological Chemistry"},{"issue":"D1","key":"pcbi.1011705.ref039","doi-asserted-by":"crossref","first-page":"D741","DOI":"10.1093\/nar\/gkab961","article-title":"Bac Dive in 2022: the knowledge base for standardized bacterial and archaeal data","volume":"50","author":"LC Reimer","year":"2022","journal-title":"Nucleic Acids Research"},{"issue":"3","key":"pcbi.1011705.ref040","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1128\/jb.147.3.787-796.1981","article-title":"Nucleotide sequence of the structural gene for tryptophanase of Escherichia coli K-12","volume":"147","author":"MC Deeley","year":"1981","journal-title":"Journal of Bacteriology"},{"issue":"1","key":"pcbi.1011705.ref041","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1038\/s42003-020-01158-y","article-title":"The Rnf complex is a Na+ coupled respiratory enzyme in a fermenting bacterium, Thermotoga maritima","volume":"3","author":"M Kuhns","year":"2020","journal-title":"Communications Biology"},{"issue":"15","key":"pcbi.1011705.ref042","doi-asserted-by":"crossref","first-page":"12703","DOI":"10.1074\/jbc.M206563200","article-title":"Characterization of a functional bacterial homologue of sodium-dependent neurotransmitter transporters","volume":"278","author":"A Androutsellis-Theotokis","year":"2003","journal-title":"Journal of Biological Chemistry"},{"issue":"4","key":"pcbi.1011705.ref043","first-page":"533","article-title":"Cell biology and molecular basis of denitrification","volume":"61","author":"WG Zumft","year":"1997","journal-title":"Microbiology and Molecular Biology Reviews"},{"issue":"12","key":"pcbi.1011705.ref044","doi-asserted-by":"crossref","first-page":"e114118","DOI":"10.1371\/journal.pone.0114118","article-title":"Intergenomic comparisons highlight modularity of the denitrification pathway and underpin the importance of community structure for N2O emissions","volume":"9","author":"DR Graf","year":"2014","journal-title":"PloS one"},{"issue":"D1","key":"pcbi.1011705.ref045","doi-asserted-by":"crossref","first-page":"D638","DOI":"10.1093\/nar\/gkac1000","article-title":"The STRING database in 2023: protein\u2013protein association networks and functional enrichment analyses for any sequenced genome of interest","volume":"51","author":"D Szklarczyk","year":"2023","journal-title":"Nucleic Acids Research"},{"issue":"9","key":"pcbi.1011705.ref046","doi-asserted-by":"crossref","first-page":"1442","DOI":"10.1038\/s41591-019-0559-3","article-title":"A library of human gut bacterial isolates paired with longitudinal multiomics data enables mechanistic microbiome research","volume":"25","author":"M Poyet","year":"2019","journal-title":"Nature medicine"},{"issue":"5","key":"pcbi.1011705.ref047","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1016\/j.tim.2014.03.001","article-title":"Phylogeny, culturing, and metagenomics of the human gut microbiota","volume":"22","author":"AW Walker","year":"2014","journal-title":"Trends in microbiology"},{"issue":"26","key":"pcbi.1011705.ref048","doi-asserted-by":"crossref","first-page":"12804","DOI":"10.1073\/pnas.1900102116","article-title":"Massively parallel screening of synthetic microbial communities","volume":"116","author":"J Kehe","year":"2019","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"pcbi.1011705.ref049","first-page":"1","article-title":"High-throughput microbial culturomics using automation and machine learning","author":"Y Huang","year":"2023","journal-title":"Nature Biotechnology"},{"issue":"6","key":"pcbi.1011705.ref050","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1016\/j.cels.2019.11.004","article-title":"Higher-order interaction between species inhibits bacterial invasion of a phototroph-predator microbial community","volume":"9","author":"H Mickalide","year":"2019","journal-title":"Cell systems"},{"issue":"12","key":"pcbi.1011705.ref051","doi-asserted-by":"crossref","first-page":"e3000550","DOI":"10.1371\/journal.pbio.3000550","article-title":"High-order interactions distort the functional landscape of microbial consortia","volume":"17","author":"A Sanchez-Gorostiaga","year":"2019","journal-title":"PLoS Biology"},{"issue":"6401","key":"pcbi.1011705.ref052","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1126\/science.aat1168","article-title":"Emergent simplicity in microbial community assembly","volume":"361","author":"JE Goldford","year":"2018","journal-title":"Science"},{"issue":"10","key":"pcbi.1011705.ref053","doi-asserted-by":"crossref","first-page":"1424","DOI":"10.1038\/s41559-021-01535-8","article-title":"Resource\u2013diversity relationships in bacterial communities reflect the network structure of microbial metabolism","volume":"5","author":"M Dal Bello","year":"2021","journal-title":"Nature Ecology & Evolution"},{"issue":"45","key":"pcbi.1011705.ref054","doi-asserted-by":"crossref","first-page":"e2013564118","DOI":"10.1073\/pnas.2013564118","article-title":"Closed microbial communities self-organize to persistently cycle carbon","volume":"118","author":"LM de Jes\u00fas Astacio","year":"2021","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"12","key":"pcbi.1011705.ref055","doi-asserted-by":"crossref","first-page":"1306","DOI":"10.1038\/s41592-019-0616-3","article-title":"Learning representations of microbe\u2013metabolite interactions","volume":"16","author":"JT Morton","year":"2019","journal-title":"Nature methods"},{"issue":"7","key":"pcbi.1011705.ref056","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1038\/s41579-022-00695-z","article-title":"Life and death in the soil microbiome: how ecological processes influence biogeochemistry","volume":"20","author":"NW Sokol","year":"2022","journal-title":"Nature Reviews Microbiology"},{"issue":"15","key":"pcbi.1011705.ref057","doi-asserted-by":"crossref","first-page":"2114","DOI":"10.1093\/bioinformatics\/btu170","article-title":"Trimmomatic: a flexible trimmer for Illumina sequence data","volume":"30","author":"AM Bolger","year":"2014","journal-title":"Bioinformatics"},{"issue":"6","key":"pcbi.1011705.ref058","doi-asserted-by":"crossref","first-page":"e1005595","DOI":"10.1371\/journal.pcbi.1005595","article-title":"Unicycler: resolving bacterial genome assemblies from short and long sequencing reads","volume":"13","author":"RR Wick","year":"2017","journal-title":"PLoS computational biology"},{"issue":"5","key":"pcbi.1011705.ref059","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1089\/cmb.2012.0021","article-title":"SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing","volume":"19","author":"A Bankevich","year":"2012","journal-title":"Journal of computational biology"},{"issue":"1","key":"pcbi.1011705.ref060","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-11-119","article-title":"Prodigal: prokaryotic gene recognition and translation initiation site identification","volume":"11","author":"D Hyatt","year":"2010","journal-title":"BMC bioinformatics"},{"issue":"7","key":"pcbi.1011705.ref061","doi-asserted-by":"crossref","first-page":"2251","DOI":"10.1093\/bioinformatics\/btz859","article-title":"KofamKOALA: KEGG Ortholog assignment based on profile HMM and adaptive score threshold","volume":"36","author":"T Aramaki","year":"2020","journal-title":"Bioinformatics"},{"issue":"14","key":"pcbi.1011705.ref062","doi-asserted-by":"crossref","first-page":"1823","DOI":"10.1093\/bioinformatics\/bts252","article-title":"SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes","volume":"28","author":"E Pruesse","year":"2012","journal-title":"Bioinformatics"},{"issue":"D1","key":"pcbi.1011705.ref063","doi-asserted-by":"crossref","first-page":"D590","DOI":"10.1093\/nar\/gks1219","article-title":"The SILVA ribosomal RNA gene database project: improved data processing and web-based tools","volume":"41","author":"C Quast","year":"2012","journal-title":"Nucleic acids research"},{"issue":"3","key":"pcbi.1011705.ref064","doi-asserted-by":"crossref","first-page":"e9490","DOI":"10.1371\/journal.pone.0009490","article-title":"FastTree 2\u2013approximately maximum-likelihood trees for large alignments","volume":"5","author":"MN Price","year":"2010","journal-title":"PloS one"},{"issue":"19","key":"pcbi.1011705.ref065","doi-asserted-by":"crossref","first-page":"2520","DOI":"10.1093\/bioinformatics\/bts480","article-title":"Snakemake\u2014a scalable bioinformatics workflow engine","volume":"28","author":"J K\u00f6ster","year":"2012","journal-title":"Bioinformatics"},{"issue":"3","key":"pcbi.1011705.ref066","doi-asserted-by":"crossref","first-page":"639","DOI":"10.1038\/s41596-018-0098-2","article-title":"Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v. 3.0","volume":"14","author":"L Heirendt","year":"2019","journal-title":"Nature protocols"},{"key":"pcbi.1011705.ref067","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1752-0509-7-74","article-title":"COBRApy: constraints-based reconstruction and analysis for python","volume":"7","author":"A Ebrahim","year":"2013","journal-title":"BMC systems biology"},{"key":"pcbi.1011705.ref068","first-page":"gkac1078","article-title":"proGenomes3: approaching one million accurately and consistently annotated high-quality prokaryotic genomes","author":"A Fullam","year":"2022","journal-title":"Nucleic Acids Research"},{"key":"pcbi.1011705.ref069","article-title":"The European Nucleotide Archive in 2022","author":"J Burgin","year":"2022","journal-title":"Nucleic Acids Research"},{"issue":"12","key":"pcbi.1011705.ref070","doi-asserted-by":"crossref","first-page":"5825","DOI":"10.1093\/molbev\/msab293","article-title":"eggNOG-mapper v2: functional annotation, orthology assignments, and domain prediction at the metagenomic scale","volume":"38","author":"CP Cantalapiedra","year":"2021","journal-title":"Molecular biology and evolution"},{"issue":"3","key":"pcbi.1011705.ref071","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1038\/s41592-019-0686-2","article-title":"SciPy 1.0: fundamental algorithms for scientific computing in Python","volume":"17","author":"P Virtanen","year":"2020","journal-title":"Nature methods"},{"key":"pcbi.1011705.ref072","first-page":"65","article-title":"A simple sequentially rejective multiple test procedure","author":"S Holm","year":"1979","journal-title":"Scandinavian journal of statistics"},{"issue":"318","key":"pcbi.1011705.ref073","first-page":"626","article-title":"Rectangular confidence regions for the means of multivariate normal distributions","volume":"62","author":"Z \u0160id\u00e1k","year":"1967","journal-title":"Journal of the American Statistical Association"},{"key":"pcbi.1011705.ref074","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"F Pedregosa","year":"2011","journal-title":"the Journal of machine Learning research"},{"issue":"10","key":"pcbi.1011705.ref075","doi-asserted-by":"crossref","first-page":"e1001184","DOI":"10.1371\/journal.pbio.1001184","article-title":"Structure and evolution of Streptomyces interaction networks in soil and in silico","volume":"9","author":"K Vetsigian","year":"2011","journal-title":"PLoS biology"}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1011705","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,12,19]],"date-time":"2023-12-19T13:24:23Z","timestamp":1702992263000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1011705"}},"subtitle":[],"editor":[{"given":"Christos A.","family":"Ouzounis","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,12,19]]},"references-count":75,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2023,12,19]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1011705","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.06.30.547261","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,19]]}}}