{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T06:29:25Z","timestamp":1768544965603,"version":"3.49.0"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2018,11,9]],"date-time":"2018-11-09T00:00:00Z","timestamp":1541721600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["31571358"],"award-info":[{"award-number":["31571358"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["21621004"],"award-info":[{"award-number":["21621004"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["31171238"],"award-info":[{"award-number":["31171238"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["91746119"],"award-info":[{"award-number":["91746119"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2019,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Bacteria can usually acquire certain advantageous genes that enable the bacteria to adapt to rapidly changing niches, thereby leading to a wide range of intraspecific genome content and genetic redundancy. The minimal genome of Escherichia coli, which is the most important bacterial species, and the association between E.coli and its human host are worthy of further exploration.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We used gene prediction and phylogenetic analysis to reveal a rich phylogenetic diversity among 491 E.coli strains and to reveal substantial differences between these strains with respect to gene number and genome length. We used pan-genomic analysis to accurately identify 867 core genes, in which only 243 genes are shared by essential genes. This analysis revealed that core genes mainly provide essential functions to the basic lifestyle of E.coli, and accessory genes are likely to confer selective advantages such as niche adaptation or the ability to colonize specific hosts. By association analysis, we found that E.coli strains in non-human hosts may more easily utilize foreign genetic materials to adapt to their surroundings, but the population in human hosts has higher demands for the control of population density, indicating that highly accurate quorum-sensing behavior is very important for harmony between E.coli and its human host. By considering core genes and previous deletions together, we proposed a potential direction for further reduction of the E.coli genome.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The data, analysis process and detailed information on software tools used in this study are all available in the supplementary material.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty938","type":"journal-article","created":{"date-parts":[[2018,11,8]],"date-time":"2018-11-08T12:47:27Z","timestamp":1541681247000},"page":"1987-1991","source":"Crossref","is-referenced-by-count":31,"title":["Pan-genomic analysis provides novel insights into the association of <i>E.coli<\/i> with human host and its minimal genome"],"prefix":"10.1093","volume":"35","author":[{"given":"Zhi-Kai","family":"Yang","sequence":"first","affiliation":[{"name":"Department of Physics, School of Science"},{"name":"Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China"},{"name":"SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, China"},{"name":"SinoGenoMax Co., Ltd.\/Chinese National Human Genome Center, Beijing, China"}]},{"given":"Hao","family":"Luo","sequence":"additional","affiliation":[{"name":"Department of Physics, School of Science"},{"name":"Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China"},{"name":"SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, China"}]},{"given":"Yanming","family":"Zhang","sequence":"additional","affiliation":[{"name":"SinoGenoMax Co., Ltd.\/Chinese National Human Genome Center, Beijing, China"}]},{"given":"Baijing","family":"Wang","sequence":"additional","affiliation":[{"name":"SinoGenoMax Co., Ltd.\/Chinese National Human Genome Center, Beijing, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9563-3841","authenticated-orcid":false,"given":"Feng","family":"Gao","sequence":"additional","affiliation":[{"name":"Department of Physics, School of Science"},{"name":"Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China"},{"name":"SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, China"}]}],"member":"286","published-online":{"date-parts":[[2018,11,9]]},"reference":[{"key":"2023012713072334500_bty938-B1","doi-asserted-by":"crossref","first-page":"2006.0008","DOI":"10.1038\/msb4100050","article-title":"Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection","volume":"2","author":"Baba","year":"2006","journal-title":"Mol. Syst. Biol"},{"key":"2023012713072334500_bty938-B2","doi-asserted-by":"crossref","first-page":"85.","DOI":"10.1038\/nature18849","article-title":"Interactions between the microbiota and pathogenic bacteria in the gut","volume":"535","author":"B\u00e4umler","year":"2016","journal-title":"Nature"},{"key":"2023012713072334500_bty938-B3","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1186\/1475-2859-11-11","article-title":"Low-mutation-rate, reduced-genome Escherichia coli: an improved host for faithful maintenance of engineered genetic constructs","volume":"11","author":"Cs\u00f6rgo","year":"2012","journal-title":"Microb. Cell Fact"},{"key":"2023012713072334500_bty938-B4","doi-asserted-by":"crossref","first-page":"811","DOI":"10.1038\/nature06245","article-title":"An ecological and evolutionary perspective on human-microbe mutualism and disease","volume":"449","author":"Dethlefsen","year":"2007","journal-title":"Nature"},{"key":"2023012713072334500_bty938-B5","doi-asserted-by":"crossref","first-page":"1345","DOI":"10.1007\/s00438-015-0998-6","article-title":"Genetic redundancy is prevalent within the 6.7 Mb Sinorhizobium meliloti genome","volume":"290","author":"Dicenzo","year":"2015","journal-title":"Mol. Genet. Genomics"},{"key":"2023012713072334500_bty938-B6","doi-asserted-by":"crossref","first-page":"3911","DOI":"10.1128\/JB.186.12.3911-3921.2004","article-title":"Extensive genomic diversity in pathogenic Escherichia coli and Shigella strains revealed by comparative genomic hybridization microarray","volume":"186","author":"Fukiya","year":"2004","journal-title":"J. Bacteriol"},{"key":"2023012713072334500_bty938-B7","doi-asserted-by":"crossref","first-page":"5673","DOI":"10.1128\/JB.185.19.5673-5684.2003","article-title":"Experimental determination and system level analysis of essential genes in Escherichia coli MG1655","volume":"185","author":"Gerdes","year":"2003","journal-title":"J. Bacteriol"},{"key":"2023012713072334500_bty938-B8","doi-asserted-by":"crossref","first-page":"611.","DOI":"10.1038\/nrg3982","article-title":"Eco-Evo-Devo: developmental symbiosis and developmental plasticity as evolutionary agents","volume":"16","author":"Gilbert","year":"2015","journal-title":"Nat. Rev. Genet"},{"key":"2023012713072334500_bty938-B9","doi-asserted-by":"crossref","first-page":"a023861.","DOI":"10.1101\/cshperspect.a023861","article-title":"Minimal cells-real and imagined","volume":"9","author":"Glass","year":"2017","journal-title":"Cold Spring Harb. Perspect. Biol"},{"key":"2023012713072334500_bty938-B10","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1111\/j.1365-2958.2004.04386.x","article-title":"Cell size and nucleoid organization of engineered Escherichia coli cells with a reduced genome","volume":"55","author":"Hashimoto","year":"2005","journal-title":"Mol. Microbiol"},{"key":"2023012713072334500_bty938-B11","doi-asserted-by":"crossref","first-page":"i89","DOI":"10.1093\/bioinformatics\/bty276","article-title":"A pan-genome-based machine learning approach for predicting antimicrobial resistance activities of the Escherichia coli strains","volume":"34","author":"Her","year":"2018","journal-title":"Bioinformatics"},{"key":"2023012713072334500_bty938-B12","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.jbiosc.2013.01.010","article-title":"Genetic manipulations restored the growth fitness of reduced-genome Escherichia coli","volume":"116","author":"Hirokawa","year":"2013","journal-title":"J. Biosci. Bioeng"},{"key":"2023012713072334500_bty938-B13","doi-asserted-by":"crossref","first-page":"aad6253","DOI":"10.1126\/science.aad6253","article-title":"Design and synthesis of a minimal bacterial genome","volume":"351","author":"Hutchison","year":"2016","journal-title":"Science"},{"key":"2023012713072334500_bty938-B14","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1111\/j.1574-6968.2011.02331.x","article-title":"Oxidative stress sensitivity of engineered Escherichia coli cells with a reduced genome","volume":"322","author":"Iwadate","year":"2011","journal-title":"FEMS Microbiol. Lett"},{"key":"2023012713072334500_bty938-B15","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1016\/j.mib.2014.11.014","article-title":"One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly","volume":"23","author":"Koren","year":"2015","journal-title":"Curr. Opin. Microbiol"},{"key":"2023012713072334500_bty938-B16","doi-asserted-by":"crossref","first-page":"2178","DOI":"10.1101\/gr.1224503","article-title":"OrthoMCL: identification of ortholog groups for eukaryotic genomes","volume":"13","author":"Li","year":"2003","journal-title":"Genome Res"},{"key":"2023012713072334500_bty938-B17","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1038\/nmeth.3444","article-title":"A complete bacterial genome assembled de novo using only nanopore sequencing data","volume":"12","author":"Loman","year":"2015","journal-title":"Nat. Methods"},{"key":"2023012713072334500_bty938-B18","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1007\/s00248-017-1109-5","article-title":"The intestinal microbiota of Tadpoles differs from those of syntopic aquatic invertebrates","volume":"76","author":"Lyra","year":"2018","journal-title":"Microb. Ecol"},{"key":"2023012713072334500_bty938-B19","doi-asserted-by":"crossref","first-page":"17040.","DOI":"10.1038\/nmicrobiol.2017.40","article-title":"Why prokaryotes have pangenomes","volume":"2","author":"McInerney","year":"2017","journal-title":"Nat. Microbiol"},{"key":"2023012713072334500_bty938-B20","doi-asserted-by":"crossref","first-page":"3691","DOI":"10.1093\/bioinformatics\/btv421","article-title":"Roary: rapid large-scale prokaryote pan genome analysis","volume":"31","author":"Page","year":"2015","journal-title":"Bioinformatics"},{"key":"2023012713072334500_bty938-B21","doi-asserted-by":"crossref","first-page":"1044","DOI":"10.1126\/science.1126439","article-title":"Emergent properties of reduced-genome Escherichia coli","volume":"312","author":"Posfai","year":"2006","journal-title":"Science"},{"key":"2023012713072334500_bty938-B22","doi-asserted-by":"crossref","first-page":"2068","DOI":"10.1093\/bioinformatics\/btu153","article-title":"Prokka: rapid prokaryotic genome annotation","volume":"30","author":"Seemann","year":"2014","journal-title":"Bioinformatics"},{"key":"2023012713072334500_bty938-B23","doi-asserted-by":"crossref","first-page":"1180","DOI":"10.3389\/fmicb.2016.01180","article-title":"Pangenome evidence for higher codon usage bias and stronger translational selection in core genes of Escherichia coli","volume":"7","author":"Sun","year":"2016","journal-title":"Front. Microbiol"},{"key":"2023012713072334500_bty938-B24","doi-asserted-by":"crossref","first-page":"313.","DOI":"10.1038\/nature24624","article-title":"Progress in and promise of bacterial quorum sensing research","volume":"551","author":"Whiteley","year":"2017","journal-title":"Nature"},{"key":"2023012713072334500_bty938-B25","doi-asserted-by":"crossref","first-page":"R267","DOI":"10.1186\/gb-2007-8-12-r267","article-title":"Characterization of probiotic Escherichia coli isolates with a novel pan-genome microarray","volume":"8","author":"Willenbrock","year":"2007","journal-title":"Genome Biol"},{"key":"2023012713072334500_bty938-B26","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1186\/s12934-017-0681-1","article-title":"Large scale validation of an efficient CRISPR\/Cas-based multi gene editing protocol in Escherichia coli","volume":"16","author":"Zerbini","year":"2017","journal-title":"Microb. Cell Fact"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/12\/1987\/48934864\/bioinformatics_35_12_1987.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/35\/12\/1987\/48934864\/bioinformatics_35_12_1987.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T14:11:39Z","timestamp":1674828699000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/35\/12\/1987\/5168157"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,11,9]]},"references-count":26,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2019,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty938","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2019,6]]},"published":{"date-parts":[[2018,11,9]]}}}