{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T17:17:43Z","timestamp":1772903863396,"version":"3.50.1"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Most proteins comprise one or several domains. New domain architectures can be created by combining previously existing domains. The elementary events that create new domain architectures may be categorized into three classes, namely domain(s) insertion or deletion (indel), exchange and repetition. Using \u2018DomainTeam\u2019, a tool dedicated to the search for microsyntenies of domains, we quantified the relative contribution of these events. This tool allowed us to collect homologous bacterial genes encoding proteins that have obviously evolved by modular assembly of domains. We show that indels are the most frequent elementary events and that they occur in most cases at either the N- or C-terminus of the proteins. As revealed by the genomic neighbourhood\/context of the corresponding genes, we show that a substantial number of these terminal indels are the consequence of gene fusions\/fissions. We provide evidence showing that the contribution of gene fusion\/fission to the evolution of multi-domain bacterial proteins is lower-bounded by 27% and upper-bounded by 64%. We conclude that gene fusion\/fission is a major contributor to the evolution of multi-domain bacterial proteins.<\/jats:p><jats:p>Contact: \u00a0pasek@genopole.cnrs.fr<\/jats:p><jats:p>Supplementary information: Supplementary data are available at<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl135","type":"journal-article","created":{"date-parts":[[2006,4,7]],"date-time":"2006-04-07T00:24:36Z","timestamp":1144369476000},"page":"1418-1423","source":"Crossref","is-referenced-by-count":136,"title":["Gene fusion\/fission is a major contributor to evolution of multi-domain bacterial proteins"],"prefix":"10.1093","volume":"22","author":[{"given":"Sophie","family":"Pasek","sequence":"first","affiliation":[{"name":"Laboratoire Statistique et G\u00e9nome 1 \u00a0 1 \u00a0 \u00a0 523 Place des Terrasses, 91034 Evry cedex, France"},{"name":"Soluscience, Biop\u00f4le Clermont-Limagne 2 \u00a0 2 \u00a0 \u00a0 63360 Saint-Beauzire, France"}]},{"given":"Jean-Loup","family":"Risler","sequence":"additional","affiliation":[{"name":"Laboratoire Statistique et G\u00e9nome 1 \u00a0 1 \u00a0 \u00a0 523 Place des Terrasses, 91034 Evry cedex, France"}]},{"given":"Pierre","family":"Br\u00e9zellec","sequence":"additional","affiliation":[{"name":"Laboratoire Statistique et G\u00e9nome 1 \u00a0 1 \u00a0 \u00a0 523 Place des Terrasses, 91034 Evry cedex, France"}]}],"member":"286","published-online":{"date-parts":[[2006,4,6]]},"reference":[{"key":"2023012408403585600_b1","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1006\/jsbi.2001.4392","article-title":"Protein repeats: structures, functions, and evolution","volume":"134","author":"Andrade","year":"2001","journal-title":"J. Struct. Biol."},{"key":"2023012408403585600_b2","doi-asserted-by":"crossref","first-page":"D226","DOI":"10.1093\/nar\/gkh039","article-title":"SCOP database in 2004: refinements integrate structure and sequence family data","volume":"32","author":"Andreeva","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012408403585600_b3","doi-asserted-by":"crossref","first-page":"D138","DOI":"10.1093\/nar\/gkh121","article-title":"The Pfam protein families database","volume":"32","author":"Bateman","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012408403585600_b4","doi-asserted-by":"crossref","first-page":"911","DOI":"10.1016\/j.jmb.2005.08.067","article-title":"Domain rearrangements in protein evolution","volume":"353","author":"Bj\u00f6rklund","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012408403585600_b5","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1007\/s00018-004-4416-1","article-title":"The evolution of domain arrangements in proteins and interaction networks","author":"Bornberg-Bauer","year":"2005","journal-title":"Cell. Mol. Life Sci."},{"key":"2023012408403585600_b6","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1146\/annurev.genet.32.1.339","article-title":"The diverse and dynamic structure of bacterial genomes","volume":"32","author":"Casjens","year":"1998","journal-title":"Annu. Rev. Genet."},{"key":"2023012408403585600_b7","doi-asserted-by":"crossref","first-page":"4516","DOI":"10.1073\/pnas.0737502100","article-title":"Enhanced protein domain discovery by using language modeling techniques from speech recognition","volume":"100","author":"Coin","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408403585600_b8","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1146\/annurev.bi.64.070195.001443","article-title":"The multiplicity of domains in proteins","volume":"64","author":"Doolittle","year":"1995","journal-title":"Annu. Rev. Biochem."},{"key":"2023012408403585600_b9","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/S0168-9525(00)02005-9","article-title":"Homology a personal view on some of the problems","volume":"16","author":"Fitch","year":"2000","journal-title":"Trends Genet."},{"key":"2023012408403585600_b10","doi-asserted-by":"crossref","first-page":"D277","DOI":"10.1093\/nar\/gkh063","article-title":"The KEGG resource for deciphering the genome","volume":"32","author":"Kanehisa","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012408403585600_b11","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1146\/annurev.genet.39.073003.114725","article-title":"Orthologs, paralogs, and evolutionary genomics","volume":"39","author":"Koonin","year":"2005","journal-title":"Annu. Rev. Genet."},{"key":"2023012408403585600_b12","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-3783-7","volume-title":"Sequence\u2014Evolution\u2014Function: Computational Approaches in Genomics","author":"Koonin","year":"2003"},{"key":"2023012408403585600_b13","doi-asserted-by":"crossref","first-page":"218","DOI":"10.1038\/nature01256","article-title":"The structure of the protein universe and genome evolution","volume":"420","author":"Koonin","year":"2002","journal-title":"Nature"},{"key":"2023012408403585600_b14","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/j.tig.2004.11.007","article-title":"Relative rates of gene fusion and fission in multi-domain proteins","volume":"21","author":"Kummerfeld","year":"2005","journal-title":"Trends Genet."},{"key":"2023012408403585600_b15","doi-asserted-by":"crossref","first-page":"D142","DOI":"10.1093\/nar\/gkh088","article-title":"SMART 4.0: towards genomic data integration","volume":"32","author":"Letunic","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012408403585600_b16","doi-asserted-by":"crossref","first-page":"R55","DOI":"10.1186\/gb-2003-4-9-r55","article-title":"Evolution of mosaic operons by horizontal gene transfer and gene displacement in situ","volume":"4","author":"Omelchenko","year":"2003","journal-title":"Genome Biol."},{"key":"2023012408403585600_b17","doi-asserted-by":"crossref","first-page":"867","DOI":"10.1146\/annurev.biochem.74.082803.133029","article-title":"Protein families and their evolution\u2014a structural perspective","author":"Orengo","year":"2005","journal-title":"Annu. Rev. Biochem."},{"key":"2023012408403585600_b18","doi-asserted-by":"crossref","first-page":"866","DOI":"10.1046\/j.1365-2958.2000.01901.x","article-title":"Novel type I restriction specificities through domain shuffling of HsdS subunits in Lactococcus lactis","volume":"36","author":"O'Sullivan","year":"2000","journal-title":"Mol. Microbiol."},{"key":"2023012408403585600_b19","doi-asserted-by":"crossref","first-page":"867","DOI":"10.1101\/gr.3638405","article-title":"Identification of genomic features using microsyntenies of domains: domain teams","volume":"15","author":"Pasek","year":"2005","journal-title":"Genome Res."},{"key":"2023012408403585600_b20","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1023\/A:1024182432483","article-title":"Modular assembly of genes and the evolution of new functions","volume":"118","author":"Patthy","year":"2003","journal-title":"Genetica"},{"key":"2023012408403585600_b21","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1006\/jmbi.1997.1003","article-title":"Protein evolution viewed through Escherichia coli protein sequences: introducing the notion of a structural segment of homology, the module","volume":"268","author":"Riley","year":"1997","journal-title":"J. Mol. Biol."},{"key":"2023012408403585600_b22","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1016\/j.mib.2004.08.006","article-title":"Order and disorder in bacterial genomes","volume":"7","author":"Rocha","year":"2004","journal-title":"Curr. Opin. Microbiol."},{"key":"2023012408403585600_b23","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1093\/nar\/28.1.33","article-title":"The COG database: a tool for genome-scale analysis of protein functions and evolution","volume":"28","author":"Tatusov","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012408403585600_b24","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1038\/79918","article-title":"Genome rearrangement by replication-directed translocation","volume":"26","author":"Tillier","year":"2000","journal-title":"Nat. Genet."},{"key":"2023012408403585600_b25","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1016\/j.sbi.2004.03.011","article-title":"Structure, function and evolution of multidomain proteins","volume":"14","author":"Vogel","year":"2004","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023012408403585600_b26","doi-asserted-by":"crossref","first-page":"809","DOI":"10.1016\/j.jmb.2003.12.026","article-title":"Supra-domains: evolutionary units larger than single protein domains","volume":"336","author":"Vogel","year":"2004","journal-title":"J. Mol. Biol."},{"key":"2023012408403585600_b27","doi-asserted-by":"crossref","first-page":"932","DOI":"10.1093\/bioinformatics\/bti085","article-title":"Rapid motif-based prediction of circular permutations in multidomain proteins","volume":"21","author":"Weiner","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012408403585600_b28","doi-asserted-by":"crossref","first-page":"734","DOI":"10.1093\/molbev\/msj091","article-title":"Evolution of circular permutations in multidomain proteins","volume":"23","author":"Weiner","year":"2006","journal-title":"Mol. Biol. Evol."},{"key":"2023012408403585600_b29","doi-asserted-by":"crossref","first-page":"7940","DOI":"10.1073\/pnas.141236298","article-title":"Genes linked by fusion events are generally of the same functional category: a systematic analysis of 30 microbial genomes","volume":"98","author":"Yanai","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012408403585600_b30","doi-asserted-by":"crossref","first-page":"research0024","DOI":"10.1186\/gb-2002-3-5-research0024","article-title":"Evolution of gene fusions: horizontal transfer versus independent events","volume":"3","author":"Yanai","year":"2002","journal-title":"Genome Biol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/12\/1418\/48838392\/bioinformatics_22_12_1418.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/12\/1418\/48838392\/bioinformatics_22_12_1418.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,2,3]],"date-time":"2024-02-03T16:46:31Z","timestamp":1706978791000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/22\/12\/1418\/207642"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,4,6]]},"references-count":30,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2006,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl135","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2006,6,15]]},"published":{"date-parts":[[2006,4,6]]}}}