{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,26]],"date-time":"2026-04-26T06:17:59Z","timestamp":1777184279241,"version":"3.51.4"},"reference-count":39,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>As orthologous proteins are expected to retain function more often than other homologs, they are often used for functional annotation transfer between species. However, ortholog identification methods do not take into account changes in domain architecture, which are likely to modify a protein's function. By domain architecture we refer to the sequential arrangement of domains along a protein sequence.<\/jats:p>\n            <jats:p>To assess the level of domain architecture conservation among orthologs, we carried out a large-scale study of such events between human and 40 other species spanning the entire evolutionary range. We designed a score to measure domain architecture similarity and used it to analyze differences in domain architecture conservation between orthologs and paralogs relative to the conservation of primary sequence. We also statistically characterized the extents of different types of domain swapping events across pairs of orthologs and paralogs.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>The analysis shows that orthologs exhibit greater domain architecture conservation than paralogous homologs, even when differences in average sequence divergence are compensated for, for homologs that have diverged beyond a certain threshold. We interpret this as an indication of a stronger selective pressure on orthologs than paralogs to retain the domain architecture required for the proteins to perform a specific function. In general, orthologs as well as the closest paralogous homologs have very similar domain architectures, even at large evolutionary separation.<\/jats:p>\n            <jats:p>The most common domain architecture changes observed in both ortholog and paralog pairs involved insertion\/deletion of new domains, while domain shuffling and segment duplication\/deletion were very infrequent.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>On the whole, our results support the hypothesis that function conservation between orthologs demands higher domain architecture conservation than other types of homologs, relative to primary sequence conservation. This supports the notion that orthologs are functionally more similar than other types of homologs at the same evolutionary distance.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-12-326","type":"journal-article","created":{"date-parts":[[2011,8,6]],"date-time":"2011-08-06T06:14:12Z","timestamp":1312611252000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":52,"title":["Domain architecture conservation in orthologs"],"prefix":"10.1186","volume":"12","author":[{"given":"Kristoffer","family":"Forslund","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Isabella","family":"Pekkari","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Erik LL","family":"Sonnhammer","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2011,8,5]]},"reference":[{"key":"4763_CR1","doi-asserted-by":"publisher","first-page":"99","DOI":"10.2307\/2412448","volume":"19","author":"WM Fitch","year":"1970","unstructured":"Fitch WM: Distinguishing homologous from analogous proteins. Syst Zool 1970, 19: 99\u2013113. 10.2307\/2412448","journal-title":"Syst Zool"},{"key":"4763_CR2","doi-asserted-by":"publisher","first-page":"1041","DOI":"10.1006\/jmbi.2000.5197","volume":"314","author":"M Remm","year":"2001","unstructured":"Remm M, Storm CEV, Sonnhammer ELL: Automatic Clustering of Orthologs and In-paralogs from Pairwise Species Comparisons. JMB 2001, 314: 1041\u20131052. 10.1006\/jmbi.2000.5197","journal-title":"JMB"},{"key":"4763_CR3","doi-asserted-by":"publisher","first-page":"619","DOI":"10.1016\/S0168-9525(02)02793-2","volume":"18","author":"ELL Sonnhammer","year":"2002","unstructured":"Sonnhammer ELL, Koonin EV: Orthology, paralogy and proposed classification for paralog subtypes. Trends Genet 2002, 18: 619\u2013620. 10.1016\/S0168-9525(02)02793-2","journal-title":"Trends Genet"},{"key":"4763_CR4","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1146\/annurev.genet.39.073003.114725","volume":"39","author":"EV Koonin","year":"2005","unstructured":"Koonin EV: Orthologs, Paralogs and Evolutionary Genomics. Annu Rev Genet 2005, 39: 309\u2013338. 10.1146\/annurev.genet.39.073003.114725","journal-title":"Annu Rev Genet"},{"issue":"9","key":"4763_CR5","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1186\/gb-2009-10-9-403","volume":"10","author":"T Gabald\u00f3n","year":"2009","unstructured":"Gabald\u00f3n T, Dessimoz C, Huxley-Jones J, Vilella AJ, Sonnhammer EL, Lewis S: Joining forces in the quest for orthologs. Genome Biol 2009, 10(9):403. 10.1186\/gb-2009-10-9-403","journal-title":"Genome Biol"},{"key":"4763_CR6","doi-asserted-by":"publisher","first-page":"465","DOI":"10.1146\/annurev.genet.40.110405.090439","volume":"41","author":"K Dolinski","year":"2007","unstructured":"Dolinski K, Botstein D: Orthology and Functional Conservation in Eukaryotes. Annu Rev Genet 2007, 41: 465\u2013507. 10.1146\/annurev.genet.40.110405.090439","journal-title":"Annu Rev Genet"},{"issue":"5","key":"4763_CR7","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1016\/j.tig.2009.03.004","volume":"25","author":"RA Studer","year":"2009","unstructured":"Studer RA, Robinson-Rechavi M: How confident can we be that orthologs are similar, but paralogs differ? Trends in Genetics 2009, 25(5):210\u2013216. 10.1016\/j.tig.2009.03.004","journal-title":"Trends in Genetics"},{"key":"4763_CR8","doi-asserted-by":"publisher","first-page":"320","DOI":"10.1093\/nar\/26.1.320","volume":"26","author":"ELL Sonnhammer","year":"1998","unstructured":"Sonnhammer ELL, Eddy SR, Birney E, Bateman A, Durbin R: Pfam: multiple sequence alignments and HMM-profiles of protein domains. Nucleic Acids Res 1998, 26: 320\u2013322. 10.1093\/nar\/26.1.320","journal-title":"Nucleic Acids Res"},{"key":"4763_CR9","doi-asserted-by":"crossref","unstructured":"Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, Holm L, Sonnhammer ELL, Eddy SR, Bateman A: The Pfam protein families database. Nucleic Acids Res 2010, (38 Database):D211-D222.","DOI":"10.1093\/nar\/gkp985"},{"key":"4763_CR10","doi-asserted-by":"publisher","first-page":"911","DOI":"10.1016\/j.jmb.2005.08.067","volume":"353","author":"\u00c5K Bj\u00f6rklund","year":"2005","unstructured":"Bj\u00f6rklund \u00c5K, Ekman D, Light S, Frey-Sk\u00f6tt J, Elofsson A: Domain Rearrangements in Protein Evolution. J Mol Biol 2005, 353: 911\u2013923. 10.1016\/j.jmb.2005.08.067","journal-title":"J Mol Biol"},{"issue":"1","key":"4763_CR11","doi-asserted-by":"publisher","first-page":"355","DOI":"10.1016\/j.jmb.2004.11.050","volume":"346","author":"C Vogel","year":"2005","unstructured":"Vogel C, Teichmann SA, Pereira-Leal J: The Relationship Between Domain Duplication and Recombination. JMB 2005, 346(1):355\u2013365. 10.1016\/j.jmb.2004.11.050","journal-title":"JMB"},{"issue":"8","key":"4763_CR12","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1371\/journal.pcbi.0020114","volume":"2","author":"\u00c5K Bj\u00f6rklund","year":"2006","unstructured":"Bj\u00f6rklund \u00c5K, Ekman D, Elofsson A: Expansion of Protein Domain Repeats. PloS Comput Biol 2006, 2(8):114. 10.1371\/journal.pcbi.0020114","journal-title":"PloS Comput Biol"},{"key":"4763_CR13","doi-asserted-by":"publisher","first-page":"751","DOI":"10.1042\/BST0370751","volume":"37","author":"M Buljan","year":"2009","unstructured":"Buljan M, Bateman A: The evolution of protein domain families. Biochem Soc Transactions 2009, 37: 751\u2013755. 10.1042\/BST0370751","journal-title":"Biochem Soc Transactions"},{"key":"4763_CR14","doi-asserted-by":"publisher","first-page":"1632","DOI":"10.1101\/gr. 183801","volume":"11","author":"H Hegyi","year":"2001","unstructured":"Hegyi H, Gerstein M: Annotation Transfer for Genomics: Measuring Functional Divergence in Multi-Domain Proteins. Genome Res 2001, 11: 1632\u20131640. 10.1101\/gr. 183801","journal-title":"Genome Res"},{"key":"4763_CR15","first-page":"140","volume-title":"Pacific Symposium on Biocomputing","author":"B Hayete","year":"2005","unstructured":"Hayete B, Bienkowska JR: GOTrees: Predicting GO Associations from Protein Domain Composition Using Decision Trees. Pacific Symposium on Biocomputing 2005, 140\u2013151."},{"key":"4763_CR16","doi-asserted-by":"crossref","unstructured":"Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Buillard V, Cerutti L, Copley R, Courcelle E, Das U, Daugherty L, Dibley M, Finn R, Fleischmann W, Gough J, Haft D, Hulo N, Hunter S, Kahn D, Kanapin A, Kejariwal A, Labarga A, Langendijk-Genevaux PS, Lonsdale D, Lopez R, Letunic I, Madera M, Maslen J, McAnulla C, McDowall J, Mistry J, Mitchell A, Nikolskaya AN, Orchard S, Orengo C, Petryszak R, Selengut JD, Sigrist CJ, Thomas PD, Valentin F, Wilson D, Wu CH, Yeats C: New developments in the InterPro database. Nucleic Acids Res 2007, (35 Database):D224\u2013228.","DOI":"10.1093\/nar\/gkl841"},{"issue":"15","key":"4763_CR17","doi-asserted-by":"publisher","first-page":"1681","DOI":"10.1093\/bioinformatics\/btn312","volume":"24","author":"K Forslund","year":"2009","unstructured":"Forslund K, Sonnhammer ELL: Predicting protein function from domain content. Bioinformatics 2009, 24(15):1681\u20131687.","journal-title":"Bioinformatics"},{"issue":"17","key":"4763_CR18","doi-asserted-by":"publisher","first-page":"2081","DOI":"10.1093\/bioinformatics\/btl366","volume":"22","author":"K Lin","year":"2006","unstructured":"Lin K, Zhu L, Zhang DY: An initial strategy for comparing proteins at the domain architecture level. Bioinformatics 2006, 22(17):2081\u20132086. 10.1093\/bioinformatics\/btl366","journal-title":"Bioinformatics"},{"issue":"1","key":"4763_CR19","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1093\/nar\/28.1.33","volume":"28","author":"RL Tatusov","year":"2000","unstructured":"Tatusov RL, Galperin MY, Natale DA, Koonin EV: The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res 2000, 28(1):33\u201336. 10.1093\/nar\/28.1.33","journal-title":"Nucleic Acids Res"},{"issue":"4","key":"4763_CR20","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1089\/cmb.2007.A009","volume":"14","author":"N Song","year":"2007","unstructured":"Song N, Sedgewick RD, Durand D: Domain Architecture Comparison for Multidomain Homology Identification. Journal of Computational Biology 2007, 14(4):496\u2013516. 10.1089\/cmb.2007.A009","journal-title":"Journal of Computational Biology"},{"key":"4763_CR21","doi-asserted-by":"crossref","unstructured":"Ostlund G, Schmitt T, Forslund K, K\u00f6stler T, Messina DN, Roopra S, Frings O, Sonnhammer ELL: InParanoid 7: new algorithms and tools for eukaryotic orthology analysis. Nucleic Acids Res 2010, (38 Database):D196\u2013203.","DOI":"10.1093\/nar\/gkp931"},{"key":"4763_CR22","doi-asserted-by":"publisher","first-page":"D263","DOI":"10.1093\/nar\/gkm1020","volume":"36","author":"AC Berglund","year":"2008","unstructured":"Berglund AC, Sjolund E, Ostlund G, Sonnhammer ELL: InParanoid 6: eukaryotic ortholog clusters with inparalogs. Nucleic Acids Res 2008, 36: D263\u2013266.","journal-title":"Nucleic Acids Res"},{"issue":"11","key":"4763_CR23","doi-asserted-by":"publisher","first-page":"1451","DOI":"10.1093\/bioinformatics\/btg161","volume":"19","author":"P Janssen","year":"2003","unstructured":"Janssen P, Enright AJ, Audit B, Cases I, Goldovsky L, Harte N, Kunin V, Ouzounis CA: COmplete GENome Tracking (COGENT): a flexible data environment for computational genomics. Bioinformatics 2003, 19(11):1451\u20131452. 10.1093\/bioinformatics\/btg161","journal-title":"Bioinformatics"},{"issue":"5","key":"4763_CR24","doi-asserted-by":"publisher","first-page":"e1000069","DOI":"10.1371\/journal.pcbi.1000069","volume":"4","author":"SR Eddy","year":"2008","unstructured":"Eddy SR: A probabilistic model of local sequence alignment that simplifies statistical significance estimation. PLoS Comput Biol 2008, 4(5):e1000069. 10.1371\/journal.pcbi.1000069","journal-title":"PLoS Comput Biol"},{"issue":"2","key":"4763_CR25","doi-asserted-by":"publisher","first-page":"254","DOI":"10.1093\/molbev\/msm254","volume":"25","author":"K Forslund","year":"2008","unstructured":"Forslund K, Henricson A, Hollich V, Sonnhammer EL: Domain tree-based analysis of protein architecture evolution. Mol Biol Evol 2008, 25(2):254\u2013264. 10.1093\/molbev\/msm254","journal-title":"Mol Biol Evol"},{"key":"4763_CR26","doi-asserted-by":"publisher","first-page":"443","DOI":"10.1016\/0022-2836(70)90057-4","volume":"48","author":"SB Needleman","year":"1970","unstructured":"Needleman SB, Wunsch CD: A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 1970, 48: 443\u2013453. 10.1016\/0022-2836(70)90057-4","journal-title":"J Mol Biol"},{"key":"4763_CR27","doi-asserted-by":"publisher","first-page":"298","DOI":"10.1186\/1471-2105-6-298","volume":"6","author":"T Lassmann","year":"2005","unstructured":"Lassmann T, Sonnhammer EL: Kalign--an accurate and fast multiple sequence alignment algorithm. BMC Bioinformatics 2005, 6: 298. 10.1186\/1471-2105-6-298","journal-title":"BMC Bioinformatics"},{"key":"4763_CR28","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1016\/B978-1-4832-3211-9.50009-7","volume-title":"Mammalian protein metabolism","author":"TH Jukes","year":"1969","unstructured":"Jukes TH, Cantor CR: Evolution of protein molecules. In Mammalian protein metabolism. Edited by: Munro HN. New York: Academic Press; 1969:21\u2013123."},{"issue":"1","key":"4763_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1002\/hbm.1058","volume":"15","author":"TE Nichols","year":"2001","unstructured":"Nichols TE, Holmes AP: Nonparametric Permutation Tests For Functional Neuroimaging: A Primer with Examples. Human Brain Mapping 2001, 15(1):1\u201325.","journal-title":"Human Brain Mapping"},{"key":"4763_CR30","volume-title":"From MathWorld--A Wolfram Web Resource","author":"EW Weisstein","year":"1999","unstructured":"Weisstein EW: Bonferroni Correction. From MathWorld--A Wolfram Web Resource 1999. [http:\/\/mathworld.wolfram.com\/BonferroniCorrection.html]"},{"key":"4763_CR31","volume-title":"From MathWorld--A Wolfram Web Resource","author":"EW Weisstein","year":"1999","unstructured":"Weisstein EW: Chi-Squared Test. From MathWorld--A Wolfram Web Resource 1999. [http:\/\/mathworld.wolfram.com\/Chi-SquaredTest.html]"},{"key":"4763_CR32","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-86659-3","volume-title":"Evolution by gene duplication","author":"S Ohno","year":"1970","unstructured":"Ohno S: Evolution by gene duplication. New York: Springer; 1970."},{"issue":"1","key":"4763_CR33","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1093\/genetics\/154.1.459","volume":"154","author":"M Lynch","year":"2000","unstructured":"Lynch M, Force A: The probability of duplicate gene preservation by subfunctionalization. Genetics 2000, 154(1):459\u2013473.","journal-title":"Genetics"},{"issue":"10","key":"4763_CR34","doi-asserted-by":"publisher","first-page":"e314","DOI":"10.1371\/journal.pbio.0030314","volume":"3","author":"P Dehal","year":"2005","unstructured":"Dehal P, Boore JL: Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate. PLoS Biol 2005, 3(10):e314. 10.1371\/journal.pbio.0030314","journal-title":"PLoS Biol"},{"key":"4763_CR35","doi-asserted-by":"publisher","first-page":"231","DOI":"10.1016\/j.jmb.2005.02.007","volume":"348","author":"D Ekman","year":"2005","unstructured":"Ekman D, Bj\u00f6rklund \u00c5K, Frey-Sk\u00f6tt J, Elofsson A: Multi-domain Proteins in the Three Kingdoms of Life: Orphan Domains and Other Unassigned Regions. J Mol Biol 2005, 348: 231\u2013243. 10.1016\/j.jmb.2005.02.007","journal-title":"J Mol Biol"},{"key":"4763_CR36","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","volume":"215","author":"SF Altschul","year":"1990","unstructured":"Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215: 403\u2013410.","journal-title":"J Mol Biol"},{"key":"4763_CR37","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1016\/j.ddtec.2006.06.002","volume":"3","author":"A Alexeyenko","year":"2006","unstructured":"Alexeyenko A, Lindberg J, Perez-Bercoff A, Sonnhammer ELL: Overview and comparison of ortholog databases. Drug Discovery Today: Technologies 2006, 3: 137\u2013143.","journal-title":"Drug Discovery Today: Technologies"},{"issue":"6","key":"4763_CR38","doi-asserted-by":"publisher","first-page":"1306","DOI":"10.1002\/pro.143","volume":"18","author":"ME Peterson","year":"2009","unstructured":"Peterson ME, Chen F, Saven JG, Roos DS, Babbitt PC, Sali A: Evolutionary constraints on structural similarity in orthologs and paralogs. Protein Sci 2009, 18(6):1306\u20131315. 10.1002\/pro.143","journal-title":"Protein Sci"},{"key":"4763_CR39","doi-asserted-by":"crossref","unstructured":"Kondrashov FA, Rogozin IB, Wolf YI, Koonin EV: Selection in the evolution of gene duplications. Genome Biol 2002., 3(2):","DOI":"10.1186\/gb-2002-3-2-research0008"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-326.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T16:01:32Z","timestamp":1630512092000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-326"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,8,5]]},"references-count":39,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4763"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-326","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,8,5]]},"assertion":[{"value":"8 July 2011","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 August 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 August 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"326"}}