{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T07:33:15Z","timestamp":1771572795835,"version":"3.50.1"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"14","license":[{"start":{"date-parts":[[2017,7,12]],"date-time":"2017-07-12T00:00:00Z","timestamp":1499817600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"funder":[{"DOI":"10.13039\/100010663","name":"European Research Council","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100010663","name":"European Research Council","doi-asserted-by":"publisher","award":["309834"],"award-info":[{"award-number":["309834"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>It has been argued that whole-genome duplication (WGD) exerted a profound influence on the course of evolution. For the purpose of fully understanding the impact of WGD, several formal algorithms have been developed for reconstructing pre-WGD gene order in yeast and plant. However, to the best of our knowledge, those algorithms have never been successfully applied to WGD events in teleost and vertebrate, impeded by extensive gene shuffling and gene losses.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Here, we present a probabilistic model of macrosynteny (i.e. conserved linkage or chromosome-scale distribution of orthologs), develop a variational Bayes algorithm for inferring the structure of pre-WGD genomes, and study estimation accuracy by simulation. Then, by applying the method to the teleost WGD, we demonstrate effectiveness of the algorithm in a situation where gene-order reconstruction algorithms perform relatively poorly due to a high rate of rearrangement and extensive gene losses. Our high-resolution reconstruction reveals previously overlooked small-scale rearrangements, necessitating a revision to previous views on genome structure evolution in teleost and vertebrate.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Conclusions<\/jats:title>\n                  <jats:p>We have reconstructed the structure of a pre-WGD genome by employing a variational Bayes approach that was originally developed for inferring topics from millions of text documents. Interestingly, comparison of the macrosynteny and topic model algorithms suggests that macrosynteny can be regarded as documents on ancestral genome structure. From this perspective, the present study would seem to provide a textbook example of the prevalent metaphor that genomes are documents of evolutionary history.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The analysis data are available for download at http:\/\/www.gen.tcd.ie\/molevol\/supp_data\/MacrosyntenyTGD.zip, and the software written in Java is available upon request.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx259","type":"journal-article","created":{"date-parts":[[2017,4,20]],"date-time":"2017-04-20T07:52:13Z","timestamp":1492674733000},"page":"i369-i378","source":"Crossref","is-referenced-by-count":31,"title":["Genomes as documents of evolutionary history: a probabilistic macrosynteny model for the reconstruction of ancestral genomes"],"prefix":"10.1093","volume":"33","author":[{"given":"Yoichiro","family":"Nakatani","sequence":"first","affiliation":[{"name":"Department of Genetics, Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin 2, Ireland"}]},{"given":"Aoife","family":"McLysaght","sequence":"additional","affiliation":[{"name":"Department of Genetics, Smurfit Institute of Genetics, Trinity College Dublin, University of Dublin, Dublin 2, Ireland"}]}],"member":"286","published-online":{"date-parts":[[2017,7,12]]},"reference":[{"key":"2023051506504504800_btx259-B1","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1016\/S0092-8240(89)80047-3","article-title":"Algorithms for the optimal identification of segment neighborhoods","volume":"51","author":"Auger","year":"1989","journal-title":"Bull. Math. Biol"},{"key":"2023051506504504800_btx259-B2","doi-asserted-by":"crossref","first-page":"24501","DOI":"10.1038\/srep24501","article-title":"The Asian arowana (Scleropages formosus) genome provides new insights into the evolution of an early lineage of teleosts","volume":"6","author":"Bian","year":"2016","journal-title":"Sci. Rep"},{"key":"2023051506504504800_btx259-B3","first-page":"993","article-title":"Latent Dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res"},{"key":"2023051506504504800_btx259-B4","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1145\/2133806.2133826","article-title":"Probabilistic topic models","volume":"55","author":"Blei","year":"2012","journal-title":"Commun. ACM"},{"key":"2023051506504504800_btx259-B5","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1016\/j.tree.2009.09.007","article-title":"Genomes as documents of evolutionary history","volume":"25","author":"Boussau","year":"2010","journal-title":"Trends Ecol. Evol"},{"key":"2023051506504504800_btx259-B6","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1038\/ng.3526","article-title":"The spotted gar genome illuminates vertebrate evolution and facilitates human-teleost comparisons","volume":"48","author":"Braasch","year":"2016","journal-title":"Nat. Genet"},{"key":"2023051506504504800_btx259-B7","doi-asserted-by":"crossref","first-page":"891","DOI":"10.1093\/genetics\/137.4.891","article-title":"Hitoshi Kihara, Japan\u2019s pioneer geneticist","volume":"137","author":"Crow","year":"1994","journal-title":"Genetics"},{"key":"2023051506504504800_btx259-B8","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1007\/BFb0030793","article-title":"Genome halving","volume":"1448","author":"El-Mabrouk","year":"1998","journal-title":"In Combinatorial Pattern Matching, Lect. Notes Comput. Sci"},{"key":"2023051506504504800_btx259-B9","doi-asserted-by":"crossref","first-page":"754","DOI":"10.1137\/S0097539700377177","article-title":"The reconstruction of doubled genomes","volume":"32","author":"El-Mabrouk","year":"2003","journal-title":"SIAM J. Comput"},{"key":"2023051506504504800_btx259-B10","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1007\/978-1-61779-582-4_15","volume-title":"Evolutionary Genomics, Methods Mol. Biol","author":"El-Mabrouk","year":"2012"},{"key":"2023051506504504800_btx259-B11","doi-asserted-by":"crossref","first-page":"S4.","DOI":"10.1186\/1471-2105-13-S19-S4","article-title":"A flexible ancestral genome reconstruction method based on gapped adjacencies","volume":"13","author":"Gagnon","year":"2012","journal-title":"BMC Bioinform"},{"key":"2023051506504504800_btx259-B12","doi-asserted-by":"crossref","first-page":"i257","DOI":"10.1093\/bioinformatics\/btr224","article-title":"Mapping ancestral genomes with massive gene loss: a matrix sandwich problem","volume":"27","author":"Gavranovi\u0107","year":"2011","journal-title":"Bioinformatics"},{"key":"2023051506504504800_btx259-B13","doi-asserted-by":"crossref","first-page":"e1000485","DOI":"10.1371\/journal.pgen.1000485","article-title":"Additions, losses, and rearrangements on the evolutionary route from a reconstructed ancestor to the modern Saccharomyces cerevisiae Genome","volume":"5","author":"Gordon","year":"2009","journal-title":"PLoS Genet"},{"key":"2023051506504504800_btx259-B14","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1093\/icb\/38.6.829","article-title":"Major transitions in animal evolution: a developmental genetic perspective","volume":"38","author":"Holland","year":"1998","journal-title":"Am. Zool"},{"key":"2023051506504504800_btx259-B15","doi-asserted-by":"crossref","first-page":"S8.","DOI":"10.1186\/1471-2105-13-S19-S8","article-title":"A consolidation algorithm for genomes fractionated after higher order polyploidization","volume":"13","author":"Jahn","year":"2012","journal-title":"BMC Bioinform"},{"key":"2023051506504504800_btx259-B16","doi-asserted-by":"crossref","first-page":"946","DOI":"10.1038\/nature03025","article-title":"Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype","volume":"431","author":"Jaillon","year":"2004","journal-title":"Nature"},{"key":"2023051506504504800_btx259-B17","doi-asserted-by":"crossref","first-page":"714","DOI":"10.1038\/nature05846","article-title":"The medaka draft genome and insights into vertebrate genome evolution","volume":"447","author":"Kasahara","year":"2007","journal-title":"Nature"},{"key":"2023051506504504800_btx259-B18","volume-title":"Story on Wheats","author":"Kihara","year":"1946"},{"key":"2023051506504504800_btx259-B19","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1093\/bioinformatics\/15.1.38","article-title":"Bayesian inference on biopolymer models","volume":"15","author":"Liu","year":"1999","journal-title":"Bioinformatics"},{"key":"2023051506504504800_btx259-B20","doi-asserted-by":"crossref","first-page":"9270","DOI":"10.1073\/pnas.0914697107","article-title":"Ohnologs in the human genome are dosage balanced and frequently associated with disease","volume":"107","author":"Makino","year":"2010","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051506504504800_btx259-B21","doi-asserted-by":"crossref","DOI":"10.1038\/ncomms3283","article-title":"Genome-wide deserts for copy number variation in vertebrates","author":"Makino","year":"2013","journal-title":"Nat. Commun"},{"key":"2023051506504504800_btx259-B22","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1073\/pnas.1309324111","article-title":"Ohnologs are overrepresented in pathogenic copy number mutations","volume":"111","author":"McLysaght","year":"2014","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023051506504504800_btx259-B23","author":"Muffato","year":"2010"},{"key":"2023051506504504800_btx259-B24","doi-asserted-by":"crossref","first-page":"122","DOI":"10.1002\/bies.20707","article-title":"Paleogenomics in vertebrates, or the recovery of lost genomes from the mist of time","volume":"30","author":"Muffato","year":"2008","journal-title":"BioEssays"},{"key":"2023051506504504800_btx259-B25","volume-title":"Machine Learning: A Probabilistic Perspective","author":"Murphy","year":"2012"},{"key":"2023051506504504800_btx259-B26","doi-asserted-by":"crossref","first-page":"1254","DOI":"10.1101\/gr.6316407","article-title":"Reconstruction of the vertebrate ancestral genome reveals dynamic genome reorganization in early vertebrates","volume":"17","author":"Nakatani","year":"2007","journal-title":"Genome Res"},{"key":"2023051506504504800_btx259-B39","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-86659-3","volume-title":"Evolution by Gene Duplication","author":"Ohno","year":"1970"},{"key":"2023051506504504800_btx259-B27","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1007\/978-3-642-01551-9_18","article-title":"Prediction of contiguous regions in the amniote ancestral genome","volume":"5542","author":"Ouangraoua","year":"2009","journal-title":"Lect. Notes Comput. Sci"},{"key":"2023051506504504800_btx259-B28","doi-asserted-by":"crossref","first-page":"2664","DOI":"10.1093\/bioinformatics\/btr461","article-title":"Reconstructing the architecture of the ancestral amniote genome","volume":"27","author":"Ouangraoua","year":"2011","journal-title":"Bioinformatics"},{"key":"2023051506504504800_btx259-B29","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1126\/science.1139158","article-title":"Sea anemone genome reveals ancestral eumetazoan gene repertoire and genomic organization","volume":"317","author":"Putnam","year":"2007","journal-title":"Science"},{"key":"2023051506504504800_btx259-B30","doi-asserted-by":"crossref","first-page":"1064","DOI":"10.1038\/nature06967","article-title":"The amphioxus genome and the evolution of the chordate karyotype","volume":"453","author":"Putnam","year":"2008","journal-title":"Nature"},{"key":"2023051506504504800_btx259-B31","doi-asserted-by":"crossref","first-page":"14366.","DOI":"10.1038\/ncomms14366","article-title":"Dosage sensitivity is a major determinant of human copy number variant pathogenicity","volume":"8","author":"Rice","year":"2017","journal-title":"Nat. Commun"},{"key":"2023051506504504800_btx259-B32","doi-asserted-by":"crossref","first-page":"i433","DOI":"10.1093\/bioinformatics\/btm169","article-title":"Polyploids, genome halving and phylogeny","volume":"23","author":"Sankoff","year":"2007","journal-title":"Bioinformatics"},{"key":"2023051506504504800_btx259-B33","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1038\/nature04562","article-title":"Multiple rounds of speciation associated with reciprocal gene loss in polyploid yeasts","volume":"440","author":"Scannell","year":"2006","journal-title":"Nature"},{"key":"2023051506504504800_btx259-B34","doi-asserted-by":"crossref","first-page":"725","DOI":"10.1038\/nrg2600","article-title":"The evolutionary significance of ancient genome duplications","volume":"10","author":"Van de Peer","year":"2009","journal-title":"Nat. Rev. Genet"},{"key":"2023051506504504800_btx259-B35","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1038\/75560","article-title":"Robustness? it\u2019s not where you think it is","volume":"25","author":"Wolfe","year":"2000","journal-title":"Nat. Genet"},{"key":"2023051506504504800_btx259-B36","doi-asserted-by":"crossref","first-page":"i96","DOI":"10.1093\/bioinformatics\/btn146","article-title":"Guided genome halving: hardness, heuristics and the history of the Hemiascomycetes","volume":"24","author":"Zheng","year":"2008","journal-title":"Bioinformatics"},{"key":"2023051506504504800_btx259-B37","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2105-14-S15-S8","article-title":"Practical aliquoting of flowering plant genomes","volume":"14","author":"Zheng","year":"2013","journal-title":"BMC Bioinform"},{"key":"2023051506504504800_btx259-B38","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1016\/0022-5193(65)90083-4","article-title":"Molecules as documents of evolutionary history","volume":"8","author":"Zuckerkandl","year":"1965","journal-title":"J. Theor. Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/14\/i369\/50315213\/bioinformatics_33_14_i369.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/14\/i369\/50315213\/bioinformatics_33_14_i369.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,15]],"date-time":"2023-05-15T06:51:29Z","timestamp":1684133489000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/14\/i369\/3953974"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,7,12]]},"references-count":39,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2017,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx259","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2017,7,15]]},"published":{"date-parts":[[2017,7,12]]}}}