{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T08:20:24Z","timestamp":1772180424426,"version":"3.50.1"},"reference-count":39,"publisher":"Oxford University Press (OUP)","issue":"11","license":[{"start":{"date-parts":[[2017,1,20]],"date-time":"2017-01-20T00:00:00Z","timestamp":1484870400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Accurate alignments between entire genomes are crucial for comparative genomics. However, computing sensitive and accurate genome alignments is a challenging problem, complicated by genomic rearrangements.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Here we present a fast approach, called chainCleaner, that improves the specificity in genome alignments by accurately detecting and removing local alignments that obscure the evolutionary history of genomic rearrangements. Systematic tests on alignments between the human and other vertebrate genomes show that chainCleaner (i) improves the alignment of numerous orthologous genes, (ii) exposes alignments between exons of orthologous genes that were masked before by alignments to pseudogenes, and (iii) recovers hundreds of kilobases in local alignments that otherwise would fall below a minimum score threshold. Our approach has broad applicability to improve the sensitivity and specificity of genome alignments.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and Implementation<\/jats:title>\n                  <jats:p>http:\/\/bds.mpi-cbg.de\/hillerlab\/chainCleaner\/ or https:\/\/github.com\/ucscGenomeBrowser\/kent<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx024","type":"journal-article","created":{"date-parts":[[2017,1,21]],"date-time":"2017-01-21T01:17:14Z","timestamp":1484961434000},"page":"1596-1603","source":"Crossref","is-referenced-by-count":60,"title":["chainCleaner improves genome alignment specificity and sensitivity"],"prefix":"10.1093","volume":"33","author":[{"given":"Hernando G","family":"Suarez","sequence":"first","affiliation":[{"name":"Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany"},{"name":"Max Planck Institute for the Physics of Complex Systems, Dresden, Germany"}]},{"given":"Bjoern E","family":"Langer","sequence":"additional","affiliation":[{"name":"Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany"},{"name":"Max Planck Institute for the Physics of Complex Systems, Dresden, Germany"}]},{"given":"Pradnya","family":"Ladde","sequence":"additional","affiliation":[{"name":"Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany"},{"name":"Max Planck Institute for the Physics of Complex Systems, Dresden, Germany"}]},{"given":"Michael","family":"Hiller","sequence":"additional","affiliation":[{"name":"Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany"},{"name":"Max Planck Institute for the Physics of Complex Systems, Dresden, Germany"}]}],"member":"286","published-online":{"date-parts":[[2017,1,20]]},"reference":[{"key":"2023020205304255800_btx024-B1","doi-asserted-by":"crossref","first-page":"1711","DOI":"10.1126\/science.282.5394.1711","article-title":"Zebrafish hox clusters and vertebrate genome evolution","volume":"282","author":"Amores","year":"1998","journal-title":"Science"},{"key":"2023020205304255800_btx024-B2","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1093\/bioinformatics\/btq665","article-title":"Mugsy: fast multiple alignment of closely related whole genomes","volume":"27","author":"Angiuoli","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020205304255800_btx024-B3","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1038\/nature05874","article-title":"Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project","volume":"447","author":"Birney","year":"2007","journal-title":"Nature"},{"key":"2023020205304255800_btx024-B4","doi-asserted-by":"crossref","first-page":"2412","DOI":"10.1101\/gr.2800104","article-title":"Reconstructing large regions of an ancestral mammalian genome in silico","volume":"14","author":"Blanchette","year":"2004","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B5","doi-asserted-by":"crossref","first-page":"708","DOI":"10.1101\/gr.1933104","article-title":"Aligning multiple genomic sequences with the threaded blockset aligner","volume":"14","author":"Blanchette","year":"2004","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B6","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1101\/gr.1960404","article-title":"MAVID: constrained ancestral alignment of multiple sequences","volume":"14","author":"Bray","year":"2004","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B7","doi-asserted-by":"crossref","first-page":"66","DOI":"10.1186\/1471-2105-4-66","article-title":"Fast and sensitive multiple alignment of large genomic sequences","volume":"4","author":"Brudno","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023020205304255800_btx024-B8","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1101\/gr.926603","article-title":"LAGAN and Multi-LAGAN: efficient tools for large-scale multiple alignment of genomic DNA","volume":"13","author":"Brudno","year":"2003","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B9","doi-asserted-by":"crossref","first-page":"e1002852","DOI":"10.1371\/journal.pgen.1002852","article-title":"Human developmental enhancers conserved between deuterostomes and protostomes","volume":"8","author":"Clarke","year":"2012","journal-title":"PLoS Genet"},{"key":"2023020205304255800_btx024-B10","doi-asserted-by":"crossref","first-page":"901","DOI":"10.1101\/gr.3577405","article-title":"Distribution and intensity of constraint in mammalian genomic sequence","volume":"15","author":"Cooper","year":"2005","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B11","doi-asserted-by":"crossref","first-page":"e11147.","DOI":"10.1371\/journal.pone.0011147","article-title":"progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement","volume":"5","author":"Darling","year":"2010","journal-title":"PLoS One"},{"key":"2023020205304255800_btx024-B12","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1007\/978-1-61779-582-4_8","article-title":"Whole-genome alignment","volume":"855","author":"Dewey","year":"2012","journal-title":"Methods Mol. Biol"},{"key":"2023020205304255800_btx024-B13","doi-asserted-by":"crossref","first-page":"682","DOI":"10.1101\/gr.081778.108","article-title":"Multiple whole-genome alignments without a reference organism","volume":"19","author":"Dubchak","year":"2009","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B14","doi-asserted-by":"crossref","first-page":"2077","DOI":"10.1101\/gr.174920.114","article-title":"Alignathon: a competitive assessment of whole-genome alignment methods","volume":"24","author":"Earl","year":"2014","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B15","doi-asserted-by":"crossref","first-page":"106.","DOI":"10.1186\/s13059-015-0670-9","article-title":"Split-alignment of genomes finds orthologies more accurately","volume":"16","author":"Frith","year":"2015","journal-title":"Genome Biol"},{"key":"2023020205304255800_btx024-B16","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1093\/bioinformatics\/btq102","article-title":"Genome-wide synteny through highly sensitive sequence alignment: Satsuma","volume":"26","author":"Grabherr","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020205304255800_btx024-B17","author":"Harris","year":"2007"},{"key":"2023020205304255800_btx024-B18","doi-asserted-by":"crossref","DOI":"10.1093\/database\/baw053","article-title":"Ensembl comparative genomics resources","volume":"2016","author":"Herrero","year":"2016","journal-title":"Database: J Biol Databases Curat"},{"key":"2023020205304255800_btx024-B19","doi-asserted-by":"crossref","first-page":"e151","DOI":"10.1093\/nar\/gkt557","article-title":"Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish","volume":"41","author":"Hiller","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023020205304255800_btx024-B20","doi-asserted-by":"crossref","first-page":"817","DOI":"10.1016\/j.celrep.2012.08.032","article-title":"\"forward genomics\" approach links genotype to phenotype using independent phenotypic losses among related species","volume":"2","author":"Hiller","year":"2012","journal-title":"Cell Rep"},{"key":"2023020205304255800_btx024-B21","doi-asserted-by":"crossref","first-page":"695","DOI":"10.1038\/nature03154","article-title":"Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution","volume":"432","author":"Hillier","year":"2004","journal-title":"Nature"},{"key":"2023020205304255800_btx024-B22","doi-asserted-by":"crossref","first-page":"11484","DOI":"10.1073\/pnas.1932072100","article-title":"Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes","volume":"100","author":"Kent","year":"2003","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023020205304255800_btx024-B23","doi-asserted-by":"crossref","first-page":"bar030","DOI":"10.1093\/database\/bar030","article-title":"Ensembl BioMarts: a hub for data retrieval across taxonomic space","volume":"2011","author":"Kinsella","year":"2011","journal-title":"Database: J Biol Databases Curat"},{"key":"2023020205304255800_btx024-B24","doi-asserted-by":"crossref","first-page":"1916","DOI":"10.1101\/gr.108753.110","article-title":"Locating protein-coding sequences under selection for additional, overlapping functions in 29 mammalian genomes","volume":"21","author":"Lin","year":"2011","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B25","doi-asserted-by":"crossref","first-page":"476","DOI":"10.1038\/nature10530","article-title":"A high-resolution map of human evolutionary constraint using 29 mammals","volume":"478","author":"Lindblad-Toh","year":"2011","journal-title":"Nature"},{"key":"2023020205304255800_btx024-B26","doi-asserted-by":"crossref","first-page":"8005","DOI":"10.1073\/pnas.0611223104","article-title":"Thousands of human mobile element fragments undergo strong purifying selection near developmental genes","volume":"104","author":"Lowe","year":"2007","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023020205304255800_btx024-B27","doi-asserted-by":"crossref","first-page":"1557","DOI":"10.1101\/gr.5383506","article-title":"Reconstructing contiguous regions of an ancestral genome","volume":"16","author":"Ma","year":"2006","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B28","doi-asserted-by":"crossref","first-page":"216","DOI":"10.1038\/nature09774","article-title":"Human-specific loss of regulatory DNA and the evolution of human-specific traits","volume":"471","author":"McLean","year":"2011","journal-title":"Nature"},{"key":"2023020205304255800_btx024-B29","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1089\/cmb.2010.0252","article-title":"Cactus graphs for genome comparisons","volume":"18","author":"Paten","year":"2011","journal-title":"J. Comp. Biol: J. Comp. Mol. Cell Biol"},{"key":"2023020205304255800_btx024-B30","doi-asserted-by":"crossref","first-page":"1512","DOI":"10.1101\/gr.123356.111","article-title":"Cactus: algorithms for genome multiple sequence alignment","volume":"21","author":"Paten","year":"2011","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B31","doi-asserted-by":"crossref","first-page":"1814","DOI":"10.1101\/gr.076554.108","article-title":"Enredo and Pecan: genome-wide mammalian consistency-based multiple alignment with paralogs","volume":"18","author":"Paten","year":"2008","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B32","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1038\/nature05113","article-title":"An RNA gene expressed during cortical development evolved rapidly in humans","volume":"443","author":"Pollard","year":"2006","journal-title":"Nature"},{"key":"2023020205304255800_btx024-B33","doi-asserted-by":"crossref","first-page":"786","DOI":"10.1126\/science.1130738","article-title":"Accelerated evolution of conserved noncoding sequences in humans","volume":"314","author":"Prabhakar","year":"2006","journal-title":"Science"},{"key":"2023020205304255800_btx024-B34","doi-asserted-by":"crossref","first-page":"2135","DOI":"10.1093\/molbev\/msw098","article-title":"Controlling for phylogenetic relatedness and evolutionary rates improves the discovery of associations between species' phenotypic and genomic differences","volume":"33","author":"Prudent","year":"2016","journal-title":"Mol. Biol. Evol"},{"key":"2023020205304255800_btx024-B35","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1186\/1471-2105-12-77","article-title":"pROC: an open-source package for R and S+ to analyze and compare ROC curves","volume":"12","author":"Robin","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023020205304255800_btx024-B36","doi-asserted-by":"crossref","first-page":"D670","DOI":"10.1093\/nar\/gku1177","article-title":"The UCSC Genome Browser database: 2015 update","volume":"43","author":"Rosenbloom","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020205304255800_btx024-B37","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1101\/gr.3715005","article-title":"Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes","volume":"15","author":"Siepel","year":"2005","journal-title":"Genome Res"},{"key":"2023020205304255800_btx024-B38","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1038\/nature06340","article-title":"Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures","volume":"450","author":"Stark","year":"2007","journal-title":"Nature"},{"key":"2023020205304255800_btx024-B39","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1038\/nature01262","article-title":"Initial sequencing and comparative analysis of the mouse genome","volume":"420","author":"Waterston","year":"2002","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/11\/1596\/49039938\/bioinformatics_33_11_1596.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/11\/1596\/49039938\/bioinformatics_33_11_1596.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T05:32:47Z","timestamp":1675315967000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/11\/1596\/2929344"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,1,20]]},"references-count":39,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2017,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx024","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2017,6,1]]},"published":{"date-parts":[[2017,1,20]]}}}