{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:44:03Z","timestamp":1753875843278,"version":"3.41.2"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2025,2,5]],"date-time":"2025-02-05T00:00:00Z","timestamp":1738713600000},"content-version":"vor","delay-in-days":4,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,2,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>The emergence of the \u2018telomere-to-telomere\u2019 genomics brought the challenge of identifying segmental duplications (SDs) in complete genomes. It further opened a possibility for identifying the differences in SDs across individual human genomes and studying the SD evolution. These newly emerged challenges require algorithms for reconstructing SDs in the most complex genomic regions that evaded all previous attempts to analyze their architecture, such as rapidly evolving immunoglobulin loci.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We describe the GenomeDecoder algorithm for inferring SDs and apply it to analyzing genomic architectures of various loci in primate genomes. Our analysis revealed that multiple duplications\/deletions led to a rapid birth\/death of immunoglobulin genes within the human population and large changes in genomic architecture of immunoglobulin loci across primate genomes. Comparison of immunoglobulin loci across primate genomes suggests that they are subjected to diversifying selection.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>GenomeDecoder is available at https:\/\/github.com\/ZhangZhenmiao\/GenomeDecoder. The software version and test data used in this paper are uploaded to https:\/\/doi.org\/10.5281\/zenodo.14753844.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf058","type":"journal-article","created":{"date-parts":[[2025,2,3]],"date-time":"2025-02-03T20:24:48Z","timestamp":1738614288000},"source":"Crossref","is-referenced-by-count":0,"title":["GenomeDecoder: inferring segmental duplications in highly repetitive genomic regions"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3748-1664","authenticated-orcid":false,"given":"Zhenmiao","family":"Zhang","sequence":"first","affiliation":[{"name":"Department of Computer Science and Engineering, University of California San Diego , La Jolla, CA 92093,","place":["United States"]},{"name":"Department of Computer Science, Hong Kong Baptist University , Hong Kong SAR,","place":["China"]}]},{"given":"Ishaan","family":"Gupta","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, University of California San Diego , La Jolla, CA 92093,","place":["United States"]}]},{"given":"Pavel A","family":"Pevzner","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, University of California San Diego , La Jolla, CA 92093,","place":["United States"]}]}],"member":"286","published-online":{"date-parts":[[2025,2,5]]},"reference":[{"key":"2025030422260671400_btaf058-B1","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1101\/gr.082784.108","article-title":"Breakpoint graphs and ancestral genome reconstructions","volume":"19","author":"Alekseyev","year":"2009","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B2","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1089\/cmb.2015.0160","article-title":"Reconstruction of ancestral genomes in presence of gene gain and loss","volume":"23","author":"Avdeyev","year":"2016","journal-title":"J Comput Biol"},{"key":"2025030422260671400_btaf058-B3","doi-asserted-by":"crossref","first-page":"1075","DOI":"10.1038\/s41587-022-01220-6","article-title":"Multiplex de Bruijn graphs enable genome assembly from long, high-fidelity reads","volume":"40","author":"Bankevich","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2025030422260671400_btaf058-B4","first-page":"26","article-title":"Genome-scale evolution: reconstructing gene orders in the ancestral species","volume":"12","author":"Bourque","year":"2002","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B5","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1101\/gr.1975204","article-title":"Reconstructing the genomic architecture of ancestral mammals: lessons from human, mouse, and rat genomes","volume":"14","author":"Bourque","year":"2004","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B6","doi-asserted-by":"crossref","first-page":"1346","DOI":"10.1038\/s41592-023-01970-4","article-title":"UniAligner: a parameter-free framework for fast sequence alignment","volume":"20","author":"Bzikadze","year":"2023","journal-title":"Nat Methods"},{"key":"2025030422260671400_btaf058-B7","doi-asserted-by":"crossref","first-page":"1394","DOI":"10.1101\/gr.2289704","article-title":"Mauve: multiple alignment of conserved genomic sequence with rearrangements","volume":"14","author":"Darling","year":"2004","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B8","doi-asserted-by":"crossref","first-page":"2708","DOI":"10.1093\/bioinformatics\/bty172","article-title":"ASGART: fast and parallel genome scale segmental duplications mapping","volume":"34","author":"Delehelle","year":"2018","journal-title":"Bioinformatics"},{"key":"2025030422260671400_btaf058-B9","doi-asserted-by":"crossref","first-page":"e92621","DOI":"10.1371\/journal.pone.0092621","article-title":"SynChro: a fast and easy tool to reconstruct and visualize synteny blocks along eukaryotic chromosomes","volume":"9","author":"Drillon","year":"2014","journal-title":"PLoS One"},{"key":"2025030422260671400_btaf058-B10","doi-asserted-by":"crossref","first-page":"eabf7117","DOI":"10.1126\/science.abf7117","article-title":"Haplotype-resolved diverse human genomes and integrated analysis of structural variation","volume":"372","author":"Ebert","year":"2021","journal-title":"Science"},{"key":"2025030422260671400_btaf058-B11","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1093\/bioinformatics\/btq102","article-title":"Genome-wide synteny through highly sensitive sequence alignment: satsuma","volume":"26","author":"Grabherr","year":"2010","journal-title":"Bioinformatics"},{"key":"2025030422260671400_btaf058-B12","doi-asserted-by":"crossref","first-page":"3643","DOI":"10.1093\/bioinformatics\/bth397","article-title":"DAGchainer: a tool for mining segmental genome duplications and synteny","volume":"20","author":"Haas","year":"2004","journal-title":"Bioinformatics"},{"key":"2025030422260671400_btaf058-B13","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1186\/s13015-022-00210-2","article-title":"Fast characterization of segmental duplication structure in multiple genome assemblies","volume":"17","author":"I\u0161eri\u0107","year":"2022","journal-title":"Algorithms Mol Biol"},{"key":"2025030422260671400_btaf058-B14","doi-asserted-by":"crossref","first-page":"i748","DOI":"10.1093\/bioinformatics\/bty597","article-title":"A fast adaptive algorithm for computing whole-genome homology maps","volume":"34","author":"Jain","year":"2018","journal-title":"Bioinformatics"},{"key":"2025030422260671400_btaf058-B15","doi-asserted-by":"crossref","first-page":"1361","DOI":"10.1038\/ng.2007.9","article-title":"Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution","volume":"39","author":"Jiang","year":"2007","journal-title":"Nat Genet"},{"key":"2025030422260671400_btaf058-B16","doi-asserted-by":"crossref","first-page":"11484","DOI":"10.1073\/pnas.1932072100","article-title":"Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes","volume":"100","author":"Kent","year":"2003","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2025030422260671400_btaf058-B17","first-page":"5048","article-title":"Gene duplication as a mechanism of genomic adaptation to a changing environment","volume":"279","author":"Kondrashov","year":"2012","journal-title":"Proc Biol Sci"},{"key":"2025030422260671400_btaf058-B18","doi-asserted-by":"crossref","first-page":"giaa047","DOI":"10.1093\/gigascience\/giaa047","article-title":"halSynteny: a fast, easy-to-use conserved synteny block construction method for multiple whole-genome alignments","volume":"9","author":"Krasheninnikova","year":"2020","journal-title":"Gigascience"},{"key":"2025030422260671400_btaf058-B19","doi-asserted-by":"crossref","first-page":"btae144","DOI":"10.1093\/bioinformatics\/btae144","article-title":"Digger: directed annotation of immunoglobulin and T cell receptor V, D, and J gene sequences and assemblies","volume":"40","author":"Lees","year":"2024","journal-title":"Bioinformatics"},{"key":"2025030422260671400_btaf058-B20","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1038\/s41586-023-05896-x","article-title":"A draft human pangenome reference","volume":"617","author":"Liao","year":"2023","journal-title":"Nature"},{"key":"2025030422260671400_btaf058-B21","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1186\/s12859-018-2026-4","article-title":"Inferring synteny between genome assemblies: a systematic evaluation","volume":"19","author":"Liu","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2025030422260671400_btaf058-B22","doi-asserted-by":"crossref","first-page":"1557","DOI":"10.1101\/gr.5383506","article-title":"Reconstructing contiguous regions of an ancestral genome","volume":"16","author":"Ma","year":"2006","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B23","first-page":"215","volume-title":"Lecture Notes in Computer Science, 8126 Proceedings of Workshop on Algorithms in Bioinformatics (WABI 2013)","author":"Minkin","year":"2013"},{"key":"2025030422260671400_btaf058-B24","doi-asserted-by":"crossref","first-page":"i706","DOI":"10.1093\/bioinformatics\/bty586","article-title":"Fast characterization of segmental duplications in genome assemblies","volume":"34","author":"Numanagi\u0107","year":"2018","journal-title":"Bioinformatics"},{"key":"2025030422260671400_btaf058-B25","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1126\/science.abj6987","article-title":"The complete sequence of a human genome","volume":"376","author":"Nurk","year":"2022","journal-title":"Science"},{"key":"2025030422260671400_btaf058-B26","doi-asserted-by":"crossref","first-page":"1814","DOI":"10.1101\/gr.076554.108","article-title":"Enredo and pecan: genome-wide mammalian consistency-based multiple alignment with paralogs","volume":"18","author":"Paten","year":"2008","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B27","doi-asserted-by":"crossref","first-page":"e14","DOI":"10.1371\/journal.pcbi.0020014","article-title":"The fragile breakage versus random breakage models of chromosome evolution","volume":"2","author":"Peng","year":"2006","journal-title":"PLoS Comput Biol"},{"key":"2025030422260671400_btaf058-B28","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1101\/gr.757503","article-title":"Genome rearrangements in mammalian evolution: lessons from human and mouse genomes","volume":"13","author":"Pevzner","year":"2003","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B29","doi-asserted-by":"crossref","first-page":"1786","DOI":"10.1101\/gr.2395204","article-title":"De novo repeat classification and fragment assembly","volume":"14","author":"Pevzner","year":"2004","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B30","doi-asserted-by":"crossref","first-page":"2509","DOI":"10.1093\/bioinformatics\/btq465","article-title":"DRIMM-Synteny: decomposing genomes into evolutionary conserved segments","volume":"26","author":"Pham","year":"2010","journal-title":"Bioinformatics"},{"key":"2025030422260671400_btaf058-B31","doi-asserted-by":"crossref","first-page":"901","DOI":"10.1101\/gr.228718.117","article-title":"Detection and analysis of ancient segmental duplications in mammalian genomes","volume":"28","author":"Pu","year":"2018","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B32","doi-asserted-by":"crossref","first-page":"4419","DOI":"10.1038\/s41467-023-40070-x","article-title":"Genetic variation in the immunoglobulin heavy chain locus shapes the human antibody repertoire","volume":"14","author":"Rodriguez","year":"2023","journal-title":"Nat Commun"},{"key":"2025030422260671400_btaf058-B33","doi-asserted-by":"crossref","first-page":"737","DOI":"10.1038\/s41586-021-03451-0","article-title":"Towards complete and error-free genome assemblies of all vertebrate species","volume":"592","author":"Rhie","year":"2021","journal-title":"Nature"},{"key":"2025030422260671400_btaf058-B34","doi-asserted-by":"crossref","first-page":"1547","DOI":"10.1101\/gr.259598.119","article-title":"V (DD) J recombination is an important and evolutionarily conserved mechanism for generating antibodies with unusually long CDR3s","volume":"30","author":"Safonova","year":"2020","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B35","doi-asserted-by":"crossref","first-page":"e35","DOI":"10.1371\/journal.pcbi.0020035","article-title":"The signal in the genomes","volume":"2","author":"Sankoff","year":"2006","journal-title":"PLoS Comput Biol"},{"key":"2025030422260671400_btaf058-B36","doi-asserted-by":"crossref","first-page":"1152","DOI":"10.1101\/gr.276676.122","article-title":"Gene prediction in the immunoglobulin loci","volume":"32","author":"Sirupurapu","year":"2022","journal-title":"Genome Res"},{"key":"2025030422260671400_btaf058-B37","doi-asserted-by":"crossref","first-page":"1394","DOI":"10.1093\/bioinformatics\/btw753","article-title":"Edlib: a C\/C\u2009++ library for fast, exact sequence alignment using edit distance","volume":"33","author":"Sosic","year":"2017","journal-title":"Bioinformatics"},{"key":"2025030422260671400_btaf058-B38","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1038\/nature01262","article-title":"Initial sequencing and comparative analysis of the mouse genome","volume":"420","author":"Waterston","year":"2002","journal-title":"Nature"},{"key":"2025030422260671400_btaf058-B39","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1016\/j.it.2017.04.003","article-title":"The individual and population genetics of antibody immunity","volume":"38","author":"Watson","year":"2017","journal-title":"Trends Immunol"},{"year":"2024","author":"Yoo","key":"2025030422260671400_btaf058-B40"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf058\/61771221\/btaf058.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/2\/btaf058\/61771221\/btaf058.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/2\/btaf058\/61771221\/btaf058.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,4]],"date-time":"2025-03-04T22:26:32Z","timestamp":1741127192000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf058\/8002818"}},"subtitle":[],"editor":[{"given":"Janet","family":"Kelso","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,2]]},"references-count":40,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2025,2,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf058","relation":{},"ISSN":["1367-4811"],"issn-type":[{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2025,2]]},"published":{"date-parts":[[2025,2]]},"article-number":"btaf058"}}