{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T07:05:29Z","timestamp":1780383929050,"version":"3.54.1"},"reference-count":51,"publisher":"Oxford University Press (OUP)","issue":"11","license":[{"start":{"date-parts":[[2023,10,25]],"date-time":"2023-10-25T00:00:00Z","timestamp":1698192000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Australian Government National Collaborative Research"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Whole genome alignment of eukaryote species remains an important method for the determination of sequence and structural variations and can also be used to ascertain the representative non-redundant core-genome sequence of a population. Many whole genome alignment tools were first developed for the more mature analysis of prokaryote species with few current tools containing the functionality to process larger genomes of eukaryotes as well as genomes of more divergent species. In addition, the functionality of these tools becomes computationally prohibitive due to the significant compute resources needed to handle larger genomes.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this research, we present CoreDetector, an easy-to-use general-purpose program that can align the core-genome sequences for a range of genome sizes and divergence levels. To illustrate the flexibility of CoreDetector, we conducted alignments of a large set of closely related fungal pathogen and hexaploid wheat cultivar genomes as well as more divergent fly and rodent species genomes. In all cases, compared to existing multiple genome alignment tools, CoreDetector exhibited improved flexibility, efficiency, and competitive accuracy in tested cases.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>CoreDetector was developed in the cross platform, and easily deployable, Java language. A packaged pipeline is readily executable in a bash terminal without any external need for Perl or Python environments. Installation, example data, and usage instructions for CoreDetector are freely available from https:\/\/github.com\/mfruzan\/CoreDetector.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad628","type":"journal-article","created":{"date-parts":[[2023,10,25]],"date-time":"2023-10-25T19:01:17Z","timestamp":1698260477000},"source":"Crossref","is-referenced-by-count":5,"title":["CoreDetector: a flexible and efficient program for core-genome alignment of evolutionary diverse genomes"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3525-8338","authenticated-orcid":false,"given":"Mario","family":"Fruzangohar","sequence":"first","affiliation":[{"name":"The Biometry Hub, School of Agriculture, Food and Wine, University of Adelaide , Urrbrae, South Australia 5064, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3502-7612","authenticated-orcid":false,"given":"Paula","family":"Moolhuijzen","sequence":"additional","affiliation":[{"name":"Centre for Crop Disease Management, School of Molecular and Life Sciences, Curtin University , Bentley, Western Australia 6102, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nicolette","family":"Bakaj","sequence":"additional","affiliation":[{"name":"The Biometry Hub, School of Agriculture, Food and Wine, University of Adelaide , Urrbrae, South Australia 5064, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Julian","family":"Taylor","sequence":"additional","affiliation":[{"name":"The Biometry Hub, School of Agriculture, Food and Wine, University of Adelaide , Urrbrae, South Australia 5064, Australia"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2023,10,25]]},"reference":[{"key":"2023112210173635000_btad628-B1","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1016\/j.jda.2004.08.011","article-title":"Chaining algorithms for multiple genome comparison","volume":"3","author":"Abouelhoda","year":"2005","journal-title":"J Discrete Algorithms"},{"key":"2023112210173635000_btad628-B2","doi-asserted-by":"crossref","first-page":"6257","DOI":"10.3390\/ijms22126257","article-title":"Comparative genomics of eight Fusarium graminearum strains with contrasting aggressiveness reveals an expanded open pangenome and extended effector content signatures","volume":"22","author":"Alouane","year":"2021","journal-title":"Int J Mol Sci"},{"key":"2023112210173635000_btad628-B3","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1093\/bioinformatics\/btq665","article-title":"Mugsy: fast multiple alignment of closely related whole genomes","volume":"27","author":"Angiuoli","year":"2011","journal-title":"Bioinformatics"},{"key":"2023112210173635000_btad628-B4","doi-asserted-by":"crossref","first-page":"eaar7191","DOI":"10.1126\/science.aar7191","article-title":"Shifting the limits in wheat research and breeding using a fully annotated reference genome","volume":"361","author":"Appels","year":"2018","journal-title":"Science"},{"key":"2023112210173635000_btad628-B5","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1146\/annurev-animal-020518-115005","article-title":"Whole-genome alignment and comparative annotation","volume":"7","author":"Armstrong","year":"2019","journal-title":"Annu Rev Anim Biosci"},{"key":"2023112210173635000_btad628-B6","doi-asserted-by":"crossref","first-page":"246","DOI":"10.1038\/s41586-020-2871-y","article-title":"Progressive cactus is a multiple-genome aligner for the thousand-genome era","volume":"587","author":"Armstrong","year":"2020","journal-title":"Nature"},{"key":"2023112210173635000_btad628-B7","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1186\/s12915-020-0744-3","article-title":"A 19-isolate reference-quality global pangenome for the fungal wheat pathogen Zymoseptoria tritici","volume":"18","author":"Badet","year":"2020","journal-title":"BMC Biol"},{"key":"2023112210173635000_btad628-B8","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1016\/j.jtbi.2014.04.040","article-title":"Neural network and SVM classifiers accurately predict lipid binding proteins, irrespective of sequence homology","volume":"356","author":"Bakhtiarizadeh","year":"2014","journal-title":"J Theor Biol"},{"key":"2023112210173635000_btad628-B9","doi-asserted-by":"crossref","first-page":"382","DOI":"10.1186\/s12864-021-07699-8","article-title":"Chromosome-level genome assembly and manually-curated proteome of model necrotroph Parastagonospora nodorum Sn15 reveals a genome-wide trove of candidate effector homologs, and redundancy of virulence-related functions within an accessory chromosome","volume":"22","author":"Bertazzoni","year":"2021","journal-title":"BMC Genomics"},{"key":"2023112210173635000_btad628-B10","doi-asserted-by":"crossref","first-page":"1009","DOI":"10.1093\/bib\/bbv099","article-title":"Multiple sequence alignment modeling: methods and applications","volume":"17","author":"Chatzou","year":"2016","journal-title":"Brief Bioinform"},{"key":"2023112210173635000_btad628-B11","first-page":"Unit 4.10","article-title":"Using RepeatMasker to identify repetitive elements in genomic sequences","volume":"Chapter 4","author":"Chen","year":"2004","journal-title":"Curr Protoc Bioinformatics"},{"key":"2023112210173635000_btad628-B12","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1038\/nature06341","article-title":"Evolution of genes and genomes on the Drosophila phylogeny","volume":"450","author":"Clark","year":"2007","journal-title":"Nature"},{"key":"2023112210173635000_btad628-B14","doi-asserted-by":"crossref","first-page":"e11147","DOI":"10.1371\/journal.pone.0011147","article-title":"progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement","volume":"5","author":"Darling","year":"2010","journal-title":"PLoS One"},{"key":"2023112210173635000_btad628-B15","first-page":"Unit 10.3","article-title":"Using MUMmer to identify similar regions in large sequence sets","volume":"Chapter 10","author":"Delcher","year":"2003","journal-title":"Curr Protoc Bioinformatics"},{"key":"2023112210173635000_btad628-B16","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1186\/1471-2105-5-113","article-title":"MUSCLE: a multiple sequence alignment method with reduced time and space complexity","volume":"5","author":"Edgar","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023112210173635000_btad628-B17","doi-asserted-by":"crossref","first-page":"494","DOI":"10.1111\/2041-210X.13358","article-title":"Visual TreeCmp: comprehensive comparison of phylogenetic trees on the web","volume":"11","author":"Goluch","year":"2020","journal-title":"Methods Ecol Evol"},{"key":"2023112210173635000_btad628-B18","doi-asserted-by":"crossref","first-page":"iyac035","DOI":"10.1093\/genetics\/iyac035","article-title":"FlyBase: a guided tour of highlighted features","volume":"220","author":"Gramates","year":"2022","journal-title":"Genetics"},{"key":"2023112210173635000_btad628-B19","volume-title":"Improved Pairwise Alignment of Genomic DNA","author":"Harris","year":"2007"},{"key":"2023112210173635000_btad628-B20","doi-asserted-by":"crossref","first-page":"1169","DOI":"10.1093\/bioinformatics\/btu815","article-title":"Andi: fast and accurate estimation of evolutionary distances between closely related genomes","volume":"31","author":"Haubold","year":"2015","journal-title":"Bioinformatics"},{"key":"2023112210173635000_btad628-B21","doi-asserted-by":"crossref","first-page":"1341","DOI":"10.1093\/bioinformatics\/btt128","article-title":"HAL: a hierarchical format for storing and analyzing multiple genome alignments","volume":"29","author":"Hickey","year":"2013","journal-title":"Bioinformatics"},{"key":"2023112210173635000_btad628-B22","doi-asserted-by":"crossref","first-page":"3167","DOI":"10.1534\/g3.119.400560","article-title":"Genome size evolution differs between drosophila subgenera with striking differences in male and female genome size in Sophophora","volume":"9","author":"Hjelmen","year":"2019","journal-title":"G3 (Bethesda)"},{"key":"2023112210173635000_btad628-B23","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1007\/BF02257378","article-title":"The alignment of sets of sequences and the construction of phyletic trees: an integrated method","volume":"20","author":"Hogeweg","year":"1984","journal-title":"J Mol Evol"},{"key":"2023112210173635000_btad628-B24","doi-asserted-by":"crossref","first-page":"i748","DOI":"10.1093\/bioinformatics\/bty597","article-title":"A fast adaptive algorithm for computing whole-genome homology maps","volume":"34","author":"Jain","year":"2018","journal-title":"Bioinformatics"},{"key":"2023112210173635000_btad628-B25","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1186\/s13059-022-02735-6","article-title":"Multiple genome alignment in the telomere-to-telomere assembly era","volume":"23","author":"Kille","year":"2022","journal-title":"Genome Biol"},{"key":"2023112210173635000_btad628-B26","doi-asserted-by":"crossref","first-page":"2040","DOI":"10.1093\/bioinformatics\/btz903","article-title":"Phylonium: fast estimation of evolutionary distances from large samples of similar genomes","volume":"36","author":"Kl\u00f6tzl","year":"2020","journal-title":"Bioinformatics"},{"key":"2023112210173635000_btad628-B27","doi-asserted-by":"crossref","first-page":"474","DOI":"10.1186\/1471-2105-7-474","article-title":"Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor","volume":"7","author":"Kohany","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023112210173635000_btad628-B28","doi-asserted-by":"crossref","first-page":"1812","DOI":"10.1093\/molbev\/msx116","article-title":"TimeTree: a resource for timelines, timetrees, and divergence times","volume":"34","author":"Kumar","year":"2017","journal-title":"Mol Biol Evol"},{"key":"2023112210173635000_btad628-B29","doi-asserted-by":"crossref","first-page":"3094","DOI":"10.1093\/bioinformatics\/bty191","article-title":"Minimap2: pairwise alignment for nucleotide sequences","volume":"34","author":"Li","year":"2018","journal-title":"Bioinformatics"},{"key":"2023112210173635000_btad628-B30","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1186\/s12864-020-6569-1","article-title":"GSAlign: an efficient sequence alignment tool for intra-species genomes","volume":"21","author":"Lin","year":"2020","journal-title":"BMC Genomics"},{"key":"2023112210173635000_btad628-B31","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1534\/g3.112.004044","article-title":"Comparative genomics of a plant-pathogenic fungus, Pyrenophora tritici-repentis, reveals transduplication and the impact of repeat elements on pathogenicity and population divergence","volume":"3","author":"Manning","year":"2013","journal-title":"G3 (Bethesda)"},{"key":"2023112210173635000_btad628-B32","doi-asserted-by":"crossref","first-page":"e1005944","DOI":"10.1371\/journal.pcbi.1005944","article-title":"MUMmer4: a fast and versatile genome alignment system","volume":"14","author":"Mar\u00e7ais","year":"2018","journal-title":"PLoS Comput Biol"},{"key":"2023112210173635000_btad628-B33","doi-asserted-by":"crossref","first-page":"6327","DOI":"10.1038\/s41467-020-19777-8","article-title":"Scalable multiple whole-genome alignment and locally collinear block construction with SibeliaZ","volume":"11","author":"Minkin","year":"2020","journal-title":"Nat Commun"},{"key":"2023112210173635000_btad628-B34","first-page":"1","article-title":"Comparative genomics of the wheat fungal pathogen Pyrenophora tritici-repentis reveals chromosomal variations and genome plasticity","volume":"19","author":"Moolhuijzen","year":"2018","journal-title":"BMC Genomics"},{"key":"2023112210173635000_btad628-B36","first-page":"mgen000872","article-title":"A global pangenome for the wheat fungal pathogen Pyrenophora tritici-repentis and prediction of effector protein structural homology","volume":"8","author":"Moolhuijzen","year":"2022","journal-title":"Microb Genom"},{"key":"2023112210173635000_btad628-B37","first-page":"101","article-title":"Complexities of hierarchic clustering algorithms: state of the art","volume":"1","author":"Murtagh","year":"1984","journal-title":"Comput Stat Q"},{"key":"2023112210173635000_btad628-B38","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","article-title":"T-Coffee: a novel method for fast and accurate multiple sequence alignment","volume":"302","author":"Notredame","year":"2000","journal-title":"J Mol Biol"},{"key":"2023112210173635000_btad628-B39","doi-asserted-by":"crossref","first-page":"1814","DOI":"10.1101\/gr.076554.108","article-title":"Enredo and Pecan: genome-wide mammalian consistency-based multiple alignment with paralogs","volume":"18","author":"Paten","year":"2008","journal-title":"Genome Res"},{"key":"2023112210173635000_btad628-B40","doi-asserted-by":"crossref","first-page":"11.12.1","DOI":"10.1002\/0471250953.bi1112s47","article-title":"BEDTools: the Swiss-army tool for genome feature analysis","volume":"47","author":"Quinlan","year":"2014","journal-title":"Curr Protoc Bioinformatics"},{"key":"2023112210173635000_btad628-B41","doi-asserted-by":"crossref","first-page":"2336","DOI":"10.1101\/gr.2657504","article-title":"A novel method for multiple alignment of sequences with repeated and shuffled elements","volume":"14","author":"Raphael","year":"2004","journal-title":"Genome Res"},{"key":"2023112210173635000_btad628-B42","first-page":"243","volume-title":"Bioinformatics Methods and Protocols","author":"Retief","year":"2000"},{"key":"2023112210173635000_btad628-B43","first-page":"406","article-title":"The neighbor-joining method: a new method for reconstructing phylogenetic trees","volume":"4","author":"Saitou","year":"1987","journal-title":"Mol Biol Evol"},{"key":"2023112210173635000_btad628-B44","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1186\/s13059-019-1632-4","article-title":"Skmer: assembly-free and alignment-free sample identification using genome skims","volume":"20","author":"Sarmashghi","year":"2019","journal-title":"Genome Biol"},{"key":"2023112210173635000_btad628-B45","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/1471-2105-6-31","article-title":"Automated generation of heuristics for biological sequence comparison","volume":"6","author":"Slater","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023112210173635000_btad628-B46","doi-asserted-by":"crossref","first-page":"e18093","DOI":"10.1371\/journal.pone.0018093","article-title":"A comprehensive benchmark study of multiple sequence alignment methods: current challenges and future perspectives","volume":"6","author":"Thompson","year":"2011","journal-title":"PLoS One"},{"key":"2023112210173635000_btad628-B47","doi-asserted-by":"crossref","first-page":"e0143465","DOI":"10.1371\/journal.pone.0143465","article-title":"Machine learning based classification of microsatellite variation: an effective approach for phylogeographic characterization of olive populations","volume":"10","author":"Torkzaban","year":"2015","journal-title":"PLoS One"},{"key":"2023112210173635000_btad628-B48","doi-asserted-by":"crossref","first-page":"524","DOI":"10.1186\/s13059-014-0524-x","article-title":"The harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes","volume":"15","author":"Treangen","year":"2014","journal-title":"Genome Biol"},{"key":"2023112210173635000_btad628-B49","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1038\/514550a","article-title":"The top 100 papers","volume":"514","author":"Van Noorden","year":"2014","journal-title":"Nature"},{"key":"2023112210173635000_btad628-B50","doi-asserted-by":"crossref","first-page":"737","DOI":"10.1101\/gr.214270.116","article-title":"Fast and accurate de novo genome assembly from long uncorrected reads","volume":"27","author":"Vaser","year":"2017","journal-title":"Genome Res"},{"key":"2023112210173635000_btad628-B51","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1016\/j.mib.2014.11.016","article-title":"Ten years of pan-genome analyses","volume":"23","author":"Vernikos","year":"2015","journal-title":"Curr Opin Microbiol"},{"key":"2023112210173635000_btad628-B52","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1186\/s13059-019-1755-7","article-title":"Benchmarking of alignment-free sequence comparison methods","volume":"20","author":"Zielezinski","year":"2019","journal-title":"Genome Biol"},{"key":"2023112210173635000_btad628-B53","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1038\/s41586-020-2876-6","article-title":"A comparative genomics multitool for scientific discovery and conservation","volume":"587","author":"Zoonomia","year":"2020","journal-title":"Nature"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad628\/52543857\/btad628.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/11\/btad628\/53674443\/btad628.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/11\/btad628\/53674443\/btad628.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,22]],"date-time":"2023-11-22T17:22:36Z","timestamp":1700673756000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad628\/7329718"}},"subtitle":[],"editor":[{"given":"Alfonso","family":"Valencia","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2023,10,25]]},"references-count":51,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2023,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad628","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,11,1]]},"published":{"date-parts":[[2023,10,25]]},"article-number":"btad628"}}