{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,12]],"date-time":"2026-04-12T08:53:23Z","timestamp":1775984003370,"version":"3.50.1"},"reference-count":50,"publisher":"Oxford University Press (OUP)","issue":"Supplement_2","license":[{"start":{"date-parts":[[2020,12,1]],"date-time":"2020-12-01T00:00:00Z","timestamp":1606780800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100006489","name":"French Alternative Energies and Atomic Energy Commission","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100006489","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,12,30]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Horizontal gene transfer (HGT) is a major source of variability in prokaryotic genomes. Regions of genome plasticity (RGPs) are clusters of genes located in highly variable genomic regions. Most of them arise from HGT and correspond to genomic islands (GIs). The study of those regions at the species level has become increasingly difficult with the data deluge of genomes. To date, no methods are available to identify GIs using hundreds of genomes to explore their diversity.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We present here the panRGP method that predicts RGPs using pangenome graphs made of all available genomes for a given species. It allows the study of thousands of genomes in order to access the diversity of RGPs and to predict spots of insertions. It gave the best predictions when benchmarked along other GI detection tools against a reference dataset. In addition, we illustrated its use on metagenome assembled genomes by redefining the borders of the leuX tRNA hotspot, a well-studied spot of insertion in Escherichia coli. panRPG is a scalable and reliable tool to predict GIs and spots making it an ideal approach for large comparative studies.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The methods presented in the current work are available through the following software: https:\/\/github.com\/labgem\/PPanGGOLiN. Detailed results and scripts to compute the benchmark metrics are available at https:\/\/github.com\/axbazin\/panrgp_supdata.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa792","type":"journal-article","created":{"date-parts":[[2020,9,9]],"date-time":"2020-09-09T15:27:03Z","timestamp":1599665223000},"page":"i651-i658","source":"Crossref","is-referenced-by-count":48,"title":["panRGP: a pangenome-based method to predict genomic islands and explore their diversity"],"prefix":"10.1093","volume":"36","author":[{"given":"Adelme","family":"Bazin","sequence":"first","affiliation":[{"name":"LABGeM, G\u00e9nomique M\u00e9tabolique, CEA, Genoscope, Institut Fran\u00e7ois Jacob, Universit\u00e9 d\u2019\u00c9vry, Universit\u00e9 Paris-Saclay , CNRS, Evry, France"}]},{"given":"Guillaume","family":"Gautreau","sequence":"additional","affiliation":[{"name":"LABGeM, G\u00e9nomique M\u00e9tabolique, CEA, Genoscope, Institut Fran\u00e7ois Jacob, Universit\u00e9 d\u2019\u00c9vry, Universit\u00e9 Paris-Saclay , CNRS, Evry, France"}]},{"given":"Claudine","family":"M\u00e9digue","sequence":"additional","affiliation":[{"name":"LABGeM, G\u00e9nomique M\u00e9tabolique, CEA, Genoscope, Institut Fran\u00e7ois Jacob, Universit\u00e9 d\u2019\u00c9vry, Universit\u00e9 Paris-Saclay , CNRS, Evry, France"}]},{"given":"David","family":"Vallenet","sequence":"additional","affiliation":[{"name":"LABGeM, G\u00e9nomique M\u00e9tabolique, CEA, Genoscope, Institut Fran\u00e7ois Jacob, Universit\u00e9 d\u2019\u00c9vry, Universit\u00e9 Paris-Saclay , CNRS, Evry, France"}]},{"given":"Alexandra","family":"Calteau","sequence":"additional","affiliation":[{"name":"LABGeM, G\u00e9nomique M\u00e9tabolique, CEA, Genoscope, Institut Fran\u00e7ois Jacob, Universit\u00e9 d\u2019\u00c9vry, Universit\u00e9 Paris-Saclay , CNRS, Evry, France"}]}],"member":"286","published-online":{"date-parts":[[2020,12,29]]},"reference":[{"key":"2023062409322604700_btaa792-B1","first-page":"361","author":"Bastian","year":"2009"},{"key":"2023062409322604700_btaa792-B2","doi-asserted-by":"crossref","first-page":"2161","DOI":"10.1093\/bioinformatics\/bty095","article-title":"Improved genomic island predictions with IslandPath-DIMOB","volume":"34","author":"Bertelli","year":"2018","journal-title":"Bioinformatics"},{"key":"2023062409322604700_btaa792-B3","doi-asserted-by":"crossref","first-page":"W30","DOI":"10.1093\/nar\/gkx343","article-title":"IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets","volume":"45","author":"Bertelli","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023062409322604700_btaa792-B4","doi-asserted-by":"crossref","first-page":"1685","DOI":"10.1093\/bib\/bby042","article-title":"Microbial genomic island discovery, visualization and analysis","volume":"20","author":"Bertelli","year":"2019","journal-title":"Brief. Bioinformatics"},{"key":"2023062409322604700_btaa792-B5","doi-asserted-by":"crossref","first-page":"606","DOI":"10.1128\/iai.62.2.606-614.1994","article-title":"Excision of large DNA regions termed pathogenicity islands from tRNA-specific loci in the chromosome of an Escherichia coli wild-type pathogen","volume":"62","author":"Blum","year":"1994","journal-title":"Infect. Immun"},{"key":"2023062409322604700_btaa792-B6","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1186\/s12859-018-2038-0","article-title":"xenoGI: reconstructing the history of genomic island insertions in clades of closely related bacteria","volume":"19","author":"Bush","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023062409322604700_btaa792-B7","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1186\/s13059-015-0701-6","article-title":"A novel method of consensus pan-chromosome assembly and large-scale comparative analysis reveal the highly flexible pan-genome of Acinetobacter baumannii","volume":"16","author":"Chan","year":"2015","journal-title":"Genome Biol"},{"key":"2023062409322604700_btaa792-B8","doi-asserted-by":"crossref","first-page":"414","DOI":"10.1038\/nrmicro884","article-title":"Genomic islands in pathogenic and environmental microorganisms","volume":"2","author":"Dobrindt","year":"2004","journal-title":"Nat. Rev. Microbiol"},{"key":"2023062409322604700_btaa792-B9","doi-asserted-by":"crossref","first-page":"e172","DOI":"10.1093\/nar\/gks757","article-title":"PanOCT: automated clustering of orthologs using conserved gene neighborhood for pan-genomic analysis of bacterial strains and closely related species","volume":"40","author":"Fouts","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023062409322604700_btaa792-B10","doi-asserted-by":"crossref","first-page":"e1007732","DOI":"10.1371\/journal.pcbi.1007732","article-title":"PPanGGOLiN: depicting microbial diversity via a partitioned pangenome graph","volume":"16","author":"Gautreau","year":"2020","journal-title":"PLoS Comput. Biol"},{"key":"2023062409322604700_btaa792-B11","doi-asserted-by":"crossref","first-page":"475","DOI":"10.1038\/s41592-018-0046-7","article-title":"Bioconda: sustainable and comprehensive software distribution for the life sciences","volume":"15","author":"Gr\u00fcning","year":"2018","journal-title":"Nat. Methods"},{"key":"2023062409322604700_btaa792-B12","doi-asserted-by":"crossref","first-page":"2334","DOI":"10.1093\/bioinformatics\/btq413","article-title":"genoPlotR: comparative gene and genome visualization in R","volume":"26","author":"Guy","year":"2010","journal-title":"Bioinformatics"},{"key":"2023062409322604700_btaa792-B13","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1093\/embo-reports\/kve097","article-title":"Ecological fitness, genomic islands and bacterial pathogenicity","volume":"2","author":"Hacker","year":"2001","journal-title":"EMBO Rep"},{"key":"2023062409322604700_btaa792-B14","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1146\/annurev.micro.54.1.641","article-title":"Pathogenicity islands and the evolution of microbes","volume":"54","author":"Hacker","year":"2000","journal-title":"Annu. Rev. Microbiol"},{"key":"2023062409322604700_btaa792-B15","doi-asserted-by":"crossref","first-page":"D851","DOI":"10.1093\/nar\/gkx1068","article-title":"RefSeq: an update on prokaryotic genome annotation and curation","volume":"46","author":"Haft","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023062409322604700_btaa792-B16","doi-asserted-by":"crossref","first-page":"D48","DOI":"10.1093\/nar\/gku1072","article-title":"Islander: a database of precisely mapped genomic islands in tRNA and tmRNA genes","volume":"43","author":"Hudson","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023062409322604700_btaa792-B17","doi-asserted-by":"crossref","first-page":"e98679","DOI":"10.1371\/journal.pone.0098679","article-title":"ForceAtlas2, a continuous graph layout algorithm for handy network visualization designed for the Gephi software","volume":"9","author":"Jacomy","year":"2014","journal-title":"PLoS One"},{"key":"2023062409322604700_btaa792-B18","doi-asserted-by":"crossref","first-page":"3273","DOI":"10.1534\/g3.119.400562","article-title":"IslandCafe: compositional anomaly and feature enrichment assessment for delineation of genomic islands","volume":"9","author":"Jani","year":"2019","journal-title":"G3 (Bethesda)"},{"key":"2023062409322604700_btaa792-B19","doi-asserted-by":"crossref","first-page":"1762","DOI":"10.21105\/joss.01762","article-title":"Mashtree: a rapid comparison of whole genome sequence files","volume":"4","author":"Katz","year":"2019","journal-title":"J. Open Source Softw"},{"key":"2023062409322604700_btaa792-B20","doi-asserted-by":"crossref","first-page":"e22420","DOI":"10.1371\/journal.pone.0022420","article-title":"Identification of the pangenome and its components in 14 distinct Aggregatibacter actinomycetemcomitans strains by comparative genomic analysis","volume":"6","author":"Kittichotirat","year":"2011","journal-title":"PLoS One"},{"key":"2023062409322604700_btaa792-B21","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1186\/1471-2105-9-329","article-title":"Evaluation of genomic island predictors using a comparative genomics approach","volume":"9","author":"Langille","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023062409322604700_btaa792-B22","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1007\/PL00006158","article-title":"Amelioration of bacterial genomes: rates of change and exchange","volume":"44","author":"Lawrence","year":"1997","journal-title":"J. Mol. Evol"},{"key":"2023062409322604700_btaa792-B23","doi-asserted-by":"crossref","first-page":"2283","DOI":"10.1128\/AAC.00123-09","article-title":"A module located at a chromosomal integration hot spot is responsible for the multidrug resistance of a reference strain from Escherichia coli clonal group A","volume":"53","author":"Lescat","year":"2009","journal-title":"Antimicrob. Agents Chemother"},{"key":"2023062409322604700_btaa792-B24","doi-asserted-by":"crossref","first-page":"200","DOI":"10.1016\/j.csbj.2016.05.001","article-title":"Computational methods for predicting genomic islands in microbial genomes","volume":"14","author":"Lu","year":"2016","journal-title":"Comput. Struct. Biotechnol. J"},{"key":"2023062409322604700_btaa792-B25","doi-asserted-by":"crossref","first-page":"1840010","DOI":"10.1142\/S0219720018400103","article-title":"GI-Cluster: detecting genomic islands via consensus clustering on multiple features","volume":"16","author":"Lu","year":"2018","journal-title":"J. Bioinf. Comput. Biol"},{"key":"2023062409322604700_btaa792-B26","doi-asserted-by":"crossref","first-page":"3100","DOI":"10.1073\/pnas.0711982105","article-title":"Dynamics of Pseudomonas aeruginosa genome evolution","volume":"105","author":"Mathee","year":"2008","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023062409322604700_btaa792-B27","doi-asserted-by":"crossref","first-page":"8924","DOI":"10.1038\/ncomms9924","article-title":"Migration and horizontal gene transfer divide microbial genomes into multiple niches","volume":"6","author":"Niehus","year":"2015","journal-title":"Nat. Commun"},{"key":"2023062409322604700_btaa792-B28","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1038\/35012500","article-title":"Lateral gene transfer and the nature of bacterial innovation","volume":"405","author":"Ochman","year":"2000","journal-title":"Nature"},{"key":"2023062409322604700_btaa792-B29","doi-asserted-by":"crossref","first-page":"568","DOI":"10.1186\/1471-2164-11-568","article-title":"Units of plasticity in bacterial genomes: new insight from the comparative genomics of two bacteria interacting with invertebrates, Photorhabdus and Xenorhabdus","volume":"11","author":"Ogier","year":"2010","journal-title":"BMC Genomics"},{"key":"2023062409322604700_btaa792-B30","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1038\/s41467-017-00808-w","article-title":"The chromosomal organization of horizontal gene transfer in bacteria","volume":"8","author":"Oliveira","year":"2017","journal-title":"Nat. Commun"},{"key":"2023062409322604700_btaa792-B31","doi-asserted-by":"crossref","first-page":"3691","DOI":"10.1093\/bioinformatics\/btv421","article-title":"Roary: rapid large-scale prokaryote pan genome analysis","volume":"31","author":"Page","year":"2015","journal-title":"Bioinformatics"},{"key":"2023062409322604700_btaa792-B32","doi-asserted-by":"crossref","first-page":"996","DOI":"10.1038\/nbt.4229","article-title":"A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life","volume":"36","author":"Parks","year":"2018","journal-title":"Nat. Biotechnol"},{"key":"2023062409322604700_btaa792-B33","doi-asserted-by":"crossref","first-page":"649","DOI":"10.1016\/j.cell.2019.01.001","article-title":"Extensive unexplored human microbiome diversity revealed by over 150,000 genomes from metagenomes spanning age, geography, and lifestyle","volume":"176","author":"Pasolli","year":"2019","journal-title":"Cell"},{"key":"2023062409322604700_btaa792-B34","first-page":"223","article-title":"PredictBias: a server for the identification of genomic and pathogenicity islands in prokaryotes","volume":"8","author":"Pundhir","year":"2008","journal-title":"In Silico Biol"},{"key":"2023062409322604700_btaa792-B35","doi-asserted-by":"crossref","first-page":"16","DOI":"10.12688\/f1000research.1-16.v1","article-title":"Is the pan-genome also a pan-selectome?","volume":"1","author":"Rodriguez-Valera","year":"2012","journal-title":"F1000Res"},{"key":"2023062409322604700_btaa792-B36","doi-asserted-by":"crossref","first-page":"D94","DOI":"10.1093\/nar\/gky989","article-title":"GenBank","volume":"47","author":"Sayers","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023062409322604700_btaa792-B37","doi-asserted-by":"crossref","first-page":"2068","DOI":"10.1093\/bioinformatics\/btu153","article-title":"Prokka: rapid prokaryotic genome annotation","volume":"30","author":"Seemann","year":"2014","journal-title":"Bioinformatics"},{"key":"2023062409322604700_btaa792-B38","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1186\/s12859-015-0517-0","article-title":"Micropan: an R-package for microbial pan-genomics","volume":"16","author":"Snipen","year":"2015","journal-title":"BMC Bioinformatics"},{"key":"2023062409322604700_btaa792-B39","doi-asserted-by":"crossref","first-page":"1026","DOI":"10.1038\/nbt.3988","article-title":"MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets","volume":"35","author":"Steinegger","year":"2017","journal-title":"Nat. Biotechnol"},{"key":"2023062409322604700_btaa792-B40","doi-asserted-by":"crossref","first-page":"13950","DOI":"10.1073\/pnas.0506758102","article-title":"Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial \u201cpan-genome\u201d","volume":"102","author":"Tettelin","year":"2005","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023062409322604700_btaa792-B41","doi-asserted-by":"crossref","first-page":"711","DOI":"10.1038\/nrmicro1234","article-title":"Mechanisms of, and barriers to, horizontal gene transfer between bacteria","volume":"3","author":"Thomas","year":"2005","journal-title":"Nat. Rev. Microbiol"},{"key":"2023062409322604700_btaa792-B42","doi-asserted-by":"crossref","first-page":"e1000344","DOI":"10.1371\/journal.pgen.1000344","article-title":"Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths","volume":"5","author":"Touchon","year":"2009","journal-title":"PLoS Genet"},{"key":"2023062409322604700_btaa792-B43","doi-asserted-by":"crossref","first-page":"e1001284","DOI":"10.1371\/journal.pgen.1001284","article-title":"Horizontal transfer, not duplication, drives the expansion of protein families in prokaryotes","volume":"7","author":"Treangen","year":"2011","journal-title":"PLoS Genet"},{"key":"2023062409322604700_btaa792-B44","doi-asserted-by":"crossref","first-page":"D506","DOI":"10.1093\/nar\/gky1049","article-title":"UniProt: a worldwide hub of protein knowledge","volume":"47","author":"UniProt Consortium","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023062409322604700_btaa792-B45","doi-asserted-by":"crossref","first-page":"bap021","DOI":"10.1093\/database\/bap021","article-title":"MicroScope: a platform for microbial genome annotation and comparative genomics","volume":"2009","author":"Vallenet","year":"2009","journal-title":"Database"},{"key":"2023062409322604700_btaa792-B46","first-page":"D579","article-title":"MicroScope: an integrated platform for the annotation and exploration of microbial gene functions through genomic, pangenomic and metabolic comparative analysis","volume":"48","author":"Vallenet","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023062409322604700_btaa792-B47","doi-asserted-by":"crossref","first-page":"2196","DOI":"10.1093\/bioinformatics\/btl369","article-title":"Interpolated variable order motifs for identification of horizontally acquired DNA: revisiting the Salmonella pathogenicity islands","volume":"22","author":"Vernikos","year":"2006","journal-title":"Bioinformatics"},{"key":"2023062409322604700_btaa792-B48","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1186\/1471-2105-7-142","article-title":"Score-based prediction of genomic islands in prokaryotic genomes using hidden Markov models","volume":"7","author":"Waack","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023062409322604700_btaa792-B49","first-page":"357","article-title":"Zisland Explorer: detect genomic islands by combining homogeneity and heterogeneity properties","volume":"18","author":"Wei","year":"2017","journal-title":"Brief. Bioinform"},{"key":"2023062409322604700_btaa792-B50","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1186\/s12864-018-5366-6","article-title":"Comparative analysis reveals the Genomic Islands in Pasteurella multocida population genetics: on Symbiosis and adaptability","volume":"20","author":"Zhu","year":"2019","journal-title":"BMC Genomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/Supplement_2\/i651\/50693344\/btaa792.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/Supplement_2\/i651\/50693344\/btaa792.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,24]],"date-time":"2023-06-24T19:52:57Z","timestamp":1687636377000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/Supplement_2\/i651\/6055938"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12]]},"references-count":50,"journal-issue":{"issue":"Supplement_2","published-print":{"date-parts":[[2020,12,30]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa792","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.03.26.007484","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,12]]},"published":{"date-parts":[[2020,12]]}}}