{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,18]],"date-time":"2025-11-18T01:37:01Z","timestamp":1763429821429,"version":"3.37.3"},"reference-count":18,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,7,8]],"date-time":"2022-07-08T00:00:00Z","timestamp":1657238400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,7,8]],"date-time":"2022-07-08T00:00:00Z","timestamp":1657238400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"crossref","award":["2015-FED-186"],"award-info":[{"award-number":["2015-FED-186"]}],"id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2022,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                <jats:title>Background<\/jats:title>\n                <jats:p>Biological sequences are increasing rapidly and exponentially worldwide. Nucleotide sequence databases play an important role in providing meaningful genomic information on a variety of biological organisms.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Results<\/jats:title>\n                <jats:p>The <jats:italic>getSequenceInfo<\/jats:italic> software tool allows to access sequence information from various public repositories (GenBank, RefSeq, and the European Nucleotide Archive), and is compatible with different operating systems (Linux, MacOS, and Microsoft Windows) in a programmatic way (command line) or as a graphical user interface. <jats:italic>getSequenceInfo<\/jats:italic> or gSeqI v1.0 should help users to get some information on queried sequences that could be useful for specific studies (e.g. the country of origin\/isolation or the release date of queried sequences). Queries can be made to retrieve sequence data based on a given kingdom and species, or from a given date. This program allows the separation between chromosomes and plasmids (or other genetic elements\/components) by arranging each component in a given folder. Some basic statistics are also performed by the program (such as the calculation of GC content for queried assemblies). An empirically designed nucleotide ratio is calculated using nucleotide information in order to tentatively provide a \u201cNucleScore\u201d for studied genome assemblies. Besides the main gSeqI tool, other additional tools have been developed to perform various tasks related to sequence analysis.<\/jats:p>\n              <\/jats:sec><jats:sec>\n                <jats:title>Conclusion<\/jats:title>\n                <jats:p>The aim of this study is to democratize the use of public repositories in programmatic ways, and to facilitate sequence data analysis in a pedagogical perspective. Output results are available in FASTA, FASTQ, Excel\/TSV or HTML formats. The program is freely available at: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/karubiotools\/getSequenceInfo\">https:\/\/github.com\/karubiotools\/getSequenceInfo<\/jats:ext-link>. getSequenceInfo and supplementary tools are partly available through the recently released Galaxy KaruBioNet platform (<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"http:\/\/calamar.univ-ag.fr\/c3i\/galaxy_karubionet.html\">http:\/\/calamar.univ-ag.fr\/c3i\/galaxy_karubionet.html<\/jats:ext-link>).<\/jats:p>\n              <\/jats:sec>","DOI":"10.1186\/s12859-022-04809-5","type":"journal-article","created":{"date-parts":[[2022,7,8]],"date-time":"2022-07-08T13:05:05Z","timestamp":1657285505000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["getSequenceInfo: a suite of tools allowing to get genome sequence information from public repositories"],"prefix":"10.1186","volume":"23","author":[{"given":"Vincent","family":"Moco","sequence":"first","affiliation":[]},{"given":"Damien","family":"Cazenave","sequence":"additional","affiliation":[]},{"given":"Ma\u00eblle","family":"Garnier","sequence":"additional","affiliation":[]},{"given":"Matthieu","family":"Pot","sequence":"additional","affiliation":[]},{"given":"Isabel","family":"Marcelino","sequence":"additional","affiliation":[]},{"given":"Antoine","family":"Talarmin","sequence":"additional","affiliation":[]},{"given":"St\u00e9phanie","family":"Guyomard-Rabenirina","sequence":"additional","affiliation":[]},{"given":"S\u00e9bastien","family":"Breurec","sequence":"additional","affiliation":[]},{"given":"S\u00e9verine","family":"Ferdinand","sequence":"additional","affiliation":[]},{"given":"Alexis","family":"Dereeper","sequence":"additional","affiliation":[]},{"given":"Yann","family":"Reynaud","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7897-9023","authenticated-orcid":false,"given":"David","family":"Couvin","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,7,8]]},"reference":[{"key":"4809_CR1","doi-asserted-by":"publisher","first-page":"D48","DOI":"10.1093\/nar\/gkx1097","volume":"46","author":"I Karsch-Mizrachi","year":"2018","unstructured":"Karsch-Mizrachi I, Takagi T, Cochrane G. International nucleotide sequence database collaboration. Int Nucleotide Seq Database Collab Nucleic Acids Res. 2018;46:D48\u201351. https:\/\/doi.org\/10.1093\/nar\/gkx1097.","journal-title":"Int Nucleotide Seq Database Collab Nucleic Acids Res"},{"key":"4809_CR2","doi-asserted-by":"publisher","first-page":"D45","DOI":"10.1093\/nar\/gkz982","volume":"48","author":"O Ogasawara","year":"2020","unstructured":"Ogasawara O, Kodama Y, Mashima J, Kosuge T, Fujisawa T. DDBJ Database updates and computational infrastructure enhancement. Nucleic Acids Res. 2020;48:D45\u201350. https:\/\/doi.org\/10.1093\/nar\/gkz982.","journal-title":"Nucleic Acids Res"},{"key":"4809_CR3","doi-asserted-by":"publisher","first-page":"D70","DOI":"10.1093\/nar\/gkz1063","volume":"48","author":"C Amid","year":"2020","unstructured":"Amid C, Alako BTF, BalavenkataramanKadhirvelu V, Burdett T, Burgin J, Fan J, Harrison PW, Holt S, Hussein A, Ivanov E, Jayathilaka S, Kay S, Keane T, Leinonen R, Liu X, Martinez-Villacorta J, Milano A, Pakseresht A, Rahman N, Rajan J, Reddy K, Richards E, Smirnov D, Sokolov A, Vijayaraja S, Cochrane G. The European nucleotide archive in 2019. Nucleic Acids Res. 2020;48:D70\u20136. https:\/\/doi.org\/10.1093\/nar\/gkz1063.","journal-title":"Nucleic Acids Res"},{"key":"4809_CR4","doi-asserted-by":"publisher","first-page":"D84","DOI":"10.1093\/nar\/gkz956","volume":"48","author":"EW Sayers","year":"2020","unstructured":"Sayers EW, Cavanaugh M, Clark K, Ostell J, Pruitt KD, Karsch-Mizrachi I. GenBank. Nucleic Acids Res. 2020;48:D84\u20136. https:\/\/doi.org\/10.1093\/nar\/gkz956.","journal-title":"Nucleic Acids Res"},{"key":"4809_CR5","doi-asserted-by":"publisher","first-page":"19","DOI":"10.1186\/1471-2105-14-19","volume":"14","author":"Y Zhu","year":"2013","unstructured":"Zhu Y, Stephens RM, Meltzer PS, Davis SR. SRAdb: query and use public next-generation sequencing data from within R. BMC Bioinform. 2013;14:19. https:\/\/doi.org\/10.1186\/1471-2105-14-19.","journal-title":"BMC Bioinform"},{"key":"4809_CR6","doi-asserted-by":"publisher","first-page":"160018","DOI":"10.1038\/sdata.2016.18","volume":"3","author":"MD Wilkinson","year":"2016","unstructured":"Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, Blomberg N, Boiten JW, da Silva Santos LB, Bourne PE, Bouwman J, Brookes AJ, Clark T, Crosas M, Dillo I, Dumon O, Edmunds S, Evelo CT, Finkers R, Gonzalez-Beltran A, Gray AJ, Groth P, Goble C, Grethe JS, Heringa J, \u2019tHoen PA, Hooft R, Kuhn T, Kok R, Kok J, Lusher SJ, Martone ME, Mons A, Packer AL, Persson B, Rocca-Serra P, Roos M, van Schaik R, Sansone SA, Schultes E, Sengstag T, Slater T, Strawn G, Swertz MA, Thompson M, van der Lei J, van Mulligen E, Velterop J, Waagmeester A, Wittenburg P, Wolstencroft K, Zhao J, Mons B. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3:160018. https:\/\/doi.org\/10.1038\/sdata.2016.18.","journal-title":"Sci Data"},{"issue":"D1","key":"4809_CR7","doi-asserted-by":"publisher","first-page":"D73","DOI":"10.1093\/nar\/gkv1226","volume":"44","author":"PA Kitts","year":"2016","unstructured":"Kitts PA, Church DM, Thibaud-Nissen F, Choi J, Hem V, Sapojnikov V, Smith RG, Tatusova T, Xiang C, Zherikov A, DiCuccio M, Murphy TD, Pruitt KD, Kimchi A. Assembly: a resource for assembled genomes at NCBI. Nucleic Acids Res. 2016;44(D1):D73\u201380. https:\/\/doi.org\/10.1093\/nar\/gkv1226.","journal-title":"Nucleic Acids Res"},{"key":"4809_CR8","doi-asserted-by":"publisher","first-page":"2386","DOI":"10.1099\/ijsem.0.002809","volume":"68","author":"S Ciufo","year":"2018","unstructured":"Ciufo S, Kannan S, Sharma S, Badretdin A, Clark K, Turner S, Brover S, Schoch CL, Kimchi A, DiCuccio M. Using average nucleotide identity to improve taxonomic assignments in prokaryotic genomes at the NCBI. Int J Syst Evol Microbiol. 2018;68:2386\u201392. https:\/\/doi.org\/10.1099\/ijsem.0.002809.","journal-title":"Int J Syst Evol Microbiol"},{"key":"4809_CR9","doi-asserted-by":"publisher","first-page":"baz136","DOI":"10.1093\/database\/baz136","volume":"2019","author":"C Amid","year":"2019","unstructured":"Amid C, Pakseresht N, Silvester N, Jayathilaka S, Lund O, Dynovski LD, Pataki B\u00c1, Visontai D, Xavier BB, Alako BTF, Belka A, Cisneros JLB, Cotten M, Haringhuizen GB, Harrison PW, H\u00f6per D, Holt S, Hundahl C, Hussein A, Kaas RS, Liu X, Leinonen R, Malhotra-Kumar S, Nieuwenhuijse DF, Rahman N, Dos Ribeiro SC, Skiby JE, Schmitz D, St\u00e9ger J, Szalai-Gindl JM, Thomsen MCF, Cacci\u00f2 SM, Csabai I, Kroneman A, Koopmans M, Aarestrup F, Cochrane G. The COMPARE data hubs. Database (Oxford). 2019;2019:baz136. https:\/\/doi.org\/10.1093\/database\/baz136.","journal-title":"Database (Oxford)."},{"issue":"10","key":"4809_CR10","doi-asserted-by":"publisher","first-page":"1611","DOI":"10.1101\/gr.361602","volume":"12","author":"JE Stajich","year":"2002","unstructured":"Stajich JE, Block D, Boulez K, Brenner SE, Chervitz SA, Dagdigian C, Fuellen G, Gilbert JG, Korf I, Lapp H, Lehv\u00e4slaiho H, Matsalla C, Mungall CJ, Osborne BI, Pocock MR, Schattner P, Senger M, Stein LD, Stupka E, Wilkinson MD, Birney E. The bioperl toolkit: perl modules for the life sciences. Genome Res. 2002;12(10):1611\u20138. https:\/\/doi.org\/10.1101\/gr.361602.","journal-title":"Genome Res"},{"key":"4809_CR11","doi-asserted-by":"publisher","unstructured":"Couvin D, Dereeper A, Meyer DF, Noroy C, Gaete S, Bhakkan B, Poullet N, Gaspard S, Bezault E, Marcelino I, Pruneau L, Segretier W, Stattner E, Cazenave D, Garnier M, Pot M, Tressi\u00e8res B, Deloumeaux J, Breurec S, Ferdinand S, Gonzalez-Rizzo S, Reynaud Y for the KaruBioNet Team. KaruBioNet: a network and discussion group for a better collaboration and structuring of bioinformatics in Guadeloupe (French West Indies). Bioinform Adv. 2022;2(1):vbac010. https:\/\/doi.org\/10.1093\/bioadv\/vbac010","DOI":"10.1093\/bioadv\/vbac010"},{"key":"4809_CR12","doi-asserted-by":"publisher","first-page":"W3","DOI":"10.1093\/nar\/gkw343","volume":"44","author":"E Afgan","year":"2016","unstructured":"Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, \u010cech M, Chilton J, Clements D, Coraor N, Eberhard C, Gr\u00fcning B, Guerler A, Hillman-Jackson J, Von Kuster G, Rasche E, Soranzo N, Turaga N, Taylor J, Nekrutenko A, Goecks J. The galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update. Nucleic Acids Res. 2016;44:W3\u201310. https:\/\/doi.org\/10.1093\/nar\/gkw343.","journal-title":"Nucleic Acids Res"},{"key":"4809_CR13","doi-asserted-by":"publisher","first-page":"124","DOI":"10.12688\/wellcomeopenres.14826.1","volume":"3","author":"KA Jolley","year":"2018","unstructured":"Jolley KA, Bray JE, Maiden MCJ. Open-access bacterial population genomics: BIGSdb software, the PubMLST.org website and their applications. Wellcome Open Res. 2018;3:124. https:\/\/doi.org\/10.12688\/wellcomeopenres.14826.1.","journal-title":"Wellcome Open Res."},{"issue":"11","key":"4809_CR14","doi-asserted-by":"publisher","first-page":"2640","DOI":"10.1093\/jac\/dks261","volume":"67","author":"E Zankari","year":"2012","unstructured":"Zankari E, Hasman H, Cosentino S, Vestergaard M, Rasmussen S, Lund O, Aarestrup FM, Larsen MV. Identification of acquired antimicrobial resistance genes. J Antimicrob Chemother. 2012;67(11):2640\u20134. https:\/\/doi.org\/10.1093\/jac\/dks261.","journal-title":"J Antimicrob Chemother"},{"issue":"7","key":"4809_CR15","doi-asserted-by":"publisher","first-page":"3895","DOI":"10.1128\/AAC.02412-14","volume":"58","author":"A Carattoli","year":"2014","unstructured":"Carattoli A, Zankari E, Garc\u00eda-Fern\u00e1ndez A, Voldby Larsen M, Lund O, Villa L, M\u00f8llerAarestrup F, Hasman H. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrob Agents Chemother. 2014;58(7):3895\u2013903. https:\/\/doi.org\/10.1128\/AAC.02412-14.","journal-title":"Antimicrob Agents Chemother"},{"issue":"D1","key":"4809_CR16","doi-asserted-by":"publisher","first-page":"D687","DOI":"10.1093\/nar\/gky1080","volume":"47","author":"B Liu","year":"2019","unstructured":"Liu B, Zheng D, Jin Q, Chen L, Yang J. VFDB 2019: a comparative pathogenomic platform with an interactive web interface. Nucleic Acids Res. 2019;47(D1):D687\u201392. https:\/\/doi.org\/10.1093\/nar\/gky1080.","journal-title":"Nucleic Acids Res"},{"issue":"5","key":"4809_CR17","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0177459","volume":"12","author":"GM Kurtzer","year":"2017","unstructured":"Kurtzer GM, Sochat V, Bauer MW. Singularity: Scientific containers for mobility of compute. PLoS ONE. 2017;12(5): e0177459. https:\/\/doi.org\/10.1371\/journal.pone.0177459.","journal-title":"PLoS ONE"},{"key":"4809_CR18","doi-asserted-by":"publisher","first-page":"118","DOI":"10.1016\/j.csbj.2018.12.006","volume":"17","author":"D Maderankova","year":"2019","unstructured":"Maderankova D, Jugas R, Sedlar K, Vitek M, Skutkova H. Rapid bacterial species delineation based on parameters derived from genome numerical representations. Comput Struct Biotechnol J. 2019;17:118\u201326. https:\/\/doi.org\/10.1016\/j.csbj.2018.12.006.","journal-title":"Comput Struct Biotechnol J"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04809-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-022-04809-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-022-04809-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,8]],"date-time":"2022-07-08T13:05:34Z","timestamp":1657285534000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-022-04809-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,8]]},"references-count":18,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2022,12]]}},"alternative-id":["4809"],"URL":"https:\/\/doi.org\/10.1186\/s12859-022-04809-5","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2022,7,8]]},"assertion":[{"value":"26 October 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 June 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 July 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"268"}}