{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T15:57:13Z","timestamp":1759939033565,"version":"3.41.2"},"reference-count":37,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2025,6,30]],"date-time":"2025-06-30T00:00:00Z","timestamp":1751241600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:sec><jats:title>Background<\/jats:title><jats:p>Understanding the structure and function of microbial genomes is crucial for uncovering their ecological roles, evolutionary trajectories, and potential applications in health, biotechnology, agriculture, food production, and environmental science. However, genome reconstruction and annotation remain computationally demanding and technically complex.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We introduce a bioinformatics platform designed explicitly for long-read microbial sequencing data to address these challenges. Developed as a service of the Italian MIRRI ERIC node, the platform provides a comprehensive solution for analyzing both prokaryotic and eukaryotic genomes, from assembly to functional protein annotation. It integrates state-of-the-art tools (e.g., Canu, Flye, BRAKER3, Prokka, InterProScan) within a reproducible, scalable workflow built on the Common Workflow Language and accelerated through high-performance computing infrastructure. A user-friendly web interface ensures accessibility, even for non-specialists.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>Through case studies involving three environmentally and clinically significant microorganisms, we demonstrate the ability of the platform to produce reliable, biologically meaningful insights, positioning it as a valuable tool for routine genome analysis and advanced microbial research.<\/jats:p><\/jats:sec>","DOI":"10.3389\/fbinf.2025.1632189","type":"journal-article","created":{"date-parts":[[2025,6,30]],"date-time":"2025-06-30T05:27:03Z","timestamp":1751261223000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Long-read microbial genome assembly, gene prediction and functional annotation: a service of the MIRRI ERIC Italian node"],"prefix":"10.3389","volume":"5","author":[{"given":"Sandro Gepiro","family":"Contaldo","sequence":"first","affiliation":[]},{"given":"Antonio","family":"d\u2019Acierno","sequence":"additional","affiliation":[]},{"given":"Lorenzo","family":"Bosio","sequence":"additional","affiliation":[]},{"given":"Francesco","family":"Venice","sequence":"additional","affiliation":[]},{"given":"Elisa Li","family":"Perottino","sequence":"additional","affiliation":[]},{"given":"Janneth Estefania","family":"Hoyos Rea","sequence":"additional","affiliation":[]},{"given":"Giovanna Cristina","family":"Varese","sequence":"additional","affiliation":[]},{"given":"Francesca","family":"Cordero","sequence":"additional","affiliation":[]},{"given":"Marco","family":"Beccuti","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2025,6,30]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"126790","DOI":"10.1016\/j.micres.2021.126790","article-title":"Siderophores: importance in bacterial pathogenesis and applications in medicine and industry","volume":"250","author":"Behnoush","year":"2021","journal-title":"Microbiol Res."},{"key":"B2","doi-asserted-by":"publisher","first-page":"D344","DOI":"10.1093\/nar\/gkaa977","article-title":"The interpro protein families and domains database: 20 years on","volume":"49","author":"Blum","year":"2020","journal-title":"Nucleic Acids Res."},{"key":"B3","doi-asserted-by":"publisher","first-page":"lqaa108","DOI":"10.1093\/nargab\/lqaa108","article-title":"Braker2: automatic eukaryotic genome annotation with genemark-ep+ and augustus supported by a protein database","volume":"3","author":"Bruna","year":"2021","journal-title":"NAR Genomics Bioinforma."},{"key":"B4","doi-asserted-by":"publisher","first-page":"e147","DOI":"10.1093\/nar\/gkw654","article-title":"Contiguous and accurate de novo assembly of metazoan genomes with modest long read coverage","volume":"44","author":"Chakraborty","year":"2016","journal-title":"Nucleic Acids Res."},{"key":"B5","doi-asserted-by":"publisher","first-page":"e1006290","DOI":"10.1371\/journal.ppat.1006290","article-title":"Candida auris: a rapidly emerging cause of hospital-acquired multidrug-resistant fungal infections globally","volume":"13","author":"Chowdhary","year":"2017","journal-title":"PLOS Pathog."},{"key":"B6","unstructured":"Common workflow language documentation\n          \n          \n          2025"},{"key":"B7","doi-asserted-by":"publisher","first-page":"316","DOI":"10.1038\/nbt.3820","article-title":"Nextflow enables reproducible computational workflows","volume":"35","author":"Di Tommaso","year":"2017","journal-title":"Nat. Biotechnol."},{"key":"B8","unstructured":"Docker official site\n          \n          \n          2025"},{"key":"B9","doi-asserted-by":"publisher","first-page":"2404","DOI":"10.1128\/AAC.47.8.2404-2412.2003","article-title":"Candida albicans mutations in the ergosterol biosynthetic pathway and resistance to several antifungal agents","volume":"47","author":"Dominique","year":"2003","journal-title":"Antimicrob. Agents Chemother."},{"key":"B10","doi-asserted-by":"publisher","first-page":"927","DOI":"10.3390\/jof8090927","article-title":"The culturable mycobiota of sediments and associated microplastics: from a harbor to a marine protected area, a comparative study","volume":"8","author":"Florio Furno","year":"2022","journal-title":"J. Fungi"},{"key":"B11","doi-asserted-by":"publisher","DOI":"10.48546\/WORKFLOWHUB.WORKFLOW.567.2","article-title":"CLAWS (CNAG\u2019s long-read assembly workflow in Snakemake)","author":"Gomez-Garrido","year":"2024","journal-title":"WorkflowHub"},{"key":"B12","doi-asserted-by":"publisher","first-page":"268","DOI":"10.1093\/nar\/30.1.268","article-title":"Superfamily: hmms representing all proteins of known structure. scop sequence searches, alignments and genome assignments","volume":"30","author":"Gough","year":"2002","journal-title":"Nucleic Acids Res."},{"key":"B13","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2410.06941","article-title":"WorkflowHub: a registry for computational workflows","author":"Gustafsson","year":"2024","journal-title":"arXiv Prepr. arXiv:2410.06941"},{"key":"B14","doi-asserted-by":"publisher","first-page":"2253","DOI":"10.1093\/bioinformatics\/btz891","article-title":"Nextpolish: a fast and efficient genome polishing tool for long-read assembly","volume":"36","author":"Hu","year":"2019","journal-title":"Bioinformatics"},{"key":"B15","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1186\/s13059-024-03252-4","article-title":"NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads","volume":"25","author":"Hu","year":"2024","journal-title":"Genome Biol."},{"key":"B16","doi-asserted-by":"publisher","first-page":"109406","DOI":"10.1016\/j.celrep.2021.109406","article-title":"The histone chaperone hir maintains chromatin states to control nitrogen assimilation and fungal virulence","volume":"36","author":"Jenull","year":"2021","journal-title":"Cell. Rep."},{"key":"B17","doi-asserted-by":"publisher","first-page":"1236","DOI":"10.1093\/bioinformatics\/btu031","article-title":"Interproscan 5: genome-scale protein function classification","volume":"30","author":"Jones","year":"2014","journal-title":"Bioinformatics"},{"key":"B18","doi-asserted-by":"publisher","first-page":"757","DOI":"10.1093\/bioinformatics\/btr010","article-title":"A novel hybrid gene prediction method employing protein multiple sequence alignments","volume":"27","author":"Keller","year":"2011","journal-title":"Bioinformatics"},{"key":"B19","doi-asserted-by":"publisher","first-page":"540","DOI":"10.1038\/s41587-019-0072-8","article-title":"Assembly of long, error-prone reads using repeat graphs","volume":"37","author":"Kolmogorov","year":"2019","journal-title":"Nat. Biotechnol."},{"key":"B20","doi-asserted-by":"publisher","first-page":"722","DOI":"10.1101\/gr.215087.116","article-title":"Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation","volume":"27","author":"Koren","year":"2017","journal-title":"Genome Res."},{"key":"B21","doi-asserted-by":"publisher","first-page":"2520","DOI":"10.1093\/bioinformatics\/bts480","article-title":"Snakemake\u2014a scalable bioinformatics workflow engine","volume":"28","author":"K\u00f6ster","year":"2012","journal-title":"Bioinformatics"},{"key":"B22","doi-asserted-by":"publisher","DOI":"10.1101\/2019.12.19.882506","article-title":"Hypo: super fast and accurate polisher for long read genome assemblies","author":"Kundu","year":"2019","journal-title":"bioRxiv"},{"key":"B23","doi-asserted-by":"publisher","first-page":"146","DOI":"10.1128\/AAC.01486-12","article-title":"Elevated chitin content reduces the susceptibility of candida species to caspofungin","volume":"57","author":"Louise","year":"2013","journal-title":"Antimicrob. Agents Chemother."},{"key":"B24","doi-asserted-by":"publisher","first-page":"4647","DOI":"10.1093\/molbev\/msab199","article-title":"Busco update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes","volume":"38","author":"Manni","year":"","journal-title":"Mol. Biol. Evol."},{"key":"B25","doi-asserted-by":"publisher","first-page":"e323","DOI":"10.1002\/cpz1.323","article-title":"Busco: assessing genomic data quality and beyond","volume":"1","author":"Manni","year":"","journal-title":"Curr. Protoc."},{"key":"B26","unstructured":"Minio official site\n          \n          \n          2025"},{"key":"B27","doi-asserted-by":"publisher","first-page":"e482","DOI":"10.1016\/S2666-5247(23)00114-3","article-title":"Candida auris: an emerging antimicrobial-resistant organism with the highest level of concern","volume":"4","author":"Mishra","year":"2023","journal-title":"Lancet Microbe"},{"key":"B28","doi-asserted-by":"publisher","first-page":"D412","DOI":"10.1093\/nar\/gkaa913","article-title":"Pfam: the protein families database in 2021","volume":"49","author":"Mistry","year":"2021","journal-title":"Nucleic Acids Res."},{"key":"B29","unstructured":"Oxford nanopore medaka\n          \n          \n          2025"},{"key":"B30","doi-asserted-by":"publisher","first-page":"23171","DOI":"10.1038\/s41598-024-74517-y","article-title":"Unveiling fungal strategies: mycoremediation in multi-metal pesticide environment using proteomics","volume":"14","author":"Priyadarshini","year":"2024","journal-title":"Sci. Rep."},{"key":"B31","doi-asserted-by":"publisher","first-page":"D753","DOI":"10.1093\/nar\/gkac1080","article-title":"Mgnify: the microbiome sequence data analysis resource in 2023","volume":"51","author":"Richardson","year":"2022","journal-title":"Nucleic Acids Res."},{"key":"B32","doi-asserted-by":"publisher","first-page":"155","DOI":"10.1038\/s41592-019-0669-3","article-title":"Fast and accurate long-read assembly with wtdbg2","volume":"17","author":"Ruan","year":"2020","journal-title":"Nat. Methods"},{"key":"B33","doi-asserted-by":"publisher","first-page":"2068","DOI":"10.1093\/bioinformatics\/btu153","article-title":"Prokka: rapid prokaryotic genome annotation","volume":"30","author":"Seemann","year":"2014","journal-title":"Bioinformatics"},{"key":"B34","doi-asserted-by":"publisher","first-page":"W83","DOI":"10.1093\/nar\/gkae410","article-title":"The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update","volume":"52","year":"2024","journal-title":"Nucleic Acids Res."},{"key":"B35","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1093\/nar\/gkg115","article-title":"Panther: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification","volume":"31","author":"Thomas","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"B36","doi-asserted-by":"publisher","first-page":"1258","DOI":"10.3390\/microorganisms8091258","article-title":"Genome sequence of trichoderma lixii mut3171, a promising strain for mycoremediation of pah-contaminated sites","volume":"8","author":"Venice","year":"2020","journal-title":"Microorganisms"},{"key":"B37","doi-asserted-by":"publisher","first-page":"944","DOI":"10.1016\/j.tim.2016.09.007","article-title":"Klebsiella pneumoniae population genomics and antimicrobial-resistant clones","volume":"24","author":"Wyres","year":"2016","journal-title":"Trends Microbiol."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1632189\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,30]],"date-time":"2025-06-30T05:27:04Z","timestamp":1751261224000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1632189\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,30]]},"references-count":37,"alternative-id":["10.3389\/fbinf.2025.1632189"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2025.1632189","relation":{},"ISSN":["2673-7647"],"issn-type":[{"type":"electronic","value":"2673-7647"}],"subject":[],"published":{"date-parts":[[2025,6,30]]},"article-number":"1632189"}}