{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,3]],"date-time":"2024-08-03T21:02:24Z","timestamp":1722718944410},"reference-count":25,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2014,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Automated gene-calling is still an error-prone process, particularly for the highly plastic genomes of fungal species. Improvement through quality control and manual curation of gene models is a time-consuming process that requires skilled biologists and is only marginally performed. The wealth of available fungal genomes has not yet been exploited by an automated method that applies quality control of gene models in order to obtain more accurate genome annotations.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We provide a novel method named alignment-based fungal gene prediction (ABFGP) that is particularly suitable for plastic genomes like those of fungi. It can assess gene models on a gene-by-gene basis making use of informant gene loci. Its performance was benchmarked on 6,965 gene models confirmed by full-length unigenes from ten different fungi. 79.4% of all gene models were correctly predicted by ABFGP. It improves the output of <jats:italic>ab initio<\/jats:italic> gene prediction software due to a higher sensitivity and precision for all gene model components. Applicability of the method was shown by revisiting the annotations of six different fungi, using gene loci from up to 29 fungal genomes as informants. Between 7,231 and 8,337 genes were assessed by ABFGP and for each genome between 1,724 and 3,505 gene model revisions were proposed. The reliability of the proposed gene models is assessed by an <jats:italic>a posteriori<\/jats:italic> introspection procedure of each intron and exon in the multiple gene model alignment. The total number and type of proposed gene model revisions in the six fungal genomes is correlated to the quality of the genome assembly, and to sequencing strategies used in the sequencing centre, highlighting different types of errors in different annotation pipelines. The ABFGP method is particularly successful in discovering sequence errors and\/or disruptive mutations causing truncated and erroneous gene models.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>The ABFGP method is an accurate and fully automated quality control method for fungal gene catalogues that can be easily implemented into existing annotation pipelines. With the exponential release of new genomes, the ABFGP method will help decreasing the number of gene models that require additional manual curation.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-15-19","type":"journal-article","created":{"date-parts":[[2014,1,16]],"date-time":"2014-01-16T22:02:04Z","timestamp":1389909724000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Automated alignment-based curation of gene models in filamentous fungi"],"prefix":"10.1186","volume":"15","author":[{"given":"Ate","family":"van der Burgt","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Edouard","family":"Severing","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"J\u00e9r\u00f4me","family":"Collemare","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pierre JGM","family":"de Wit","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2014,1,16]]},"reference":[{"issue":"Database issue","key":"6267_CR1","doi-asserted-by":"publisher","first-page":"D26","DOI":"10.1093\/nar\/gkr947","volume":"40","author":"IV Grigoriev","year":"2012","unstructured":"Grigoriev IV, Nordberg H, Shabalov I, Aerts A, Cantor M, Goodstein D, Kuo A, Minovitsky S, Nikitin R, Ohm RA, et al: The genome portal of the Department of Energy Joint Genome Institute. Nucleic Acids Res. 2012, 40 (Database issue): D26-D32.","journal-title":"Nucleic Acids Res"},{"key":"6267_CR2","doi-asserted-by":"publisher","first-page":"833","DOI":"10.1016\/S0076-6879(10)70034-3","volume":"470","author":"CA Cuomo","year":"2010","unstructured":"Cuomo CA, Birren BW: The fungal genome initiative and lessons learned from genome sequencing. Methods Enzymol. 2010, 470: 833-855.","journal-title":"Methods Enzymol"},{"key":"6267_CR3","doi-asserted-by":"publisher","first-page":"269","DOI":"10.1007\/978-1-60327-241-4_16","volume":"609","author":"E Picardi","year":"2010","unstructured":"Picardi E, Pesole G: Computational methods for ab initio and comparative gene finding. Methods Mol Biol. 2010, 609: 269-284. 10.1007\/978-1-60327-241-4_16.","journal-title":"Methods Mol Biol"},{"issue":"1","key":"6267_CR4","doi-asserted-by":"publisher","first-page":"S11","DOI":"10.1186\/gb-2006-7-s1-s11","volume":"7","author":"M Stanke","year":"2006","unstructured":"Stanke M, Tzvetkova A, Morgenstern B: AUGUSTUS at EGASP: using EST, protein and genomic alignments for improved gene prediction in the human genome. Genome Biol. 2006, 7 (1): S11-11\u201318","journal-title":"Genome Biol"},{"issue":"12","key":"6267_CR5","doi-asserted-by":"publisher","first-page":"1979","DOI":"10.1101\/gr.081612.108","volume":"18","author":"V Ter-Hovhannisyan","year":"2008","unstructured":"Ter-Hovhannisyan V, Lomsadze A, Chernoff YO, Borodovsky M: Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training. Genome Res. 2008, 18 (12): 1979-1990. 10.1101\/gr.081612.108.","journal-title":"Genome Res"},{"issue":"11","key":"6267_CR6","doi-asserted-by":"publisher","first-page":"2330","DOI":"10.1101\/gr.2816704","volume":"14","author":"AE Tenney","year":"2004","unstructured":"Tenney AE, Brown RH, Vaske C, Lodge JK, Doering TL, Brent MR: Gene prediction and verification in a compact genome with numerous small introns. Genome Res. 2004, 14 (11): 2330-2335. 10.1101\/gr.2816704.","journal-title":"Genome Res"},{"key":"6267_CR7","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1186\/1471-2105-7-62","volume":"7","author":"M Stanke","year":"2006","unstructured":"Stanke M, Schoffmann O, Morgenstern B, Waack S: Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics. 2006, 7: 62-10.1186\/1471-2105-7-62.","journal-title":"BMC Bioinformatics"},{"issue":"12","key":"6267_CR8","doi-asserted-by":"publisher","first-page":"e1003037","DOI":"10.1371\/journal.ppat.1003037","volume":"8","author":"RA Ohm","year":"2012","unstructured":"Ohm RA, Feau N, Henrissat B, Schoch CL, Horwitz BA, Barry KW, Condon BJ, Copeland AC, Dhillon B, Glaser F, et al: Diverse lifestyles and strategies of plant pathogenesis encoded in the genomes of eighteen Dothideomycetes fungi. PLoS Pathog. 2012, 8 (12): e1003037-10.1371\/journal.ppat.1003037.","journal-title":"PLoS Pathog"},{"issue":"4","key":"6267_CR9","doi-asserted-by":"publisher","first-page":"1015","DOI":"10.1111\/j.1469-8137.2012.04330.x","volume":"196","author":"R Oliver","year":"2012","unstructured":"Oliver R: Genomic tillage and the harvest of fungal phytopathogens. New Phytol. 2012, 196 (4): 1015-1023. 10.1111\/j.1469-8137.2012.04330.x.","journal-title":"New Phytol"},{"issue":"6","key":"6267_CR10","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1038\/nrmicro2790","volume":"10","author":"S Raffaele","year":"2012","unstructured":"Raffaele S, Kamoun S: Genome evolution in filamentous plant pathogens: why bigger can be better. Nat Rev Microbiol. 2012, 10 (6): 417-430.","journal-title":"Nat Rev Microbiol"},{"issue":"5","key":"6267_CR11","doi-asserted-by":"publisher","first-page":"597","DOI":"10.1093\/bioinformatics\/btn004","volume":"24","author":"Q Liu","year":"2008","unstructured":"Liu Q, Mackey AJ, Roos DS, Pereira FC: Evigan: a hidden variable model for integrating gene evidence for eukaryotic gene prediction. Bioinformatics. 2008, 24 (5): 597-605. 10.1093\/bioinformatics\/btn004.","journal-title":"Bioinformatics"},{"issue":"12","key":"6267_CR12","doi-asserted-by":"publisher","first-page":"1571","DOI":"10.1093\/bioinformatics\/bts176","volume":"28","author":"A Bernal","year":"2012","unstructured":"Bernal A, Crammer K, Pereira F: Automated gene-model curation using global discriminative learning. Bioinformatics. 2012, 28 (12): 1571-1578. 10.1093\/bioinformatics\/bts176.","journal-title":"Bioinformatics"},{"key":"6267_CR13","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1186\/1471-2105-9-433","volume":"9","author":"Q Liu","year":"2008","unstructured":"Liu Q, Crammer K, Pereira FC, Roos DS: Reranking candidate gene models with cross-species comparison for improved gene prediction. BMC Bioinformatics. 2008, 9: 433-10.1186\/1471-2105-9-433.","journal-title":"BMC Bioinformatics"},{"issue":"5","key":"6267_CR14","doi-asserted-by":"publisher","first-page":"988","DOI":"10.1101\/gr.1865504","volume":"14","author":"E Birney","year":"2004","unstructured":"Birney E, Clamp M, Durbin R: GeneWise and Genomewise. Genome Res. 2004, 14 (5): 988-995. 10.1101\/gr.1865504.","journal-title":"Genome Res"},{"key":"6267_CR15","doi-asserted-by":"publisher","first-page":"278","DOI":"10.1186\/1471-2105-9-278","volume":"9","author":"O Keller","year":"2008","unstructured":"Keller O, Odronitz F, Stanke M, Kollmar M, Waack S: Scipio: using protein sequences to determine the precise exon\/intron structures of genes and their orthologs in closely related species. BMC Bioinformatics. 2008, 9: 278-10.1186\/1471-2105-9-278.","journal-title":"BMC Bioinformatics"},{"issue":"Database issue","key":"6267_CR16","doi-asserted-by":"publisher","first-page":"D637","DOI":"10.1093\/nar\/gkq1016","volume":"39","author":"P Wong","year":"2011","unstructured":"Wong P, Walter M, Lee W, Mannhaupt G, Munsterkotter M, Mewes HW, Adam G, Guldener U: FGDB: revisiting the genome annotation of the plant pathogen Fusarium graminearum. Nucleic Acids Res. 2011, 39 (Database issue): D637-D639.","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"6267_CR17","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1186\/1471-2164-14-21","volume":"14","author":"C Zhao","year":"2013","unstructured":"Zhao C, Waalwijk C, de Wit PJ, Tang D, van der Lee T: RNA-Seq analysis reveals new gene models and alternative splicing in the fungal pathogen Fusarium graminearum. BMC Genomics. 2013, 14 (1): 21-10.1186\/1471-2164-14-21.","journal-title":"BMC Genomics"},{"issue":"5","key":"6267_CR18","doi-asserted-by":"publisher","first-page":"1088","DOI":"10.1128\/EC.3.5.1088-1100.2004","volume":"3","author":"DM Kupfer","year":"2004","unstructured":"Kupfer DM, Drabenstot SD, Buchanan KL, Lai H, Zhu H, Dyer DW, Roe BA, Murphy JW: Introns and splicing elements of five diverse fungi. Eukaryot Cell. 2004, 3 (5): 1088-1100. 10.1128\/EC.3.5.1088-1100.2004.","journal-title":"Eukaryot Cell"},{"issue":"12","key":"6267_CR19","doi-asserted-by":"publisher","first-page":"e422","DOI":"10.1371\/journal.pbio.0020422","volume":"2","author":"CB Nielsen","year":"2004","unstructured":"Nielsen CB, Friedman B, Birren B, Burge CB, Galagan JE: Patterns of intron gain and loss in fungi. PLoS Biology. 2004, 2 (12): e422-10.1371\/journal.pbio.0020422.","journal-title":"PLoS Biology"},{"issue":"11","key":"6267_CR20","doi-asserted-by":"publisher","first-page":"e1003088","DOI":"10.1371\/journal.pgen.1003088","volume":"8","author":"PJ de Wit","year":"2012","unstructured":"de Wit PJ, van der Burgt A, Okmen B, Stergiopoulos I, Abd-Elsalam KA, Aerts AL, Bahkali AH, Beenen HG, Chettri P, Cox MP, et al: The genomes of the fungal plant pathogens Cladosporium fulvum and Dothistroma septosporum reveal adaptation to different hosts and lifestyles but also signatures of common ancestry. PLoS Genetics. 2012, 8 (11): e1003088-10.1371\/journal.pgen.1003088.","journal-title":"PLoS Genetics"},{"issue":"8","key":"6267_CR21","doi-asserted-by":"publisher","first-page":"e1002230","DOI":"10.1371\/journal.pgen.1002230","volume":"7","author":"J Amselem","year":"2011","unstructured":"Amselem J, Cuomo CA, van Kan JA, Viaud M, Benito EP, Couloux A, Coutinho PM, de Vries RP, Dyer PS, Fillinger S, et al: Genomic analysis of the necrotrophic fungal pathogens Sclerotinia sclerotiorum and Botrytis cinerea. PLoS Genetics. 2011, 7 (8): e1002230-10.1371\/journal.pgen.1002230.","journal-title":"PLoS Genetics"},{"issue":"7","key":"6267_CR22","doi-asserted-by":"publisher","first-page":"e1002137","DOI":"10.1371\/journal.ppat.1002137","volume":"7","author":"SJ Klosterman","year":"2011","unstructured":"Klosterman SJ, Subbarao KV, Kang S, Veronese P, Gold SE, Thomma BP, Chen Z, Henrissat B, Lee YH, Park J, et al: Comparative genomics yields insights into niche adaptation of plant vascular wilt pathogens. PLoS Pathogens. 2011, 7 (7): e1002137-10.1371\/journal.ppat.1002137.","journal-title":"PLoS Pathogens"},{"issue":"6","key":"6267_CR23","doi-asserted-by":"publisher","first-page":"e1002070","DOI":"10.1371\/journal.pgen.1002070","volume":"7","author":"SB Goodwin","year":"2011","unstructured":"Goodwin SB, M'Barek SB, Dhillon B, Wittenberg AH, Crane CF, Hane JK, Foster AJ, Van der Lee TA, Grimwood J, Aerts A, et al: Finished genome of the fungal wheat pathogen Mycosphaerella graminicola reveals dispensome structure, chromosome plasticity, and stealth pathogenesis. PLoS Genetics. 2011, 7 (6): e1002070-10.1371\/journal.pgen.1002070.","journal-title":"PLoS Genetics"},{"key":"6267_CR24","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1111\/mpp.12072","volume":"15","author":"A van der Burgt","year":"2013","unstructured":"van der Burgt A, Karimi M, Bahkali AH, de Wit PJ: Pseudogenization in pathogenic fungi with different host plants and lifestyles might reflect their evolutionary past. Mol Plant Pathol. 2013, 15: 133-144. in press","journal-title":"Mol Plant Pathol"},{"issue":"11","key":"6267_CR25","doi-asserted-by":"publisher","first-page":"1413","DOI":"10.1128\/EC.00164-12","volume":"11","author":"M Staats","year":"2012","unstructured":"Staats M, van Kan JA: Genome update of Botrytis cinerea strains B05.10 and T4. Eukaryot Cell. 2012, 11 (11): 1413-1414. 10.1128\/EC.00164-12.","journal-title":"Eukaryot Cell"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-15-19.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,2]],"date-time":"2021-09-02T02:50:51Z","timestamp":1630551051000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-15-19"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,1,16]]},"references-count":25,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2014,12]]}},"alternative-id":["6267"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-15-19","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2014,1,16]]},"assertion":[{"value":"15 July 2013","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 January 2014","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 January 2014","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"19"}}