{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T01:50:16Z","timestamp":1768441816119,"version":"3.49.0"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2016,10,12]],"date-time":"2016-10-12T00:00:00Z","timestamp":1476230400000},"content-version":"vor","delay-in-days":268,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,5,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Summary: Genomes of emerging model organisms are now being sequenced at very low cost. However, obtaining accurate gene predictions remains challenging: even the best gene prediction algorithms make substantial errors and can jeopardize subsequent analyses. Therefore, many predicted genes must be time-consumingly visually inspected and manually curated. We developed GeneValidator (GV) to automatically identify problematic gene predictions and to aid manual curation. For each gene, GV performs multiple analyses based on comparisons to gene sequences from large databases. The resulting report identifies problematic gene predictions and includes extensive statistics and graphs for each prediction to guide manual curation efforts. GV thus accelerates and enhances the work of biocurators and researchers who need accurate gene predictions from newly sequenced genomes.<\/jats:p>\n               <jats:p>Availability and implementation: GV can be used through a web interface or in the command-line. GV is open-source (AGPL), available at https:\/\/wurmlab.github.io\/tools\/genevalidator.<\/jats:p>\n               <jats:p>Contact: y.wurm@qmul.ac.uk<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw015","type":"journal-article","created":{"date-parts":[[2016,2,15]],"date-time":"2016-02-15T01:09:07Z","timestamp":1455498547000},"page":"1559-1561","source":"Crossref","is-referenced-by-count":38,"title":["GeneValidator: identify problems with protein-coding gene predictions"],"prefix":"10.1093","volume":"32","author":[{"given":"Monica-Andreea","family":"Dr\u0103gan","sequence":"first","affiliation":[{"name":"1 Department of Computer Science, ETH Z\u00fcrich, Z\u00fcrich, Switzerland,"}]},{"given":"Ismail","family":"Moghul","sequence":"additional","affiliation":[{"name":"2 School of Biological and Chemical Sciences, Queen Mary University of London, London, UK and"}]},{"given":"Anurag","family":"Priyam","sequence":"additional","affiliation":[{"name":"2 School of Biological and Chemical Sciences, Queen Mary University of London, London, UK and"}]},{"given":"Claudio","family":"Bustos","sequence":"additional","affiliation":[{"name":"3 Departamento de Psiquiatr\u00eda y Salud Mental, University of Concepci\u00f3n, Concepci\u00f3n, Chile"}]},{"given":"Yannick","family":"Wurm","sequence":"additional","affiliation":[{"name":"2 School of Biological and Chemical Sciences, Queen Mary University of London, London, UK and"}]}],"member":"286","published-online":{"date-parts":[[2016,1,18]]},"reference":[{"key":"2023020112262362200_btw015-B1","doi-asserted-by":"crossref","first-page":"502827","DOI":"10.1155\/2013\/502827","article-title":"ASPic-GeneID: a lightweight pipeline for gene prediction and alternative isoforms detection","volume":"2013","author":"Alioto","year":"2013","journal-title":"BioMed Res. Int"},{"key":"2023020112262362200_btw015-B2","doi-asserted-by":"crossref","first-page":"D46","DOI":"10.1093\/nar\/gkp1024","article-title":"GenBank","volume":"38","author":"Benson","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023020112262362200_btw015-B3","doi-asserted-by":"crossref","first-page":"1035","DOI":"10.1093\/bioinformatics\/bts080","article-title":"Biogem: an effective tool-based approach for scaling up open source software development in bioinformatics","volume":"28","author":"Bonnal","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020112262362200_btw015-B4","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1186\/1471-2105-10-421","article-title":"BLAST+: architecture and applications","volume":"10","author":"Camacho","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023020112262362200_btw015-B5","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1101\/gr.6743907","article-title":"MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes","volume":"18","author":"Cantarel","year":"2008","journal-title":"Genome Res"},{"key":"2023020112262362200_btw015-B6","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1186\/1471-2105-10-67","article-title":"Quantitative measures for the management and comparison of annotated genomes","volume":"10","author":"Eilbeck","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023020112262362200_btw015-B7","doi-asserted-by":"crossref","first-page":"e50609","DOI":"10.1371\/journal.pone.0050609","article-title":"Evaluating high-throughput ab initio gene finders to discover proteins encoded in eukaryotic pathogen genomes missed by laboratory techniques","volume":"7","author":"Goodswen","year":"2012","journal-title":"PLoS One"},{"key":"2023020112262362200_btw015-B8","first-page":"59","article-title":"A cost-effective RNA sequencing protocol for large-scale gene expression studies","volume":"5","author":"Hou","year":"2015","journal-title":"Nat. Sci. Rep"},{"key":"2023020112262362200_btw015-B9","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1038\/455047a","article-title":"Big data: the future of biocuration","volume":"455","author":"Howe","year":"2008","journal-title":"Nature"},{"key":"2023020112262362200_btw015-B10","doi-asserted-by":"crossref","first-page":"757","DOI":"10.1093\/bioinformatics\/btr010","article-title":"A novel hybrid gene prediction method employing protein multiple sequence alignments","volume":"27","author":"Keller","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020112262362200_btw015-B11","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1186\/1471-2105-5-59","article-title":"Gene finding in novel genomes","volume":"5","author":"Korf","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023020112262362200_btw015-B12","doi-asserted-by":"crossref","first-page":"e119","DOI":"10.1093\/nar\/gku557","article-title":"Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm","volume":"42","author":"Lomsadze","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023020112262362200_btw015-B13","doi-asserted-by":"crossref","first-page":"S5","DOI":"10.1038\/nmeth.1422","article-title":"Visualizing genomes: techniques and challenges","volume":"7","author":"Nielsen","year":"2010","journal-title":"Nat. Methods"},{"key":"2023020112262362200_btw015-B14","first-page":"59","article-title":"Ant genomics (Hymenoptera: Formicidae): challenges to overcome and opportunities to seize","volume":"21","author":"Nygaard","year":"2015","journal-title":"Myrmecol. News"},{"key":"2023020112262362200_btw015-B15","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1093\/bioinformatics\/btm071","article-title":"CEGMA: A pipeline to accurately annotate core genes in eukaryotic genomes","volume":"23","author":"Parra","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020112262362200_btw015-B16","first-page":"96","article-title":"Eukaryotic genome complexity","volume-title":"Nat. Educ","author":"Pray","year":"2008"},{"key":"2023020112262362200_btw015-B17","doi-asserted-by":"crossref","first-page":"3425","DOI":"10.1093\/bioinformatics\/btr569","article-title":"Visualization and quality assessment of de novo genome assemblies","volume":"27","author":"Riba-Grognuz","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020112262362200_btw015-B18","doi-asserted-by":"crossref","first-page":"1235","DOI":"10.1101\/gr.155408.113","article-title":"Social insect genomes exhibit dramatic evolution in gene composition and regulation while preserving regulatory features linked to sociality","volume":"23","author":"Simola","year":"2013","journal-title":"Genome Res"},{"key":"2023020112262362200_btw015-B19","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1093\/bioinformatics\/btn013","article-title":"Using native and syntenically mapped cDNA alignments to improve de novo gene finding","volume":"24","author":"Stanke","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020112262362200_btw015-B20","first-page":"gku989","article-title":"Uniprot: a hub for protein information","author":"The UniProt Consortium","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2023020112262362200_btw015-B21","author":"Wetterstrand","year":"2015"},{"key":"2023020112262362200_btw015-B22","doi-asserted-by":"crossref","first-page":"r58","DOI":"10.1186\/gb-2006-7-7-r58","article-title":"yrGATE: a web-based gene-structure annotation tool for the identification and dissemination of eukaryotic genes","volume":"7","author":"Wilkerson","year":"2006","journal-title":"Genome Biol"},{"key":"2023020112262362200_btw015-B23","first-page":"e143696.68941","article-title":"Avoid having to retract your genomics analysis","volume":"2","author":"Wurm","year":"2015","journal-title":"The Winnower"},{"key":"2023020112262362200_btw015-B24","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1038\/nrg3174","article-title":"A beginner\u2019s guide to eukaryotic genome annotation","volume":"13","author":"Yandell","year":"2012","journal-title":"Nat. Rev. Genet"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/10\/1559\/49019347\/bioinformatics_32_10_1559.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/10\/1559\/49019347\/bioinformatics_32_10_1559.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T22:30:42Z","timestamp":1675290642000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/10\/1559\/1742817"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,1,18]]},"references-count":24,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2016,5,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw015","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2016,5,15]]},"published":{"date-parts":[[2016,1,18]]}}}