{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T21:31:24Z","timestamp":1781127084540,"version":"3.54.1"},"reference-count":23,"publisher":"Oxford University Press (OUP)","issue":"6","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,3,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: As improved DNA sequencing techniques have increased enormously the speed of producing new eukaryotic genome assemblies, the further development of automated gene prediction methods continues to be essential.<\/jats:p><jats:p>While the classification of proteins into families is a task heavily relying on correct gene predictions, it can at the same time provide a source of additional information for the prediction, complementary to those presently used.<\/jats:p><jats:p>Results: We extended the gene prediction software AUGUSTUS by a method that employs block profiles generated from multiple sequence alignments as a protein signature to improve the accuracy of the prediction. Equipped with profiles modelling human dynein heavy chain (DHC) proteins and other families, AUGUSTUS was run on the genomic sequences known to contain members of these families. Compared with AUGUSTUS' ab initio version, the rate of genes predicted with high accuracy showed a dramatic increase.<\/jats:p><jats:p>Availability: The AUGUSTUS project web page is located at http:\/\/augustus.gobics.de, with the executable program as well as the source code available for download.<\/jats:p><jats:p>Contact: \u00a0keller@cs.uni-goettingen.de; mario.stanke@uni-greifswald.de<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr010","type":"journal-article","created":{"date-parts":[[2011,1,8]],"date-time":"2011-01-08T01:26:38Z","timestamp":1294449998000},"page":"757-763","source":"Crossref","is-referenced-by-count":567,"title":["A novel hybrid gene prediction method employing protein multiple sequence alignments"],"prefix":"10.1093","volume":"27","author":[{"given":"Oliver","family":"Keller","sequence":"first","affiliation":[{"name":"1 Institute of Computer Science, University of G\u00f6ttingen, Goldschmidtstrasse 7, 2Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, 37077 G\u00f6ttingen and 3Institute of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Strasse 47, 17487 Greifswald, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Martin","family":"Kollmar","sequence":"additional","affiliation":[{"name":"1 Institute of Computer Science, University of G\u00f6ttingen, Goldschmidtstrasse 7, 2Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, 37077 G\u00f6ttingen and 3Institute of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Strasse 47, 17487 Greifswald, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mario","family":"Stanke","sequence":"additional","affiliation":[{"name":"1 Institute of Computer Science, University of G\u00f6ttingen, Goldschmidtstrasse 7, 2Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, 37077 G\u00f6ttingen and 3Institute of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Strasse 47, 17487 Greifswald, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Stephan","family":"Waack","sequence":"additional","affiliation":[{"name":"1 Institute of Computer Science, University of G\u00f6ttingen, Goldschmidtstrasse 7, 2Department of NMR-based Structural Biology, Max-Planck-Institute for Biophysical Chemistry, Am Fassberg 11, 37077 G\u00f6ttingen and 3Institute of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Strasse 47, 17487 Greifswald, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2011,1,6]]},"reference":[{"key":"2023012511540559400_B1","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1093\/protein\/7.7.841","article-title":"Prints\u2013a protein motif fingerprint database","volume":"7","author":"Attwood","year":"1994","journal-title":"Protein Eng."},{"key":"2023012511540559400_B2","doi-asserted-by":"crossref","first-page":"400","DOI":"10.1093\/nar\/gkg030","article-title":"Prints and its automatic supplement, preprints","volume":"31","author":"Attwood","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012511540559400_B3","doi-asserted-by":"crossref","first-page":"988","DOI":"10.1101\/gr.1865504","article-title":"Genewise and genomewise","volume":"14","author":"Birney","year":"2004","journal-title":"Genome Res."},{"key":"2023012511540559400_B4","doi-asserted-by":"crossref","first-page":"21034","DOI":"10.1073\/pnas.0811066106","article-title":"Discovery and revision of arabidopsis genes by proteogenomics","volume":"105","author":"Castellana","year":"2008","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511540559400_B5","doi-asserted-by":"crossref","first-page":"i97","DOI":"10.1093\/bioinformatics\/btm225","article-title":"Homology search for genes","volume":"23","author":"Cui","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012511540559400_B6","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1186\/gb-2009-10-1-201","article-title":"Identifying protein-coding genes in genomic sequences","volume":"10","author":"Harrow","year":"2009","journal-title":"Genome Biol."},{"key":"2023012511540559400_B7","doi-asserted-by":"crossref","first-page":"6565","DOI":"10.1093\/nar\/19.23.6565","article-title":"Automated assembly of protein blocks for database searching","volume":"19","author":"Henikoff","year":"1991","journal-title":"Nucleic Acids Res."},{"key":"2023012511540559400_B8","doi-asserted-by":"crossref","first-page":"111","DOI":"10.1016\/0076-6879(90)83009-X","article-title":"Finding protein similarities with nucleotide sequence databases","volume":"183","author":"Henikoff","year":"1990","journal-title":"Methods Enzymol."},{"key":"2023012511540559400_B9","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1093\/bioinformatics\/15.6.471","article-title":"Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations","volume":"15","author":"Henikoff","year":"1999","journal-title":"Bioinformatics"},{"key":"2023012511540559400_B10","doi-asserted-by":"crossref","first-page":"D211","DOI":"10.1093\/nar\/gkn785","article-title":"Interpro: the integrative protein signature database","volume":"37","author":"Hunter","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012511540559400_B11","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1186\/1471-2105-9-278","article-title":"Scipio: using protein sequences to determine the precise exon\/intron structures of genes and their orthologs in closely related species","volume":"9","author":"Keller","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012511540559400_B12","first-page":"656","article-title":"Blat\u2013the blast-like alignment tool","volume":"12","author":"Kent","year":"2002","journal-title":"Genome Res."},{"key":"2023012511540559400_B13","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1038\/nrg2626","article-title":"Sequencing technologies - the next generation","volume":"11","author":"Metzker","year":"2010","journal-title":"Nat. Rev. Genet."},{"key":"2023012511540559400_B14","doi-asserted-by":"crossref","first-page":"776","DOI":"10.1093\/nar\/gkh211","article-title":"Gene structure conservation aids similarity based gene prediction","volume":"32","author":"Meyer","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012511540559400_B15","doi-asserted-by":"crossref","first-page":"300","DOI":"10.1186\/1471-2164-7-300","article-title":"Pfarao: a web application for protein family analysis customized for cytoskeletal and motor proteins (cymobase)","volume":"7","author":"Odronitz","year":"2006","journal-title":"BMC Genomics"},{"key":"2023012511540559400_B16","doi-asserted-by":"crossref","first-page":"422","DOI":"10.1186\/1471-2164-9-422","article-title":"Webscipio: An online tool for the determination of gene structures using protein sequences","volume":"9","author":"Odronitz","year":"2008","journal-title":"BMC Genomics"},{"key":"2023012511540559400_B17","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1093\/nar\/24.1.197","article-title":"The blocks database\u2013a system for protein classification","volume":"24","author":"Pietrokovski","year":"1996","journal-title":"Nucleic Acids Res."},{"key":"2023012511540559400_B18","doi-asserted-by":"crossref","first-page":"W116","DOI":"10.1093\/nar\/gki442","article-title":"Interproscan: protein domains identifier","volume":"33","author":"Quevillon","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012511540559400_B19","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/1471-2105-6-31","article-title":"Automated generation of heuristics for biological sequence comparison","volume":"6","author":"Slater","year":"2005","journal-title":"BMC Bioinformatics"},{"issue":"Suppl. 2","key":"2023012511540559400_B20","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1093\/bioinformatics\/btg1080","article-title":"Gene prediction with a hidden Markov model and a new intron submodel","volume":"19","author":"Stanke","year":"2003","journal-title":"Bioinformatics"},{"issue":"Suppl. 1","key":"2023012511540559400_B21","first-page":"1","article-title":"Augustus at egasp: using est, protein and genomic alignments for improved gene prediction in the human genome","volume":"7","author":"Stanke","year":"2006","journal-title":"Genome Biol."},{"key":"2023012511540559400_B22","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1186\/1471-2105-7-62","article-title":"Gene prediction in eukaryotes with a generalized hidden markov model that uses hints from external sources","volume":"7","author":"Stanke","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012511540559400_B23","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1093\/bioinformatics\/btn013","article-title":"Using native and syntenically mapped cdna alignments to improve de novo gene finding","volume":"24","author":"Stanke","year":"2008","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/6\/757\/48863720\/bioinformatics_27_6_757.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/6\/757\/48863720\/bioinformatics_27_6_757.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,4]],"date-time":"2023-06-04T19:54:46Z","timestamp":1685908486000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/6\/757\/234821"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,1,6]]},"references-count":23,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2011,3,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr010","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,3,15]]},"published":{"date-parts":[[2011,1,6]]}}}