{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T23:05:58Z","timestamp":1773270358407,"version":"3.50.1"},"reference-count":16,"publisher":"Springer Science and Business Media LLC","issue":"S3","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2013,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Any method that <jats:italic>de novo<\/jats:italic> predicts protein function should do better than random. More challenging, it also ought to outperform simple homology-based inference.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Methods<\/jats:title>\n            <jats:p>Here, we describe a few methods that predict protein function exclusively through homology. Together, they set the bar or lower limit for future improvements.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results and conclusions<\/jats:title>\n            <jats:p>During the development of these methods, we faced two surprises. Firstly, our most successful implementation for the baseline ranked very high at CAFA1. In fact, our best combination of homology-based methods fared only slightly worse than the top-of-the-line prediction method from the Jones group. Secondly, although the concept of homology-based inference is simple, this work revealed that the precise details of the implementation are crucial: not only did the methods span from top to bottom performers at CAFA, but also the reasons for these differences were unexpected. In this work, we also propose a new rigorous measure to compare predicted and experimental annotations. It puts more emphasis on the details of protein function than the other measures employed by CAFA and may best reflect the expectations of users. Clearly, the definition of proper goals remains one major objective for CAFA.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-14-s3-s7","type":"journal-article","created":{"date-parts":[[2013,2,28]],"date-time":"2013-02-28T17:09:37Z","timestamp":1362071377000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":37,"title":["Homology-based inference sets the bar high for protein function prediction"],"prefix":"10.1186","volume":"14","author":[{"given":"Tobias","family":"Hamp","sequence":"first","affiliation":[]},{"given":"Rebecca","family":"Kassner","sequence":"additional","affiliation":[]},{"given":"Stefan","family":"Seemayer","sequence":"additional","affiliation":[]},{"given":"Esmeralda","family":"Vicedo","sequence":"additional","affiliation":[]},{"given":"Christian","family":"Schaefer","sequence":"additional","affiliation":[]},{"given":"Dominik","family":"Achten","sequence":"additional","affiliation":[]},{"given":"Florian","family":"Auer","sequence":"additional","affiliation":[]},{"given":"Ariane","family":"Boehm","sequence":"additional","affiliation":[]},{"given":"Tatjana","family":"Braun","sequence":"additional","affiliation":[]},{"given":"Maximilian","family":"Hecht","sequence":"additional","affiliation":[]},{"given":"Mark","family":"Heron","sequence":"additional","affiliation":[]},{"given":"Peter","family":"H\u00f6nigschmid","sequence":"additional","affiliation":[]},{"given":"Thomas A","family":"Hopf","sequence":"additional","affiliation":[]},{"given":"Stefanie","family":"Kaufmann","sequence":"additional","affiliation":[]},{"given":"Michael","family":"Kiening","sequence":"additional","affiliation":[]},{"given":"Denis","family":"Krompass","sequence":"additional","affiliation":[]},{"given":"Cedric","family":"Landerer","sequence":"additional","affiliation":[]},{"given":"Yannick","family":"Mahlich","sequence":"additional","affiliation":[]},{"given":"Manfred","family":"Roos","sequence":"additional","affiliation":[]},{"given":"Burkhard","family":"Rost","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2013,2,28]]},"reference":[{"key":"5696_CR1","doi-asserted-by":"publisher","first-page":"D214","DOI":"10.1093\/nar\/gkq1020","volume":"39","author":"Consortium TU","year":"2011","unstructured":"Consortium TU: Ongoing and Future Developments at the Universal Protein Resource. Nucleic Acids Research. 2011, 39: D214-219.","journal-title":"Nucleic Acids Research"},{"key":"5696_CR2","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1038\/75556","volume":"25","author":"M Ashburner","year":"2000","unstructured":"Ashburner M, Ball CA, Blake JA, Botstein D, Butler H: Gene Ontology: Tool for the Unification of Biology. The Gene Ontology Consortium. Nat Genet. 2000, 25: 25-29. 10.1038\/75556.","journal-title":"Nat Genet"},{"key":"5696_CR3","doi-asserted-by":"publisher","first-page":"210","DOI":"10.1016\/j.tibtech.2009.01.002","volume":"27","author":"R Rentzsch","year":"2009","unstructured":"Rentzsch R, Orengo CA: Protein function prediction--the power of multiplicity. Trends Biotechnol. 2009, 27: 210-219. 10.1016\/j.tibtech.2009.01.002.","journal-title":"Trends Biotechnol"},{"key":"5696_CR4","volume-title":"Nature Methods","author":"P Radivojac","year":"2012","unstructured":"Radivojac P, Clark WT, Friedberg I: A Large-scale Evaluation of Computational Protein Function Prediction. Nature Methods. 2012"},{"key":"5696_CR5","doi-asserted-by":"publisher","first-page":"798","DOI":"10.1093\/bioinformatics\/btn037","volume":"24","author":"MN Wass","year":"2008","unstructured":"Wass MN, Sternberg MJ: ConFunc--Functional Annotation in the Twilight Zone. Bioinformatics. 2008, 24: 798-806. 10.1093\/bioinformatics\/btn037.","journal-title":"Bioinformatics"},{"key":"5696_CR6","doi-asserted-by":"publisher","first-page":"2628","DOI":"10.1093\/bioinformatics\/btn486","volume":"24","author":"CE Jones","year":"2008","unstructured":"Jones CE, Schwerdt J, Bretag TA, Baumann U, Brown AL: GOSLING: A Rule-Based Protein Annotator Using BLAST and GO. Bioinformatics. 2008, 24: 2628-2629. 10.1093\/bioinformatics\/btn486.","journal-title":"Bioinformatics"},{"key":"5696_CR7","doi-asserted-by":"publisher","first-page":"1550","DOI":"10.1110\/ps.062153506","volume":"15","author":"T Hawkins","year":"2006","unstructured":"Hawkins T, Luban S, Kihara D: Enhanced Automated Function Prediction Using Distantly Related Sequences and Contextual Association by PFP. Protein Science. 2006, 15: 1550-1556. 10.1110\/ps.062153506.","journal-title":"Protein Science"},{"key":"5696_CR8","doi-asserted-by":"publisher","first-page":"178","DOI":"10.1186\/1471-2105-5-178","volume":"5","author":"DM Martin","year":"2004","unstructured":"Martin DM, Berriman M, Barton GJ: GOtcha: A New Method for Prediction of Protein Function Assessed by the Annotation of Seven Genomes. BMC Bioinformatics. 2004, 5: 178-10.1186\/1471-2105-5-178.","journal-title":"BMC Bioinformatics"},{"key":"5696_CR9","doi-asserted-by":"publisher","first-page":"1739","DOI":"10.1093\/bioinformatics\/btp309","volume":"25","author":"M Chitale","year":"2009","unstructured":"Chitale M, Hawkins T, Park C, Kihara D: ESG: Extended Similarity Group Method for Automated Protein Function Prediction. Bioinformatics. 2009, 25: 1739-1745. 10.1093\/bioinformatics\/btp309.","journal-title":"Bioinformatics"},{"key":"5696_CR10","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1142\/S0219720010004744","volume":"8","author":"A Sokolov","year":"2010","unstructured":"Sokolov A, Ben-Hur A: Hierarchical Classification of Gene Ontology Terms Using the GOstruct Method. Journal of Bioinformatics and Computational Biology. 2010, 8: 357-376. 10.1142\/S0219720010004744.","journal-title":"Journal of Bioinformatics and Computational Biology"},{"key":"5696_CR11","doi-asserted-by":"publisher","first-page":"2086","DOI":"10.1002\/prot.23029","volume":"79","author":"WT Clark","year":"2011","unstructured":"Clark WT, Radivojac P: Analysis of Protein Function and its Prediction from Amino Acid Sequence. Proteins. 2011, 79: 2086-2096. 10.1002\/prot.23029.","journal-title":"Proteins"},{"key":"5696_CR12","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"SF Altschul","year":"1997","unstructured":"Altschul SF, Madden TL, Schaeffer AA, Zhang J, Zhang Z: Gapped Blast and PSI-Blast: A New Generation of Protein Database Search Programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093\/nar\/25.17.3389.","journal-title":"Nucleic Acids Res"},{"key":"5696_CR13","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1613\/jair.594","volume":"10","author":"K Ming","year":"1999","unstructured":"Ming K, Witten I: Issues in Stacked Generalization. Journal of Artificial Intelligence Research. 1999, 10: 271-280.http:\/\/citeseerx.ist.psu.edu\/viewdoc\/summary?doi=10.1.1.16.1519http:\/\/dl.acm.org\/citation.cfm?id=1622868,","journal-title":"Journal of Artificial Intelligence Research"},{"issue":"Suppl 6","key":"5696_CR14","doi-asserted-by":"publisher","first-page":"548","DOI":"10.1002\/prot.10534","volume":"53","author":"VA Eyrich","year":"2003","unstructured":"Eyrich VA, Przybylski D, Koh IY, Grana O, Pazos F: CAFASP3 in the Spotlight of EVA. Proteins. 2003, 53 (Suppl 6): 548-560.","journal-title":"Proteins"},{"key":"5696_CR15","doi-asserted-by":"publisher","first-page":"3311","DOI":"10.1093\/nar\/gkg619","volume":"31","author":"IY Koh","year":"2003","unstructured":"Koh IY, Eyrich VA, Marti-Renom MA, Przybylski D, Madhusudhan MS: EVA: Evaluation of Protein Structure Prediction Servers. Nucleic Acids Research. 2003, 31: 3311-3315. 10.1093\/nar\/gkg619.","journal-title":"Nucleic Acids Research"},{"key":"5696_CR16","doi-asserted-by":"publisher","first-page":"435","DOI":"10.1016\/S0969-2126(02)00731-1","volume":"10","author":"MA Marti-Renom","year":"2002","unstructured":"Marti-Renom MA, Madhusudhan MS, Fiser A, Rost B, Sali A: Reliability of Assessment of Protein Structure Prediction Methods. Structure. 2002, 10: 435-440. 10.1016\/S0969-2126(02)00731-1.","journal-title":"Structure"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-14-S3-S7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T22:24:32Z","timestamp":1630535072000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-14-S3-S7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,2]]},"references-count":16,"journal-issue":{"issue":"S3","published-print":{"date-parts":[[2013,2]]}},"alternative-id":["5696"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-14-s3-s7","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,2]]},"assertion":[{"value":"28 February 2013","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S7"}}