{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,21]],"date-time":"2025-09-21T17:12:04Z","timestamp":1758474724530,"version":"3.38.0"},"reference-count":0,"publisher":"SAGE Publications","issue":"1-2","license":[{"start":{"date-parts":[[2009,1,1]],"date-time":"2009-01-01T00:00:00Z","timestamp":1230768000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["In Silico Biology: Journal of Biological Systems Modeling and Multi-Scale Simulation"],"published-print":{"date-parts":[[2009,2]]},"abstract":"<jats:p> A Naive Bayes classifier tool is presented for annotating proteins on the basis of amino acid motifs, cellular localization and protein-protein interactions. Annotations take the form of posterior probabilities within the Molecular Function hierarchy of the Gene Ontology (GO). Experiments with the data available for yeast, Saccharomyces cerevisiae, show that our prediction method can yield a relatively high level of accuracy. Several apparent challenges and possibilities for future developments are also discussed. <\/jats:p><jats:p> A common approach to functional characterization is to use sequence similarities at varying levels, by utilizing several existing databases and local alignment\/identification algorithms. Such an approach is typically quite labor-intensive when performed by an expert in a manual fashion. Integration of several sources of information is in this context generally considered as the only possibility to obtain valuable predictions with practical implications. However, some improvements in the prediction accuracy of the molecular functions, and thereby also savings in the computational effort, can be achieved by restricting attention to only those data sources that involve a higher degree of specificity. We employ here a Naive Bayes model in order to provide probabilistic predictions, and to enable a computationally efficient approach to data integration. <\/jats:p>","DOI":"10.3233\/isb-2009-0382","type":"journal-article","created":{"date-parts":[[2019,12,3]],"date-time":"2019-12-03T03:36:53Z","timestamp":1575344213000},"page":"23-34","source":"Crossref","is-referenced-by-count":8,"title":["A Naive Bayes Classifier for Protein Function Prediction"],"prefix":"10.1177","volume":"9","author":[{"given":"Jukka","family":"Kohonen","sequence":"first","affiliation":[{"name":"Department of Mathematics and Statistics, University\r\t\t\t of Helsinki, Helsinki, FI-00014, Finland"}]},{"given":"Sarish","family":"Talikota","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Statistics, University\r\t\t\t of Helsinki, Helsinki, FI-00014, Finland"}]},{"given":"Jukka","family":"Corander","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Statistics, University\r\t\t\t of Helsinki, Helsinki, FI-00014, Finland"}]},{"given":"Petri","family":"Auvinen","sequence":"additional","affiliation":[{"name":"Institute of Biotechnology, University of Helsinki,\r\t\t\t Helsinki, FI-00014, Finland"}]},{"given":"Elja","family":"Arjas","sequence":"additional","affiliation":[{"name":"Department of Mathematics and Statistics, University\r\t\t\t of Helsinki, Helsinki, FI-00014, Finland"}]}],"member":"179","published-online":{"date-parts":[[2009,1,1]]},"container-title":["In Silico Biology: Journal of Biological Systems Modeling and Multi-Scale Simulation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/ISB-2009-0382","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.3233\/ISB-2009-0382","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,11]],"date-time":"2025-03-11T06:57:59Z","timestamp":1741676279000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.3233\/ISB-2009-0382"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,1,1]]},"references-count":0,"journal-issue":{"issue":"1-2","published-print":{"date-parts":[[2009,2]]}},"alternative-id":["10.3233\/ISB-2009-0382"],"URL":"https:\/\/doi.org\/10.3233\/isb-2009-0382","relation":{},"ISSN":["1386-6338","1434-3207"],"issn-type":[{"type":"print","value":"1386-6338"},{"type":"electronic","value":"1434-3207"}],"subject":[],"published":{"date-parts":[[2009,1,1]]}}}