{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T00:07:24Z","timestamp":1775606844378,"version":"3.50.1"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2019,7,27]],"date-time":"2019-07-27T00:00:00Z","timestamp":1564185600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Office of Sponsored Research","award":["URF\/1\/3454-01-01"],"award-info":[{"award-number":["URF\/1\/3454-01-01"]}]},{"name":"Office of Sponsored Research","award":["FCC\/1\/1976-08-01"],"award-info":[{"award-number":["FCC\/1\/1976-08-01"]}]},{"name":"Office of Sponsored Research","award":["FCS\/1\/3657-02-01"],"award-info":[{"award-number":["FCS\/1\/3657-02-01"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Protein function prediction is one of the major tasks of bioinformatics that can help in wide range of biological problems such as understanding disease mechanisms or finding drug targets. Many methods are available for predicting protein functions from sequence based features, protein\u2013protein interaction networks, protein structure or literature. However, other than sequence, most of the features are difficult to obtain or not available for many proteins thereby limiting their scope. Furthermore, the performance of sequence-based function prediction methods is often lower than methods that incorporate multiple features and predicting protein functions may require a lot of time.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We developed a novel method for predicting protein functions from sequence alone which combines deep convolutional neural network (CNN) model with sequence similarity based predictions. Our CNN model scans the sequence for motifs which are predictive for protein functions and combines this with functions of similar proteins (if available). We evaluate the performance of DeepGOPlus using the CAFA3 evaluation measures and achieve an Fmax of 0.390, 0.557 and 0.614 for BPO, MFO and CCO evaluations, respectively. These results would have made DeepGOPlus one of the three best predictors in CCO and the second best performing method in the BPO and MFO evaluations. We also compare DeepGOPlus with state-of-the-art methods such as DeepText2GO and GOLabeler on another dataset. DeepGOPlus can annotate around 40 protein sequences per second on common hardware, thereby making fast and accurate function predictions available for a wide range of proteins.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>http:\/\/deepgoplus.bio2vec.net\/ .<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz595","type":"journal-article","created":{"date-parts":[[2019,7,24]],"date-time":"2019-07-24T15:17:27Z","timestamp":1563981447000},"page":"422-429","source":"Crossref","is-referenced-by-count":428,"title":["DeepGOPlus: improved protein function prediction from sequence"],"prefix":"10.1093","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1710-1820","authenticated-orcid":false,"given":"Maxat","family":"Kulmanov","sequence":"first","affiliation":[{"name":"Computational Bioscience Research Center , Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8149-5890","authenticated-orcid":false,"given":"Robert","family":"Hoehndorf","sequence":"additional","affiliation":[{"name":"Computational Bioscience Research Center , Computer, Electrical and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, Thuwal 23955, Saudi Arabia"}]}],"member":"286","published-online":{"date-parts":[[2019,7,27]]},"reference":[{"key":"2023013112064031400_btz595-B1","author":"Abadi","year":"2016"},{"key":"2023013112064031400_btz595-B2","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2023013112064031400_btz595-B3","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet"},{"key":"2023013112064031400_btz595-B4","doi-asserted-by":"crossref","first-page":"59.","DOI":"10.1038\/nmeth.3176","article-title":"Fast and sensitive protein alignment using diamond","volume":"12","author":"Buchfink","year":"2015","journal-title":"Nat. Methods"},{"key":"2023013112064031400_btz595-B5","first-page":"233","author":"Davis","year":"2006"},{"key":"2023013112064031400_btz595-B6","doi-asserted-by":"crossref","first-page":"D427","DOI":"10.1093\/nar\/gky995","article-title":"The Pfam protein families database in 2019","volume":"47","author":"El-Gebali","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023013112064031400_btz595-B7","doi-asserted-by":"crossref","first-page":"e0198216","DOI":"10.1371\/journal.pone.0198216","article-title":"Predicting human protein function with multi-task deep neural networks","volume":"13","author":"Fa","year":"2018","journal-title":"PLoS One"},{"key":"2023013112064031400_btz595-B8","doi-asserted-by":"crossref","first-page":"D190.","DOI":"10.1093\/nar\/gkw1107","article-title":"Interpro in 2017\u2014beyond protein family and domain annotations","volume":"45","author":"Finn","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023013112064031400_btz595-B9","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1287\/opre.15.3.537","article-title":"Additive utilities with incomplete product sets: application to priorities and assignments","volume":"15","author":"Fishburn","year":"1967","journal-title":"Operations Res"},{"key":"2023013112064031400_btz595-B10","doi-asserted-by":"crossref","first-page":"184.","DOI":"10.1186\/s13059-016-1037-6","article-title":"An expanded evaluation of protein function prediction methods shows an improvement in accuracy","volume":"17","author":"Jiang","year":"2016","journal-title":"Genome Biol"},{"key":"2023013112064031400_btz595-B11","first-page":"60","author":"Kahanda","year":"2017"},{"key":"2023013112064031400_btz595-B13","author":"Kingma","year":"2014"},{"key":"2023013112064031400_btz595-B14","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1093\/bioinformatics\/btx624","article-title":"DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier","volume":"34","author":"Kulmanov","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013112064031400_btz595-B15","doi-asserted-by":"crossref","first-page":"1236","DOI":"10.1093\/bioinformatics\/btu031","article-title":"InterProScan 5: genome-scale protein function classification","volume":"30","author":"Mitchell","year":"2014","journal-title":"Bioinformatics"},{"key":"2023013112064031400_btz595-B16","author":"Nair","year":"2010"},{"key":"2023013112064031400_btz595-B17","doi-asserted-by":"crossref","first-page":"i53","DOI":"10.1093\/bioinformatics\/btt228","article-title":"Information-theoretic evaluation of predicted ontological annotations","volume":"29","author":"Radivojac","year":"2013","journal-title":"Bioinformatics"},{"key":"2023013112064031400_btz595-B18","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1038\/nmeth.2340","article-title":"A large-scale evaluation of computational protein function prediction","volume":"10","author":"Radivojac","year":"2013","journal-title":"Nat. Methods"},{"key":"2023013112064031400_btz595-B19","author":"Rives","year":"2019"},{"key":"2023013112064031400_btz595-B20","article-title":"The universal protein resource (uniprot)","volume":"35","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023013112064031400_btz595-B21","doi-asserted-by":"crossref","first-page":"e1005324","DOI":"10.1371\/journal.pcbi.1005324","article-title":"Accurate de novo prediction of protein contact map by ultra-deep learning model","volume":"13","author":"Wang","year":"2017","journal-title":"PLOS Comput. Biol"},{"key":"2023013112064031400_btz595-B22","doi-asserted-by":"crossref","first-page":"7.","DOI":"10.1038\/nmeth.3213","article-title":"The I-TASSER Suite: protein structure and function prediction","volume":"12","author":"Yang","year":"2015","journal-title":"Nat. Methods"},{"key":"2023013112064031400_btz595-B23","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.ymeth.2018.05.026","article-title":"DeepText2GO: improving large-scale protein function prediction with deep semantic text representation","volume":"145","author":"You","year":"2018","journal-title":"Methods"},{"key":"2023013112064031400_btz595-B24","doi-asserted-by":"crossref","first-page":"2465","DOI":"10.1093\/bioinformatics\/bty130","article-title":"GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank","volume":"34","author":"You","year":"2018","journal-title":"Bioinformatics"},{"key":"2023013112064031400_btz595-B25","author":"Zhou","year":"2019"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz595\/29192371\/btz595.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/2\/422\/48990969\/btz595.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/2\/422\/48990969\/btz595.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T16:23:05Z","timestamp":1675182185000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/2\/422\/5539866"}},"subtitle":[],"editor":[{"given":"Lenore","family":"Cowen","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,7,27]]},"references-count":24,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz595","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/615260","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,1,15]]},"published":{"date-parts":[[2019,7,27]]}}}