{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,30]],"date-time":"2026-05-30T01:02:28Z","timestamp":1780102948325,"version":"3.54.0"},"reference-count":38,"publisher":"Oxford University Press (OUP)","issue":"10","license":[{"start":{"date-parts":[[2024,9,23]],"date-time":"2024-09-23T00:00:00Z","timestamp":1727049600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Proteins play pivotal roles in biological systems, and precise prediction of their functions is indispensable for practical applications. Despite the surge in protein sequence data facilitated by high-throughput techniques, unraveling the exact functionalities of proteins still demands considerable time and resources. Currently, numerous methods rely on protein sequences for prediction, while methods targeting protein structures are scarce, often employing convolutional neural networks (CNN) or graph convolutional networks (GCNs) individually.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>To address these challenges, our approach starts from protein structures and proposes a method that combines CNN and GCN into a unified framework called the two-model adaptive weight fusion network (TAWFN) for protein function prediction. First, amino acid contact maps and sequences are extracted from the protein structure. Then, the sequence is used to generate one-hot encoded features and deep semantic features. These features, along with the constructed graph, are fed into the adaptive graph convolutional networks (AGCN) module and the multi-layer convolutional neural network (MCNN) module as needed, resulting in preliminary classification outcomes. Finally, the preliminary classification results are inputted into the adaptive weight computation network, where adaptive weights are calculated to fuse the initial predictions from both networks, yielding the final prediction result. To evaluate the effectiveness of our method, experiments were conducted on the PDBset and AFset datasets. For molecular function, biological process, and cellular component tasks, TAWFN achieved area under the precision-recall curve (AUPR) values of 0.718, 0.385, and 0.488 respectively, with corresponding Fmax scores of 0.762, 0.628, and 0.693, and Smin scores of 0.326, 0.483, and 0.454. The experimental results demonstrate that TAWFN exhibits promising performance, outperforming existing methods.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The TAWFN source code can be found at: https:\/\/github.com\/ss0830\/TAWFN.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae571","type":"journal-article","created":{"date-parts":[[2024,9,23]],"date-time":"2024-09-23T19:08:08Z","timestamp":1727118488000},"source":"Crossref","is-referenced-by-count":18,"title":["TAWFN: a deep learning framework for protein function prediction"],"prefix":"10.1093","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2442-8354","authenticated-orcid":false,"given":"Lu","family":"Meng","sequence":"first","affiliation":[{"name":"College of Information Science and Engineering, Northeastern University , Shenyang, Liaoning, 110000,","place":["China"]}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoran","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Information Science and Engineering, Northeastern University , Shenyang, Liaoning, 110000,","place":["China"]}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2024,9,23]]},"reference":[{"key":"2024100123513599900_btae571-B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res"},{"key":"2024100123513599900_btae571-B2","author":"Baek","year":"2021"},{"key":"2024100123513599900_btae571-B3","first-page":"27","article-title":"Scale invariant feature transform plus hue feature, the international archives of photogrammetry, remote sensing and spatial","volume":"42","author":"Bdaneshvar","year":"2017","journal-title":"Inform Sci"},{"key":"2024100123513599900_btae571-B4","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2020","journal-title":"Nucleic Acids Res"},{"key":"2024100123513599900_btae571-B5","doi-asserted-by":"crossref","first-page":"i318","DOI":"10.1093\/bioinformatics\/btad208","article-title":"Combining protein sequences and structures with transformers and equivariant graph neural networks to predict protein function","volume":"39","author":"Boadu","year":"2023","journal-title":"Bioinformatics"},{"key":"2024100123513599900_btae571-B6","doi-asserted-by":"crossref","first-page":"i53","DOI":"10.1093\/bioinformatics\/btt228","article-title":"Information-theoretic evaluation of predicted ontological annotations","volume":"29","author":"Clark","year":"2013","journal-title":"Bioinformatics"},{"key":"2024100123513599900_btae571-B7","doi-asserted-by":"crossref","first-page":"D482","DOI":"10.1093\/nar\/gky1114","article-title":"Sifts: updated structure integration with function, taxonomy and sequences resource allows 40-fold increase in coverage of structure-based annotations for proteins","volume":"47","author":"Dana","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024100123513599900_btae571-B8","doi-asserted-by":"crossref","first-page":"3460","DOI":"10.1093\/bioinformatics\/btv398","article-title":"Functional classification of CATH superfamilies: a domain-based approach for protein function annotation","volume":"31","author":"Das","year":"2015","journal-title":"Bioinformatics"},{"key":"2024100123513599900_btae571-B9","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1145\/1143844.1143874","volume-title":"Proceedings of the 23rd International Conference on Machine Learning (ICML)","author":"Davis","year":"2006"},{"key":"2024100123513599900_btae571-B10","doi-asserted-by":"crossref","first-page":"823","DOI":"10.1038\/35015694","article-title":"Protein function in the post-genomic era","volume":"405","author":"Eisenberg","year":"2000","journal-title":"Nature"},{"key":"2024100123513599900_btae571-B11","doi-asserted-by":"crossref","first-page":"7112","DOI":"10.1109\/TPAMI.2021.3095381","article-title":"ProtTrans: toward understanding the language of life through self-supervised learning","volume":"44","author":"Elnaggar","year":"2022","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2024100123513599900_btae571-B12","author":"Fey","year":"2019"},{"key":"2024100123513599900_btae571-B13","doi-asserted-by":"crossref","first-page":"3168","DOI":"10.1038\/s41467-021-23303-9","article-title":"Structure-based protein function prediction using graph convolutional networks","volume":"12","author":"Gligorijevi\u0107","year":"2021","journal-title":"Nat Commun"},{"key":"2024100123513599900_btae571-B14","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btad410","article-title":"Hierarchical graph transformer with contrastive learning for protein function prediction","volume":"39","author":"Gu","year":"2023","journal-title":"Bioinformatics"},{"key":"2024100123513599900_btae571-B15","first-page":"12595","author":"Guo","year":"2020"},{"key":"2024100123513599900_btae571-B16","author":"Huikai","year":"2019"},{"key":"2024100123513599900_btae571-B17","first-page":"4171","author":"Kenton","year":"2019"},{"key":"2024100123513599900_btae571-B18","author":"Kingma","year":"2015"},{"key":"2024100123513599900_btae571-B19","doi-asserted-by":"crossref","first-page":"422","DOI":"10.1093\/bioinformatics\/btz595","article-title":"DeepGOPlus: Improved protein function prediction from sequence","volume":"36","author":"Kulmanov","year":"2020","journal-title":"Bioinformatics"},{"key":"2024100123513599900_btae571-B20","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1093\/bioinformatics\/btx624","article-title":"DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier","volume":"34","author":"Kulmanov","year":"2018","journal-title":"Bioinformatics"},{"key":"2024100123513599900_btae571-B21","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1038\/s42256-024-00795-w","article-title":"Protein function prediction as approximate semantic entailment","volume":"6","author":"Kulmanov","year":"2024","journal-title":"Nat Mach Intell"},{"key":"2024100123513599900_btae571-B22","doi-asserted-by":"crossref","first-page":"bbab502","DOI":"10.1093\/bib\/bbab502","article-title":"Accurate protein function prediction via graph attention networks with predicted structure information","volume":"23","author":"Lai","year":"2022","journal-title":"Brief Bioinform"},{"key":"2024100123513599900_btae571-B23","first-page":"2117","author":"Lin","year":"2017"},{"key":"2024100123513599900_btae571-B24","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1016\/j.jbiotec.2016.03.038","article-title":"A facile and efficient transposon mutagenesis method for generation of multi-codon deletions in protein sequences","volume":"227","author":"Liu","year":"2016","journal-title":"J Biotechnol"},{"key":"2024100123513599900_btae571-B25","doi-asserted-by":"crossref","first-page":"601","DOI":"10.1038\/35001165","article-title":"Guilt-by-association goes global","volume":"403","author":"Oliver","year":"2020","journal-title":"Nature"},{"key":"2024100123513599900_btae571-B26","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1038\/nmeth.2340","article-title":"A large-scale evaluation of computational protein function prediction","volume":"10","author":"Radivojac","year":"2013","journal-title":"Nat Methods"},{"key":"2024100123513599900_btae571-B27","doi-asserted-by":"crossref","first-page":"e2016239118","DOI":"10.1073\/pnas.2016239118","article-title":"Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences","volume":"118","author":"Rives","year":"2021","journal-title":"Proc Natl Acad Sci USA"},{"key":"2024100123513599900_btae571-B28","doi-asserted-by":"crossref","first-page":"2401","DOI":"10.1093\/bioinformatics\/btaa003","article-title":"UDSMProt: universal deep sequence models for protein classification","volume":"36","author":"Strodthoff","year":"2020","journal-title":"Bioinformatics"},{"key":"2024100123513599900_btae571-B29","doi-asserted-by":"crossref","first-page":"bbad201","DOI":"10.1093\/bib\/bbad201","article-title":"MMSMAPlus: a multi-view multi-scale multi-attention embedding model for protein function prediction","volume":"24","author":"Wang","year":"2023","journal-title":"Brief Bioinform"},{"key":"2024100123513599900_btae571-B30","doi-asserted-by":"crossref","first-page":"W296","DOI":"10.1093\/nar\/gky427","article-title":"SWISS-MODEL: homology modelling of protein structures and complexes","volume":"46","author":"Waterhouse","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024100123513599900_btae571-B31","doi-asserted-by":"crossref","first-page":"4668","DOI":"10.1093\/bioinformatics\/btab551","article-title":"PhosIDN: an integrated deep neural network for improving protein phosphorylation site prediction by combining sequence and protein\u2013protein interaction information","volume":"37","author":"Yang","year":"2021","journal-title":"Bioinformatics"},{"key":"2024100123513599900_btae571-B32","doi-asserted-by":"crossref","first-page":"W469","DOI":"10.1093\/nar\/gkab398","article-title":"NetGO 2.0: improving large-scale protein function prediction with massive sequence, text, domain, family and network information","volume":"49","author":"Yao","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2024100123513599900_btae571-B33","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.ymeth.2018.05.026","article-title":"DeepText2GO: improving large-scale protein function prediction with deep semantic text representation","volume":"145","author":"You","year":"2018","journal-title":"Methods"},{"key":"2024100123513599900_btae571-B34","doi-asserted-by":"crossref","first-page":"W379","DOI":"10.1093\/nar\/gkz388","article-title":"NetGO: improving large-scale protein function prediction with massive network information","volume":"47","author":"You","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2024100123513599900_btae571-B35","doi-asserted-by":"crossref","first-page":"bbad117","DOI":"10.1093\/bib\/bbad117","article-title":"Fast and accurate protein function prediction from sequence through pretrained language model and homology-based label diffusion","volume":"24","author":"Yuan","year":"2023","journal-title":"Brief Bioinform"},{"key":"2024100123513599900_btae571-B36","doi-asserted-by":"crossref","first-page":"2208","DOI":"10.1109\/TCBB.2020.2968882","article-title":"A deep learning framework for gene ontology annotations with sequence- and network-based information","volume":"18","author":"Zhang","year":"2021","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2024100123513599900_btae571-B37","doi-asserted-by":"crossref","first-page":"755","DOI":"10.1109\/TNB.2023.3278033","article-title":"Protein function prediction with functional and topological knowledge of gene ontology","volume":"22","author":"Zhao","year":"2023","journal-title":"IEEE Trans Nanobiosci"},{"key":"2024100123513599900_btae571-B38","doi-asserted-by":"crossref","first-page":"e1010793","DOI":"10.1371\/journal.pcbi.1010793","article-title":"Integrating unsupervised language model with triplet neural networks for protein gene ontology prediction","volume":"18","author":"Zhu","year":"2022","journal-title":"PLoS Comput Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae571\/59253526\/btae571.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/10\/btae571\/59491578\/btae571.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/10\/btae571\/59491578\/btae571.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,1]],"date-time":"2024-10-01T23:51:54Z","timestamp":1727826714000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae571\/7766190"}},"subtitle":[],"editor":[{"given":"Xin","family":"Gao","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2024,9,23]]},"references-count":38,"journal-issue":{"issue":"10","published-print":{"date-parts":[[2024,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae571","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,10]]},"published":{"date-parts":[[2024,9,23]]},"article-number":"btae571"}}