{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T17:00:12Z","timestamp":1778173212792,"version":"3.51.4"},"reference-count":89,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2025,7,31]],"date-time":"2025-07-31T00:00:00Z","timestamp":1753920000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001824","name":"Czech Science Foundation","doi-asserted-by":"publisher","award":["23-07349S"],"award-info":[{"award-number":["23-07349S"]}],"id":[{"id":"10.13039\/501100001824","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2025,8,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Predicting protein\u2013ligand binding sites is crucial in studying protein interactions with applications in biotechnology and drug discovery. Two distinct paradigms have emerged for this purpose: sequence-based methods, which leverage protein sequence information, and structure-based methods, which rely on the three-dimensional (3D) structure of the protein. Here, we analyze a hybrid approach that combines the strengths of both paradigms by integrating two recent deep learning architectures: protein language models (pLMs) from the sequence-based paradigm and Graph Neural Networks (GNNs) from the structure-based paradigm. Specifically, we construct a residue-level Graph Attention Network (GAT) model based on the protein\u2019s 3D structure that uses pre-trained pLM embeddings as node features. This integration enables us to study the interplay between the sequential information encoded in the protein sequence and the spatial relationships within the protein structure on the model performance.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>By exploiting a benchmark dataset over a range of ligands and ligand types, we have shown that using the structure information consistently enhances the predictive power of the baselines in absolute terms. Nevertheless, as more complex pLMs are used to represent node features, the relative impact of the structure information represented by the GNN architecture diminishes. The above observations suggest that although the use of the experimental protein structure almost always improves the accuracy of the prediction of the binding site, complex pLMs still contain structural information that leads to good predictive performance even without the use of 3D structure.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>The datasets generated and\/or analyzed during the current study, as well as pretrained models, are available in the following Zenodo link https:\/\/zenodo.org\/records\/15184302. The source code that was used to generate the results of the current study is available in the following GitHub repository https:\/\/github.com\/hamzagamouh\/pt-lm-gnn as well as in the following Zenodo link https:\/\/zenodo.org\/records\/15192327.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf431","type":"journal-article","created":{"date-parts":[[2025,7,30]],"date-time":"2025-07-30T11:48:18Z","timestamp":1753876098000},"source":"Crossref","is-referenced-by-count":10,"title":["Hybrid protein\u2013ligand binding residue prediction with protein language models: does the structure matter?"],"prefix":"10.1093","volume":"41","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-2584-727X","authenticated-orcid":false,"given":"Hamza","family":"Gamouh","sequence":"first","affiliation":[{"name":"Faculty of Mathematics and Physics, Charles University , 118 00 Prague,","place":["Czech Republic"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8788-3202","authenticated-orcid":false,"given":"Marian","family":"Novotn\u00fd","sequence":"additional","affiliation":[{"name":"Faculty of Science, Charles University , 128 00 Prague,","place":["Czech Republic"]}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Hoksza","sequence":"additional","affiliation":[{"name":"Faculty of Mathematics and Physics, Charles University , 118 00 Prague,","place":["Czech Republic"]}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2025,7,31]]},"reference":[{"key":"2025082519465337300_btaf431-B12","doi-asserted-by":"crossref","first-page":"5069","DOI":"10.1021\/acs.jcim.1c00799","article-title":"Deeppocket: ligand binding site detection and segmentation using 3d convolutional neural networks","volume":"62","author":"Aggarwal","year":"2022","journal-title":"J Chem Inf Model"},{"key":"2025082519465337300_btaf431-B13","doi-asserted-by":"crossref","first-page":"831","DOI":"10.1038\/nbt.3300","article-title":"Predicting the sequence specificities of dna-and rna-binding proteins by deep learning","volume":"33","author":"Alipanahi","year":"2015","journal-title":"Nat Biotechnol"},{"key":"2025082519465337300_btaf431-B14","doi-asserted-by":"crossref","first-page":"W529","DOI":"10.1093\/nar\/gkq399","article-title":"Consurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids","volume":"38","author":"Ashkenazy","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B15","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B16","first-page":"1877","article-title":"Language models are few-shot learners","volume":"33","author":"Brown","year":"2020","journal-title":"Adv Neural Info Process Syst"},{"key":"2025082519465337300_btaf431-B17","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1073\/pnas.0707684105","article-title":"A threading-based method (findsite) for ligand-binding site prediction and functional annotation","volume":"105","author":"Brylinski","year":"2008","journal-title":"Proc Natl Acad Sci USA"},{"key":"2025082519465337300_btaf431-B18","first-page":"100134","article-title":"Deep learning in computer vision: a critical review of emerging techniques and application scenarios","volume":"6","author":"Chai","year":"2021","journal-title":"Mach Learn Appl"},{"key":"2025082519465337300_btaf431-B19","doi-asserted-by":"crossref","first-page":"434","DOI":"10.1186\/1471-2105-10-434","article-title":"Identification of atp binding residues of a protein from its primary sequence","volume":"10","author":"Chauhan","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2025082519465337300_btaf431-B20","doi-asserted-by":"crossref","first-page":"331","DOI":"10.1093\/bioinformatics\/btr657","article-title":"Prediction and analysis of nucleotide-binding residues using sequence and sequence-derived structural descriptors","volume":"28","author":"Chen","year":"2012","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B21","doi-asserted-by":"crossref","first-page":"S4","DOI":"10.1186\/1471-2105-15-S15-S4","article-title":"Ligandrfs: random Forest ensemble to identify ligand-binding residues from sequence information alone","volume":"15","author":"Chen","year":"2014","journal-title":"BMC Bioinfo"},{"key":"2025082519465337300_btaf431-B22","doi-asserted-by":"crossref","first-page":"901","DOI":"10.1109\/TCBB.2015.2505286","article-title":"A sequence-based dynamic ensemble learning system for protein ligand-binding site prediction","volume":"13","author":"Chen","year":"2016","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2025082519465337300_btaf431-B23","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1186\/s12864-019-6413-7","article-title":"The advantages of the matthews correlation coefficient (mcc) over f1 score and accuracy in binary classification evaluation","volume":"21","author":"Chicco","year":"2020","journal-title":"BMC Genomics"},{"key":"2025082519465337300_btaf431-B24","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1186\/s12859-019-2672-1","article-title":"Predicting protein-ligand binding residues with deep convolutional neural networks","volume":"20","author":"Cui","year":"2019","journal-title":"BMC Bioinfo"},{"key":"2025082519465337300_btaf431-B25","author":"Devlin","year":"2018"},{"key":"2025082519465337300_btaf431-B26","doi-asserted-by":"publisher","first-page":"3149","DOI":"10.1021\/acs.jcim.7b00307","article-title":"Identification of protein\u2013ligand binding sites by sequence information and ensemble classifier","volume":"57","author":"Ding","year":"2017","journal-title":"J Chem Inf Model"},{"key":"2025082519465337300_btaf431-B27","doi-asserted-by":"crossref","first-page":"7112","DOI":"10.1109\/TPAMI.2021.3095381","article-title":"Prottrans: toward understanding the language of life through self-supervised learning","volume":"44","author":"Elnaggar","year":"2022","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"2025082519465337300_btaf431-B28","doi-asserted-by":"crossref","first-page":"1269","DOI":"10.1002\/prot.26816","article-title":"Skittles: gnn-assisted pseudo-ligands generation and its application for binding sites classification and affinity prediction","volume":"93","author":"Evteev","year":"2025","journal-title":"Prot Struct Funct Bioinfo"},{"key":"2025082519465337300_btaf431-B29","doi-asserted-by":"crossref","first-page":"1124","DOI":"10.1021\/acs.jcim.2c01413","article-title":"Siteradar: utilizing graph machine learning for precise mapping of protein\u2013ligand-binding sites","volume":"63","author":"Evteev","year":"2023","journal-title":"J Chem Inf Model"},{"key":"2025082519465337300_btaf431-B30","doi-asserted-by":"crossref","first-page":"13384","DOI":"10.3390\/molecules200713384","article-title":"Molecular docking and structure-based drug design strategies","volume":"20","author":"Ferreira","year":"2015","journal-title":"Molecules"},{"key":"2025082519465337300_btaf431-B31","doi-asserted-by":"crossref","first-page":"521","DOI":"10.1038\/s42256-022-00499-z","article-title":"Controllable protein design with language models","volume":"4","author":"Ferruz","year":"2022","journal-title":"Nat Mach Intell"},{"key":"2025082519465337300_btaf431-B32","article-title":"Protein interface prediction using graph convolutional networks","volume":"30","author":"Fout","year":"2017","journal-title":"Adv Neural Info Process Syst"},{"key":"2025082519465337300_btaf431-B33","doi-asserted-by":"crossref","first-page":"1323","DOI":"10.1093\/bioinformatics\/btw006","article-title":"Mmseqs software suite for fast and deep clustering and searching of large protein sequence sets","volume":"32","author":"Hauser","year":"2016","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B34","first-page":"770","author":"He","year":"2016"},{"key":"2025082519465337300_btaf431-B35","doi-asserted-by":"crossref","first-page":"723","DOI":"10.1186\/s12859-019-3220-8","article-title":"Modeling aspects of the language of life through transfer-learning protein sequences","volume":"20","author":"Heinzinger","year":"2019","journal-title":"BMC Bioinformatics"},{"key":"2025082519465337300_btaf431-B36","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1016\/S1093-3263(98)00002-3","article-title":"Ligsite: automatic and efficient detection of potential small molecule-binding sites in proteins","volume":"15","author":"Hendlich","year":"1997","journal-title":"J Mol Graph Model"},{"key":"2025082519465337300_btaf431-B37","doi-asserted-by":"crossref","first-page":"W510","DOI":"10.1093\/nar\/gkac439","article-title":"Netsurfp-3.0: accurate and fast prediction of protein structural features by protein language models and deep learning","volume":"50","author":"H\u00f8ie","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B38","first-page":"3356","author":"Hoksza","year":"2022"},{"key":"2025082519465337300_btaf431-B39","first-page":"448","author":"Ioffe","year":"2015"},{"key":"2025082519465337300_btaf431-B40","doi-asserted-by":"crossref","first-page":"5663","DOI":"10.1038\/s41598-023-31612-w","article-title":"Graph-bert and language model-based framework for protein\u2013protein interaction identification","volume":"13","author":"Jha","year":"2023","journal-title":"Sci Rep"},{"key":"2025082519465337300_btaf431-B41","doi-asserted-by":"publisher","DOI":"10.1007\/s11704-025-41426-w","article-title":"A survey of geometric graph neural networks: data structures, models and applications","volume":"19","author":"Han","year":"2025","journal-title":"Front Comp Sci"},{"key":"2025082519465337300_btaf431-B42","doi-asserted-by":"crossref","first-page":"3036","DOI":"10.1093\/bioinformatics\/btx350","article-title":"Deepsite: protein-binding site predictor using 3d-convolutional neural networks","volume":"33","author":"Jim\u00e9nez","year":"2017","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B43","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with alphafold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2025082519465337300_btaf431-B44","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1186\/s13321-021-00547-7","article-title":"Puresnet: prediction of protein-ligand binding sites using deep residual neural network","volume":"13","author":"Kandel","year":"2021","journal-title":"J Cheminform"},{"key":"2025082519465337300_btaf431-B45","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1093\/nar\/28.1.374","article-title":"Aaindex: amino acid index database","volume":"28","author":"Kawashima","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B46","doi-asserted-by":"crossref","first-page":"3713","DOI":"10.1007\/s11042-022-13428-4","article-title":"Natural language processing: state of the art, current trends and challenges","volume":"82","author":"Khurana","year":"2023","journal-title":"Multimed Tools Appl"},{"key":"2025082519465337300_btaf431-B47","doi-asserted-by":"crossref","first-page":"D256","DOI":"10.1093\/nar\/gkw905","article-title":"Mutlbsgenedb: mutated ligand binding site gene database","volume":"45","author":"Kim","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B48","author":"Kipf","year":"2016"},{"key":"2025082519465337300_btaf431-B49","doi-asserted-by":"crossref","first-page":"1413","DOI":"10.1007\/s12551-022-01028-3","article-title":"Protein binding sites for drug design","volume":"14","author":"Konc","year":"2022","journal-title":"Biophys Rev"},{"key":"2025082519465337300_btaf431-B50","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1186\/s13321-018-0285-8","article-title":"P2rank: machine learning based tool for rapid and accurate prediction of ligand binding sites from protein structure","volume":"10","author":"Kriv\u00e1k","year":"2018","journal-title":"J Cheminform"},{"key":"2025082519465337300_btaf431-B51","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1016\/0263-7855(95)00073-9","article-title":"Surfnet: a program for visualizing molecular surfaces, cavities, and intermolecular interactions","volume":"13","author":"Laskowski","year":"1995","journal-title":"J Mol Graph"},{"key":"2025082519465337300_btaf431-B52","doi-asserted-by":"crossref","first-page":"1908","DOI":"10.1093\/bioinformatics\/bti315","article-title":"Q-sitefinder: an energy-based method for the prediction of protein\u2013ligand binding sites","volume":"21","author":"Laurie","year":"2005","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B53","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1186\/1471-2105-10-168","article-title":"Fpocket: an open source platform for ligand pocket detection","volume":"10","author":"Le Guilloux","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2025082519465337300_btaf431-B54","doi-asserted-by":"crossref","first-page":"e60","DOI":"10.1093\/nar\/gkad288","article-title":"Geobind: segmentation of nucleic acid binding interface on protein surface with geometric deep learning","volume":"51","author":"Li","year":"2023","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B55","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1016\/j.ymeth.2019.04.008","article-title":"Deep learning in bioinformatics: introduction, application, and perspective in the big data era","volume":"166","author":"Li","year":"2019","journal-title":"Methods"},{"key":"2025082519465337300_btaf431-B56","doi-asserted-by":"crossref","first-page":"1172","DOI":"10.1093\/bioinformatics\/bts095","article-title":"Sitecomp: a server for ligand binding site analysis in protein structures","volume":"28","author":"Lin","year":"2012","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B57","author":"Lin","year":"2022"},{"key":"2025082519465337300_btaf431-B58","author":"Lin","year":"2022"},{"key":"2025082519465337300_btaf431-B59","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1038\/s41401-019-0228-6","article-title":"Cb-dock: a web server for cavity detection-guided protein\u2013ligand blind docking","volume":"41","author":"Liu","year":"2020","journal-title":"Acta Pharmacol Sin"},{"key":"2025082519465337300_btaf431-B60","author":"Loshchilov","year":"2017"},{"key":"2025082519465337300_btaf431-B61","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3605943","article-title":"Recent advances in natural language processing via large pre-trained language models: A survey","volume":"56","author":"Min","year":"2024","journal-title":"ACM Comput Surv"},{"key":"2025082519465337300_btaf431-B62","doi-asserted-by":"crossref","first-page":"1681","DOI":"10.1093\/bioinformatics\/btab009","article-title":"Deepsurf: a surface-based deep learning approach for the prediction of ligand binding sites on proteins","volume":"37","author":"Mylonas","year":"2021","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B63","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1093\/bioinformatics\/btr651","article-title":"Ftsite: high accuracy detection of ligand binding sites on unbound protein structures","volume":"28","author":"Ngan","year":"2012","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B64","author":"O\u2019Shea","year":"2015"},{"key":"2025082519465337300_btaf431-B65","first-page":"2825","article-title":"Scikit-learn: machine learning in python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J Mach Learn Res"},{"key":"2025082519465337300_btaf431-B66","doi-asserted-by":"crossref","first-page":"16933","DOI":"10.1038\/s41598-022-21366-2","article-title":"Improving protein succinylation sites prediction using embeddings from protein language model","volume":"12","author":"Pokharel","year":"2022","journal-title":"Sci Rep"},{"key":"2025082519465337300_btaf431-B67","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1186\/s12859-023-05164-9","article-title":"Plmsnosite: an ensemble-based approach for predicting protein s-nitrosylation sites by integrating supervised word embedding and embedding from pre-trained protein language model","volume":"24","author":"Pratyush","year":"2023","journal-title":"BMC Bioinfo"},{"key":"2025082519465337300_btaf431-B68","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1186\/s12859-014-0379-x","article-title":"Anatomy of enzyme channels","volume":"15","author":"Pravda","year":"2014","journal-title":"BMC Bioinfo"},{"key":"2025082519465337300_btaf431-B69","doi-asserted-by":"crossref","first-page":"e1006718","DOI":"10.1371\/journal.pcbi.1006718","article-title":"Deepdrug3d: classification of ligand-binding pockets in proteins with a convolutional neural network","volume":"15","author":"Pu","year":"2019","journal-title":"PLoS Comput Biol"},{"key":"2025082519465337300_btaf431-B70","author":"Rao","year":"2020"},{"key":"2025082519465337300_btaf431-B71","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1186\/1471-2105-12-160","article-title":"Funfold: an improved automated method for the prediction of ligand binding residues using 3d models of proteins","volume":"12","author":"Roche","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2025082519465337300_btaf431-B72","doi-asserted-by":"crossref","first-page":"29829","DOI":"10.3390\/ijms161226202","article-title":"Proteins and their interacting partners: an introduction to protein\u2013ligand binding site prediction methods","volume":"16","author":"Roche","year":"2015","journal-title":"Int J Mol Sci"},{"key":"2025082519465337300_btaf431-B73","doi-asserted-by":"crossref","first-page":"e27","DOI":"10.1093\/nar\/gkae039","article-title":"Equipnas: improved protein\u2013nucleic acid binding site prediction using protein-language-model-informed equivariant deep graph neural networks","volume":"52","author":"Roche","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B74","doi-asserted-by":"crossref","first-page":"742","DOI":"10.1021\/ci100050t","article-title":"Extended-connectivity fingerprints","volume":"50","author":"Rogers","year":"2010","journal-title":"J Chem Inf Model"},{"key":"2025082519465337300_btaf431-B75","author":"Rusch","year":"2023"},{"key":"2025082519465337300_btaf431-B76","doi-asserted-by":"crossref","first-page":"e1248","DOI":"10.1002\/widm.1248","article-title":"Machine learning for bioinformatics and neuroimaging","volume":"8","author":"Serra","year":"2018","journal-title":"Wiley Interdiscip Rev Data Min Knowl Discov"},{"key":"2025082519465337300_btaf431-B77","first-page":"1929","article-title":"Dropout: a simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J Mach Learn Res"},{"key":"2025082519465337300_btaf431-B78","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1038\/s41592-019-0437-4","article-title":"Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold","volume":"16","author":"Steinegger","year":"2019","journal-title":"Nat Methods"},{"key":"2025082519465337300_btaf431-B79","doi-asserted-by":"crossref","first-page":"895","DOI":"10.1021\/acs.jcim.8b00545","article-title":"Comparative assessment of scoring functions: the casf-2016 update","volume":"59","author":"Su","year":"2019","journal-title":"J Chem Inf Model"},{"key":"2025082519465337300_btaf431-B80","doi-asserted-by":"crossref","first-page":"926","DOI":"10.1093\/bioinformatics\/btu739","article-title":"Uniref clusters: a comprehensive and scalable alternative for improving sequence similarity searches","volume":"31","author":"Suzek","year":"2015","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B81","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1007\/978-981-16-4241-8","volume-title":"Bioinformatics and Computational Biology: A Primer for Biologists","author":"Tiwary","year":"2022"},{"key":"2025082519465337300_btaf431-B82","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1038\/s42256-022-00457-9","article-title":"Learning functional properties of proteins with language models","volume":"4","author":"Unsal","year":"2022","journal-title":"Nat Mach Intell"},{"key":"2025082519465337300_btaf431-B83","doi-asserted-by":"crossref","first-page":"D439","DOI":"10.1093\/nar\/gkab1061","article-title":"Alphafold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models","volume":"50","author":"Varadi","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B84","article-title":"Attention is all you need","volume":"30","author":"Vaswani","year":"2017","journal-title":"Adv Neural Info Process Syst"},{"key":"2025082519465337300_btaf431-B85","doi-asserted-by":"crossref","first-page":"102538","DOI":"10.1016\/j.sbi.2023.102538","article-title":"Everything is connected: graph neural networks","volume":"79","author":"Veli\u010dkovi\u0107","year":"2023","journal-title":"Curr Opin Struct Biol"},{"key":"2025082519465337300_btaf431-B86","author":"Veli\u010dkovi\u0107","year":"2017"},{"key":"2025082519465337300_btaf431-B87","doi-asserted-by":"crossref","first-page":"2977","DOI":"10.1021\/jm030580l","article-title":"The pdbbind database: collection of binding affinities for protein- ligand complexes with known three-dimensional structures","volume":"47","author":"Wang","year":"2004","journal-title":"J Med Chem"},{"key":"2025082519465337300_btaf431-B88","doi-asserted-by":"crossref","first-page":"2223","DOI":"10.1109\/TCBB.2023.3239983","article-title":"Graphplbr: protein-ligand binding residue prediction with deep graph convolution network","volume":"20","author":"Wang","year":"2023","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2025082519465337300_btaf431-B89","doi-asserted-by":"crossref","first-page":"122","DOI":"10.3390\/cells8020122","article-title":"A high efficient biological language model for predicting protein\u2013protein interactions","volume":"8","author":"Wang","year":"2019","journal-title":"Cells"},{"key":"2025082519465337300_btaf431-B90","doi-asserted-by":"crossref","first-page":"W469","DOI":"10.1093\/nar\/gkq406","article-title":"3dligandsite: predicting ligand-binding sites using similar structures","volume":"38","author":"Wass","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B91","doi-asserted-by":"crossref","first-page":"e51","DOI":"10.1093\/nar\/gkab044","article-title":"Graphbind: protein structural context embedded rules learned by hierarchical graph neural networks for recognizing nucleic-acid-binding residues","volume":"49","author":"Xia","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B92","doi-asserted-by":"crossref","first-page":"D1096","DOI":"10.1093\/nar\/gks966","article-title":"Biolip: a semi-manually curated database for biologically relevant ligand\u2013protein interactions","volume":"41","author":"Yang","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2025082519465337300_btaf431-B93","doi-asserted-by":"crossref","first-page":"2588","DOI":"10.1093\/bioinformatics\/btt447","article-title":"Protein\u2013ligand binding site recognition using complementary binding-specific substructure comparison and sequence profile alignment","volume":"29","author":"Yang","year":"2013","journal-title":"Bioinformatics"},{"key":"2025082519465337300_btaf431-B94","doi-asserted-by":"crossref","first-page":"994","DOI":"10.1109\/TCBB.2013.104","article-title":"Designing template-free predictor for targeting protein-ligand binding sites with classifier ensemble and spatial clustering","volume":"10","author":"Yu","year":"2013","journal-title":"IEEE\/ACM Trans Comput Biol Bioinform"},{"key":"2025082519465337300_btaf431-B95","doi-asserted-by":"crossref","first-page":"bbab564","DOI":"10.1093\/bib\/bbab564","article-title":"Alphafold2-aware protein\u2013DNA binding site prediction using graph transformer","volume":"23","author":"Yuan","year":"2022","journal-title":"Brief Bioinform"},{"key":"2025082519465337300_btaf431-B96","doi-asserted-by":"crossref","first-page":"690049","DOI":"10.3389\/fgene.2021.690049","article-title":"Graph neural networks and their current applications in bioinformatics","volume":"12","author":"Zhang","year":"2021","journal-title":"Front Genet"},{"key":"2025082519465337300_btaf431-B97","author":"Zhang","year":"2023"},{"key":"2025082519465337300_btaf431-B98","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1016\/j.csbj.2020.02.008","article-title":"Exploring the computational methods for protein-ligand binding site prediction","volume":"18","author":"Zhao","year":"2020","journal-title":"Comput Struct Biotechnol J"},{"key":"2025082519465337300_btaf431-B99","doi-asserted-by":"publisher","first-page":"965","DOI":"10.3390\/genes10120965","article-title":"Sxgbsite: prediction of protein\u2013ligand binding sites using sequence information and extreme gradient boosting","volume":"10","author":"Zhao","year":"2019","journal-title":"Genes (Basel)"},{"key":"2025082519465337300_btaf431-B100","author":"Zheng","year":"2023"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf431\/63902656\/btaf431.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/8\/btaf431\/63902656\/btaf431.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/41\/8\/btaf431\/63902656\/btaf431.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,25]],"date-time":"2025-08-25T23:47:12Z","timestamp":1756165632000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf431\/8220314"}},"subtitle":[],"editor":[{"given":"Lenore","family":"Cowen","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2025,7,31]]},"references-count":89,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2025,8,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf431","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.08.11.553028","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025,8]]},"published":{"date-parts":[[2025,7,31]]},"article-number":"btaf431"}}