{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,13]],"date-time":"2026-01-13T05:04:30Z","timestamp":1768280670226,"version":"3.49.0"},"reference-count":22,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T00:00:00Z","timestamp":1719532800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"High-level and Urgently Needed Overseas Talent Programs of Jiangxi Province","award":["20232BCJ25026"],"award-info":[{"award-number":["20232BCJ25026"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62102133"],"award-info":[{"award-number":["62102133"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Kaifeng Major Science and Technology","award":["21ZD011"],"award-info":[{"award-number":["21ZD011"]}]},{"name":"Ji\u2019an Finance and Science Foundation","award":["20211085454"],"award-info":[{"award-number":["20211085454"]}]},{"name":"Ji\u2019an Finance and Science Foundation","award":["20222151746"],"award-info":[{"award-number":["20222151746"]}]},{"name":"Ji\u2019an Finance and Science Foundation","award":["20222151704"],"award-info":[{"award-number":["20222151704"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,6,28]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>In drug discovery, it is crucial to assess the drug\u2013target binding affinity (DTA). Although molecular docking is widely used, computational efficiency limits its application in large-scale virtual screening. Deep learning-based methods learn virtual scoring functions from labeled datasets and can quickly predict affinity. However, there are three limitations. First, existing methods only consider the atom-bond graph or one-dimensional sequence representations of compounds, ignoring the information about functional groups (pharmacophores) with specific biological activities. Second, relying on limited labeled datasets fails to learn comprehensive embedding representations of compounds and proteins, resulting in poor generalization performance in complex scenarios. Third, existing feature fusion methods cannot adequately capture contextual interaction information.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Therefore, we propose a novel DTA prediction method named HeteroDTA. Specifically, a multi-view compound feature extraction module is constructed to model the atom\u2013bond graph and pharmacophore graph. The residue concat graph and protein sequence are also utilized to model protein structure and function. Moreover, to enhance the generalization capability and reduce the dependence on task-specific labeled data, pre-trained models are utilized to initialize the atomic features of the compounds and the embedding representations of the protein sequence. A context-aware nonlinear feature fusion method is also proposed to learn interaction patterns between compounds and proteins. Experimental results on public benchmark datasets show that HeteroDTA significantly outperforms existing methods. In addition, HeteroDTA shows excellent generalization performance in cold-start experiments and superiority in the representation learning ability of drug\u2013target pairs. Finally, the effectiveness of HeteroDTA is demonstrated in a real-world drug discovery study.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The source code and data are available at https:\/\/github.com\/daydayupzzl\/HeteroDTA.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae240","type":"journal-article","created":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T09:35:34Z","timestamp":1719567334000},"page":"i539-i547","source":"Crossref","is-referenced-by-count":5,"title":["Enhancing generalizability and performance in drug\u2013target interaction identification by integrating pharmacophore and pre-trained models"],"prefix":"10.1093","volume":"40","author":[{"given":"Zuolong","family":"Zhang","sequence":"first","affiliation":[{"name":"School of Software, Henan University , Kaifeng, Henan Province 475000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xin","family":"He","sequence":"additional","affiliation":[{"name":"School of Software, Henan University , Kaifeng, Henan Province 475000, China"},{"name":"Henan International Joint Laboratory of Intelligent Network Theory and Key Technology, Henan University , Kaifeng, Henan Province 475000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dazhi","family":"Long","sequence":"additional","affiliation":[{"name":"Department of Urology, Ji\u2019an Third People\u2019s Hospital , Ji\u2019an, Jiangxi Province 343000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gang","family":"Luo","sequence":"additional","affiliation":[{"name":"School of Mathematics and Computer Science, Nanchang University , Nanchang, Jiangxi Province 330031, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shengbo","family":"Chen","sequence":"additional","affiliation":[{"name":"Henan Engineering Research Center of Intelligent Technology and Application, Henan University , Kaifeng, Henan Province 475000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2024,6,28]]},"reference":[{"key":"2024062809081445000_btae240-B1","doi-asserted-by":"crossref","first-page":"4633","DOI":"10.1093\/bioinformatics\/btaa544","article-title":"DeepCDA: deep cross-domain compound-protein affinity prediction through LSTM and convolutional neural networks","volume":"36","author":"Abbasi","year":"2020","journal-title":"Bioinformatics"},{"key":"2024062809081445000_btae240-B2","first-page":"25","volume-title":"Pharmacophore Modelling and Screening: Concepts, Recent Developments and Applications in Rational Drug Design","author":"Choudhury","year":"2019"},{"key":"2024062809081445000_btae240-B3","author":"Dai","year":"2021"},{"key":"2024062809081445000_btae240-B4","doi-asserted-by":"crossref","first-page":"1046","DOI":"10.1038\/nbt.1990","article-title":"Comprehensive analysis of kinase inhibitor selectivity","volume":"29","author":"Davis","year":"2011","journal-title":"Nat Biotechnol"},{"key":"2024062809081445000_btae240-B5","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1038\/s42256-021-00438-4","article-title":"Geometry-enhanced molecular representation learning for property prediction","volume":"4","author":"Fang","year":"2022","journal-title":"Nat Mach Intell"},{"key":"2024062809081445000_btae240-B6","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1186\/s13321-017-0209-z","article-title":"Simboost: a read-across approach for predicting drug\u2013target binding affinities using gradient boosting machines","volume":"9","author":"He","year":"2017","journal-title":"J Cheminform"},{"key":"2024062809081445000_btae240-B7","doi-asserted-by":"crossref","first-page":"e23172","DOI":"10.1016\/j.heliyon.2023.e23172","article-title":"FDA-approved heterocyclic molecules for cancer treatment: synthesis, dosage, mechanism of action and their adverse effect","volume":"10","author":"Hossain","year":"2023","journal-title":"Heliyon"},{"key":"2024062809081445000_btae240-B8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/pcmedi\/pbab001","article-title":"Computational molecular docking and virtual screening revealed promising SARS-CoV-2 drugs","volume":"4","author":"Hosseini","year":"2021","journal-title":"Precis Clin Med"},{"key":"2024062809081445000_btae240-B9","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1186\/s12864-022-08648-9","article-title":"Sequence-based drug\u2013target affinity prediction using weighted graph neural networks","volume":"23","author":"Jiang","year":"2022","journal-title":"BMC Genomics"},{"key":"2024062809081445000_btae240-B10","author":"Kipf","year":"2017"},{"key":"2024062809081445000_btae240-B11","author":"Landrum","year":"2014"},{"key":"2024062809081445000_btae240-B12","first-page":"1","article-title":"Molecular docking for ligand\u2013receptor binding process based on heterogeneous computing","volume":"2022","author":"Li","year":"2022","journal-title":"Sci Program"},{"key":"2024062809081445000_btae240-B13","doi-asserted-by":"crossref","first-page":"106145","DOI":"10.1016\/j.compbiomed.2022.106145","article-title":"GSAML-DTA: an interpretable drug\u2013target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information","volume":"150","author":"Liao","year":"2022","journal-title":"Comput Biol Med"},{"key":"2024062809081445000_btae240-B14","doi-asserted-by":"crossref","first-page":"i221","DOI":"10.1093\/bioinformatics\/btv256","article-title":"Improving compound\u2013protein interaction prediction by building up highly credible negative samples","volume":"31","author":"Liu","year":"2015","journal-title":"Bioinformatics"},{"key":"2024062809081445000_btae240-B15","doi-asserted-by":"crossref","first-page":"541","DOI":"10.13005\/bbra\/2659","article-title":"Frequency and importance of six functional groups that play a role in drug discovery","volume":"15","author":"Maslehat","year":"2018","journal-title":"Biosci Biotech Res Asia"},{"key":"2024062809081445000_btae240-B16","doi-asserted-by":"crossref","first-page":"1140","DOI":"10.1093\/bioinformatics\/btaa921","article-title":"GraphDTA: predicting drug\u2013target binding affinity with graph neural networks","volume":"37","author":"Nguyen","year":"2020","journal-title":"Bioinformatics"},{"key":"2024062809081445000_btae240-B17","doi-asserted-by":"crossref","first-page":"i821","DOI":"10.1093\/bioinformatics\/bty593","article-title":"DeepDTA: deep drug\u2013target binding affinity prediction","volume":"34","author":"\u00d6zt\u00fcrk","year":"2018","journal-title":"Bioinformatics"},{"key":"2024062809081445000_btae240-B18","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1093\/bib\/bbu010","article-title":"Toward more realistic drug\u2013target interaction predictions","volume":"16","author":"Pahikkala","year":"2014","journal-title":"Brief Bioinform"},{"key":"2024062809081445000_btae240-B19","doi-asserted-by":"crossref","first-page":"e2016239118","DOI":"10.1073\/pnas.2016239118","article-title":"Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences","volume":"118","author":"Rives","year":"2021","journal-title":"Proc Natl Acad Sci U S A"},{"key":"2024062809081445000_btae240-B20","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1038\/s41586-023-05905-z","article-title":"Computational approaches streamlining drug discovery","volume":"616","author":"Sadybekov","year":"2023","journal-title":"Nature"},{"key":"2024062809081445000_btae240-B21","doi-asserted-by":"crossref","first-page":"735","DOI":"10.1021\/ci400709d","article-title":"Making sense of large-scale kinase inhibitor bioactivity data sets: a comparative and integrative analysis","volume":"54","author":"Tang","year":"2014","journal-title":"J Chem Inf Model"},{"key":"2024062809081445000_btae240-B22","author":"Veli\u010dkovi\u0107","year":"2018"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/Supplement_1\/i539\/58355122\/btae240.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/Supplement_1\/i539\/58355122\/btae240.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T09:36:02Z","timestamp":1719567362000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/40\/Supplement_1\/i539\/7700904"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,28]]},"references-count":22,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2024,6,28]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae240","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,7]]},"published":{"date-parts":[[2024,6,28]]}}}