{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T04:05:04Z","timestamp":1769832304483,"version":"3.49.0"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2021,2,27]],"date-time":"2021-02-27T00:00:00Z","timestamp":1614384000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"Hong Kong Innovation and Technology Commission"},{"name":"Hong Kong Research Grants Council","award":["11200818"],"award-info":[{"award-number":["11200818"]}]},{"DOI":"10.13039\/100007567","name":"City University of Hong Kong","doi-asserted-by":"publisher","award":["9610460"],"award-info":[{"award-number":["9610460"]}],"id":[{"id":"10.13039\/100007567","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,9,9]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Reliable predictive models of protein\u2013ligand binding affinity are required in many areas of biomedical research. Accurate prediction based on current descriptors or molecular fingerprints (FPs) remains a challenge. We develop novel interaction FPs (IFPs) to encode protein\u2013ligand interactions and use them to improve the prediction.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>Proteo-chemometrics IFPs (PrtCmm IFPs) formed by combining extended connectivity fingerprints (ECFPs) with the proteo-chemometrics concept. Combining PrtCmm IFPs with machine-learning models led to efficient scoring models, which were validated on the PDBbind v2019 core set and CSAR-HiQ sets. The PrtCmm IFP Score outperformed several other models in predicting protein\u2013ligand binding affinities. Besides, conventional ECFPs were simplified to generate new IFPs, which provided consistent but faster predictions. The relationship between the base atom properties of ECFPs and the accuracy of predictions was also investigated.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability<\/jats:title>\n                  <jats:p>PrtCmm IFP has been implemented in the IFP Score Toolkit on github (https:\/\/github.com\/debbydanwang\/IFPscore).<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab132","type":"journal-article","created":{"date-parts":[[2021,2,25]],"date-time":"2021-02-25T17:46:00Z","timestamp":1614275160000},"page":"2570-2579","source":"Crossref","is-referenced-by-count":17,"title":["Proteo-chemometrics interaction fingerprints of protein\u2013ligand complexes predict binding affinity"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3755-8943","authenticated-orcid":false,"given":"Debby D.","family":"Wang","sequence":"first","affiliation":[{"name":"Institute of Medical and Information Engineering, School of Medical Instrument and Food Engineering, University of Shanghai for Science and Technology , Shanghai 200093, China"}]},{"given":"Haoran","family":"Xie","sequence":"additional","affiliation":[{"name":"Department of Computing and Decision Sciences, Lingnan University , Tuen Mun, Hong Kong"}]},{"given":"Hong","family":"Yan","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, City University of Hong Kong , Kowloon, Hong Kong"}]}],"member":"286","published-online":{"date-parts":[[2021,2,27]]},"reference":[{"key":"2023051609215656500_btab132-B1","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1021\/ci700224e","article-title":"Distance dependent scoring function for describing protein\u2013ligand intermolecular interactions","volume":"48","author":"Artemenko","year":"2008","journal-title":"J. Chem. Inf. Model"},{"key":"2023051609215656500_btab132-B2","doi-asserted-by":"crossref","first-page":"1169","DOI":"10.1093\/bioinformatics\/btq112","article-title":"A machine learning approach to predicting protein\u2013ligand binding affinity with applications to molecular docking","volume":"26","author":"Ballester","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051609215656500_btab132-B3","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The protein data bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023051609215656500_btab132-B4","doi-asserted-by":"crossref","first-page":"792","DOI":"10.1021\/ci200615h","article-title":"Comparison of random forest and pipeline pilot naive bayes in prospective qsar predictions","volume":"52","author":"Chen","year":"2012","journal-title":"J. Chem. Inf. Model"},{"key":"2023051609215656500_btab132-B5","doi-asserted-by":"crossref","first-page":"719","DOI":"10.2174\/1386207013330670","article-title":"Bindingdb: a Web-Accessible Molecular Recognition Database","volume":"4","author":"Chen","year":"2001","journal-title":"Comb. Chem. High Throughput Screen"},{"key":"2023051609215656500_btab132-B6","doi-asserted-by":"crossref","first-page":"2555","DOI":"10.1021\/ci500319f","article-title":"Structural protein\u2013ligand interaction fingerprints (splif) for structure-based virtual screening: method and benchmark study","volume":"54","author":"Da","year":"2014","journal-title":"J. Chem. Inf. Model"},{"key":"2023051609215656500_btab132-B7","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1021\/ci9004139","article-title":"Binding affinity prediction with property-encoded shape distribution signatures","volume":"50","author":"Das","year":"2010","journal-title":"J. Chem. Inf. Model"},{"key":"2023051609215656500_btab132-B8","doi-asserted-by":"crossref","first-page":"699","DOI":"10.1021\/ci034246+","article-title":"Predicting protein\u2013ligand binding affinities using novel geometrical descriptors and machine-learning methods","volume":"44","author":"Deng","year":"2004","journal-title":"J. Chem. Inf. Comput. Sci"},{"key":"2023051609215656500_btab132-B9","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1021\/jm030331x","article-title":"Structural interaction fingerprint (sift): a novel method for analyzing three-dimensional protein\u2013ligand binding interactions","volume":"47","author":"Deng","year":"2004","journal-title":"J. Med. Chem"},{"key":"2023051609215656500_btab132-B10","doi-asserted-by":"crossref","first-page":"2036","DOI":"10.1021\/ci200082t","article-title":"Csar benchmark exercise of 2010: selection of the protein\u2013ligand complexes","volume":"51","author":"Dunbar","year":"2011","journal-title":"J. Chem. Inf. Model"},{"key":"2023051609215656500_btab132-B11","doi-asserted-by":"crossref","first-page":"3032","DOI":"10.1021\/jm030489h","article-title":"Assessing scoring functions for protein\u2013ligand interactions","volume":"47","author":"Ferrara","year":"2004","journal-title":"J. Med. Chem"},{"key":"2023051609215656500_btab132-B12","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1016\/S0167-9473(02)00257-8","article-title":"Comparison of the performance of multi-layer perceptron and linear regression for epidemiological data","volume":"44","author":"Gaudart","year":"2004","journal-title":"Comput. Stat. Data Anal"},{"key":"2023051609215656500_btab132-B13","doi-asserted-by":"crossref","first-page":"333","DOI":"10.1002\/prot.20512","article-title":"Binding moad (mother of all databases)","volume":"60","author":"Hu","year":"2005","journal-title":"Proteins"},{"key":"2023051609215656500_btab132-B14","doi-asserted-by":"crossref","first-page":"935","DOI":"10.1038\/nrd1549","article-title":"Docking and scoring in virtual screening for drug discovery: methods and applications","volume":"3","author":"Kitchen","year":"2004","journal-title":"Nat. Rev. Drug Discov"},{"key":"2023051609215656500_btab132-B15","doi-asserted-by":"crossref","first-page":"180","DOI":"10.1016\/S0304-4165(00)00187-2","article-title":"Development of proteo-chemometrics: a novel technology for the analysis of drug-receptor interactions","volume":"1525","author":"Lapinsh","year":"2001","journal-title":"Biochim. Biophys. Acta"},{"key":"2023051609215656500_btab132-B16","volume-title":"Annual Meeting of the Society for Academic Emergency Medicine","author":"Lewis","year":"2000"},{"key":"2023051609215656500_btab132-B17","doi-asserted-by":"crossref","first-page":"592","DOI":"10.1021\/ci300493w","article-title":"Id-score: a new empirical scoring function based on a comprehensive set of descriptors related to protein\u2013ligand interactions","volume":"53","author":"Li","year":"2013","journal-title":"J. Chem. Inf. Model"},{"key":"2023051609215656500_btab132-B18","doi-asserted-by":"crossref","first-page":"3076","DOI":"10.1021\/ci400450h","article-title":"Binding affinity prediction for protein\u2013ligand complexes based on \u03b2 contacts and b factor","volume":"53","author":"Liu","year":"2013","journal-title":"J. Chem. Inf. Model"},{"key":"2023051609215656500_btab132-B19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1142\/S021972001100577X","article-title":"Cscore: a simple yet effective scoring function for protein\u2013ligand binding affinity prediction using modified cmac learning architecture","volume":"09","author":"Ouyang","year":"2011","journal-title":"J. Bioinform. Comput. Biol"},{"key":"2023051609215656500_btab132-B20","author":"Prettenhofer","year":"2014"},{"key":"2023051609215656500_btab132-B21","doi-asserted-by":"crossref","first-page":"325","DOI":"10.6026\/97320630009325","article-title":"Pyplif: python-based protein\u2013ligand interaction fingerprinting","volume":"9","author":"Radifar","year":"2013","journal-title":"Bioinformation"},{"key":"2023051609215656500_btab132-B22","author":"Rivest","year":"1992"},{"key":"2023051609215656500_btab132-B23","doi-asserted-by":"crossref","first-page":"742","DOI":"10.1021\/ci100050t","article-title":"Extended-connectivity fingerprints","volume":"50","author":"Rogers","year":"2010","journal-title":"J. Chem. Inf. Model"},{"key":"2023051609215656500_btab132-B24","doi-asserted-by":"crossref","first-page":"3666","DOI":"10.1093\/bioinformatics\/bty374","article-title":"Development and evaluation of a deep learning model for protein\u2013ligand binding affinity prediction","volume":"34","author":"Stepniewska-Dziubinska","year":"2018","journal-title":"Bioinformatics"},{"key":"2023051609215656500_btab132-B25","article-title":"Computationally predicting binding affinity in protein\u2013ligand complexes: free energy-based simulations and machine learning-based scoring functions","author":"Wang","year":"2020","journal-title":"Brief. Bioinform"},{"key":"2023051609215656500_btab132-B26","doi-asserted-by":"crossref","first-page":"2977","DOI":"10.1021\/jm030580l","article-title":"The pdbbind database: collection of binding affinities for protein\u2013ligand complexes with known three-dimensional structures","volume":"47","author":"Wang","year":"2004","journal-title":"J. Med. Chem"},{"key":"2023051609215656500_btab132-B27","doi-asserted-by":"crossref","first-page":"5912","DOI":"10.1021\/jm050362n","article-title":"A critical assessment of docking programs and scoring functions","volume":"49","author":"Warren","year":"2006","journal-title":"J. Med. Chem"},{"key":"2023051609215656500_btab132-B28","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1021\/ci00062a008","article-title":"Smiles. 2. algorithm for generation of unique smiles notation","volume":"29","author":"Weininger","year":"1989","journal-title":"J. Chem. Inf. Comp. Sci"},{"key":"2023051609215656500_btab132-B29","doi-asserted-by":"crossref","first-page":"D901","DOI":"10.1093\/nar\/gkm958","article-title":"Drugbank: a knowledgebase for drugs, drug actions and drug targets","volume":"36","author":"Wishart","year":"2008","journal-title":"Nucleic Acids Res"},{"key":"2023051609215656500_btab132-B30","doi-asserted-by":"crossref","first-page":"1334","DOI":"10.1093\/bioinformatics\/bty757","article-title":"Development of a protein\u2013ligand extended connectivity (plec) fingerprint and its application for binding affinity predictions","volume":"35","author":"W\u00f3jcikowski","year":"2019","journal-title":"Bioinformatics"},{"key":"2023051609215656500_btab132-B31","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1039\/C7SC02664A","article-title":"Moleculenet: a benchmark for molecular machine learning","volume":"9","author":"Wu","year":"2018","journal-title":"Chem. Sci"},{"key":"2023051609215656500_btab132-B32","doi-asserted-by":"crossref","first-page":"1923","DOI":"10.1021\/ci400120b","article-title":"Sfcscore rf: a random forest-based scoring function for improved affinity prediction of protein\u2013ligand complexes","volume":"53","author":"Zilian","year":"2013","journal-title":"J. Chem. Inf. Model"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab132\/36622586\/btab132.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/17\/2570\/50339425\/btab132.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/17\/2570\/50339425\/btab132.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T09:27:44Z","timestamp":1684229264000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/17\/2570\/6154300"}},"subtitle":[],"editor":[{"given":"Yann","family":"Ponty","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,2,27]]},"references-count":32,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2021,9,9]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab132","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,9,1]]},"published":{"date-parts":[[2021,2,27]]}}}