{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,4]],"date-time":"2026-04-04T21:40:26Z","timestamp":1775338826844,"version":"3.50.1"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":480,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,6,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Computational prediction of compound\u2013protein interactions (CPIs) is of great importance for drug design and development, as genome-scale experimental validation of CPIs is not only time-consuming but also prohibitively expensive. With the availability of an increasing number of validated interactions, the performance of computational prediction approaches is severely impended by the lack of reliable negative CPI samples. A systematic method of screening reliable negative sample becomes critical to improving the performance of in silico prediction methods.<\/jats:p>\n               <jats:p>Results: This article aims at building up a set of highly credible negative samples of CPIs via an in silico screening method. As most existing computational models assume that similar compounds are likely to interact with similar target proteins and achieve remarkable performance, it is rational to identify potential negative samples based on the converse negative proposition that the proteins dissimilar to every known\/predicted target of a compound are not much likely to be targeted by the compound and vice versa. We integrated various resources, including chemical structures, chemical expression profiles and side effects of compounds, amino acid sequences, protein\u2013protein interaction network and functional annotations of proteins, into a systematic screening framework. We first tested the screened negative samples on six classical classifiers, and all these classifiers achieved remarkably higher performance on our negative samples than on randomly generated negative samples for both human and Caenorhabditis elegans. We then verified the negative samples on three existing prediction models, including bipartite local model, Gaussian kernel profile and Bayesian matrix factorization, and found that the performances of these models are also significantly improved on the screened negative samples. Moreover, we validated the screened negative samples on a drug bioactivity dataset. Finally, we derived two sets of new interactions by training an support vector machine classifier on the positive interactions annotated in DrugBank and our screened negative interactions. The screened negative samples and the predicted interactions provide the research community with a useful resource for identifying new drug targets and a helpful supplement to the current curated compound\u2013protein databases.<\/jats:p>\n               <jats:p>Availability: Supplementary files are available at: http:\/\/admis.fudan.edu.cn\/negative-cpi\/.<\/jats:p>\n               <jats:p>Contact: \u00a0sgzhou@fudan.edu.cn<\/jats:p>\n               <jats:p>Supplementary Information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv256","type":"journal-article","created":{"date-parts":[[2015,6,13]],"date-time":"2015-06-13T17:12:36Z","timestamp":1434215556000},"page":"i221-i229","source":"Crossref","is-referenced-by-count":322,"title":["Improving compound\u2013protein interaction prediction by building up highly credible negative samples"],"prefix":"10.1093","volume":"31","author":[{"given":"Hui","family":"Liu","sequence":"first","affiliation":[{"name":"1 Lab of Information Management, Changzhou University, Jiangsu 213164, China, 2School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore, 3Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200433, China and 4Department of Computer Science and Technology, Tongji University, Shanghai 201804, China"},{"name":"1 Lab of Information Management, Changzhou University, Jiangsu 213164, China, 2School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore, 3Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200433, China and 4Department of Computer Science and Technology, Tongji University, Shanghai 201804, China"}]},{"given":"Jianjiang","family":"Sun","sequence":"additional","affiliation":[{"name":"1 Lab of Information Management, Changzhou University, Jiangsu 213164, China, 2School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore, 3Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200433, China and 4Department of Computer Science and Technology, Tongji University, Shanghai 201804, China"}]},{"given":"Jihong","family":"Guan","sequence":"additional","affiliation":[{"name":"1 Lab of Information Management, Changzhou University, Jiangsu 213164, China, 2School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore, 3Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200433, China and 4Department of Computer Science and Technology, Tongji University, Shanghai 201804, China"}]},{"given":"Jie","family":"Zheng","sequence":"additional","affiliation":[{"name":"1 Lab of Information Management, Changzhou University, Jiangsu 213164, China, 2School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore, 3Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200433, China and 4Department of Computer Science and Technology, Tongji University, Shanghai 201804, China"}]},{"given":"Shuigeng","family":"Zhou","sequence":"additional","affiliation":[{"name":"1 Lab of Information Management, Changzhou University, Jiangsu 213164, China, 2School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore, 3Shanghai Key Lab of Intelligent Information Processing, and School of Computer Science, Fudan University, Shanghai 200433, China and 4Department of Computer Science and Technology, Tongji University, Shanghai 201804, China"}]}],"member":"286","published-online":{"date-parts":[[2015,6,10]]},"reference":[{"key":"2023020115432543200_btv256-B1","doi-asserted-by":"crossref","first-page":"2004","DOI":"10.1093\/bioinformatics\/btt307","article-title":"Drug-target interaction prediction through domain-tuned network-based inference","volume":"29","author":"Alaimo","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B2","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology","volume":"25","author":"Ashburner","year":"2013","journal-title":"Nat. Genet."},{"key":"2023020115432543200_btv256-B3","first-page":"CD001190","article-title":"Donepezil for dementia due to Alzheimer\u2019s disease","volume":"1","author":"Birks","year":"2006","journal-title":"Cochrane Database Syst. Rev."},{"key":"2023020115432543200_btv256-B4","doi-asserted-by":"crossref","first-page":"2397","DOI":"10.1093\/bioinformatics\/btp433","article-title":"Supervised prediction of drug-target interactions using bipartite local models","volume":"25","author":"Bleakley","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B5","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1126\/science.1158140","article-title":"Drug target identification using side-effect similarity","volume":"321","author":"Campillos","year":"2008","journal-title":"Science"},{"key":"2023020115432543200_btv256-B6","doi-asserted-by":"crossref","first-page":"1787","DOI":"10.1093\/bioinformatics\/btu058","article-title":"Mantra 2.0: an online collaborative resource for drug mode of action and repurposing by network analysis","volume":"30","author":"Carrella","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B7","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/1961189.1961199","article-title":"LIBSVM: a library for support vector machines","volume":"2","author":"Chang","year":"2011","journal-title":"ACM Trans. Intell. Syst. Technol."},{"key":"2023020115432543200_btv256-B8","doi-asserted-by":"crossref","first-page":"e62975","DOI":"10.1371\/journal.pone.0062975","article-title":"A semi-supervised method for drug-target interaction prediction with consistency in networks","volume":"8","author":"Chen","year":"2013","journal-title":"PLoS One"},{"key":"2023020115432543200_btv256-B9","doi-asserted-by":"crossref","first-page":"e1002503","DOI":"10.1371\/journal.pcbi.1002503","article-title":"Prediction of drug-target interactions and drug repositioning via network-based inference","volume":"8","author":"Cheng","year":"2012","journal-title":"PLoS Comput. Biol."},{"key":"2023020115432543200_btv256-B10","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/j.datak.2006.05.003","article-title":"Measuring semantic similarity between Gene Ontology terms","volume":"61","author":"Coutoa","year":"2007","journal-title":"Data Knowl. Eng."},{"key":"2023020115432543200_btv256-B11","doi-asserted-by":"crossref","first-page":"1046","DOI":"10.1038\/nbt.1990","article-title":"Comprehensive analysis of kinase inhibitor selectivity","volume":"29","author":"Davis","year":"2011","journal-title":"Nat. Biotechnol."},{"key":"2023020115432543200_btv256-B12","doi-asserted-by":"crossref","first-page":"734","DOI":"10.1093\/bib\/bbt056","article-title":"Similarity-based machine learning methods for predicting drug-target interactions: a brief review","volume":"15","author":"Ding","year":"2014","journal-title":"Brief Bioinform."},{"key":"2023020115432543200_btv256-B13","first-page":"1871","article-title":"LIBLINEAR: a library for large linear classification","volume":"9","author":"Fan","year":"2008","journal-title":"J. Machine Learning Res."},{"issue":"Database issue","key":"2023020115432543200_btv256-B14","first-page":"D808","article-title":"STRING v9.1: protein-protein interaction networks, with increased coverage and integration","volume":"41","author":"Franceschini","year":"2013","journal-title":"Nucleic Acids Res."},{"key":"2023020115432543200_btv256-B15","doi-asserted-by":"crossref","first-page":"2304","DOI":"10.1093\/bioinformatics\/bts360","article-title":"Predicting drug-target interactions from chemical and genomic kernels using Bayesian matrix factorization","volume":"28","author":"Gonen","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B16","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1038\/msb.2011.26","article-title":"PREDICT: a method for inferring novel drug indications with application to personalized medicine","volume":"7","author":"Gottlieb","year":"2011","journal-title":"Mol. Syst. Biol."},{"key":"2023020115432543200_btv256-B17","doi-asserted-by":"crossref","first-page":"592","DOI":"10.1038\/msb.2012.26","article-title":"INDI: a computational framework for inferring drug interactions and their associated recommendations","volume":"8","author":"Gottlieb","year":"2012","journal-title":"Mol. Syst. biol."},{"key":"2023020115432543200_btv256-B18","doi-asserted-by":"crossref","first-page":"D919","DOI":"10.1093\/nar\/gkm862","article-title":"SuperTarget and Matador: resources for exploring drug-target relationships","volume":"36","author":"Gnther","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023020115432543200_btv256-B19","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1145\/1656274.1656278","article-title":"The WEKA data mining software: an update","volume":"11","author":"Hall","year":"2009","journal-title":"SIGKDD Explor. Newsl."},{"key":"2023020115432543200_btv256-B21","doi-asserted-by":"crossref","first-page":"e9603","DOI":"10.1371\/journal.pone.0009603","article-title":"Predicting drug-target interaction networks based on functional groups and biological features","volume":"5","author":"He","year":"2010","journal-title":"PloS One"},{"key":"2023020115432543200_btv256-B41","doi-asserted-by":"crossref","first-page":"1806","DOI":"10.1021\/ci300274c","article-title":"Extending the activity cliff concept: structural categorization of activity cliffs and systematic identification of different types of cliffs in the ChEMBL database","volume":"52","author":"Hu","year":"2012","journal-title":"J. Chem. Inf. Model."},{"key":"2023020115432543200_btv256-B22","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1038\/nprot.2008.211","article-title":"Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources","volume":"4","author":"Huang","year":"2009","journal-title":"Nat. Protoc."},{"key":"2023020115432543200_btv256-B24","doi-asserted-by":"crossref","first-page":"14621","DOI":"10.1073\/pnas.1000138107","article-title":"Discovery of drug mode of action and drug repositioning from transcriptional responses","volume":"107","author":"Iorio","year":"2010","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020115432543200_btv256-B25","first-page":"223","article-title":"Nouvelles recherches sur la distribution florale","volume":"44","author":"Jaccard","year":"1908","journal-title":"Bul. Soc. Vaudoise Sci. Nat."},{"key":"2023020115432543200_btv256-B26","doi-asserted-by":"crossref","first-page":"2149","DOI":"10.1093\/bioinformatics\/btn409","article-title":"Protein-ligand interaction prediction: an improved chemogenomics approach","volume":"24","author":"Jacob","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B27","doi-asserted-by":"crossref","first-page":"791","DOI":"10.1177\/1087057114522690","article-title":"Causal network models for predicting compound targets and driving pathways in cancer","volume":"19","author":"Jaeger","year":"2014","journal-title":"J. Biomol. Screen."},{"key":"2023020115432543200_btv256-B28","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-540-37635-4","volume-title":"Chemical genomics: small molecule probes to study cellular function","author":"Jaroch","year":"2006"},{"key":"2023020115432543200_btv256-B29","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1038\/msb.2009.98","article-title":"A side effect resource to capture phenotypic effects of drugs","volume":"6","author":"Kuhn","year":"2010","journal-title":"Mol. Syst. Biol"},{"key":"2023020115432543200_btv256-B30","doi-asserted-by":"crossref","DOI":"10.1093\/nar\/gkt1207","article-title":"STITCH 4: integration of protein-chemical interactions with user data","volume":"42","author":"Kuhn","year":"2014","journal-title":"Nucleic Acids Res."},{"key":"2023020115432543200_btv256-B31","doi-asserted-by":"crossref","first-page":"1929","DOI":"10.1126\/science.1132939","article-title":"The connectivity map: using gene-expression signatures to connect small molecules, genes, and disease","volume":"313","author":"Lamb","year":"2006","journal-title":"Science"},{"key":"2023020115432543200_btv256-B32","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1093\/bioinformatics\/bts670","article-title":"Drug-target interaction prediction by learning from local information and neighbors","volume":"29","author":"Mei","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B33","doi-asserted-by":"crossref","first-page":"200","DOI":"10.1038\/nchembio.530","article-title":"Navigating the kinome","volume":"7","author":"Metz","year":"2011","journal-title":"Nat. Chem. Biol."},{"key":"2023020115432543200_btv256-B34","doi-asserted-by":"crossref","first-page":"i522","DOI":"10.1093\/bioinformatics\/bts383","article-title":"Relating drug-protein interaction network with drug side effects","volume":"28","author":"Mizutani","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B35","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1093\/bib\/bbu010","article-title":"Toward more realistic drug-target interaction predictions","volume":"16","author":"Pahikkala","year":"2015","journal-title":"Brief Bioinform."},{"key":"2023020115432543200_btv256-B36","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1186\/1471-2105-12-169","article-title":"Predicting drug side-effect profiles: a chemical fragment-based approach","volume":"12","author":"Pauwels","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023020115432543200_btv256-B37","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1089\/cmb.2010.0213","article-title":"Combining drug and gene similarity measures for drug-target elucidation","volume":"18","author":"Perlman","year":"2011","journal-title":"J. Comput. Biol."},{"issue":"Database issue","key":"2023020115432543200_btv256-B38","doi-asserted-by":"crossref","first-page":"D290","DOI":"10.1093\/nar\/gkr1065","article-title":"The Pfam protein families database","volume":"40","author":"Punta","year":"2012","journal-title":"Nucleic Acids Res."},{"key":"2023020115432543200_btv256-B39","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","article-title":"Identification of common molecular subsequences","volume":"147","author":"Smith","year":"1981","journal-title":"J. Mol. Biol."},{"key":"2023020115432543200_btv256-B40","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1016\/j.drudis.2011.10.024","article-title":"Classification of scaffold-hopping approaches","volume":"17","author":"Sun","year":"2012","journal-title":"Drug. Discov. Today"},{"key":"2023020115432543200_btv256-B42","doi-asserted-by":"crossref","first-page":"S3","DOI":"10.1186\/1752-0509-7-S6-S3","article-title":"Scalable prediction of compound-protein interactions using minwise hashing","volume":"7","author":"Tabei","year":"2013","journal-title":"BMC Syst. Biol."},{"key":"2023020115432543200_btv256-B43","doi-asserted-by":"crossref","first-page":"e66952","DOI":"10.1371\/journal.pone.0066952","article-title":"Predicting drug-target interactions for new drug compounds using a weighted nearest neighbor profile","volume":"8","author":"van Laarhoven","year":"2013","journal-title":"PLoS One"},{"key":"2023020115432543200_btv256-B44","doi-asserted-by":"crossref","first-page":"3036","DOI":"10.1093\/bioinformatics\/btr500","article-title":"Gaussian interaction profile kernels for predicting drug-target interaction","volume":"27","author":"van Laarhoven","year":"2011","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B45","doi-asserted-by":"crossref","first-page":"i126","DOI":"10.1093\/bioinformatics\/btt234","article-title":"Predicting drug-target interactions using restricted Boltzmann machines","volume":"29","author":"Wang","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B46","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1016\/j.compbiolchem.2011.10.003","article-title":"Kernel-based data fusion improves the drug-protein interaction prediction","volume":"35","author":"Wang","year":"2011","journal-title":"Comput. Biol Chem."},{"key":"2023020115432543200_btv256-B47","doi-asserted-by":"crossref","first-page":"D173","DOI":"10.1093\/nar\/gkj158","article-title":"Database resources of the national center for biotechnology information","volume":"34","author":"Wheeler","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023020115432543200_btv256-B48","doi-asserted-by":"crossref","first-page":"D901","DOI":"10.1093\/nar\/gkm958","article-title":"DrugBank: a knowledgebase for drugs, drug actions and drug targets","volume":"36","author":"Wishart","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023020115432543200_btv256-B49","doi-asserted-by":"crossref","first-page":"E771","DOI":"10.1073\/pnas.1106149108","article-title":"Modulatory profiling identifies mechanisms of small molecule-induced cell death","volume":"108","author":"Wolpaw","year":"2011","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023020115432543200_btv256-B50","doi-asserted-by":"crossref","first-page":"S6","DOI":"10.1186\/1752-0509-4-S2-S6","article-title":"Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces","volume":"4","author":"Xia","year":"2010","journal-title":"BMC Syst. Biol."},{"key":"2023020115432543200_btv256-B51","doi-asserted-by":"crossref","first-page":"472","DOI":"10.1038\/msb.2011.5","article-title":"Analysis of multiple compound-protein interactions reveals novel bioactive molecules","volume":"7","author":"Yabuuchi","year":"2011","journal-title":"Mol. Syst. Biol."},{"key":"2023020115432543200_btv256-B52","doi-asserted-by":"crossref","first-page":"i232","DOI":"10.1093\/bioinformatics\/btn162","article-title":"Prediction of drug-target interaction networks from the integration of chemical and genomic spaces","volume":"24","author":"Yamanishi","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B53","doi-asserted-by":"crossref","first-page":"i246","DOI":"10.1093\/bioinformatics\/btq176","article-title":"Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework","volume":"26","author":"Yamanishi","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020115432543200_btv256-B54","doi-asserted-by":"crossref","first-page":"W39","DOI":"10.1093\/nar\/gku337","article-title":"DINIES: drug-target interaction network inference engine based on supervised analysis","volume":"42","author":"Yamanishi","year":"2014","journal-title":"Nucleic Acids Res."},{"key":"2023020115432543200_btv256-B55","doi-asserted-by":"crossref","first-page":"046115","DOI":"10.1103\/PhysRevE.76.046115","article-title":"Bipartite network projection and personal recommendation","volume":"76","author":"Zhou","year":"2010","journal-title":"Phys. Rev. E Stat. Nonlin. Soft Matter Phys."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/12\/i221\/49014164\/bioinformatics_31_12_i221.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/12\/i221\/49014164\/bioinformatics_31_12_i221.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T00:08:31Z","timestamp":1675296511000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/12\/i221\/216307"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,6,10]]},"references-count":53,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2015,6,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv256","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,6,15]]},"published":{"date-parts":[[2015,6,10]]}}}