{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,5]],"date-time":"2026-01-05T21:54:59Z","timestamp":1767650099279},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"22","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2181,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,11,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Most of the previous data mining studies based on the NCI-60 dataset, due to its intrinsic cell-based nature, can hardly provide insights into the molecular targets for screened compounds. On the other hand, the abundant information of the compound\u2013target associations in PubChem can offer extensive experimental evidence of molecular targets for tested compounds. Therefore, by taking advantages of the data from both public repositories, one may investigate the correlations between the bioactivity profiles of small molecules from the NCI-60 dataset (cellular level) and their patterns of interactions with relevant protein targets from PubChem (molecular level) simultaneously.<\/jats:p>\n               <jats:p>Results: We investigated a set of 37 small molecules by providing links among their bioactivity profiles, protein targets and chemical structures. Hierarchical clustering of compounds was carried out based on their bioactivity profiles. We found that compounds were clustered into groups with similar mode of actions, which strongly correlated with chemical structures. Furthermore, we observed that compounds similar in bioactivity profiles also shared similar patterns of interactions with relevant protein targets, especially when chemical structures were related. The current work presents a new strategy for combining and data mining the NCI-60 dataset and PubChem. This analysis shows that bioactivity profile comparison can provide insights into the mode of actions at the molecular level, thus will facilitate the knowledge-based discovery of novel compounds with desired pharmacological properties.<\/jats:p>\n               <jats:p>Availability: The bioactivity profiling data and the target annotation information are publicly available in the PubChem BioAssay database (ftp:\/\/ftp.ncbi.nlm.nih.gov\/pubchem\/Bioassay\/).<\/jats:p>\n               <jats:p>Contact: \u00a0ywang@ncbi.nlm.nih.gov; bryant@ncbi.nlm.nih.gov<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq550","type":"journal-article","created":{"date-parts":[[2010,10,19]],"date-time":"2010-10-19T04:15:50Z","timestamp":1287461750000},"page":"2881-2888","source":"Crossref","is-referenced-by-count":26,"title":["Investigating the correlations among the chemical structures, bioactivity profiles and molecular targets of small molecules"],"prefix":"10.1093","volume":"26","author":[{"given":"Tiejun","family":"Cheng","sequence":"first","affiliation":[{"name":"National Center for Biotechnology Information, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA"}]},{"given":"Yanli","family":"Wang","sequence":"additional","affiliation":[{"name":"National Center for Biotechnology Information, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA"}]},{"given":"Stephen H.","family":"Bryant","sequence":"additional","affiliation":[{"name":"National Center for Biotechnology Information, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,10,13]]},"reference":[{"key":"2023012507555700200_B1","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The Protein Data Bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012507555700200_B2","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1016\/j.jmgm.2009.10.001","article-title":"PubChem BioAssays as a data source for predictive models","volume":"28","author":"Chen","year":"2010","journal-title":"J. Mol. Graphics Model."},{"key":"2023012507555700200_B3","doi-asserted-by":"crossref","first-page":"2044","DOI":"10.1021\/ci9001876","article-title":"PubChem as a source of polypharmacology","volume":"49","author":"Chen","year":"2009","journal-title":"J. Chem. Inf. Model."},{"key":"2023012507555700200_B4","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1016\/S0167-6296(02)00126-1","article-title":"The price of innovation: new estimates of drug development costs","volume":"22","author":"DiMasi","year":"2003","journal-title":"J. Health Econ."},{"key":"2023012507555700200_B5","doi-asserted-by":"crossref","first-page":"456","DOI":"10.1021\/ci700188u","article-title":"Flexible web service infrastructure for the development and deployment of predictive models","volume":"48","author":"Guha","year":"2008","journal-title":"J. Chem. Inf. Model."},{"key":"2023012507555700200_B6","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1186\/1471-2105-9-401","article-title":"Developing and validating predictive decision tree models from mining chemical structural fingerprints and high-throughput screening data in PubChem","volume":"9","author":"Han","year":"2008","journal-title":"BMC Bioinf."},{"key":"2023012507555700200_B7","doi-asserted-by":"crossref","first-page":"D277","DOI":"10.1093\/nar\/gkh063","article-title":"The KEGG resource for deciphering the genome","volume":"32","author":"Kanehisa","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012507555700200_B8","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1055\/s-2006-951767","article-title":"Identification of a new natural camptothecin analogue in targeted screening for HIF-1\u03b1 inhibitors","volume":"73","author":"Klausmeyer","year":"2007","journal-title":"Planta Med."},{"key":"2023012507555700200_B9","doi-asserted-by":"crossref","first-page":"1379","DOI":"10.1021\/ci800097k","article-title":"Data mining the NCI60 to predict generalized cytotoxicity","volume":"48","author":"Lee","year":"2008","journal-title":"J. Chem. Inf. Model."},{"key":"2023012507555700200_B10","doi-asserted-by":"crossref","first-page":"3310","DOI":"10.1093\/bioinformatics\/btp589","article-title":"A novel method for mining highly imbalanced high-throughput screening data in PubChem","volume":"25","author":"Li","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012507555700200_B11","doi-asserted-by":"crossref","first-page":"326","DOI":"10.1002\/jcb.1163","article-title":"JNK phosphorylates the HSF1 transcriptional activation domain: role of JNK in the regulation of the heat shock response","volume":"82","author":"Park","year":"2001","journal-title":"J. Cell. Biochem."},{"key":"2023012507555700200_B12","doi-asserted-by":"crossref","first-page":"1088","DOI":"10.1093\/jnci\/81.14.1088","article-title":"Display and analysis of patterns of differential activity of drugs against human tumor cell lines: development of mean graph and COMPARE algorithm","volume":"81","author":"Paull","year":"1989","journal-title":"J. Natl Cancer Inst."},{"key":"2023012507555700200_B13","first-page":"1273","article-title":"Anticancer effects of low-dose 10-hydroxycamptothecin in human colon cancer","volume":"15","author":"Ping","year":"2006","journal-title":"Oncol. Rep."},{"key":"2023012507555700200_B14","doi-asserted-by":"crossref","first-page":"2235","DOI":"10.1016\/S0140-6736(03)13780-4","article-title":"The camptothecins","volume":"361","author":"Pizzolato","year":"2003","journal-title":"Lancet"},{"key":"2023012507555700200_B15","doi-asserted-by":"crossref","first-page":"818","DOI":"10.1021\/jm010385b","article-title":"Mining the National Cancer Institute's tumor-screening database: identification of compounds with similar cellular activities","volume":"45","author":"Rabow","year":"2002","journal-title":"J. Med. Chem."},{"key":"2023012507555700200_B16","first-page":"4316","article-title":"Identification of small molecule inhibitors of hypoxia-inducible factor 1 transcriptional activation pathway","volume":"62","author":"Rapisarda","year":"2002","journal-title":"Cancer Res."},{"key":"2023012507555700200_B17","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1021\/ci8002649","article-title":"Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data","volume":"49","author":"Rohrer","year":"2009","journal-title":"J. Chem. Inf. Model."},{"key":"2023012507555700200_B18","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1109\/MC.2002.1016905","article-title":"Interactively exploring hierarchical clustering results","volume":"35","author":"Seo","year":"2002","journal-title":"IEEE Computer"},{"key":"2023012507555700200_B19","doi-asserted-by":"crossref","first-page":"2498","DOI":"10.1101\/gr.1239303","article-title":"Cytoscape: a software environment for integrated models of biomolecular interaction networks","volume":"13","author":"Shannon","year":"2003","journal-title":"Genome Res."},{"key":"2023012507555700200_B20","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1021\/ci970085w","article-title":"Mining the NCI anticancer drug discovery databases: genetic function approximation for the QSAR Study of anticancer ellipticine analogues","volume":"38","author":"Shi","year":"1998","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023012507555700200_B21","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1124\/mol.53.2.241","article-title":"Mining the National Cancer Institute anticancer drug discovery database: cluster analysis of ellipticine analogs with p53-inverse and central nervous system-selective patterns of activity","volume":"53","author":"Shi","year":"1998","journal-title":"Mol. Pharmacol."},{"key":"2023012507555700200_B22","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1021\/ci990087b","article-title":"Mining and visualizing large anticancer drug discovery databases","volume":"40","author":"Shi","year":"1999","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023012507555700200_B23","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1038\/nrc1951","article-title":"The NCI60 human tumour cell line anticancer drug screen","volume":"6","author":"Shoemaker","year":"2006","journal-title":"Nat. Rev. Cancer."},{"key":"2023012507555700200_B24","doi-asserted-by":"crossref","first-page":"2336","DOI":"10.1021\/jm049146p","article-title":"Structures of three classes of anticancer agents bound to the human topoisomerase I-DNA covalent complex","volume":"48","author":"Staker","year":"2005","journal-title":"J. Med. Chem."},{"key":"2023012507555700200_B25","doi-asserted-by":"crossref","first-page":"15387","DOI":"10.1073\/pnas.242259599","article-title":"The mechanism of topoisomerase I poisoning by a camptothecin analog","volume":"99","author":"Staker","year":"2002","journal-title":"Proc. Nat. Acad. Sci. USA."},{"key":"2023012507555700200_B26","doi-asserted-by":"crossref","first-page":"1853","DOI":"10.1093\/jnci\/86.24.1853","article-title":"Use of the Kohonen self-organizing map to study the mechanisms of action of chemotherapeutic agents","volume":"86","author":"van Osdol","year":"1994","journal-title":"J. Natl Cancer Inst."},{"key":"2023012507555700200_B27","doi-asserted-by":"crossref","first-page":"2212","DOI":"10.1093\/bioinformatics\/btg302","article-title":"Linking the growth inhibition response from the National Cancer Institute's anticancer screen to gene expression levels and other molecular target data","volume":"19","author":"Wallqvist","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012507555700200_B28","doi-asserted-by":"crossref","first-page":"2063","DOI":"10.1021\/ci700141x","article-title":"Chemical data mining of the NCI human tumor cell line database","volume":"47","author":"Wang","year":"2007","journal-title":"J. Chem. Inf. Model."},{"key":"2023012507555700200_B29","doi-asserted-by":"crossref","first-page":"W623","DOI":"10.1093\/nar\/gkp456","article-title":"PubChem: a public information system for analyzing bioactivities of small molecules","volume":"37","author":"Wang","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012507555700200_B30","doi-asserted-by":"crossref","first-page":"D255","DOI":"10.1093\/nar\/gkp965","article-title":"An overview of the PubChem BioAssay resource","volume":"38","author":"Wang","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012507555700200_B31","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1126\/science.1411538","article-title":"Neural computing in cancer drug development: predicting mechanism of action","volume":"258","author":"Weinstein","year":"1992","journal-title":"Science"},{"key":"2023012507555700200_B32","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1126\/science.275.5298.343","article-title":"An information-intensive approach to the molecular pharmacology of cancer","volume":"275","author":"Weinstein","year":"1997","journal-title":"Science"},{"key":"2023012507555700200_B33","doi-asserted-by":"crossref","first-page":"466","DOI":"10.1016\/j.jmgm.2008.08.004","article-title":"Data mining PubChem using a support vector machine with the signature molecular descriptor: classification of factor XIa inhibitors","volume":"27","author":"Weis","year":"2008","journal-title":"J. Mol. Graphics Model."},{"key":"2023012507555700200_B34","doi-asserted-by":"crossref","first-page":"9616","DOI":"10.1074\/jbc.M512044200","article-title":"Triptolide, an inhibitor of the human heat shock response that enhances stress-induced cell death","volume":"281","author":"Westerheide","year":"2006","journal-title":"J. Biol. Chem."},{"key":"2023012507555700200_B35","doi-asserted-by":"crossref","first-page":"819","DOI":"10.1586\/14737140.8.5.819","article-title":"Key role of topoisomerase I inhibitors in the treatment of recurrent and refractory epithelial ovarian carcinoma","volume":"8","author":"Wethington","year":"2008","journal-title":"Expert Rev. Anticancer Ther."},{"key":"2023012507555700200_B36","doi-asserted-by":"crossref","first-page":"D668","DOI":"10.1093\/nar\/gkj067","article-title":"DrugBank: a comprehensive resource for in silico drug discovery and exploration","volume":"34","author":"Wishart","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012507555700200_B37","doi-asserted-by":"crossref","first-page":"D901","DOI":"10.1093\/nar\/gkm958","article-title":"DrugBank: a knowledgebase for drugs, drug actions and drug targets","volume":"36","author":"Wishart","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012507555700200_B38","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1021\/ci700193u","article-title":"Data mining a small molecule drug screening representative subset from NIH PubChem","volume":"48","author":"Xie","year":"2008","journal-title":"J. Chem. Inf. Model."},{"key":"2023012507555700200_B39","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1016\/S1093-3263(01)00126-7","article-title":"COMPARE: a web accessible tool for investigating mechanisms of cell growth inhibition","volume":"20","author":"Zaharevitz","year":"2002","journal-title":"J. Mol. Graphics Model."},{"key":"2023012507555700200_B40","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1126\/science.1091867","article-title":"The NIH roadmap","volume":"302","author":"Zerhouni","year":"2003","journal-title":"Science"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/22\/2881\/48852228\/bioinformatics_26_22_2881.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/22\/2881\/48852228\/bioinformatics_26_22_2881.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T07:56:36Z","timestamp":1674633396000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/22\/2881\/228099"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,10,13]]},"references-count":40,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2010,11,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq550","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,11,15]]},"published":{"date-parts":[[2010,10,13]]}}}