{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T11:48:04Z","timestamp":1774439284563,"version":"3.50.1"},"reference-count":34,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Drug side-effects, or adverse drug reactions, have become a major public health concern. It is one of the main causes of failure in the process of drug development, and of drug withdrawal once they have reached the market. Therefore, <jats:italic>in silico<\/jats:italic> prediction of potential side-effects early in the drug discovery process, before reaching the clinical stages, is of great interest to improve this long and expensive process and to provide new efficient and safe therapies for patients.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>In the present work, we propose a new method to predict potential side-effects of drug candidate molecules based on their chemical structures, applicable on large molecular databanks. A unique feature of the proposed method is its ability to extract correlated sets of chemical substructures (or chemical fragments) and side-effects. This is made possible using sparse canonical correlation analysis (SCCA). In the results, we show the usefulness of the proposed method by predicting 1385 side-effects in the SIDER database from the chemical structures of 888 approved drugs. These predictions are performed with simultaneous extraction of correlated ensembles formed by a set of chemical substructures shared by drugs that are likely to have a set of side-effects. We also conduct a comprehensive side-effect prediction for many uncharacterized drug molecules stored in DrugBank, and were able to confirm interesting predictions using independent source of information.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>The proposed method is expected to be useful in various stages of the drug development process.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-12-169","type":"journal-article","created":{"date-parts":[[2011,6,28]],"date-time":"2011-06-28T18:17:44Z","timestamp":1309285064000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":209,"title":["Predicting drug side-effect profiles: a chemical fragment-based approach"],"prefix":"10.1186","volume":"12","author":[{"given":"Edouard","family":"Pauwels","sequence":"first","affiliation":[]},{"given":"V\u00e9ronique","family":"Stoven","sequence":"additional","affiliation":[]},{"given":"Yoshihiro","family":"Yamanishi","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,5,18]]},"reference":[{"issue":"7139","key":"4595_CR1","doi-asserted-by":"publisher","first-page":"975","DOI":"10.1038\/446975a","volume":"446","author":"KM Giacomini","year":"2007","unstructured":"Giacomini KM, Krauss RM, Roden DM, Eichelbaum M, Hayden MR, Nakamura Y: When good drugs go bad. Nature 2007, 446(7139):975\u2013977. 10.1038\/446975a","journal-title":"Nature"},{"issue":"2","key":"4595_CR2","doi-asserted-by":"publisher","first-page":"155","DOI":"10.2174\/138161210790112719","volume":"16","author":"D Houtsma","year":"2010","unstructured":"Houtsma D, Guchelaar H, Gelderblom H: Pharmacogenetics in oncology: a promising field. Curr Pharm Des 2010, 16(2):155\u2013163. 10.2174\/138161210790112719","journal-title":"Curr Pharm Des"},{"key":"4595_CR3","first-page":"10","volume-title":"Mol Cancer Ther","author":"S McWhinney","year":"2009","unstructured":"McWhinney S, Goldberg R, McLeod H: Platinum neurotoxicity pharmacogenetics. Mol Cancer Ther 2009, (8):10\u201316."},{"key":"4595_CR4","doi-asserted-by":"publisher","first-page":"238","DOI":"10.1186\/gb-2009-10-9-238","volume":"10","author":"N Tatonetti","year":"2009","unstructured":"Tatonetti N, Liu T, Altman R: Predicting drug side-effects by chemical systems biology. Genome Biol 2009, 10: 238. 10.1186\/gb-2009-10-9-238","journal-title":"Genome Biol"},{"issue":"2","key":"4595_CR5","doi-asserted-by":"publisher","first-page":"308","DOI":"10.1021\/ci800344p","volume":"49","author":"J Scheiber","year":"2009","unstructured":"Scheiber J, Chen B, Milik M, Sukuru S, Bender A, Mikhailov D, Whitebread S, Hamon J, Azzaoui K, Urban L, Glick M, Davies J, Jenkins J: Gaining insight into off-target mediated effects of drug candidates with a comprehensive systems chemical biology analysis. J Chem Inf Model 2009, 49(2):308\u201317. 10.1021\/ci800344p","journal-title":"J Chem Inf Model"},{"issue":"21","key":"4595_CR6","doi-asserted-by":"publisher","first-page":"1421","DOI":"10.1016\/S1359-6446(05)03632-9","volume":"10","author":"S Whitebread","year":"2005","unstructured":"Whitebread S, Hamon J, Bojanic D, Urban L: Keynote review: in vitro safety pharmacology profiling: an essential tool for successful drug development. Drug DiscoVery Today 2005, 10(21):1421\u20131433. 10.1016\/S1359-6446(05)03632-9","journal-title":"Drug DiscoVery Today"},{"issue":"3","key":"4595_CR7","doi-asserted-by":"publisher","first-page":"213","DOI":"10.1016\/S0300-483X(97)03631-7","volume":"119","author":"E Benfenatia","year":"1997","unstructured":"Benfenatia E, Gini G: Computational predictive programs (expert systems) in toxicology. Toxicology 1997, 119(3):213\u2013225. 10.1016\/S0300-483X(97)03631-7","journal-title":"Toxicology"},{"issue":"5886","key":"4595_CR8","doi-asserted-by":"publisher","first-page":"263","DOI":"10.1126\/science.1158140","volume":"321","author":"M Campillos","year":"2008","unstructured":"Campillos M, Kuhn M, Gavin A, Jensen L, Bork P: Drug target identification using side-effect similarity. Science 2008, 321(5886):263\u20136. 10.1126\/science.1158140","journal-title":"Science"},{"key":"4595_CR9","doi-asserted-by":"publisher","first-page":"142","DOI":"10.1109\/BIBM.2009.26","volume-title":"IEEE International Conference on Bioinformatics and Biomedicine 2009 (IEEE BIBM 2009)","author":"M Fukuzaki","year":"2009","unstructured":"Fukuzaki M, Seki M, Kashima H, Sese J: Side Effect Prediction using Cooperative Pathways. IEEE International Conference on Bioinformatics and Biomedicine 2009 (IEEE BIBM 2009) 2009, 142\u2013147."},{"key":"4595_CR10","doi-asserted-by":"publisher","first-page":"e1000387","DOI":"10.1371\/journal.pcbi.1000387","volume":"5","author":"L Xie","year":"2009","unstructured":"Xie L, Li J, Xie L, Bourne P: Drug discovery using chemical systems biology: identification of the protein-ligand binding network to explain the side effects of CETP inhibitors. PLoS Comput Biol 2009, 5: e1000387. 10.1371\/journal.pcbi.1000387","journal-title":"PLoS Comput Biol"},{"issue":"9","key":"4595_CR11","doi-asserted-by":"publisher","first-page":"3103","DOI":"10.1021\/jm801546k","volume":"52","author":"J Scheiber","year":"2009","unstructured":"Scheiber J, Jenkins J, Sukuru S, Bender A, Mikhailov D, Milik M, Azzaoui K, Whitebread S, Hamon J, Urban L, Glick M, Davies J: Mapping adverse drug reactions in chemical space. J Med Chem 2009, 52(9):3103\u20137. 10.1021\/jm801546k","journal-title":"J Med Chem"},{"key":"4595_CR12","doi-asserted-by":"publisher","first-page":"i246","DOI":"10.1093\/bioinformatics\/btq176","volume":"26","author":"Y Yamanishi","year":"2010","unstructured":"Yamanishi Y, Kotera M, Kanehisa M, Goto S: Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework. Bioinformatics 2010, 26: i246-i254. 10.1093\/bioinformatics\/btq176","journal-title":"Bioinformatics"},{"key":"4595_CR13","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1038\/msb.2009.98","volume":"6","author":"M Kuhn","year":"2010","unstructured":"Kuhn M, Campillos M, Letunic I, Jensen L, Bork P: A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol 2010, 6: 343.","journal-title":"Mol Syst Biol"},{"issue":"9","key":"4595_CR14","doi-asserted-by":"publisher","first-page":"2044","DOI":"10.1021\/ci9001876","volume":"49","author":"B Chen","year":"2009","unstructured":"Chen B, Wild D, Guha R: PubChem as a Source of Polypharmacology. Journal of chemical information and modeling 2009, 49(9):2044\u20132055. 10.1021\/ci9001876","journal-title":"Journal of chemical information and modeling"},{"key":"4595_CR15","doi-asserted-by":"publisher","first-page":"D668","DOI":"10.1093\/nar\/gkj067","volume":"34","author":"D Wishart","year":"2006","unstructured":"Wishart D, Knox C, Guo A, Shrivastava S, Hassanali M, Stothard P, Chang Z, Woolsey J: DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Research 2006, 34: D668-D672. 10.1093\/nar\/gkj067","journal-title":"Nucleic Acids Research"},{"key":"4595_CR16","doi-asserted-by":"publisher","first-page":"25","DOI":"10.1016\/S0097-8485(96)80004-0","volume":"20","author":"M Gribskov","year":"1996","unstructured":"Gribskov M, Robinson N: Use of receiver operating characteristic (ROC) analysis to evaluate sequence matching. Comput Chem 1996, 20: 25\u201333. 10.1016\/S0097-8485(96)80004-0","journal-title":"Comput Chem"},{"key":"4595_CR17","first-page":"457","volume":"3","author":"M Kravette","year":"1986","unstructured":"Kravette M: Perilymphatic atrophy of skin. An adverse side effect of intralesional steroid injections. Clin Podiatr Med Surg 1986, 3: 457\u201362.","journal-title":"Clin Podiatr Med Surg"},{"key":"4595_CR18","doi-asserted-by":"publisher","first-page":"437","DOI":"10.1016\/S1474-4422(05)70121-6","volume":"4","author":"K Sander","year":"2005","unstructured":"Sander K, Sander D: New insights into transient global amnesia: recent imaging and clinical findings. The Lancet Neurology 2005, 4: 437\u2013444. 10.1016\/S1474-4422(05)70121-6","journal-title":"The Lancet Neurology"},{"issue":"8","key":"4595_CR19","doi-asserted-by":"publisher","first-page":"1079","DOI":"10.1016\/0002-9149(84)90641-6","volume":"53","author":"C Richard","year":"1984","unstructured":"Richard C, Klein M: Ventricular arrhythmias in aortic valve disease: Analysis of 102 patients. The American Journal of Cardiology 1984, 53(8):1079\u20131083. 10.1016\/0002-9149(84)90641-6","journal-title":"The American Journal of Cardiology"},{"key":"4595_CR20","first-page":"4","volume-title":"International Journal of Computational Cognition","author":"T Yang","year":"2006","unstructured":"Yang T: Computational verb decision trees. International Journal of Computational Cognition 2006., 4(4):"},{"issue":"5","key":"4595_CR21","doi-asserted-by":"publisher","first-page":"509","DOI":"10.1080\/10629360290023340","volume":"13","author":"S Kramer","year":"2002","unstructured":"Kramer S, Frank E, Helma C: Fragment generation and support vector machines for inducing SARs. SAR QSAR Environ Res 2002, 13(5):509\u2013523. 10.1080\/10629360290023340","journal-title":"SAR QSAR Environ Res"},{"key":"4595_CR22","first-page":"853","volume":"17","author":"L De Raedt","year":"2001","unstructured":"De Raedt L, Kramer S: The levelwise version space algorithm and its application to molecular fragment finding. In International Joint Conference on Artificial Intelligence 2001, 17: 853\u2013862.","journal-title":"In International Joint Conference on Artificial Intelligence"},{"key":"4595_CR23","first-page":"313","volume-title":"IEEE International Conference on Data Mining","author":"M Kuramochi","year":"2001","unstructured":"Kuramochi M, Karypis : Frequent Subgraph Discovery. IEEE International Conference on Data Mining 2001, 313."},{"issue":"6","key":"4595_CR24","first-page":"1052","volume":"15","author":"A Inokuchi","year":"2000","unstructured":"Inokuchi A, Washio T, Motoda H, Kumazawa K, Arai N: Fast and Complete Mining Method for Frequent Graph Patterns. Journal of Japanese Society for Artificial Intelligence 2000, 15(6):1052\u20131063.","journal-title":"Journal of Japanese Society for Artificial Intelligence"},{"key":"4595_CR25","doi-asserted-by":"publisher","first-page":"564","DOI":"10.1145\/967900.968018","volume-title":"In Proceedings of the 2004 ACM symposium on Applied computing","author":"U R\u00fcckert","year":"2004","unstructured":"R\u00fcckert U, Kramer S: Frequent free tree discovery in graph data. In Proceedings of the 2004 ACM symposium on Applied computing 2004, 564\u2013570."},{"key":"4595_CR26","first-page":"721","volume-title":"Proceedings of the 2002 IEEE International Conference on Data Mining","author":"X Yan","year":"2002","unstructured":"Yan X, Han J: gSpan: Graph-Based Substructure Pattern Mining. In Proceedings of the 2002 IEEE International Conference on Data Mining. ICDM '02, Washington, DC, USA: IEEE Computer Society; 2002:721."},{"key":"4595_CR27","doi-asserted-by":"publisher","first-page":"578","DOI":"10.1145\/1401890.1401961","volume-title":"Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD2008)","author":"H Saigo","year":"2008","unstructured":"Saigo H, Kr\u00e4mer N, Tsuda K: Partial Least Squares Regression for Graph Mining. Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD2008) 2008, 578\u2013586."},{"issue":"8","key":"4595_CR28","doi-asserted-by":"publisher","first-page":"956","DOI":"10.2174\/138955709788681645","volume":"9","author":"R Gozalbes","year":"2009","unstructured":"Gozalbes R, Carbajo R, Pineda-Lucena A: From fragment screening to potent binders: strategies for fragment-to-lead evolution. Mini Reviews in Medicinal Chemistry 2009, 9(8):956\u2013961.","journal-title":"Mini Reviews in Medicinal Chemistry"},{"issue":"10","key":"4595_CR29","first-page":"906","volume":"16","author":"T Furey","year":"2000","unstructured":"Furey T, Cristianini N, Duffy N, Bednarski D, Schummer M, Haussler D: Support vector machine classification and validation of cancer tissue samples using microarray expression data. Toxicology 2000, 16(10):906\u2013914.","journal-title":"Toxicology"},{"key":"4595_CR30","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/4057.001.0001","volume-title":"Kernel Methods in Computational Biology","author":"B Sch\u00f6lkopf","year":"2004","unstructured":"Sch\u00f6lkopf B, Tsuda K, Vert J: Kernel Methods in Computational Biology. MIT Press; 2004."},{"key":"4595_CR31","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1093\/biomet\/28.3-4.321","volume":"28","author":"H Hotelling","year":"1936","unstructured":"Hotelling H: Relations between two sets of variates. Biometrika 1936, 28: 321\u2013377.","journal-title":"Biometrika"},{"key":"4595_CR32","first-page":"1151","volume-title":"J Amer Statist Assoc","author":"S Dudoit","year":"2001","unstructured":"Dudoit S, Fridlyand J, Speed T: Comparison of discrimination methods for the classification of tumors using gene expression data. J Amer Statist Assoc 2001, 1151\u20131160."},{"key":"4595_CR33","first-page":"104","volume-title":"Statistical Science","author":"R Tibshirani","year":"2003","unstructured":"Tibshirani R, Hastie T, Narasimhan B, Chu G: Class prediction by nearest shrunken centroids, with applications to DNA microarrays. Statistical Science 2003, 104\u2013117."},{"issue":"3","key":"4595_CR34","doi-asserted-by":"publisher","first-page":"515","DOI":"10.1093\/biostatistics\/kxp008","volume":"10","author":"D Witten","year":"2009","unstructured":"Witten D, Tibshirani R, Hastie T: A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics 2009, 10(3):515. 10.1093\/biostatistics\/kxp008","journal-title":"Biostatistics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-169.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T14:10:38Z","timestamp":1630505438000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-169"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,5,18]]},"references-count":34,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4595"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-169","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,5,18]]},"assertion":[{"value":"1 August 2010","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 May 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 May 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"169"}}