{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T02:19:08Z","timestamp":1775096348981,"version":"3.50.1"},"reference-count":48,"publisher":"Springer Science and Business Media LLC","issue":"5","license":[{"start":{"date-parts":[[2022,3,19]],"date-time":"2022-03-19T00:00:00Z","timestamp":1647648000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,3,19]],"date-time":"2022-03-19T00:00:00Z","timestamp":1647648000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100008131","name":"Rheinische Friedrich-Wilhelms-Universit\u00e4t Bonn","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100008131","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Comput Aided Mol Des"],"published-print":{"date-parts":[[2022,5]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The support vector machine (SVM) algorithm is one of the most widely used machine learning (ML) methods for predicting active compounds and molecular properties. In chemoinformatics and drug discovery, SVM has been a state-of-the-art ML approach for more than a decade. A unique attribute of SVM is that it operates in feature spaces of increasing dimensionality. Hence, SVM conceptually departs from the paradigm of low dimensionality that applies to many other methods for chemical space navigation. The SVM approach is applicable to compound classification, and ranking, multi-class predictions, and \u2013in algorithmically modified form\u2013 regression modeling. In the emerging era of deep learning (DL), SVM retains its relevance as one of the premier ML methods in chemoinformatics, for reasons discussed herein. We describe the SVM methodology including strengths and weaknesses and discuss selected applications that have contributed to the evolution of SVM as a premier approach for compound classification, property predictions, and virtual compound screening.<\/jats:p>","DOI":"10.1007\/s10822-022-00442-9","type":"journal-article","created":{"date-parts":[[2022,3,19]],"date-time":"2022-03-19T02:02:26Z","timestamp":1647655346000},"page":"355-362","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":203,"title":["Evolution of Support Vector Machine and Regression Modeling in Chemoinformatics and Drug Discovery"],"prefix":"10.1007","volume":"36","author":[{"given":"Raquel","family":"Rodr\u00edguez-P\u00e9rez","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0557-5714","authenticated-orcid":false,"given":"J\u00fcrgen","family":"Bajorath","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,3,19]]},"reference":[{"key":"442_CR1","volume-title":"Estimation of dependencies based on empirical data [in Russian]","author":"V Vapnik","year":"1979","unstructured":"Vapnik V (1979) Estimation of dependencies based on empirical data [in Russian]. Nauka, Moscow"},{"key":"442_CR2","volume-title":"Estimation of dependencies based on empirical data","author":"V Vapnik","year":"1982","unstructured":"Vapnik V (1982) Estimation of dependencies based on empirical data. Springer, New York"},{"key":"442_CR3","first-page":"273","volume":"20","author":"C Cortes","year":"1995","unstructured":"Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273\u2013297","journal-title":"Mach Learn"},{"key":"442_CR4","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-2440-0","volume-title":"The nature of statistical learning theory","author":"V Vapnik","year":"1995","unstructured":"Vapnik V (1995) The nature of statistical learning theory. Springer, New York"},{"key":"442_CR5","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1023\/B:STCO.0000035301.49549.88","volume":"14","author":"AJ Smola","year":"2004","unstructured":"Smola AJ, Sch\u00f6lkopf B (2004) A tutorial on support vector regression. Stat Comput 14:199\u2013222","journal-title":"Stat Comput"},{"key":"442_CR6","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1016\/S0097-8485(01)00094-8","volume":"26","author":"R Burbridge","year":"2001","unstructured":"Burbridge R, Trotter M, Buxton B, Holden S (2001) Drug design by machine learning: support vector machines for pharmaceutical data analysis. Comput Chem 26:5\u201314","journal-title":"Comput Chem"},{"key":"442_CR7","first-page":"667","volume":"43","author":"MK Warmuth","year":"2003","unstructured":"Warmuth MK, Liao J, R\u00e4tsch G, Mathieson M, Putta S, Lemmen C (2003) Active learning with support vector machines in the drug discovery process. J Chem Inf Model 43:667\u2013673","journal-title":"J Chem Inf Model"},{"key":"442_CR8","first-page":"2048","volume":"43","author":"VV Zernov","year":"2003","unstructured":"Zernov VV, Balakin KV, Ivaschenko AA, Savchuk NP, Pletnev IV (2003) Drug discovery using support vector machines. The case studies of drug-likeness, agrochemical-likeness, and enzyme inhibition predictions. J Chem Inf Model 43:2048\u20132056","journal-title":"J Chem Inf Model"},{"key":"442_CR9","doi-asserted-by":"publisher","first-page":"1947","DOI":"10.1021\/ci034160g","volume":"43","author":"V Svetnik","year":"2003","unstructured":"Svetnik V, Liaw A, Tong C, Culberson JC, Sheridan RP, Feuston BP (2003) Random forest: a classification and regression tool for compound classification and QSAR modeling. J Chem Inf Comput Sci 43:1947\u20131958","journal-title":"J Chem Inf Comput Sci"},{"key":"442_CR10","doi-asserted-by":"publisher","first-page":"370","DOI":"10.1016\/j.chembiol.2013.01.011","volume":"20","author":"S Ekins","year":"2013","unstructured":"Ekins S, Reynolds RC, Kim H, Koo MS, Ekonomidis M, Talaue M, Paget SD, Woolhiser LK, Lenaerts AJ, Bunin BA, Connell N (2013) Bayesian models leveraging bioactivity and cytotoxicity information for drug discovery. Chem Biol 20:370\u2013378","journal-title":"Chem Biol"},{"key":"442_CR11","doi-asserted-by":"publisher","first-page":"1413","DOI":"10.1021\/ci200409x","volume":"52","author":"A Varnek","year":"2012","unstructured":"Varnek A, Baskin I (2012) Machine learning methods for property prediction in chemoinformatics. Quo vadis? J Chem Inf Model 52:1413\u20131437","journal-title":"Quo vadis? J Chem Inf Model"},{"key":"442_CR12","doi-asserted-by":"publisher","first-page":"785","DOI":"10.1080\/17460441.2016.1201262","volume":"11","author":"II Baskin","year":"2016","unstructured":"Baskin II, Winkler D, Tetko IV (2016) A Renaissance of Neural Networks in Drug Discovery. Expert Opin Drug Discov 11:785\u2013795","journal-title":"Expert Opin Drug Discov"},{"key":"442_CR13","doi-asserted-by":"publisher","first-page":"1241","DOI":"10.1016\/j.drudis.2018.01.039","volume":"23","author":"H Chen","year":"2018","unstructured":"Chen H, Engkvist O, Wang Y, Olivecrona M, Blaschke T (2018) The Rise of Deep Learning in Drug Discovery. Drug Discov Today 23:1241\u20131250","journal-title":"Drug Discov Today"},{"key":"442_CR14","doi-asserted-by":"publisher","first-page":"0119301","DOI":"10.1371\/journal.pone.0119301","volume":"10","author":"J Balfer","year":"2015","unstructured":"Balfer J, Bajorath J (2015) Systematic artifacts in support vector regression-based compound potency prediction revealed by statistical and activity landscape analysis. PLoS ONE 10:0119301","journal-title":"PLoS ONE"},{"key":"442_CR15","doi-asserted-by":"crossref","unstructured":"Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the 5th annual workshop on computational learning theory: Pittsburgh, Pennsylvania, pp\u00a0144\u2013152","DOI":"10.1145\/130385.130401"},{"key":"442_CR16","doi-asserted-by":"publisher","first-page":"1093","DOI":"10.1016\/j.neunet.2005.07.009","volume":"18","author":"L Ralaivola","year":"2005","unstructured":"Ralaivola L, Swamidass SJ, Saigo H, Baldi P (2005) Graph kernels for chemical informatics. Neural Netw 18:1093\u20131110","journal-title":"Neural Netw"},{"key":"442_CR17","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1517\/17460441.2014.866943","volume":"9","author":"K Heikamp","year":"2014","unstructured":"Heikamp K, Bajorath J (2014) Support vector machines for drug discovery. Expert Opin Drug Discov 9:93\u2013104","journal-title":"Expert Opin Drug Discov"},{"key":"442_CR18","doi-asserted-by":"publisher","first-page":"24","DOI":"10.2174\/157340910790980124","volume":"6","author":"K Hasegawa","year":"2010","unstructured":"Hasegawa K, Funatsu K (2010) Non-linear modeling and chemical interpretation with aid of support vector machine and regression. Curr Comput-Aided Drug Des 6:24\u201336","journal-title":"Curr Comput-Aided Drug Des"},{"key":"442_CR19","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1517\/17460441.2013.761204","volume":"8","author":"S Kar","year":"2013","unstructured":"Kar S, Roy K (2013) How far can virtual screening take us in drug discovery? Expert Opin Drug Discov 8:245\u2013261","journal-title":"Expert Opin Drug Discov"},{"key":"442_CR20","doi-asserted-by":"publisher","first-page":"1122","DOI":"10.1021\/ci049732r","volume":"45","author":"J Saeh","year":"2005","unstructured":"Saeh J, Lyne PD, Takasaki BK, Cosgrove DA (2005) Lead hopping using SVM and 3D pharmacophore fingerprints. J Chem Inf Model 45:1122\u20131133","journal-title":"J Chem Inf Model"},{"key":"442_CR21","doi-asserted-by":"publisher","first-page":"1227","DOI":"10.1021\/ci800022e","volume":"48","author":"XH Ma","year":"2008","unstructured":"Ma XH, Wang R, Yang SY, Xue Y, Wei YC, Low BC, Chen YZ (2008) Evaluation of virtual screening performance using support vector machines trained by sparsely distributed active compounds. J Chem Inf Model 48:1227\u20131237","journal-title":"J Chem Inf Model"},{"key":"442_CR22","doi-asserted-by":"publisher","first-page":"1545","DOI":"10.1021\/mp100179t","volume":"7","author":"XH Ma","year":"2010","unstructured":"Ma XH, Wang R, Tan CY, Jiang YY, Lu T, Rao HB, Li XY, Go ML, Low BC, Chen YZ (2010) Virtual screening of selective multitarget kinase inhibitors by combinatorial support vector machines. Mol Pharm 7:1545\u20131560","journal-title":"Mol Pharm"},{"key":"442_CR23","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1021\/ci800366f","volume":"49","author":"H Tang","year":"2009","unstructured":"Tang H, Wang XS, Huang X, Roth BL, Butler KV, Kozikowski AP, Jung M, Tropsha A (2009) Novel inhibitors of human histone deacetylase (HDAC) identified by QSAR modeling of known inhibitors, virtual screening, and experimental validation. J Chem Inf Model 49:461\u2013476","journal-title":"J Chem Inf Model"},{"key":"442_CR24","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: Machine Learning in Python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"442_CR25","doi-asserted-by":"publisher","first-page":"1545","DOI":"10.1021\/mp100179t","volume":"7","author":"XH Ma","year":"2010","unstructured":"Ma XH, Wang R, Tan CY, Jiang YY, Lu T, Rao HB, Li XY, Go ML, Low BC, Chen YZ (2010) Virtual screening of selective multitarget kinase inhibitors by combinatorial support vector machines. Mol Pharm 7:1545\u20131560","journal-title":"Mol Pharm"},{"key":"442_CR26","doi-asserted-by":"publisher","first-page":"49","DOI":"10.1016\/j.jmgm.2011.09.002","volume":"32","author":"Z Shi","year":"2012","unstructured":"Shi Z, Ma XH, Qin C, Jia J, Jiang YY, Tan CY, Chen YZ (2012) Combinatorial support vector machines approach for virtual screening of selective multi-target serotonin reuptake inhibitors from large compounds libraries. J Mol Graph Model 32:49\u201366","journal-title":"J Mol Graph Model"},{"key":"442_CR27","doi-asserted-by":"publisher","first-page":"1152","DOI":"10.1021\/ci7004753","volume":"48","author":"K Kawai","year":"2008","unstructured":"Kawai K, Fujishima S, Takahashi Y (2008) Predictive activity profiling of drugs by topological-fragment-spectra-based support vector machines. J Chem Inf Model 48:1152\u20131160","journal-title":"J Chem Inf Model"},{"key":"442_CR28","doi-asserted-by":"publisher","first-page":"75","DOI":"10.1111\/cbdd.12294","volume":"84","author":"J Balfer","year":"2014","unstructured":"Balfer J, Heikamp K, Laufer S, Bajorath J (2014) Modeling of compound profiling experiments using support vector machines. Chem Biol Drug Des 84:75\u201385","journal-title":"Chem Biol Drug Des"},{"key":"442_CR29","doi-asserted-by":"publisher","first-page":"2149","DOI":"10.1093\/bioinformatics\/btn409","volume":"24","author":"L Jacob","year":"2008","unstructured":"Jacob L, Vert JP (2008) Protein-ligand interaction prediction: an improved chemogenomics approach. Bioinformatics 24:2149\u20132156","journal-title":"Bioinformatics"},{"key":"442_CR30","doi-asserted-by":"publisher","first-page":"391","DOI":"10.1021\/ci025569t","volume":"43","author":"A Schuffenhauer","year":"2003","unstructured":"Schuffenhauer A, Floersheim P, Acklin P, Jacoby E (2003) Similarity metrics for ligands reflecting the similarity of the target proteins. J Chem Inf Comput Sci 43:391\u2013405","journal-title":"J Chem Inf Comput Sci"},{"key":"442_CR31","doi-asserted-by":"publisher","first-page":"767","DOI":"10.1021\/ci900004a","volume":"49","author":"H Geppert","year":"2009","unstructured":"Geppert H, Humrich J, Stumpfe D, G\u00e4rtner T, Bajorath J (2009) Ligand prediction from protein sequence and small molecule information using support vector machines and fingerprint descriptors. J Chem Inf Model 49:767\u2013779","journal-title":"J Chem Inf Model"},{"key":"442_CR32","doi-asserted-by":"publisher","first-page":"2155","DOI":"10.1021\/ci9002624","volume":"49","author":"AM Wassermann","year":"2009","unstructured":"Wassermann AM, Geppert H, Bajorath J (2009) Ligand prediction for orphan targets using support vector machines and various target-ligands kernels is dominated by nearest neighbor effects. J Chem Inf Model 49:2155\u20132167","journal-title":"J Chem Inf Model"},{"key":"442_CR33","doi-asserted-by":"publisher","first-page":"1535","DOI":"10.1021\/ci060117s","volume":"46","author":"GM Maggiora","year":"2006","unstructured":"Maggiora GM (2006) On outliers and activity cliffs: Why QSAR often disappoints. J Chem Inf Model 46:1535\u20131535","journal-title":"J Chem Inf Model"},{"key":"442_CR34","doi-asserted-by":"publisher","first-page":"2932","DOI":"10.1021\/jm201706b","volume":"55","author":"D Stumpfe","year":"2012","unstructured":"Stumpfe D, Bajorath J (2012) Exploring activity cliffs in medicinal chemistry. J Med Chem 55:2932\u20132942","journal-title":"J Med Chem"},{"key":"442_CR35","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1021\/ci900450m","volume":"50","author":"J Hussain","year":"2010","unstructured":"Hussain J, Rea C (2010) Computationally efficient algorithm to identify matched molecular pairs (MMPs) in large data sets. J Chem Inf Model 50:339\u2013348","journal-title":"J Chem Inf Model"},{"key":"442_CR36","doi-asserted-by":"publisher","first-page":"2354","DOI":"10.1021\/ci300306a","volume":"52","author":"K Heikamp","year":"2012","unstructured":"Heikamp K, Hu X, Yan A, Bajorath J (2012) Prediction of activity cliffs using support vector machines. J Chem Inf Model 52:2354\u20132365","journal-title":"J Chem Inf Model"},{"key":"442_CR37","doi-asserted-by":"publisher","first-page":"2654","DOI":"10.1021\/ci5003944","volume":"54","author":"A de Le\u00f3n","year":"2014","unstructured":"De la Vega, de Le\u00f3n A, Bajorath J (2014) Prediction of Compound Potency Changes in Matched Molecular Pairs Using Support Vector Regression. J Chem Inf Model 54:2654\u20132663","journal-title":"J Chem Inf Model"},{"key":"442_CR38","doi-asserted-by":"crossref","unstructured":"Horvath D, Marcou G, Varnek A, de la Kayastha S, Bajorath J (2016) Prediction of activity cliffs using condensed graphs of reaction representations, descriptor recombination, support vector machine classification, and support vector regression. J Chem Inf Model 56:1631\u20131640","DOI":"10.1021\/acs.jcim.6b00359"},{"key":"442_CR39","volume-title":"Pattern recognition and machine learning","author":"C Bishop","year":"2006","unstructured":"Bishop C (2006) Pattern recognition and machine learning. Springer, New York"},{"key":"442_CR40","doi-asserted-by":"publisher","first-page":"710","DOI":"10.1021\/acs.jcim.7b00088","volume":"57","author":"R Rodr\u00edguez-P\u00e9rez","year":"2017","unstructured":"Rodr\u00edguez-P\u00e9rez R, Vogt M, Bajorath J (2017) Influence of varying training set composition and size on support vector machine-based prediction of active compounds. J Chem Inf Model 57:710\u2013716","journal-title":"J Chem Inf Model"},{"key":"442_CR41","doi-asserted-by":"publisher","first-page":"1021","DOI":"10.1021\/ci100091e","volume":"50","author":"L Peltason","year":"2010","unstructured":"Peltason L, Iyer P, Bajorath J (2010) Rationalizing three-dimensional activity landscapes and the influence of molecular representations on landscape topology and formation of activity cliffs. J Chem Inf Model 50:1021\u20131033","journal-title":"J Chem Inf Model"},{"key":"442_CR42","doi-asserted-by":"publisher","first-page":"2618","DOI":"10.1021\/acs.jcim.7b00274","volume":"57","author":"P Polishchuk","year":"2017","unstructured":"Polishchuk P (2017) Interpretation of quantitative structure-activity relationship models: Past, present, and future. J Chem Inf Model 57:2618\u20132639","journal-title":"J Chem Inf Model"},{"key":"442_CR43","doi-asserted-by":"publisher","first-page":"178","DOI":"10.1016\/j.neucom.2010.02.016","volume":"74","author":"N Barakat","year":"2010","unstructured":"Barakat N, Bradley AP (2010) Rule extraction from support vector machines: A review. Neurocomputing 74:178\u2013190","journal-title":"Neurocomputing"},{"key":"442_CR44","doi-asserted-by":"publisher","first-page":"817","DOI":"10.1002\/minf.201100059","volume":"30","author":"K Hansen","year":"2011","unstructured":"Hansen K, Baehrens D, Schroeter T, Rupp M, M\u00fcller KR (2011) Visual interpretation of kernel-based prediction models. Mol Inf 30:817\u2013826","journal-title":"Mol Inf"},{"key":"442_CR45","doi-asserted-by":"publisher","first-page":"1136","DOI":"10.1021\/acs.jcim.5b00175","volume":"55","author":"J Balfer","year":"2015","unstructured":"Balfer J, Bajorath J (2015) Visualization and interpretation of support vector machine activity predictions. J Chem Inf Model 55:1136\u20131147","journal-title":"J Chem Inf Model"},{"key":"442_CR46","doi-asserted-by":"publisher","first-page":"6371","DOI":"10.1021\/acsomega.7b01079","volume":"2","author":"R Rodr\u00edguez-P\u00e9rez","year":"2017","unstructured":"Rodr\u00edguez-P\u00e9rez R, Vogt M, Bajorath J (2017) Support vector machine classification and regression prioritize different structural features for binary compound activity and potency value prediction. ACS Omega 2:6371\u20136379","journal-title":"ACS Omega"},{"key":"442_CR47","first-page":"1","volume-title":"Introduction to sensitivity analysis. Handbook of Uncertainty Quantification. Ghanem","author":"B Iooss","year":"2016","unstructured":"Iooss B, Saltelli A (2016) In: Higdon R, Owhadi D (eds) Introduction to sensitivity analysis. Handbook of Uncertainty Quantification. Ghanem. Springer International Publishing, Cham, pp 1\u201320"},{"key":"442_CR48","unstructured":"Lundberg SM, Lee S (2017) A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems 30 (NIPS)"}],"container-title":["Journal of Computer-Aided Molecular Design"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10822-022-00442-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10822-022-00442-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10822-022-00442-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,26]],"date-time":"2022-07-26T09:24:53Z","timestamp":1658827493000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10822-022-00442-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,19]]},"references-count":48,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2022,5]]}},"alternative-id":["442"],"URL":"https:\/\/doi.org\/10.1007\/s10822-022-00442-9","relation":{},"ISSN":["0920-654X","1573-4951"],"issn-type":[{"value":"0920-654X","type":"print"},{"value":"1573-4951","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,19]]},"assertion":[{"value":"8 February 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"15 February 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 March 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}