{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T10:25:42Z","timestamp":1780395942305,"version":"3.54.1"},"reference-count":79,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T00:00:00Z","timestamp":1719532800000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,6,28]],"date-time":"2024-06-28T00:00:00Z","timestamp":1719532800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004359","name":"Vetenskapsr\u00e5det","doi-asserted-by":"publisher","award":["2020-03731"],"award-info":[{"award-number":["2020-03731"]}],"id":[{"id":"10.13039\/501100004359","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001862","name":"Svenska Forskningsr\u00e5det Formas","doi-asserted-by":"publisher","award":["2022-00940"],"award-info":[{"award-number":["2022-00940"]}],"id":[{"id":"10.13039\/501100001862","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002794","name":"Cancerfonden","doi-asserted-by":"publisher","award":["22 2412"],"award-info":[{"award-number":["22 2412"]}],"id":[{"id":"10.13039\/501100002794","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100018693","name":"HORIZON EUROPE Framework Programme","doi-asserted-by":"publisher","award":["101057014"],"award-info":[{"award-number":["101057014"]}],"id":[{"id":"10.13039\/100018693","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100007051","name":"Uppsala University","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100007051","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Conformal prediction has seen many applications in pharmaceutical science, being able to calibrate outputs of machine learning models and producing valid prediction intervals. We here present the open source software CPSign that is a complete implementation of conformal prediction for cheminformatics modeling. CPSign implements inductive and transductive conformal prediction for classification and regression, and probabilistic prediction with the Venn-ABERS methodology. The main chemical representation is signatures but other types of descriptors are also supported. The main modeling methodology is support vector machines (SVMs), but additional modeling methods are supported via an extension mechanism, e.g. DeepLearning4J models. We also describe features for visualizing results from conformal models including calibration and efficiency plots, as well as features to publish predictive models as REST services. We compare CPSign against other common cheminformatics modeling approaches including random forest, and a directed message-passing neural network. The results show that CPSign produces robust predictive performance with comparative predictive efficiency, with superior runtime and lower hardware requirements compared to neural network based models. CPSign has been used in several studies and is in production-use in multiple organizations. The ability to work directly with chemical input files, perform descriptor calculation and modeling with SVM in the conformal prediction framework, with a single software package having a low footprint and fast execution time makes CPSign a convenient and yet flexible package for training, deploying, and predicting on chemical data. CPSign can be downloaded from GitHub at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/arosbio\/cpsign\">https:\/\/github.com\/arosbio\/cpsign<\/jats:ext-link>.<\/jats:p><jats:p><jats:bold>Scientific contribution<\/jats:bold><\/jats:p><jats:p> CPSign provides a single software that allows users to perform data preprocessing, modeling and make predictions directly on chemical structures, using conformal and probabilistic prediction. Building and evaluating new models can be achieved at a high abstraction level, without sacrificing flexibility and predictive performance\u2014showcased with a method evaluation against contemporary modeling approaches, where CPSign performs on par with a state-of-the-art deep learning based model.<\/jats:p>","DOI":"10.1186\/s13321-024-00870-9","type":"journal-article","created":{"date-parts":[[2024,7,7]],"date-time":"2024-07-07T19:01:13Z","timestamp":1720378873000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["CPSign: conformal prediction for cheminformatics modeling"],"prefix":"10.1186","volume":"16","author":[{"given":"Staffan","family":"Arvidsson McShane","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ulf","family":"Norinder","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jonathan","family":"Alvarsson","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ernst","family":"Ahlberg","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Lars","family":"Carlsson","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ola","family":"Spjuth","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,6,28]]},"reference":[{"issue":"6","key":"870_CR1","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1038\/s41573-019-0024-5","volume":"18","author":"J Vamathevan","year":"2019","unstructured":"Vamathevan J, Clark D, Czodrowski P, Dunham I, Ferran E, Lee G, Li B, Madabhushi A, Shah P, Spitzer M, Zhao S (2019) Applications of machine learning in drug discovery and development. Nat Rev Drug Discov 18(6):463\u2013477","journal-title":"Nat Rev Drug Discov"},{"issue":"9","key":"870_CR2","doi-asserted-by":"publisher","first-page":"624","DOI":"10.1016\/j.tips.2019.07.005","volume":"40","author":"AO Basile","year":"2019","unstructured":"Basile AO, Yahi A, Tatonetti NP (2019) Artificial intelligence for drug toxicity and safety. Trends Pharmacol Sci 40(9):624\u2013635","journal-title":"Trends Pharmacol Sci"},{"issue":"11","key":"870_CR3","doi-asserted-by":"publisher","first-page":"3525","DOI":"10.1039\/D0CS00098A","volume":"49","author":"EN Muratov","year":"2020","unstructured":"Muratov EN, Bajorath J, Sheridan RP, Tetko IV, Filimonov D, Poroikov V, Oprea TI, Baskin II, Varnek A, Roitberg A, Isayev O, Curtarolo S, Fourches D, Cohen Y, Aspuru-Guzik A, Winkler DA, Agrafiotis D, Cherkasov A, Tropsha A (2020) QSAR without borders. Chem Soc Rev 49(11):3525\u20133564","journal-title":"Chem Soc Rev"},{"issue":"9","key":"870_CR4","doi-asserted-by":"publisher","first-page":"949","DOI":"10.1080\/17460441.2021.1909567","volume":"16","author":"J Jim\u00e9nez-Luna","year":"2021","unstructured":"Jim\u00e9nez-Luna J, Grisoni F, Weskamp N, Schneider G (2021) Artificial intelligence in drug discovery: recent advances and future perspectives. Expert Opin Drug Discov 16(9):949\u2013959","journal-title":"Expert Opin Drug Discov"},{"issue":"1","key":"870_CR5","first-page":"45","volume":"1","author":"D Gadaleta","year":"2016","unstructured":"Gadaleta D, Mangiatordi GF, Catto M, Carotti A, Nicolotti O (2016) Applicability domain for QSAR models: where theory meets reality. Int J Quant Struct Prop Relatsh 1(1):45\u201363","journal-title":"Int J Quant Struct Prop Relatsh"},{"key":"870_CR6","doi-asserted-by":"publisher","DOI":"10.1007\/b106715","volume-title":"Algorithmic learning in a random world","author":"V Vovk","year":"2005","unstructured":"Vovk V, Gammerman A, Shafer G (2005) Algorithmic learning in a random world. Springer, New York. https:\/\/doi.org\/10.1007\/b106715"},{"issue":"6","key":"870_CR7","doi-asserted-by":"publisher","first-page":"1596","DOI":"10.1021\/ci5001168","volume":"54","author":"U Norinder","year":"2014","unstructured":"Norinder U, Carlsson L, Boyer S, Eklund M (2014) Introducing conformal prediction in predictive modeling: a transparent and flexible alternative to applicability domain determination. J Chem Inf Model 54(6):1596\u2013603. https:\/\/doi.org\/10.1021\/ci5001168","journal-title":"J Chem Inf Model"},{"issue":"4","key":"870_CR8","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1080\/1062936X.2016.1172665","volume":"27","author":"U Norinder","year":"2016","unstructured":"Norinder U, Rybacka A, Andersson PL (2016) Conformal prediction to define applicability domain\u2014a case study on predicting ER and AR binding. SAR QSAR Environ Res 27(4):303\u201316. https:\/\/doi.org\/10.1080\/1062936X.2016.1172665","journal-title":"SAR QSAR Environ Res"},{"issue":"1","key":"870_CR9","doi-asserted-by":"publisher","first-page":"42","DOI":"10.1016\/j.xphs.2020.09.055","volume":"110","author":"J Alvarsson","year":"2021","unstructured":"Alvarsson J, McShane SA, Norinder U, Spjuth O (2021) Predicting with confidence: using conformal prediction in drug discovery. J Pharm Sci 110(1):42\u201349","journal-title":"J Pharm Sci"},{"issue":"1","key":"870_CR10","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1186\/s13321-018-0260-4","volume":"10","author":"F Svensson","year":"2018","unstructured":"Svensson F, Afzal AM, Norinder U, Bender A (2018) Maximizing gain in high-throughput screening using conformal prediction. J Cheminform 10(1):7. https:\/\/doi.org\/10.1186\/s13321-018-0260-4","journal-title":"J Cheminform"},{"issue":"1","key":"870_CR11","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1039\/c6tx00252h","volume":"6","author":"F Svensson","year":"2017","unstructured":"Svensson F, Norinder U, Bender A (2017) Modelling compound cytotoxicity using conformal prediction and PubChem HTS data. Toxicol Res 6(1):73\u201380. https:\/\/doi.org\/10.1039\/c6tx00252h","journal-title":"Toxicol Res"},{"issue":"1","key":"870_CR12","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1186\/s13321-021-00511-5","volume":"13","author":"A Morger","year":"2021","unstructured":"Morger A, Svensson F, Arvidsson McShane S, Gauraha N, Norinder U, Spjuth O, Volkamer A (2021) Assessing the calibration in toxicological in vitro models with conformal prediction. J Cheminf 13(1):35","journal-title":"J Cheminf"},{"issue":"1","key":"870_CR13","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1186\/s13321-018-0271-1","volume":"10","author":"M Lapins","year":"2018","unstructured":"Lapins M, Arvidsson S, Lampa S, Berg A, Schaal W, Alvarsson J, Spjuth O (2018) A confidence predictor for logD using conformal regression and a support-vector machine. J Cheminform 10(1):17. https:\/\/doi.org\/10.1186\/s13321-018-0271-1","journal-title":"J Cheminform"},{"key":"870_CR14","doi-asserted-by":"publisher","first-page":"1256","DOI":"10.3389\/fphar.2018.01256","volume":"9","author":"S Lampa","year":"2018","unstructured":"Lampa S, Alvarsson J, Arvidsson Mc\u00a0Shane S, Berg A, Ahlberg E, Spjuth O (2018) Predicting off-target binding profiles with confidence using conformal prediction. Front Pharmacol 9:1256. https:\/\/doi.org\/10.3389\/fphar.2018.01256","journal-title":"Front Pharmacol"},{"issue":"2","key":"870_CR15","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1080\/00498254.2022.2049397","volume":"52","author":"U Fagerholm","year":"2022","unstructured":"Fagerholm U, Hellberg S, Alvarsson J, Spjuth O (2022) In silico predictions of the human pharmacokinetics\/toxicokinetics of 65 chemicals from various classes using conformal prediction methodology. Xenobiotica 52(2):113\u2013118. https:\/\/doi.org\/10.1080\/00498254.2022.2049397","journal-title":"Xenobiotica"},{"issue":"1","key":"870_CR16","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1177\/02611929221148447","volume":"51","author":"U Fagerholm","year":"2023","unstructured":"Fagerholm U, Hellberg S, Alvarsson J, Spjuth O (2023) In silico prediction of human clinical pharmacokinetics with andromeda by prosilico: predictions for an established benchmarking data set, a modern small drug data set, and a comparison with laboratory methods. Altern Lab Anim 51(1):39\u201354. https:\/\/doi.org\/10.1177\/02611929221148447","journal-title":"Altern Lab Anim"},{"issue":"3","key":"870_CR17","doi-asserted-by":"publisher","first-page":"1269","DOI":"10.1021\/acs.jcim.8b00542","volume":"59","author":"I Cort\u00e9s-Ciriano","year":"2019","unstructured":"Cort\u00e9s-Ciriano I, Bender A (2019) Deep confidence: a computationally efficient framework for calculating reliable prediction errors for deep neural networks. J Chem Inf Model 59(3):1269\u20131281. https:\/\/doi.org\/10.1021\/acs.jcim.8b00542","journal-title":"J Chem Inf Model"},{"key":"870_CR18","doi-asserted-by":"publisher","DOI":"10.3390\/molecules28010217","author":"U Norinder","year":"2022","unstructured":"Norinder U (2022) Traditional machine and deep learning for predicting toxicity endpoints. Molecules. https:\/\/doi.org\/10.3390\/molecules28010217","journal-title":"Molecules"},{"issue":"6","key":"870_CR19","doi-asserted-by":"publisher","first-page":"2648","DOI":"10.1021\/acs.jcim.1c00208","volume":"61","author":"J Zhang","year":"2021","unstructured":"Zhang J, Norinder U, Svensson F (2021) Deep learning-based conformal prediction of toxicity. J Chem Inf Model 61(6):2648\u20132657. https:\/\/doi.org\/10.1021\/acs.jcim.1c00208","journal-title":"J Chem Inf Model"},{"issue":"1","key":"870_CR20","doi-asserted-by":"publisher","first-page":"7761","DOI":"10.1038\/s41467-022-34945-8","volume":"13","author":"H Olsson","year":"2022","unstructured":"Olsson H, Kartasalo K, Mulliqi N, Capuccini M, Ruusuvuori P, Samaratunga H, Delahunt B, Lindskog C, Janssen EAM, Blilie A, Egevad L, Spjuth O, Eklund M, ISUP Prostate Imagebase Expert Panel (2022) Estimating diagnostic uncertainty in artificial intelligence assisted pathology using conformal prediction. Nat Commun 13(1):7761. https:\/\/doi.org\/10.1038\/s41467-022-34945-8","journal-title":"Nat Commun"},{"key":"870_CR21","unstructured":"Linusson H. Nonconformist. 2015. http:\/\/donlnz.github.io\/nonconformist\/. Accessed Aug 2023"},{"key":"870_CR22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-018-0323-6","volume":"11","author":"N Bosc","year":"2019","unstructured":"Bosc N, Atkinson F, Felix E, Gaulton A, Hersey A, Leach AR (2019) Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery. J Cheminf 11:1\u201316","journal-title":"J Cheminf"},{"issue":"3","key":"870_CR23","doi-asserted-by":"publisher","first-page":"439","DOI":"10.1021\/acs.jcim.6b00532","volume":"57","author":"F Svensson","year":"2017","unstructured":"Svensson F, Norinder U, Bender A (2017) Improving screening efficiency through iterative screening using docking and conformal prediction. J Chem Inf Model 57(3):439\u2013444","journal-title":"J Chem Inf Model"},{"issue":"4","key":"870_CR24","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1080\/1062936X.2019.1591503","volume":"30","author":"U Norinder","year":"2019","unstructured":"Norinder U, Naveja JJ, L\u00f3pez-L\u00f3pez E, Mucs D, Medina-Franco JL (2019) Conformal prediction of HDAC inhibitors. SAR QSAR Environ Res 30(4):265\u2013277. https:\/\/doi.org\/10.1080\/1062936X.2019.1591503","journal-title":"SAR QSAR Environ Res"},{"key":"870_CR25","unstructured":"Mendil M, Mossina L, Vigouroux D. PUNCC: a python library for predictive uncertainty calibration and conformalization. In: Conformal and Probabilistic Prediction with Applications, PMLR. 2023. p. 582\u2013601."},{"key":"870_CR26","unstructured":"Bostr\u00f6m H (2022) crepes: a python package for generating conformal regressors and predictive systems. In: Conformal and Probabilistic Prediction with Applications, pp. 24\u201341. PMLR"},{"key":"870_CR27","doi-asserted-by":"publisher","unstructured":"Manokhin V. Awesome conformal prediction. https:\/\/doi.org\/10.5281\/zenodo.6467205. https:\/\/doi.org\/10.5281\/zenodo.6467205. Accessed Nov 2023","DOI":"10.5281\/zenodo.6467205"},{"issue":"3","key":"870_CR28","doi-asserted-by":"publisher","first-page":"707","DOI":"10.1021\/ci020345w","volume":"43","author":"J-L Faulon","year":"2003","unstructured":"Faulon J-L, Visco DP, Pophale RS (2003) The signature molecular descriptor. 1. Using extended valence sequences in QSAR and QSPR studies. J Chem Inf Model 43(3):707\u2013720. https:\/\/doi.org\/10.1021\/ci020345w","journal-title":"J Chem Inf Model"},{"issue":"3","key":"870_CR29","doi-asserted-by":"publisher","first-page":"721","DOI":"10.1021\/ci020346o","volume":"43","author":"J-L Faulon","year":"2003","unstructured":"Faulon J-L, Churchwell CJ, Visco DP (2003) The signature molecular descriptor. 2. Enumerating molecules from their extended valence sequences. J Chem Inf Model 43(3):721\u2013734. https:\/\/doi.org\/10.1021\/ci020346o","journal-title":"J Chem Inf Model"},{"issue":"6","key":"870_CR30","doi-asserted-by":"publisher","first-page":"1596","DOI":"10.1021\/ci5001168","volume":"54","author":"U Norinder","year":"2014","unstructured":"Norinder U, Carlsson L, Boyer S, Eklund M (2014) Introducing conformal prediction in predictive modeling: a transparent and flexible alternative to applicability domain determination. J Chem Inf Model 54(6):1596\u20131603","journal-title":"J Chem Inf Model"},{"issue":"7","key":"870_CR31","doi-asserted-by":"publisher","first-page":"1591","DOI":"10.1021\/acs.jcim.7b00159","volume":"57","author":"J Sun","year":"2017","unstructured":"Sun J, Carlsson L, Ahlberg E, Norinder U, Engkvist O, Chen H (2017) Applying mondrian cross-conformal prediction to estimate prediction confidence on large imbalanced bioactivity data sets. J Chem Inf Model 57(7):1591\u20131598. https:\/\/doi.org\/10.1021\/acs.jcim.7b00159","journal-title":"J Chem Inf Model"},{"key":"870_CR32","doi-asserted-by":"publisher","first-page":"256","DOI":"10.1016\/j.jmgm.2017.01.008","volume":"72","author":"U Norinder","year":"2017","unstructured":"Norinder U, Boyer S (2017) Binary classification of imbalanced datasets using conformal prediction. J Mol Gr Modell 72:256\u2013265","journal-title":"J Mol Gr Modell"},{"key":"870_CR33","doi-asserted-by":"crossref","unstructured":"Vovk V, Fedorova V, Nouretdinov I, Gammerman A. Criteria of efficiency for conformal prediction. In: Symp. on Conformal and Probabilistic Prediction with Appl. Springer; 2016. p. 23\u201339.","DOI":"10.1007\/978-3-319-33395-3_2"},{"key":"870_CR34","unstructured":"Vovk V. Venn predictors and isotonic regression. CoRR abs\/1211.0025. 2012."},{"key":"870_CR35","unstructured":"Vovk V, Petej I, Fedorova V. Large-scale probabilistic prediction with and without validity guarantees. In: Proceedings of NIPS, vol. 2015. 2015."},{"key":"870_CR36","unstructured":"Sweidan D, Johansson U. Probabilistic prediction in scikit-learn. In: The 18th International Conference on Modeling Decisions for Artificial Intelligence, Sept 27\u201330, 2021. 2021."},{"issue":"3","key":"870_CR37","doi-asserted-by":"publisher","first-page":"1230","DOI":"10.1021\/acs.jcim.8b00724","volume":"59","author":"R Buendia","year":"2019","unstructured":"Buendia R, Kogej T, Engkvist O, Carlsson L, Linusson H, Johansson U, Toccaceli P, Ahlberg E (2019) Accurate hit estimation for iterative screening using venn\u2013abers predictors. J Chem Inf Model 59(3):1230\u20131237","journal-title":"J Chem Inf Model"},{"key":"870_CR38","unstructured":"Arvidsson S, Spjuth O, Carlsson L, Toccaceli P. Prediction of metabolic transformations using cross venn\u2013abers predictors. In: Conformal and Probabilistic Prediction and Applications, PMLR. 2017. p. 118\u201331."},{"key":"870_CR39","unstructured":"Ahlberg E, Buendia R, Carlsson L. Using venn\u2013abers predictors to assess cardio-vascular risk. In: Conformal and Probabilistic Prediction and Applications, PMLR. 2018. p. 132\u201346."},{"key":"870_CR40","volume-title":"Handbook of molecular descriptors","author":"R Todeschini","year":"2008","unstructured":"Todeschini R, Consonni V (2008) Handbook of molecular descriptors. John Wiley & Sons, Hoboken"},{"issue":"3","key":"870_CR41","first-page":"199","volume":"9","author":"RC Glen","year":"2006","unstructured":"Glen RC, Bender A, Arnby CH, Carlsson L, Boyer S, Smith J (2006) Circular fingerprints: flexible molecular descriptors with applications from physical chemistry to ADME. IDrugs 9(3):199","journal-title":"IDrugs"},{"issue":"2","key":"870_CR42","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1021\/c160017a018","volume":"5","author":"HL Morgan","year":"1965","unstructured":"Morgan HL (1965) The generation of a unique machine description for chemical structures-a technique developed at chemical abstracts service. J Chem Doc 5(2):107\u2013113","journal-title":"J Chem Doc"},{"issue":"5","key":"870_CR43","doi-asserted-by":"publisher","first-page":"742","DOI":"10.1021\/ci100050t","volume":"50","author":"D Rogers","year":"2010","unstructured":"Rogers D, Hahn M (2010) Extended-connectivity fingerprints. J Chem Inf Model 50(5):742\u2013754","journal-title":"J Chem Inf Model"},{"issue":"17","key":"870_CR44","doi-asserted-by":"publisher","first-page":"2111","DOI":"10.2174\/138161206777585274","volume":"12","author":"C Steinbeck","year":"2006","unstructured":"Steinbeck C, Hoppe C, Kuhn S, Floris M, Guha R, Willighagen EL (2006) Recent developments of the chemistry development kit (CDK)-an open-source java library for chemo-and bioinformatics. Curr Pharm Des 12(17):2111\u20132120","journal-title":"Curr Pharm Des"},{"key":"870_CR45","first-page":"1871","volume":"9","author":"R-E Fan","year":"2008","unstructured":"Fan R-E, Chang K-W, Hsieh C-J, Wang X-R, Lin C-J (2008) LIBLINEAR: a library for large linear classification. J Mach Learn Res 9:1871\u20131874","journal-title":"J Mach Learn Res"},{"key":"870_CR46","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1145\/1961189.1961199","volume":"2","author":"C-C Chang","year":"2011","unstructured":"Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:27\u201312727","journal-title":"ACM Trans Intell Syst Technol"},{"issue":"1","key":"870_CR47","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-016-0151-5","volume":"8","author":"J Alvarsson","year":"2016","unstructured":"Alvarsson J, Lampa S, Schaal W, Andersson C, Wikberg JE, Spjuth O (2016) Large-scale ligand-based predictive modelling using support vector machines. J Cheminf 8(1):1\u20139","journal-title":"J Cheminf"},{"issue":"11","key":"870_CR48","doi-asserted-by":"publisher","first-page":"3211","DOI":"10.1021\/ci500344v","volume":"54","author":"J Alvarsson","year":"2014","unstructured":"Alvarsson J, Eklund M, Andersson C, Carlsson L, Spjuth O, Wikberg JE (2014) Benchmarking study of parameter variation when using signature fingerprints together with support vector machines. J Chem Inf Model 54(11):3211\u20133217","journal-title":"J Chem Inf Model"},{"key":"870_CR49","unstructured":"Team EDD. Deeplearning4j: open-source distributed deep learning for the JVM. 2023. https:\/\/deeplearning4j.konduit.ai\/. Accessed Nov 2023"},{"key":"870_CR50","first-page":"231","volume-title":"Artificial intelligence applications and innovations IFIPAICT 14","author":"L Carlsson","year":"2014","unstructured":"Carlsson L, Eklund M, Norinder U (2014) Aggregated conformal prediction. In: Iliadis L, Maglogiannis I, Papadopoulos H, Sioutas S, Makris C (eds) Artificial intelligence applications and innovations IFIPAICT 14. Springer, Berlin, pp 231\u2013240"},{"issue":"1\u20132","key":"870_CR51","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1007\/s10472-013-9368-4","volume":"74","author":"V Vovk","year":"2015","unstructured":"Vovk V (2015) Cross-conformal predictors. Ann Math Artif Intell 74(1\u20132):9\u201328. https:\/\/doi.org\/10.1007\/s10472-013-9368-4","journal-title":"Ann Math Artif Intell"},{"issue":"7","key":"870_CR52","doi-asserted-by":"publisher","first-page":"3722","DOI":"10.1021\/acs.jcim.1c00293","volume":"61","author":"S Arvidsson McShane","year":"2021","unstructured":"Arvidsson McShane S, Ahlberg E, Noeske T, Spjuth O (2021) Machine learning strategies when transitioning between biological assays. J Chem Inf Model 61(7):3722\u20133733","journal-title":"J Chem Inf Model"},{"key":"870_CR53","doi-asserted-by":"crossref","unstructured":"Johansson U, Ahlberg E, Bostr\u00f6m H, Carlsson L, Linusson H, S\u00f6nstr\u00f6d C. Handling small calibration sets in mondrian inductive conformal regressors. In: Int Symp on Statistical Learning and Data Sci, Springer. 2015. p. 271\u201380.","DOI":"10.1007\/978-3-319-17091-6_22"},{"key":"870_CR54","doi-asserted-by":"crossref","unstructured":"Carlsson L, Ahlberg E, Bostr\u00f6m H, Johansson U, Linusson H. Modifications to p-values of conformal predictors. In: Int Symp on Statistical Learning and Data Sci. Springer. 2015. p. 251\u20139.","DOI":"10.1007\/978-3-319-17091-6_20"},{"key":"870_CR55","doi-asserted-by":"crossref","unstructured":"Ahlberg E, Spjuth O, Hasselgren C, Carlsson L. Interpretation of conformal prediction classification models. In: statistical learning and data sciences: third international symposium, SLDS 2015, Egham, UK, April 20\u201323, 2015, Proceedings 3, Springer. 2015. p. 323\u201334.","DOI":"10.1007\/978-3-319-17091-6_27"},{"key":"870_CR56","unstructured":"Deligianni M. Comparison of support vector machines and deep learning For QSAR with conformal prediction. 2022."},{"key":"870_CR57","doi-asserted-by":"publisher","DOI":"10.1177\/02611929221148447","author":"U Fagerholm","year":"2023","unstructured":"Fagerholm U, Hellberg S, Alvarsson J, Spjuth O (2023) In silico prediction of human clinical pharmacokinetics with andromeda by prosilico: predictions for an established benchmarking data set, a modern small drug data set, and a comparison with laboratory methods. Altern Lab Anim. https:\/\/doi.org\/10.1177\/02611929221148447","journal-title":"Altern Lab Anim"},{"key":"870_CR58","doi-asserted-by":"publisher","first-page":"1256","DOI":"10.3389\/fphar.2018.01256","volume":"9","author":"S Lampa","year":"2018","unstructured":"Lampa S, Alvarsson J, Arvidsson Mc Shane S, Berg A, Ahlberg E, Spjuth O (2018) Predicting off-target binding profiles with confidence using conformal prediction. Front Pharmacol 9:1256","journal-title":"Front Pharmacol"},{"key":"870_CR59","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-018-0271-1","volume":"10","author":"M Lapins","year":"2018","unstructured":"Lapins M, Arvidsson S, Lampa S, Berg A, Schaal W, Alvarsson J, Spjuth O (2018) A confidence predictor for logD using conformal regression and a support-vector machine. J Cheminf 10:1\u201310","journal-title":"J Cheminf"},{"key":"870_CR60","unstructured":"Software S. OpenAPI specification. 2023. https:\/\/swagger.io\/specification\/. Accessed Nov 2023"},{"issue":"3","key":"870_CR61","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1109\/MCSE.2007.55","volume":"9","author":"JD Hunter","year":"2007","unstructured":"Hunter JD (2007) Matplotlib: a 2d graphics environment. Comput Sci Eng 9(3):90\u201395. https:\/\/doi.org\/10.1109\/MCSE.2007.55","journal-title":"Comput Sci Eng"},{"key":"870_CR62","doi-asserted-by":"publisher","DOI":"10.1021\/acs.jcim.3c01250","author":"E Heid","year":"2023","unstructured":"Heid E, Greenman KP, Chung Y, Li S-C, Graff DE, Vermeire FH, Wu H, Green WH, McGill CJ (2023) Chemprop: machine learning package for chemical property prediction. J Chem Inf Model. https:\/\/doi.org\/10.1021\/acs.jcim.3c01250","journal-title":"J Chem Inf Model"},{"issue":"2","key":"870_CR63","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1039\/C7SC02664A","volume":"9","author":"Z Wu","year":"2018","unstructured":"Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS, Leswing K, Pande V (2018) MoleculeNet: a benchmark for molecular machine learning. Chemical Sci 9(2):513\u2013530","journal-title":"Chemical Sci"},{"issue":"1","key":"870_CR64","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-020-00443-6","volume":"12","author":"C \u0160kuta","year":"2020","unstructured":"\u0160kuta C, Cort\u00e9s-Ciriano I, Dehaen W, K\u0159\u00ed\u017e P, Westen GJ, Tetko IV, Bender A, Svozil D (2020) QSAR-derived affinity fingerprints (part 1): fingerprint construction and modeling performance for similarity searching, bioactivity classification and scaffold hopping. J Cheminf 12(1):1\u201316","journal-title":"J Cheminf"},{"issue":"1","key":"870_CR65","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13321-022-00672-x","volume":"15","author":"OJ B\u00e9quignon","year":"2023","unstructured":"B\u00e9quignon OJ, Bongers BJ, Jespers W, IJzerman AP, Water B, Westen GJ (2023) Papyrus: a large-scale curated dataset aimed at bioactivity predictions. J Cheminf 15(1):1\u201311","journal-title":"J Cheminf"},{"key":"870_CR66","volume-title":"Deep learning for the life sciences","author":"B Ramsundar","year":"2019","unstructured":"Ramsundar B, Eastman P, Walters P, Pande V, Leswing K, Wu Z (2019) Deep learning for the life sciences. O\u2019Reilly Media, Sebastopol"},{"issue":"7825","key":"870_CR67","doi-asserted-by":"publisher","first-page":"357","DOI":"10.1038\/s41586-020-2649-2","volume":"585","author":"CR Harris","year":"2020","unstructured":"Harris CR, Millman KJ, Walt SJ, Gommers R, Virtanen P, Cournapeau D, Wieser E, Taylor J, Berg S, Smith NJ, Kern R, Picus M, Hoyer S, Kerkwijk MH, Brett M, Haldane A, R\u00edo JF, Wiebe M, Peterson P, G\u00e9rard-Marchant P, Sheppard K, Reddy T, Weckesser W, Abbasi H, Gohlke C, Oliphant TE (2020) Array programming with NumPy. Nature 585(7825):357\u2013362. https:\/\/doi.org\/10.1038\/s41586-020-2649-2","journal-title":"Nature"},{"key":"870_CR68","unstructured":"RDKit: RDKit: Open-source cheminformatics software. https:\/\/zenodo.org\/record\/7671152#.ZFIV43ZBzao. Accessed Aug 2023"},{"issue":"8","key":"870_CR69","doi-asserted-by":"publisher","first-page":"3370","DOI":"10.1021\/acs.jcim.9b00237","volume":"59","author":"K Yang","year":"2019","unstructured":"Yang K, Swanson K, Jin W, Coley C, Eiden P, Gao H, Guzman-Perez A, Hopper T, Kelley B, Mathea M et al (2019) Analyzing learned molecular representations for property prediction. J Chem Inf Model 59(8):3370\u20133388","journal-title":"J Chem Inf Model"},{"key":"870_CR70","unstructured":"Bergstra J, Yamins D, Cox D. Making a science of model search: hyperparameter optimization in hundreds of dimensions for vision architectures. In: International Conference on Machine Learning, PMLR. 2013. p. 115\u201323."},{"issue":"1\u20132","key":"870_CR71","doi-asserted-by":"publisher","first-page":"117","DOI":"10.1007\/s10472-013-9378-2","volume":"74","author":"M Eklund","year":"2015","unstructured":"Eklund M, Norinder U, Boyer S, Carlsson L (2015) The application of conformal prediction to the drug discovery process. Ann Math Artif Intell 74(1\u20132):117\u2013132","journal-title":"Ann Math Artif Intell"},{"issue":"5","key":"870_CR72","doi-asserted-by":"publisher","first-page":"1132","DOI":"10.1021\/acs.jcim.8b00054","volume":"58","author":"F Svensson","year":"2018","unstructured":"Svensson F, Aniceto N, Norinder U, Cortes-Ciriano I, Spjuth O, Carlsson L, Bender A (2018) Conformal regression for quantitative structure-activity relationship modeling-quantifying prediction uncertainty. J Chem Inf Model 58(5):1132\u20131140","journal-title":"J Chem Inf Model"},{"key":"870_CR73","unstructured":"Krizhevsky A, Sutskever I, Hinton GE Imagenet classification with deep convolutional neural networks. Adv Neural Inf Process Syst. 2012;25."},{"issue":"4","key":"870_CR74","first-page":"466","volume":"24","author":"A Kensert","year":"2019","unstructured":"Kensert A, Harrison PJ, Spjuth O (2019) Transfer learning with deep convolutional neural networks for classifying cellular morphological changes. SLAS Discov Adv Life Sci R &D 24(4):466\u2013475","journal-title":"SLAS Discov Adv Life Sci R &D"},{"issue":"4","key":"870_CR75","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1093\/bib\/bbaa321","volume":"22","author":"Z Wu","year":"2021","unstructured":"Wu Z, Zhu M, Kang Y, Leung EL-H, Lei T, Shen C, Jiang D, Wang Z, Cao D, Hou T (2021) Do we need different machine learning algorithms for QSAR modeling? A comprehensive assessment of 16 machine learning algorithms on 14 qsar data sets. Briefin Bioinf 22(4):321","journal-title":"Briefin Bioinf"},{"issue":"12","key":"870_CR76","doi-asserted-by":"publisher","first-page":"4462","DOI":"10.1021\/acs.molpharmaceut.7b00578","volume":"14","author":"A Korotcov","year":"2017","unstructured":"Korotcov A, Tkachenko V, Russo DP, Ekins S (2017) Comparison of deep learning with multiple machine learning methods and metrics using diverse drug discovery data sets. Mol Pharma 14(12):4462\u20134475","journal-title":"Mol Pharma"},{"key":"870_CR77","unstructured":"Guo C, Pleiss G, Sun Y, Weinberger KQ. On calibration of modern neural networks. In: International Conference on Machine Learning, PMLR. 2017. p. 1321\u201330."},{"issue":"7","key":"870_CR78","doi-asserted-by":"publisher","first-page":"755","DOI":"10.1080\/17460441.2020.1745183","volume":"15","author":"II Baskin","year":"2020","unstructured":"Baskin II (2020) The power of deep learning to ligand-based novel drug discovery. Expert Opin Drug Discov 15(7):755\u2013764","journal-title":"Expert Opin Drug Discov"},{"issue":"9","key":"870_CR79","doi-asserted-by":"publisher","first-page":"1071","DOI":"10.1080\/17460441.2021.1932812","volume":"16","author":"O Spjuth","year":"2021","unstructured":"Spjuth O, Frid J, Hellander A (2021) The machine learning life cycle and the cloud: implications for drug discovery. Expert Opin Drug Discov 16(9):1071\u20131079","journal-title":"Expert Opin Drug Discov"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-024-00870-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-024-00870-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-024-00870-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,7,7]],"date-time":"2024-07-07T19:03:43Z","timestamp":1720379023000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-024-00870-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,6,28]]},"references-count":79,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["870"],"URL":"https:\/\/doi.org\/10.1186\/s13321-024-00870-9","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,6,28]]},"assertion":[{"value":"29 November 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"11 June 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 June 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"OS is the owner of Aros Bio AB, providing commercial licenses of CPSign. Both JA and SAM have previously been employed at Genetta Soft which at the time owned the rights to CPSign.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"75"}}