{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,26]],"date-time":"2026-06-26T04:50:05Z","timestamp":1782449405840,"version":"3.54.5"},"reference-count":32,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T00:00:00Z","timestamp":1628467200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T00:00:00Z","timestamp":1628467200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100010767","name":"Innovative Medicines Initiative","doi-asserted-by":"publisher","award":["821528"],"award-info":[{"award-number":["821528"]}],"id":[{"id":"10.13039\/501100010767","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>QSAR methods are widely applied in the drug discovery process, both in the hit\u2010to\u2010lead and lead optimization phase, as well as in the drug-approval process. Most QSAR algorithms are limited to using molecules as input and disregard pharmacophores or pharmacophoric features entirely. However, due to the high level of abstraction, pharmacophore representations provide some advantageous properties for building quantitative SAR models. The abstract depiction of molecular interactions avoids a bias towards overrepresented functional groups in small datasets. Furthermore, a well\u2010crafted quantitative pharmacophore model can generalise to underrepresented or even missing molecular features in the training set by using pharmacophoric interaction patterns only. This paper presents a novel method to construct quantitative pharmacophore models and demonstrates its applicability and robustness on more than 250 diverse datasets. fivefold cross-validation on these datasets with default settings yielded an average RMSE of 0.62, with an average standard deviation of 0.18. Additional cross-validation studies on datasets with 15\u201320 training samples showed that robust quantitative pharmacophore models could be obtained. These low requirements for dataset sizes render quantitative pharmacophores a viable go-tomethod for medicinal chemists, especially in the lead-optimisation stage of drug discovery projects.<\/jats:p>","DOI":"10.1186\/s13321-021-00537-9","type":"journal-article","created":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T04:03:24Z","timestamp":1628481804000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":24,"title":["QPHAR: quantitative pharmacophore activity relationship: method and validation"],"prefix":"10.1186","volume":"13","author":[{"given":"Stefan M.","family":"Kohlbacher","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Thierry","family":"Langer","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9815-6577","authenticated-orcid":false,"given":"Thomas","family":"Seidel","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2021,8,9]]},"reference":[{"key":"537_CR1","doi-asserted-by":"publisher","first-page":"178","DOI":"10.1038\/194178b0","volume":"194","author":"C Hansch","year":"1962","unstructured":"Hansch C, Maloney PP, Fujita T, Muir RM (1962) Correlation of biological activity of phenoxyacetic acids with hammett substituent constants and partition coefficients. Nature 194:178\u2013180. https:\/\/doi.org\/10.1038\/194178b0","journal-title":"Nature"},{"key":"537_CR2","doi-asserted-by":"publisher","first-page":"1538","DOI":"10.1016\/j.drudis.2018.05.010","volume":"23","author":"Y-C Lo","year":"2018","unstructured":"Lo Y-C, Rensi SE, Torng W, Altman RB (2018) Machine learning in chemoinformatics and drug discovery. Drug Discov Today 23:1538\u20131546. https:\/\/doi.org\/10.1016\/j.drudis.2018.05.010","journal-title":"Drug Discov Today"},{"key":"537_CR3","doi-asserted-by":"publisher","first-page":"74","DOI":"10.1186\/s13321-020-00475-y","volume":"12","author":"IV Tetko","year":"2020","unstructured":"Tetko IV, Engkvist O (2020) From big data to artificial intelligence: chemoinformatics meets new challenges. J Cheminformatics 12:74. https:\/\/doi.org\/10.1186\/s13321-020-00475-y","journal-title":"J Cheminformatics"},{"key":"537_CR4","doi-asserted-by":"crossref","unstructured":"Hong H, Chen M, Ng HW, Tong W QSAR Models at the US FDA\/NCTR. In: In Silico Methods for Predicting Drug Toxicity; Benfenati, E, Ed; Springer Science+Business Media New York. 2016","DOI":"10.1007\/978-1-4939-3609-0_18"},{"issue":"5","key":"537_CR5","doi-asserted-by":"publisher","first-page":"742","DOI":"10.1021\/ci100050t","volume":"50","author":"D Rogers","year":"2010","unstructured":"Rogers D, Hahn M. Extended-Connectivity Fingerprints. J Chem Inf Model. 2010; 50(5):742-754.","journal-title":"J Chem Inf Model"},{"key":"537_CR6","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1021\/ci025516b","volume":"43","author":"A Golbraikh","year":"2003","unstructured":"Golbraikh A, Tropsha A (2003) QSAR modeling using chirality descriptors derived from molecular topology. J Chem Inf Comput Sci 43:144\u2013154. https:\/\/doi.org\/10.1021\/ci025516b","journal-title":"J Chem Inf Comput Sci"},{"key":"537_CR7","doi-asserted-by":"publisher","first-page":"5959","DOI":"10.1021\/ja00226a005","volume":"110","author":"RD Cramer","year":"1988","unstructured":"Cramer RD, Patterson DE, Bunce JD (1988) Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. J Am Chem Soc 110:5959\u20135967. https:\/\/doi.org\/10.1021\/ja00226a005","journal-title":"J Am Chem Soc"},{"key":"537_CR8","doi-asserted-by":"publisher","first-page":"647","DOI":"10.1007\/s10822-006-9087-6","volume":"20","author":"SL Dixon","year":"2006","unstructured":"Dixon SL, Smondyrev AM, Knoll EH et al (2006) PHASE: a new engine for pharmacophore perception, 3D QSAR model development, and 3D database screening: 1. Methodology and preliminary results. J Comput Aided Mol Des 20:647\u2013671. https:\/\/doi.org\/10.1007\/s10822-006-9087-6","journal-title":"J Comput Aided Mol Des"},{"key":"537_CR9","unstructured":"Schr\u00f6dinger [Internet]. [cited 2021 Mar 25]. Available from: https:\/\/www.schrodinger.com\/"},{"key":"537_CR10","doi-asserted-by":"publisher","first-page":"1213","DOI":"10.1016\/j.drudis.2016.03.015","volume":"21","author":"C Kramer","year":"2016","unstructured":"Kramer C, Dahl G, Tyrchan C, Ulander J (2016) A comprehensive company database analysis of biological assay variability. Drug Discov Today 21:1213\u20131221. https:\/\/doi.org\/10.1016\/j.drudis.2016.03.015","journal-title":"Drug Discov Today"},{"key":"537_CR11","doi-asserted-by":"publisher","first-page":"1238","DOI":"10.1021\/acs.jmedchem.6b01437","volume":"60","author":"Y Hu","year":"2017","unstructured":"Hu Y, Stumpfe D, Bajorath J (2017) Recent Advances in Scaffold Hopping. J Med Chem 60:1238\u20131246. https:\/\/doi.org\/10.1021\/acs.jmedchem.6b01437","journal-title":"J Med Chem"},{"key":"537_CR12","unstructured":"Li H, Sutter J, Hoffman R. Hypogen: An Automated System for Generating 3D Predictive Pharmacophore Models. In: Pharmacophore Perception, Development and Use in Drug Design; Guner, O, Ed; International University Line: La Jolla, CA,. 2000"},{"key":"537_CR13","unstructured":"BIOVIA-Scientific Enterprise Software for Chemical Research, Material Science R&D [Internet]. [cited 2021 Mar 25]. Available from: https:\/\/www.3dsbiovia.com\/."},{"key":"537_CR14","unstructured":"3D Design & Engineering Software - Dassault Syst\u00e8mes\u00ae[Internet]. [cited 2021 Mar 25]. Available from: https:\/\/www.3ds.com\/"},{"key":"537_CR15","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1021\/jm010360c","volume":"45","author":"AK Debnath","year":"2002","unstructured":"Debnath AK (2002) Pharmacophore mapping of a Series of 2,4-diamino-5-deazapteridine inhibitors of Mycobacterium avium complex dihydrofolate reductase. J Med Chem 45:41\u201353. https:\/\/doi.org\/10.1021\/jm010360c","journal-title":"J Med Chem"},{"key":"537_CR16","doi-asserted-by":"publisher","unstructured":"Poli G, Seidel T, Langer T (2018) Conformational sampling of small molecules with iCon: performance assessment in comparison with OMEGA. Front Chem 6. https:\/\/doi.org\/10.3389\/fchem.2018.00229","DOI":"10.3389\/fchem.2018.00229"},{"key":"537_CR17","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1021\/ci049885e","volume":"45","author":"G Wolber","year":"2005","unstructured":"Wolber G, Langer T (2005) LigandScout: 3-D pharmacophores derived from protein-bound ligands and their use as virtual screening filters. J Chem Inf Model 45:160\u2013169. https:\/\/doi.org\/10.1021\/ci049885e","journal-title":"J Chem Inf Model"},{"key":"537_CR18","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1186\/s13321-020-00444-5","volume":"12","author":"I Cort\u00e9s-Ciriano","year":"2020","unstructured":"Cort\u00e9s-Ciriano I, \u0160kuta C, Bender A, Svozil D (2020) QSAR-derived affinity fingerprints (part 2): modeling performance for potency prediction. J Cheminformatics 12:41. https:\/\/doi.org\/10.1186\/s13321-020-00444-5","journal-title":"J Cheminformatics"},{"issue":"8","key":"537_CR19","doi-asserted-by":"publisher","first-page":"757","DOI":"10.1080\/17460441.2017.1339032","volume":"12","author":"M Nowotka","year":"2017","unstructured":"Nowotka M, Gaulton A, Mnedez D, Bento A. P, Hersey A, Leach A. (2017) Using ChEMBL web services for building applications and data processing workflows relevant t. o drug discovery. Expert Opin Drug Discov. 12(8):757-767. https:\/\/doi.org\/10.1080\/17460441.2017.1339032","journal-title":"Expert Opin Drug Discov"},{"key":"537_CR20","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1016\/j.chemolab.2018.10.008","volume":"183","author":"L Xu","year":"2018","unstructured":"Xu L, Hu O, Guo Y et al (2018) Representative splitting cross validation. Chemom Intell Lab Syst 183:29\u201335. https:\/\/doi.org\/10.1016\/j.chemolab.2018.10.008","journal-title":"Chemom Intell Lab Syst"},{"key":"537_CR21","doi-asserted-by":"publisher","first-page":"163","DOI":"10.1021\/ci00063a006","volume":"29","author":"VN Viswanadhan","year":"1989","unstructured":"Viswanadhan VN, Ghose AK, Revankar GR, Robins RK (1989) Atomic physicochemical parameters for three dimensional structure directed quantitative structure-activity relationships. 4. Additional parameters for hydrophobic and dispersive interactions and their application for an automated superposition of certain naturally occurring nucleoside antibiotics. J Chem Inf Comput Sci 29:163\u2013172. https:\/\/doi.org\/10.1021\/ci00063a006","journal-title":"J Chem Inf Comput Sci"},{"key":"537_CR22","doi-asserted-by":"publisher","first-page":"21","DOI":"10.2174\/092986709787002817","volume":"16","author":"S Prasanna","year":"2009","unstructured":"Prasanna S, Doerksen R (2009) Topological polar surface area: a useful descriptor in 2D-QSAR. Curr Med Chem 16:21\u201341. https:\/\/doi.org\/10.2174\/092986709787002817","journal-title":"Curr Med Chem"},{"key":"537_CR23","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1177\/026119290503300508","volume":"33","author":"J Jaworska","year":"2005","unstructured":"Jaworska J, Nikolova-Jeliazkova N, Aldenberg T (2005) QSAR applicability domain estimation by projection of the training set in descriptor space: a review. Altern Lab Anim 33:445\u2013459. https:\/\/doi.org\/10.1177\/026119290503300508","journal-title":"Altern Lab Anim"},{"key":"537_CR24","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1023\/A:1010933404324","volume":"45","author":"L Breiman","year":"2001","unstructured":"Breiman L (2001) Random forests. Mach Learn 45:5\u201332. https:\/\/doi.org\/10.1023\/A:1010933404324","journal-title":"Mach Learn"},{"key":"537_CR25","unstructured":"Seidel T (2021) Chemical data processing toolkit, GitHub repository [Internet]. [cited 2021 Mar 19] Available from: https:\/\/github.com\/aglanger\/CDPKit. \u00a0"},{"key":"537_CR26","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A et al (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825\u20132830","journal-title":"J Mach Learn Res"},{"key":"537_CR27","doi-asserted-by":"publisher","first-page":"3494","DOI":"10.2174\/138161207782794257","volume":"13","author":"A Tropsha","year":"2007","unstructured":"Tropsha A, Golbraikh A (2007) Predictive QSAR modeling workflow, model applicability domains, and virtual screening. Curr Pharm Des 13:3494\u20133504. https:\/\/doi.org\/10.2174\/138161207782794257","journal-title":"Curr Pharm Des"},{"key":"537_CR28","doi-asserted-by":"publisher","first-page":"476","DOI":"10.1002\/minf.201000061","volume":"29","author":"A Tropsha","year":"2010","unstructured":"Tropsha A (2010) Best practices for QSAR model development, validation, and exploitation. Mol Inform 29:476\u2013488. https:\/\/doi.org\/10.1002\/minf.201000061","journal-title":"Mol Inform"},{"key":"537_CR29","unstructured":"NeuroDeRisk (2019) [Internet]. [cited 2021 Mar 17]. Available from: https:\/\/neuroderisk.eu\/.\u00a0"},{"key":"537_CR30","unstructured":"IMI Innovative Medicines Initiative. [Internet]. [cited 2021 Apr 12]. http:\/\/www.imi.europa.eu\/.\u00a0"},{"key":"537_CR31","unstructured":"Horizon 2020. [Internet]. Horizon 202 - European Commision. [cited 2021 Mar 17]. Available from: https:\/\/ec.europa.eu\/programmes\/horizon2020\/en"},{"key":"537_CR32","unstructured":"EFPIA Homepage [Internet]. [cited 2021 Mar 17]. Available from: https:\/\/www.efpia.eu\/"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-021-00537-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-021-00537-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-021-00537-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,9]],"date-time":"2021-08-09T04:23:11Z","timestamp":1628482991000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-021-00537-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,9]]},"references-count":32,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["537"],"URL":"https:\/\/doi.org\/10.1186\/s13321-021-00537-9","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-426014\/v1","asserted-by":"object"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,8,9]]},"assertion":[{"value":"14 April 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 July 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 August 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"57"}}