{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,17]],"date-time":"2026-04-17T22:15:02Z","timestamp":1776464102670,"version":"3.51.2"},"reference-count":23,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2013,12,1]],"date-time":"2013-12-01T00:00:00Z","timestamp":1385856000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2013,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Virtual screening in the form of similarity rankings is often applied in the early drug discovery process to rank and prioritize compounds from a database. This similarity ranking can be achieved with structural similarity measures. However, their general nature can lead to insufficient performance in some application cases. In this paper, we provide a link between ranking-based virtual screening and fragment-based data mining methods. The inclusion of binding-relevant background knowledge into a structural similarity measure improves the quality of the similarity rankings. This background knowledge in the form of binding relevant substructures can either be derived by hand selection or by automated fragment-based data mining methods.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>In virtual screening experiments we show that our approach clearly improves enrichment factors with both applied variants of our approach: the extension of the structural similarity measure with background knowledge in the form of a hand-selected relevant substructure or the extension of the similarity measure with background knowledge derived with data mining methods.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>Our study shows that adding binding relevant background knowledge can lead to significantly improved similarity rankings in virtual screening and that even basic data mining approaches can lead to competitive results making hand-selection of the background knowledge less crucial. This is especially important in drug discovery and development projects where no receptor structure is available or more frequently no verified binding mode is known and mostly ligand based approaches can be applied to generate hit compounds.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1758-2946-5-50","type":"journal-article","created":{"date-parts":[[2013,12,16]],"date-time":"2013-12-16T14:01:50Z","timestamp":1387202510000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["Improving structural similarity based virtual screening using background knowledge"],"prefix":"10.1186","volume":"5","author":[{"given":"Tobias","family":"Girschick","sequence":"first","affiliation":[]},{"given":"Lucia","family":"Puchbauer","sequence":"additional","affiliation":[]},{"given":"Stefan","family":"Kramer","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2013,12,16]]},"reference":[{"key":"649_CR1","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1016\/S0165-6147(00)01584-4","volume":"22","author":"G Terstappen","year":"2001","unstructured":"Terstappen G, Reggiani A: In silico research in drug discovery. Trends Pharmacol Sci. 2001, 22: 23-26.","journal-title":"Trends Pharmacol Sci"},{"key":"649_CR2","doi-asserted-by":"publisher","first-page":"192","DOI":"10.1038\/nrd1032","volume":"2","author":"H van de Waterbeemed","year":"2003","unstructured":"van de Waterbeemed H, Gifford E: ADMET in silico modelling: towards prediction paradise?. Nat Rev Drug Discov. 2003, 2: 192-204. 10.1038\/nrd1032.","journal-title":"Nat Rev Drug Discov"},{"key":"649_CR3","doi-asserted-by":"crossref","first-page":"564","DOI":"10.1145\/967900.968018","volume-title":"Proceedings of the ACM SIG Symposium on Applied Computing (SAC\u201904)","author":"U R\u00fcckert","year":"2004","unstructured":"R\u00fcckert U, Kramer S: Frequent free tree discovery in graph data. Proceedings of the ACM SIG Symposium on Applied Computing (SAC\u201904). 2004, New York, NY, USA: ACM Press, 564-570."},{"issue":"6","key":"649_CR4","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1093\/comjnl\/45.6.631","volume":"45","author":"J Raymond","year":"2002","unstructured":"Raymond J, Gardiner E, Willett P: RASCAL: calculation of graph similarity using maximum common edge subgraphs. Comput J. 2002, 45 (6): 631-644. 10.1093\/comjnl\/45.6.631.","journal-title":"Comput J"},{"issue":"5","key":"649_CR5","doi-asserted-by":"publisher","first-page":"742","DOI":"10.1021\/ci100050t","volume":"50","author":"D Rogers","year":"2010","unstructured":"Rogers D, Hahn M: Extended-connectivity fingerprints. J Chem Inf Model. 2010, 50 (5): 742-754. 10.1021\/ci100050t.","journal-title":"J Chem Inf Model"},{"key":"649_CR6","doi-asserted-by":"publisher","first-page":"701","DOI":"10.1016\/S0167-8655(01)00022-8","volume":"22","author":"W Wallis","year":"2001","unstructured":"Wallis W, Shoubridge P, Kraetz M, Ray D: Graph distances using graph union. Pattern Recognit Lett. 2001, 22: 701-704. 10.1016\/S0167-8655(01)00022-8. [http:\/\/dx.doi.org\/10.1016\/S0167-8655(01)00022-8],","journal-title":"Pattern Recognit Lett"},{"issue":"2","key":"649_CR7","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1021\/ci00062a008","volume":"29","author":"D Weininger","year":"1989","unstructured":"Weininger D, Weininger A, Weininger J: SMILES. 2. algorithm for generation of unique SMILES notation. J Chem Inf Comput Sci. 1989, 29 (2): 97-101. 10.1021\/ci00062a008.","journal-title":"J Chem Inf Comput Sci"},{"key":"649_CR8","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1186\/1758-2946-3-28","volume":"3","author":"J Stalring","year":"2011","unstructured":"Stalring J, Carlsson L, Almeida P, Boyer S: AZOrange-High performance open source machine learning for QSAR modeling in a graphical programming environment. J Cheminformatics. 2011, 3: 28-10.1186\/1758-2946-3-28.","journal-title":"J Cheminformatics"},{"issue":"suppl 1","key":"649_CR9","doi-asserted-by":"publisher","first-page":"D1035","DOI":"10.1093\/nar\/gkq1126","volume":"39","author":"C Knox","year":"2011","unstructured":"Knox C, Law V, Jewison T, Liu P, Ly S, Frolkis A, Pon A, Banco K, Mak C, Neveu V, Djoumbou Y, Eisner R, Guo AC, Wishart DS: DrugBank 3.0: a comprehensive resource for \u2018Omics\u2019 research on drugs. Nucl Acids Res. 2011, 39 (suppl 1): D1035-D1041.","journal-title":"Nucl Acids Res"},{"issue":"23","key":"649_CR10","doi-asserted-by":"publisher","first-page":"6789","DOI":"10.1021\/jm0608356","volume":"49","author":"N Huang","year":"2006","unstructured":"Huang N, Shoichet B, Irwin J: Benchmarking sets for molecular docking. J Med Chem. 2006, 49 (23): 6789-6801. 10.1021\/jm0608356.","journal-title":"J Med Chem"},{"issue":"8","key":"649_CR11","doi-asserted-by":"publisher","first-page":"1831","DOI":"10.1021\/ci200199u","volume":"51","author":"K Heikamp","year":"2011","unstructured":"Heikamp K, Bajorath J: Large-scale similarity search profiling of ChEMBL compound data sets. J Chem Inf Model. 2011, 51 (8): 1831-1839. 10.1021\/ci200199u.","journal-title":"J Chem Inf Model"},{"issue":"7","key":"649_CR12","doi-asserted-by":"publisher","first-page":"1757","DOI":"10.1021\/ci3001277","volume":"52","author":"JJ Irwin","year":"2012","unstructured":"Irwin JJ, Sterling T, Mysinger MM, Bolstad ES, Coleman RG: ZINC: a free tool to discover chemistry for biology. J Chem Inf Model. 2012, 52 (7): 1757-1768. 10.1021\/ci3001277.","journal-title":"J Chem Inf Model"},{"issue":"9602","key":"649_CR13","doi-asserted-by":"publisher","first-page":"1829","DOI":"10.1016\/S0140-6736(07)61778-4","volume":"370","author":"S Lewington","year":"2007","unstructured":"Lewington S, Whitlock G, Clarke R, Sherliker P, Emberson J, Halsey J, Qizilbash N, Peto R, Collins R: Blood cholesterol and vascular mortality by age, sex, and blood pressure: a meta-analysis of individual data from 61 prospective studies with 55000 vascular deaths. The Lancet. 2007, 370 (9602): 1829-1839.","journal-title":"The Lancet"},{"issue":"2, Supplement 1","key":"649_CR14","doi-asserted-by":"publisher","first-page":"2S","DOI":"10.1016\/S0002-9343(98)00038-2","volume":"104","author":"D Eisenberg","year":"1998","unstructured":"Eisenberg D: Cholesterol lowering in the management of coronary artery disease: the clinical implications of recent trials. Am J Med. 1998, 104 (2, Supplement 1): 2S-5S. 10.1016\/S0002-9343(98)00038-2.","journal-title":"Am J Med"},{"issue":"2","key":"649_CR15","doi-asserted-by":"publisher","first-page":"323","DOI":"10.1016\/0014-5793(76)80996-9","volume":"72","author":"A Endo","year":"1976","unstructured":"Endo A, Kuroda M, Tanzawa K: Competitive inhibition of 3-hydroxy-3-methylglutaryl coenzyme A reductase by ML-236A and ML-236B fungal metabolites, having hypocholesterolemic activity. FEBS Lett. 1976, 72 (2): 323-326. 10.1016\/0014-5793(76)80996-9.","journal-title":"FEBS Lett"},{"key":"649_CR16","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"H Berman","year":"2000","unstructured":"Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H, Shindyalov I, Bourne P: The protein data bank. Nucl Acids Res. 2000, 28: 235-242. 10.1093\/nar\/28.1.235.","journal-title":"Nucl Acids Res"},{"issue":"5519","key":"649_CR17","doi-asserted-by":"publisher","first-page":"1160","DOI":"10.1126\/science.1059344","volume":"292","author":"E Istvan","year":"2001","unstructured":"Istvan E, Deisenhofer J: Structural mechanism for statin inhibition of HMG-CoA reductase. Science. 2001, 292 (5519): 1160-1164. 10.1126\/science.1059344. [http:\/\/www.sciencemag.org\/content\/292\/5519\/1160.abstract],","journal-title":"Science"},{"issue":"2","key":"649_CR18","doi-asserted-by":"publisher","first-page":"398","DOI":"10.1124\/mol.106.024596","volume":"71","author":"M Scarsi","year":"2007","unstructured":"Scarsi M, Podvinec M, Roth A, Hug H, Kersten S, Albrecht H, Schwede T, Meyer UA, Ruecker C: Sulfonylureas and Glinides exhibit peroxisome proliferator-activated receptor gamma activity: A combined virtual screening and biological assay approach. Mol Pharmacol. 2007, 71 (2): 398-406.","journal-title":"Mol Pharmacol"},{"issue":"15","key":"649_CR19","doi-asserted-by":"publisher","first-page":"2887","DOI":"10.1021\/jm9602928","volume":"39","author":"GW Bemis","year":"1996","unstructured":"Bemis GW, Murcko MA: The properties of known drugs. 1. Molecular frameworks. J Med Chem. 1996, 39 (15): 2887-2893. 10.1021\/jm9602928.","journal-title":"J Med Chem"},{"issue":"4","key":"649_CR20","doi-asserted-by":"publisher","first-page":"1088","DOI":"10.1021\/jm0491804","volume":"48","author":"A Evers","year":"2005","unstructured":"Evers A, Klabunde T: Structure-based drug discovery using GPCR homology modeling: successful virtual screening for antagonists of the alpha1A adrenergic receptor. J Med Chem. 2005, 48 (4): 1088-1097. 10.1021\/jm0491804.","journal-title":"J Med Chem"},{"issue":"5","key":"649_CR21","doi-asserted-by":"publisher","first-page":"e36297","DOI":"10.1371\/journal.pone.0036297","volume":"7","author":"MV Liberato","year":"2012","unstructured":"Liberato MV, Nascimento AS, Ayers SD, Lin JZ, Cvoro A, Silveira RL, Mart\u00ednez L, Souza PCT, Saidemberg D, Deng T, Amato AA, Togashi M, Hsueh WA, Phillips K, Palma MS, Neves FAR, Skaf MS, Webb P, Polikarpov I: Medium chain fatty acids are selective peroxisome proliferator activated receptor (PPAR) gamma activators and Pan-PPAR partial agonists. PLoS ONE. 2012, 7 (5): e36297-10.1371\/journal.pone.0036297.","journal-title":"PLoS ONE"},{"key":"649_CR22","first-page":"1","volume":"7","author":"J Dem\u0161ar","year":"2006","unstructured":"Dem\u0161ar J: Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res. 2006, 7: 1-30.","journal-title":"J Mach Learn Res"},{"key":"649_CR23","doi-asserted-by":"publisher","first-page":"3256","DOI":"10.1039\/b409865j","volume":"2","author":"J Hert","year":"2004","unstructured":"Hert J, Willett P, Wilton DJ, Acklin P, Azzaoui K, Jacoby E, Schuffenhauer A: Comparison of topological descriptors for similarity-based virtual screening using multiple bioactive reference structures. Org Biomol Chem. 2004, 2: 3256-3266. 10.1039\/b409865j.","journal-title":"Org Biomol Chem"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1758-2946-5-50.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1758-2946-5-50\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1758-2946-5-50.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,2]],"date-time":"2021-09-02T19:54:34Z","timestamp":1630612474000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/1758-2946-5-50"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,12]]},"references-count":23,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,12]]}},"alternative-id":["649"],"URL":"https:\/\/doi.org\/10.1186\/1758-2946-5-50","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,12]]},"assertion":[{"value":"8 June 2013","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 November 2013","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 December 2013","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"50"}}