{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T22:24:14Z","timestamp":1776723854892,"version":"3.51.2"},"reference-count":19,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,3,23]],"date-time":"2021-03-23T00:00:00Z","timestamp":1616457600000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2021,3,23]],"date-time":"2021-03-23T00:00:00Z","timestamp":1616457600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","award":["BAS\/1\/1624-01"],"award-info":[{"award-number":["BAS\/1\/1624-01"]}],"id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","award":["URF\/1\/3412-01"],"award-info":[{"award-number":["URF\/1\/3412-01"]}],"id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","award":["URF\/1\/3450-01"],"award-info":[{"award-number":["URF\/1\/3450-01"]}],"id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","award":["FCC\/1\/1976-18"],"award-info":[{"award-number":["FCC\/1\/1976-18"]}],"id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","award":["FCC\/1\/1976-23"],"award-info":[{"award-number":["FCC\/1\/1976-23"]}],"id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","award":["FCC\/1\/1976-25"],"award-info":[{"award-number":["FCC\/1\/1976-25"]}],"id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","award":["FCC\/1\/1976-26"],"award-info":[{"award-number":["FCC\/1\/1976-26"]}],"id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004052","name":"King Abdullah University of Science and Technology","doi-asserted-by":"publisher","award":["FCS\/1\/4102-02"],"award-info":[{"award-number":["FCS\/1\/4102-02"]}],"id":[{"id":"10.13039\/501100004052","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"published-print":{"date-parts":[[2021,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Two-dimensional (2D) chemical fingerprints are widely used as binary features for the quantification of structural similarity of chemical compounds, which is an important step in similarity-based virtual screening (VS). Here, using an eigenvalue-based entropy approach, we identified 2D fingerprints with little to no contribution to shaping the eigenvalue distribution of the feature matrix as related ones and examined the degree to which these related 2D fingerprints influenced molecular similarity scores calculated with the Tanimoto coefficient. Our analysis identified many related fingerprints in publicly available fingerprint schemes and showed that their presence in the feature set could have substantial effects on the similarity scores and bias the outcome of molecular similarity analysis. Our results have implication in the optimal selection of 2D fingerprints for compound similarity analysis and the identification of potential hits for compounds with target biological activity in VS.<\/jats:p>","DOI":"10.1186\/s13321-021-00506-2","type":"journal-article","created":{"date-parts":[[2021,3,23]],"date-time":"2021-03-23T14:03:58Z","timestamp":1616508238000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":58,"title":["Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach"],"prefix":"10.1186","volume":"13","author":[{"given":"Hiroyuki","family":"Kuwahara","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7108-3574","authenticated-orcid":false,"given":"Xin","family":"Gao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,3,23]]},"reference":[{"key":"506_CR1","first-page":"453","volume":"418","author":"A Smith","year":"2002","unstructured":"Smith A (2002) Screening for drug discovery: the leading question. Nature 418:453\u2013459","journal-title":"Nature"},{"key":"506_CR2","doi-asserted-by":"publisher","first-page":"1047","DOI":"10.1016\/S1359-6446(02)02483-2","volume":"7","author":"PD Lyne","year":"2002","unstructured":"Lyne PD (2002) Structure-based virtual screening: an overview. Drug Discovery Today 7:1047\u20131055","journal-title":"Drug Discovery Today"},{"key":"506_CR3","doi-asserted-by":"publisher","first-page":"1046","DOI":"10.1016\/j.drudis.2006.10.005","volume":"11","author":"P Willett","year":"2006","unstructured":"Willett P (2006) Similarity-based virtual screening using 2D fingerprints. Drug Discovery Today 11:1046\u20131053","journal-title":"Drug Discovery Today"},{"key":"506_CR4","doi-asserted-by":"publisher","first-page":"867","DOI":"10.1021\/ci200528d","volume":"52","author":"T Scior","year":"2012","unstructured":"Scior T, Bender A, Tresadern G, Medina-Franco JL, Mart\u00ednez-Mayorga K et al (2012) Recognizing pitfalls in virtual screening: a critical review. J Chemical Information Modeling 52:867\u2013881","journal-title":"J Chemical Information Modeling"},{"key":"506_CR5","doi-asserted-by":"publisher","first-page":"58","DOI":"10.1016\/j.ymeth.2014.08.005","volume":"71","author":"A Cereto-Massagu\u00e9","year":"2015","unstructured":"Cereto-Massagu\u00e9 A, Ojeda MJ, Valls C, Mulero M, Garcia-Vallv\u00e9 S et al (2015) Molecular fingerprint similarity search in virtual screening. Methods 71:58\u201363","journal-title":"Methods"},{"key":"506_CR6","doi-asserted-by":"publisher","first-page":"1273","DOI":"10.1021\/ci010132r","volume":"42","author":"JL Durant","year":"2002","unstructured":"Durant JL, Leland BA, Henry DR, Nourse JG (2002) Reoptimization of MDL keys for use in drug discovery. J Chemical Information Computer Sci 42:1273\u20131280","journal-title":"J Chemical Information Computer Sci"},{"key":"506_CR7","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1016\/j.yrtph.2018.11.002","volume":"101","author":"CL Mellor","year":"2019","unstructured":"Mellor CL, Marchese Robinson RL, Benigni R, Ebbrell D, Enoch SJ et al (2019) Molecular fingerprint-derived similarity measures for toxicological read-across: Recommendations for optimal use. Regulatory Toxicol Pharmacol 101:121\u2013134","journal-title":"Regulatory Toxicol Pharmacol"},{"key":"506_CR8","doi-asserted-by":"publisher","first-page":"170","DOI":"10.1021\/ci034207y","volume":"44","author":"A Bender","year":"2004","unstructured":"Bender A, Mussa HY, Glen RC, Reiling S (2004) Molecular similarity searching using atom environments, information-based feature selection, and a na\u00efve bayesian classifier. J Chemical Information Computer Sci 44:170\u2013178","journal-title":"J Chemical Information Computer Sci"},{"key":"506_CR9","doi-asserted-by":"publisher","first-page":"205","DOI":"10.1021\/ci900419k","volume":"50","author":"H Geppert","year":"2010","unstructured":"Geppert H, Vogt M, Bajorath J (2010) Current trends in ligand-based virtual screening: molecular representations, data mining methods, new application areas, and performance evaluation. J Chemical Information Modeling 50:205\u2013216","journal-title":"J Chemical Information Modeling"},{"key":"506_CR10","doi-asserted-by":"publisher","first-page":"2254","DOI":"10.1021\/ci200275m","volume":"51","author":"K Heikamp","year":"2011","unstructured":"Heikamp K, Bajorath J (2011) How do 2D fingerprints detect structurally diverse active compounds? Revealing compound subset-specific fingerprint features through systematic selection. J Chemical Information Modeling 51:2254\u20132265","journal-title":"J Chemical Information Modeling"},{"key":"506_CR11","doi-asserted-by":"publisher","first-page":"1347","DOI":"10.1021\/ci900087y","volume":"49","author":"B Nisius","year":"2009","unstructured":"Nisius B, Vogt M, Bajorath J (2009) Development of a fingerprint reduction approach for Bayesian similarity searching based on Kullback-Leibler divergence analysis. J Chemical Information Modeling 49:1347\u20131358","journal-title":"J Chemical Information Modeling"},{"key":"506_CR12","doi-asserted-by":"publisher","first-page":"10101","DOI":"10.1073\/pnas.97.18.10101","volume":"97","author":"O Alter","year":"2000","unstructured":"Alter O, Brown PO, Botstein D (2000) Singular value decomposition for genome-wide expression data processing and modeling. Proceedings of the National Academy of Sciences of the United States of America 97:10101\u201310106","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"506_CR13","doi-asserted-by":"publisher","first-page":"e507","DOI":"10.1093\/bioinformatics\/btl214","volume":"22","author":"R Varshavsky","year":"2006","unstructured":"Varshavsky R, Gottlieb A, Linial M, Horn D (2006) Novel unsupervised feature filtering of biological data. Bioinformatics (Oxford, England) 22:e507\u2013e513","journal-title":"Bioinformatics (Oxford, England)"},{"key":"506_CR14","doi-asserted-by":"publisher","first-page":"D608","DOI":"10.1093\/nar\/gkx1089","volume":"46","author":"DS Wishart","year":"2018","unstructured":"Wishart DS, Feunang YD, Marcu A, Guo AC, Liang K et al (2018) HMDB 4.0: the human metabolome database for 2018. Nucleic Acids Res 46:D608\u2013D617","journal-title":"Nucleic Acids Res"},{"key":"506_CR15","doi-asserted-by":"publisher","first-page":"D1074","DOI":"10.1093\/nar\/gkx1037","volume":"46","author":"DS Wishart","year":"2018","unstructured":"Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A et al (2018) Drugbank 5.0: a major update to the drugbank database for 2018. Nucleic Acids Res 46:D1074\u2013D1082","journal-title":"Nucleic Acids Res"},{"key":"506_CR16","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1186\/s13321-017-0220-4","volume":"9","author":"EL Willighagen","year":"2017","unstructured":"Willighagen EL, Mayfield JW, Alvarsson J, Berg A, Carlsson L et al (2017) The chemistry development kit (cdk) v2.0: atom typing, depiction, molecular formulas, and substructure searching. J Cheminformatics 9:33","journal-title":"J Cheminformatics"},{"key":"506_CR17","doi-asserted-by":"publisher","first-page":"D1035","DOI":"10.1093\/nar\/gkq1126","volume":"39","author":"C Knox","year":"2011","unstructured":"Knox C, Law V, Jewison T, Liu P, Ly S et al (2011) DrugBank 3.0: a comprehensive resource for \u2018omics\u2019 research on drugs. Nucleic Acids Res 39:D1035\u2013D1041","journal-title":"Nucleic Acids Res"},{"key":"506_CR18","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1186\/1758-2946-6-5","volume":"6","author":"P Franco","year":"2014","unstructured":"Franco P, Porta N, Holliday JD, Willett P (2014) The use of 2d fingerprint methods to support the assessment of structural similarity in orphan drug legislation. J Cheminformatics 6:5","journal-title":"J Cheminformatics"},{"key":"506_CR19","doi-asserted-by":"publisher","first-page":"2634","DOI":"10.1093\/bioinformatics\/bty1035","volume":"35","author":"M Alazmi","year":"2019","unstructured":"Alazmi M, Kuwahara H, Soufan O, Ding L, Gao X (2019) Systematic selection of chemical fingerprint features improves the Gibbs energy prediction of biochemical reactions. Bioinformatics 35:2634\u20132643","journal-title":"Bioinformatics"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-021-00506-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/s13321-021-00506-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-021-00506-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,3,23]],"date-time":"2021-03-23T14:07:31Z","timestamp":1616508451000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-021-00506-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,23]]},"references-count":19,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,12]]}},"alternative-id":["506"],"URL":"https:\/\/doi.org\/10.1186\/s13321-021-00506-2","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/853762","asserted-by":"object"}]},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,3,23]]},"assertion":[{"value":"8 May 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 March 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 March 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"We declare that we have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"27"}}