{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,19]],"date-time":"2026-06-19T05:49:03Z","timestamp":1781848143811,"version":"3.54.5"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"22","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2005,11,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The development of chemoinformatics has been hampered by the lack of large, publicly available, comprehensive repositories of molecules, in particular of small molecules. Small molecules play a fundamental role in organic chemistry and biology. They can be used as combinatorial building blocks for chemical synthesis, as molecular probes in chemical genomics and systems biology, and for the screening and discovery of new drugs and other useful compounds.<\/jats:p>\n               <jats:p>Results: We describe ChemDB, a public database of small molecules available on the Web. ChemDB is built using the digital catalogs of over a hundred vendors and other public sources and is annotated with information derived from these sources as well as from computational methods, such as predicted solubility and three-dimensional structure. It supports multiple molecular formats and is periodically updated, automatically whenever possible. The current version of the database contains approximately 4.1 million commercially available compounds and 8.2 million counting isomers. The database includes a user-friendly graphical interface, chemical reactions capabilities, as well as unique search capabilities.<\/jats:p>\n               <jats:p>Availability: Database and datasets are available on<\/jats:p>\n               <jats:p>Contact: \u00a0pfbaldi@ics.uci.edu<\/jats:p>\n               <jats:p>Supplementary information: Supplementary materials are available on<\/jats:p>","DOI":"10.1093\/bioinformatics\/bti683","type":"journal-article","created":{"date-parts":[[2005,9,21]],"date-time":"2005-09-21T03:03:34Z","timestamp":1127271814000},"page":"4133-4139","source":"Crossref","is-referenced-by-count":148,"title":["ChemDB: a public database of small molecules and related chemoinformatics resources"],"prefix":"10.1093","volume":"21","author":[{"given":"Jonathan","family":"Chen","sequence":"first","affiliation":[{"name":"Institute for Genomics and Bioinformatics, School of Information and Computer Sciences, University of California 1 \u00a0 1 \u00a0 \u00a0 Irvine, CA, USA"},{"name":"Department of Computer Science, School of Information and Computer Sciences, University of California 2 \u00a0 2 \u00a0 \u00a0 Irvine, CA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"S. Joshua","family":"Swamidass","sequence":"additional","affiliation":[{"name":"Institute for Genomics and Bioinformatics, School of Information and Computer Sciences, University of California 1 \u00a0 1 \u00a0 \u00a0 Irvine, CA, USA"},{"name":"Department of Computer Science, School of Information and Computer Sciences, University of California 2 \u00a0 2 \u00a0 \u00a0 Irvine, CA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yimeng","family":"Dou","sequence":"additional","affiliation":[{"name":"Institute for Genomics and Bioinformatics, School of Information and Computer Sciences, University of California 1 \u00a0 1 \u00a0 \u00a0 Irvine, CA, USA"},{"name":"Department of Computer Science, School of Information and Computer Sciences, University of California 2 \u00a0 2 \u00a0 \u00a0 Irvine, CA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jocelyne","family":"Bruand","sequence":"additional","affiliation":[{"name":"Institute for Genomics and Bioinformatics, School of Information and Computer Sciences, University of California 1 \u00a0 1 \u00a0 \u00a0 Irvine, CA, USA"},{"name":"Department of Computer Science, School of Information and Computer Sciences, University of California 2 \u00a0 2 \u00a0 \u00a0 Irvine, CA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Pierre","family":"Baldi","sequence":"additional","affiliation":[{"name":"Institute for Genomics and Bioinformatics, School of Information and Computer Sciences, University of California 1 \u00a0 1 \u00a0 \u00a0 Irvine, CA, USA"},{"name":"Department of Computer Science, School of Information and Computer Sciences, University of California 2 \u00a0 2 \u00a0 \u00a0 Irvine, CA, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2005,9,20]]},"reference":[{"key":"2023061007105751700_b1","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1038\/nrd791","article-title":"Combinatorial informatics in the post-genomics era","volume":"1","author":"Agrafiotis","year":"2002","journal-title":"Nat. Rev. Drug Discov."},{"key":"2023061007105751700_b2","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The Protein Data Bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023061007105751700_b3","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1038\/nature03192","article-title":"Chemical space and biology","volume":"432","author":"Dobson","year":"2004","journal-title":"Nature"},{"key":"2023061007105751700_b4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1198\/004017002317375064","article-title":"A modification of the Jaccard\/Tanimoto similarity index for diverse selection of chemical compounds using binary strings","volume":"44","author":"Fligner","year":"2002","journal-title":"Technometrics"},{"key":"2023061007105751700_b5","doi-asserted-by":"crossref","first-page":"378","DOI":"10.1021\/ci970437z","article-title":"On the properties of bit string-based measures of chemical similarity","volume":"38","author":"Flower","year":"1998","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023061007105751700_b6","doi-asserted-by":"crossref","first-page":"1315","DOI":"10.1021\/ci0003810","article-title":"Improving the odds in discriminating \u2018drug-like\u2019 from \u2018non drug-like\u2019 compounds","volume":"40","author":"Frimurer","year":"2000","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023061007105751700_b7","doi-asserted-by":"crossref","first-page":"1030","DOI":"10.1021\/ci960343+","article-title":"Chemical information in 3D-space","volume":"36","author":"Gasteiger","year":"1996","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023061007105751700_b8","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1038\/nature03658","article-title":"An endocannabinoid mechanism for stress-induced analgesia","volume":"435","author":"Hohmann","year":"2005","journal-title":"Nature"},{"key":"2023061007105751700_b9","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1146\/annurev.pharmtox.40.1.273","article-title":"Parallel array and mixture-based synthetic combinatorial chemistry: tools for the next millennium","volume":"40","author":"Houghten","year":"2000","journal-title":"Ann. Rev. Pharmacol. Toxicol."},{"key":"2023061007105751700_b10","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1021\/ci049714+","article-title":"ZINC\u2014a free database of commercially available compounds for virtual screening","volume":"45","author":"Irwin","year":"2005","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023061007105751700_b11","volume-title":"Daylight Theory Manual","author":"James","year":"2004"},{"key":"2023061007105751700_b12","doi-asserted-by":"crossref","first-page":"2145","DOI":"10.1093\/bioinformatics\/bti314","article-title":"Prediction methods and databases within chemoinformatics: emphasis on drugs and drug candidates","volume":"21","author":"Jonsdottir","year":"2005","journal-title":"Bioinformatics"},{"key":"2023061007105751700_b13","doi-asserted-by":"crossref","first-page":"774","DOI":"10.1126\/science.308.5723.774a","article-title":"Chemists want NIH to curtail database","volume":"308","author":"Kaiser","year":"2005","journal-title":"Science"},{"key":"2023061007105751700_b14","doi-asserted-by":"crossref","first-page":"1729","DOI":"10.1126\/science.308.5729.1729b","article-title":"House approves 0.5% raise for NIH, comments on database","volume":"308","author":"Kaiser","year":"2005","journal-title":"Science"},{"key":"2023061007105751700_b15","doi-asserted-by":"crossref","first-page":"855","DOI":"10.1038\/nature03193","article-title":"Navigating chemical space for biology and medicine","volume":"432","author":"Lipinski","year":"2004","journal-title":"Nature"},{"key":"2023061007105751700_b16","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1016\/S0169-409X(96)00423-1","article-title":"Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings","volume":"23","author":"Lipinski","year":"1997","journal-title":"Adv. Drug Deliv. Rev."},{"key":"2023061007105751700_b17","doi-asserted-by":"crossref","first-page":"718","DOI":"10.1038\/435718a","article-title":"Chemistry society goes head to head with NIH in fight over public database","volume":"435","author":"Marris","year":"2005","journal-title":"Nature"},{"key":"2023061007105751700_b18","first-page":"265","article-title":"A novel approach to QSPR\/QSAR based on neural networks for structures","volume-title":"Soft Computing Approaches in Chemistry","author":"Micheli","year":"2003"},{"key":"2023061007105751700_b19","doi-asserted-by":"crossref","DOI":"10.1016\/j.neunet.2005.07.009","article-title":"Graph kernels for chemical informatics","author":"Ralaivola","year":"2005","journal-title":"Neural Netw."},{"key":"2023061007105751700_b20","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1021\/ci00010a002","article-title":"Definition and role of similarity concepts in the chemical and physical sciences","volume":"32","author":"Rouvray","year":"1992","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023061007105751700_b21","doi-asserted-by":"crossref","first-page":"1000","DOI":"10.1021\/ci00020a039","article-title":"Comparison of automatic three-dimensional model builders using 639 X-ray structures","volume":"34","author":"Sadowski","year":"1994","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023061007105751700_b22","volume-title":"Learning with Kernels, Support Vector Machines, Regularization, Optimization and Beyond","author":"Sch\u00f6lkopf","year":"2002"},{"key":"2023061007105751700_b23","doi-asserted-by":"crossref","first-page":"1964","DOI":"10.1126\/science.287.5460.1964","article-title":"Target-oriented and diversity-oriented organic synthesis in drug discovery","volume":"287","author":"Schreiber","year":"2000","journal-title":"Science"},{"key":"2023061007105751700_b24","first-page":"51","article-title":"The small-molecule approach to biology: chemical genetics and diversity-oriented organic synthesis make possible the systematic exploration of biology","volume":"81","author":"Schreiber","year":"2003","journal-title":"Chem. Eng. News"},{"key":"2023061007105751700_b25","doi-asserted-by":"crossref","first-page":"846","DOI":"10.1038\/nature03196","article-title":"Exploring biology with small organic molecules","volume":"432","author":"Stockwell","year":"2004","journal-title":"Nature"},{"key":"2023061007105751700_b26","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1126\/science.1083395","article-title":"From knowing to controlling: a path from genomics to drugs using small molecule probes","volume":"300","author":"Strauseberg","year":"2003","journal-title":"Science"},{"key":"2023061007105751700_b27","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1093\/bioinformatics\/bti1055","article-title":"Kernels for small molecules and the prediction of mutagenicity, toxicity, and anti-cancer activity","volume":"21","author":"Swamidass","year":"2005","journal-title":"Bioinformatics"},{"key":"2023061007105751700_b28","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1037\/0033-295X.84.4.327","article-title":"Features of similarity","volume":"84","author":"Tversky","year":"1977","journal-title":"Psychol. Rev."},{"key":"2023061007105751700_b29","doi-asserted-by":"crossref","first-page":"2615","DOI":"10.1021\/jm020017n","article-title":"Molecular properties that influence the oral bioavailability of drug candidates","volume":"45","author":"Veber","year":"2002","journal-title":"J. Med. Chem."},{"key":"2023061007105751700_b30","doi-asserted-by":"crossref","first-page":"702","DOI":"10.1021\/ci000150t","article-title":"Comparison of the NCI open database with seven large chemical structural databases","volume":"41","author":"Voigt","year":"2001","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"2023061007105751700_b31","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1021\/ci00062a008","article-title":"SMILES. 2. Algorithm for generation of uniques SMILES notation","volume":"29","author":"Weininger","year":"1989","journal-title":"J. Chem. Inf. Comput. Sci."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/22\/4133\/50566332\/bioinformatics_21_22_4133.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/21\/22\/4133\/50566332\/bioinformatics_21_22_4133.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,10]],"date-time":"2023-06-10T07:12:02Z","timestamp":1686381122000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/21\/22\/4133\/195041"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,9,20]]},"references-count":31,"journal-issue":{"issue":"22","published-print":{"date-parts":[[2005,11,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bti683","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2005,11,15]]},"published":{"date-parts":[[2005,9,20]]}}}