{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T08:18:36Z","timestamp":1777623516977,"version":"3.51.4"},"reference-count":37,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T00:00:00Z","timestamp":1672185600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T00:00:00Z","timestamp":1672185600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This article demonstrates how to create Chemical Space Networks (CSNs) using a Python RDKit and NetworkX workflow. CSNs are a type of network visualization that depict compounds as nodes connected by edges, defined as a pairwise relationship such as a 2D fingerprint similarity value. A step by step approach is presented for creating two different CSNs in this manuscript, one based on RDKit 2D fingerprint Tanimoto similarity values, and another based on maximum common substructure similarity values. Several different CSN visualization features are included in the tutorial including methods to represent nodes with color based on bioactivity attribute value, edges with different line styles based on similarity value, as well as replacing the circle nodes\u00a0with 2D structure depictions. Finally, some common network property and analysis calculations are presented including the clustering coefficient, degree assortativity, and modularity. All code is provided in the form of Jupyter Notebooks and is available on GitHub with a permissive BSD-3 open-source license:<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/vfscalfani\/CSN_tutorial\">https:\/\/github.com\/vfscalfani\/CSN_tutorial<\/jats:ext-link><\/jats:p><jats:p><jats:bold>Graphical Abstract<\/jats:bold><\/jats:p>","DOI":"10.1186\/s13321-022-00664-x","type":"journal-article","created":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T15:03:49Z","timestamp":1672239829000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":84,"title":["Visualizing chemical space networks with RDKit and NetworkX"],"prefix":"10.1186","volume":"14","author":[{"given":"Vincent F.","family":"Scalfani","sequence":"first","affiliation":[]},{"given":"Vishank D.","family":"Patel","sequence":"additional","affiliation":[]},{"given":"Avery M.","family":"Fernandez","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,12,28]]},"reference":[{"key":"664_CR1","doi-asserted-by":"publisher","first-page":"795","DOI":"10.1007\/s10822-014-9760-0","volume":"28","author":"GM Maggiora","year":"2014","unstructured":"Maggiora GM, Bajorath J (2014) Chemical space networks: a powerful new paradigm for the description of chemical space. J Comput Aided Mol Des 28:795\u2013802. https:\/\/doi.org\/10.1007\/s10822-014-9760-0","journal-title":"J Comput Aided Mol Des"},{"key":"664_CR2","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1007\/s10822-016-9906-3","volume":"30","author":"M Vogt","year":"2016","unstructured":"Vogt M, Stumpfe D, Maggiora GM, Bajorath J (2016) Lessons learned from the design of chemical space networks and opportunities for new applications. J Comput Aided Mol Des 30:191\u2013208. https:\/\/doi.org\/10.1007\/s10822-016-9906-3","journal-title":"J Comput Aided Mol Des"},{"key":"664_CR3","doi-asserted-by":"publisher","first-page":"8653","DOI":"10.1021\/acs.jmedchem.9b01989","volume":"63","author":"M Recanatini","year":"2020","unstructured":"Recanatini M, Cabrelle C (2020) Drug research meets network science: where are we? J Med Chem 63:8653\u20138666. https:\/\/doi.org\/10.1021\/acs.jmedchem.9b01989","journal-title":"J Med Chem"},{"key":"664_CR4","doi-asserted-by":"publisher","first-page":"3768","DOI":"10.1021\/acsomega.8b00344","volume":"3","author":"R Kunimoto","year":"2018","unstructured":"Kunimoto R, Bajorath J (2018) Combining similarity searching and network analysis for the identification of active compounds. ACS Omega 3:3768\u20133777. https:\/\/doi.org\/10.1021\/acsomega.8b00344","journal-title":"ACS Omega"},{"key":"664_CR5","doi-asserted-by":"publisher","first-page":"595","DOI":"10.1007\/s10822-015-9852-5","volume":"29","author":"B Zhang","year":"2015","unstructured":"Zhang B, Vogt M, Maggiora GM, Bajorath J (2015) Comparison of bioactive chemical space networks generated using substructure- and fingerprint-based measures of molecular similarity. J Comput Aided Mol Des 29:595\u2013608. https:\/\/doi.org\/10.1007\/s10822-015-9852-5","journal-title":"J Comput Aided Mol Des"},{"key":"664_CR6","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1007\/s10822-014-9821-4","volume":"29","author":"M Zwierzyna","year":"2015","unstructured":"Zwierzyna M, Vogt M, Maggiora GM, Bajorath J (2015) Design and characterization of chemical space networks for different compound data sets. J Comput Aided Mol Des 29:113\u2013125. https:\/\/doi.org\/10.1007\/s10822-014-9821-4","journal-title":"J Comput Aided Mol Des"},{"key":"664_CR7","doi-asserted-by":"publisher","first-page":"937","DOI":"10.1007\/s10822-015-9872-1","volume":"29","author":"B Zhang","year":"2015","unstructured":"Zhang B, Vogt M, Maggiora GM, Bajorath J (2015) Design of chemical space networks using a Tanimoto similarity variant based upon maximum common substructures. J Comput Aided Mol Des 29:937\u2013950. https:\/\/doi.org\/10.1007\/s10822-015-9872-1","journal-title":"J Comput Aided Mol Des"},{"key":"664_CR8","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1007\/s10822-017-0061-2","volume":"31","author":"R Kunimoto","year":"2017","unstructured":"Kunimoto R, Bajorath J (2017) Exploring sets of molecules from patents and relationships to other active compounds in chemical space networks. J Comput Aided Mol Des 31:779\u2013788. https:\/\/doi.org\/10.1007\/s10822-017-0061-2","journal-title":"J Comput Aided Mol Des"},{"key":"664_CR9","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10822-015-9891-y","volume":"30","author":"M Wu","year":"2016","unstructured":"Wu M, Vogt M, Maggiora GM, Bajorath J (2016) Design of chemical space networks on the basis of Tversky similarity. J Comput Aided Mol Des 30:1\u201312. https:\/\/doi.org\/10.1007\/s10822-015-9891-y","journal-title":"J Comput Aided Mol Des"},{"key":"664_CR10","doi-asserted-by":"publisher","first-page":"376","DOI":"10.1039\/C6MD00628K","volume":"8","author":"R Kunimoto","year":"2017","unstructured":"Kunimoto R, Vogt M, Bajorath J (2017) Tracing compound pathways using chemical space networks. Med Chem Commun 8:376\u2013384. https:\/\/doi.org\/10.1039\/C6MD00628K","journal-title":"Med Chem Commun"},{"key":"664_CR11","doi-asserted-by":"publisher","first-page":"1769","DOI":"10.4155\/fmc-2016-0023","volume":"8","author":"A de la VegadeLe\u00f3n","year":"2016","unstructured":"de la VegadeLe\u00f3n A, Bajorath J (2016) Chemical space visualization: transforming multidimensional chemical spaces into similarity-based molecular networks. Future Med Chem 8:1769\u20131778. https:\/\/doi.org\/10.4155\/fmc-2016-0023","journal-title":"Future Med Chem"},{"key":"664_CR12","doi-asserted-by":"publisher","first-page":"2429","DOI":"10.1021\/ci9001102","volume":"49","author":"Z Lepp","year":"2009","unstructured":"Lepp Z, Huang C, Okada T (2009) Finding key members in compound libraries by analyzing networks of molecules assembled by structural similarity. J Chem Inf Model 49:2429\u20132443. https:\/\/doi.org\/10.1021\/ci9001102","journal-title":"J Chem Inf Model"},{"key":"664_CR13","doi-asserted-by":"publisher","first-page":"1578","DOI":"10.1021\/ci4007547","volume":"54","author":"S Wollenhaupt","year":"2014","unstructured":"Wollenhaupt S, Baumann K (2014) inSARa: intuitive and interactive SAR interpretation by reduced graphs and hierarchical MCS-based network navigation. J Chem Inf Model 54:1578\u20131595. https:\/\/doi.org\/10.1021\/ci4007547","journal-title":"J Chem Inf Model"},{"key":"664_CR14","doi-asserted-by":"publisher","first-page":"2186","DOI":"10.1021\/acs.jcim.1c01013","volume":"62","author":"TB Dunn","year":"2022","unstructured":"Dunn TB, Seabra GM, Kim TD et al (2022) Diversity and chemical library networks of large data sets. J Chem Inf Model 62:2186\u20132201. https:\/\/doi.org\/10.1021\/acs.jcim.1c01013","journal-title":"J Chem Inf Model"},{"key":"664_CR15","unstructured":"Landrum G, contributors Rdk (2022) rdkit\/rdkit: 2022_03_5 (Q1 2022) Release"},{"key":"664_CR16","doi-asserted-by":"crossref","unstructured":"Hagberg AA, Schult DA, Swart PJ (2008) Exploring network structure, dynamics, and function using NetworkX. In: Varoquaux G, Vaught T, Millman J (eds) Proceedings of the 7th Python in science conference. Pasadena, CA USA, pp 11\u201315","DOI":"10.25080\/TCWV9851"},{"key":"664_CR17","unstructured":"ChEMBL Data Licensing. https:\/\/chembl.gitbook.io\/chembl-interface-documentation\/about#data-licensing"},{"key":"664_CR18","unstructured":"Creative Commons\u2014CC BY-SA 3.0 License. https:\/\/creativecommons.org\/licenses\/by-sa\/3.0\/"},{"key":"664_CR19","doi-asserted-by":"publisher","first-page":"D945","DOI":"10.1093\/nar\/gkw1074","volume":"45","author":"A Gaulton","year":"2017","unstructured":"Gaulton A, Hersey A, Nowotka M et al (2017) The ChEMBL database in 2017. Nucleic Acids Res 45:D945\u2013D954. https:\/\/doi.org\/10.1093\/nar\/gkw1074","journal-title":"Nucleic Acids Res"},{"key":"664_CR20","doi-asserted-by":"crossref","unstructured":"McKinney W (2010) Data structures for statistical computing in Python. In: Walt S van der, Millman J (eds) Proceedings of the 9th Python in science conference, pp 56\u201361","DOI":"10.25080\/Majora-92bf1922-00a"},{"key":"664_CR21","doi-asserted-by":"publisher","unstructured":"The Pandas development team (2022) pandas-dev\/pandas: Pandas 1.4.4. https:\/\/doi.org\/10.5281\/zenodo.3509134","DOI":"10.5281\/zenodo.3509134"},{"key":"664_CR22","unstructured":"Dalke, Andrew Combinitorial Library Generation with SMILES. In: Combinitorial Library Generation with SMILES. http:\/\/www.dalkescientific.com\/writings\/diary\/archive\/2004\/12\/12\/library_generation_with_smiles.html. Accessed 20 Oct 2022"},{"key":"664_CR23","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1186\/s13321-020-00456-1","volume":"12","author":"AP Bento","year":"2020","unstructured":"Bento AP, Hersey A, F\u00e9lix E et al (2020) An open source chemical structure curation pipeline using RDKit. J Cheminform 12:51. https:\/\/doi.org\/10.1186\/s13321-020-00456-1","journal-title":"J Cheminform"},{"key":"664_CR24","doi-asserted-by":"publisher","first-page":"1189","DOI":"10.1021\/ci100176x","volume":"50","author":"D Fourches","year":"2010","unstructured":"Fourches D, Muratov E, Tropsha A (2010) Trust, but verify: on the importance of chemical structure curation in cheminformatics and QSAR modeling research. J Chem Inf Model 50:1189\u20131204. https:\/\/doi.org\/10.1021\/ci100176x","journal-title":"J Chem Inf Model"},{"key":"664_CR25","doi-asserted-by":"publisher","first-page":"1243","DOI":"10.1021\/acs.jcim.6b00129","volume":"56","author":"D Fourches","year":"2016","unstructured":"Fourches D, Muratov E, Tropsha A (2016) Trust, but verify II: a practical guide to chemogenomics data curation. J Chem Inf Model 56:1243\u20131252. https:\/\/doi.org\/10.1021\/acs.jcim.6b00129","journal-title":"J Chem Inf Model"},{"key":"664_CR26","unstructured":"Landrum, Greg RSC_OpenScience_Standardization. In: RSC_OpenScience_Standardization. https:\/\/github.com\/greglandrum\/RSC_OpenScience_Standardization_202104. Accessed 20 Oct 2022"},{"key":"664_CR27","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1109\/MCSE.2007.55","volume":"9","author":"JD Hunter","year":"2007","unstructured":"Hunter JD (2007) Matplotlib: a 2D graphics environment. Comput Sci Eng 9:90\u201395. https:\/\/doi.org\/10.1109\/MCSE.2007.55","journal-title":"Comput Sci Eng"},{"key":"664_CR28","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.79.026102","volume":"79","author":"A Noack","year":"2009","unstructured":"Noack A (2009) Modularity clustering is force-directed layout. Phys Rev E 79:026102. https:\/\/doi.org\/10.1103\/PhysRevE.79.026102","journal-title":"Phys Rev E"},{"key":"664_CR29","unstructured":"NetworkX Weighted Graph Drawing. In: NetworkX weighted graph drawing. https:\/\/networkx.org\/documentation\/stable\/auto_examples\/drawing\/plot_weighted_graph.html. Accessed 20 Oct 2022"},{"key":"664_CR30","unstructured":"NetworkX Custom Node Icons Drawing. In: NetworkX custom node icons drawing. https:\/\/networkx.org\/documentation\/stable\/auto_examples\/drawing\/plot_custom_node_icons.html. Accessed 20 Oct 2022"},{"key":"664_CR31","unstructured":"ipycytoscape. In: ipycytoscape. https:\/\/github.com\/cytoscape\/ipycytoscape. Accessed 20 Oct 2022"},{"key":"664_CR32","unstructured":"Landrum G (2020) RDKit UGM what\u2019s new. https:\/\/github.com\/rdkit\/UGM_2020\/blob\/master\/Notebooks\/Landrum_WhatsNew.ipynb. Accessed 20 Oct 2022"},{"key":"664_CR33","doi-asserted-by":"publisher","first-page":"3331","DOI":"10.1021\/acs.jcim.0c00296","volume":"60","author":"F Kruger","year":"2020","unstructured":"Kruger F, Stiefl N, Landrum GA (2020) rdScaffoldNetwork: the scaffold network implementation in RDKit. J Chem Inf Model 60:3331\u20133335. https:\/\/doi.org\/10.1021\/acs.jcim.0c00296","journal-title":"J Chem Inf Model"},{"key":"664_CR34","doi-asserted-by":"crossref","unstructured":"Newman M (2018) Networks. Oxford University Press","DOI":"10.1093\/oso\/9780198805090.001.0001"},{"key":"664_CR35","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.71.065103","volume":"71","author":"J-P Onnela","year":"2005","unstructured":"Onnela J-P, Saram\u00e4ki J, Kert\u00e9sz J, Kaski K (2005) Intensity and coherence of motifs in weighted complex networks. Phys Rev E 71:065103. https:\/\/doi.org\/10.1103\/PhysRevE.71.065103","journal-title":"Phys Rev E"},{"key":"664_CR36","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.75.027105","volume":"75","author":"J Saram\u00e4ki","year":"2007","unstructured":"Saram\u00e4ki J, Kivel\u00e4 M, Onnela J-P et al (2007) Generalizations of the clustering coefficient to weighted complex networks. Phys Rev E 75:027105. https:\/\/doi.org\/10.1103\/PhysRevE.75.027105","journal-title":"Phys Rev E"},{"key":"664_CR37","doi-asserted-by":"publisher","unstructured":"Ito MI, Ohnishi T (2019) Weighted network analysis of biologically relevant chemical spaces. https:\/\/doi.org\/10.48550\/ARXIV.1911.05259","DOI":"10.48550\/ARXIV.1911.05259"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00664-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-022-00664-x\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-022-00664-x.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,11]],"date-time":"2024-10-11T03:29:31Z","timestamp":1728617371000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-022-00664-x"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,28]]},"references-count":37,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,12]]}},"alternative-id":["664"],"URL":"https:\/\/doi.org\/10.1186\/s13321-022-00664-x","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,28]]},"assertion":[{"value":"21 October 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 November 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 December 2022","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare they have no competing interests.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"87"}}