{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,1]],"date-time":"2026-04-01T02:38:35Z","timestamp":1775011115375,"version":"3.50.1"},"reference-count":78,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,9,19]],"date-time":"2023-09-19T00:00:00Z","timestamp":1695081600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,9,19]],"date-time":"2023-09-19T00:00:00Z","timestamp":1695081600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004281","name":"Narodowe Centrum Nauki","doi-asserted-by":"publisher","award":["2019\/35\/N\/ST6\/02125"],"award-info":[{"award-number":["2019\/35\/N\/ST6\/02125"]}],"id":[{"id":"10.13039\/501100004281","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004281","name":"Narodowe Centrum Nauki","doi-asserted-by":"publisher","award":["2020\/37\/N\/ST6\/02728"],"award-info":[{"award-number":["2020\/37\/N\/ST6\/02728"]}],"id":[{"id":"10.13039\/501100004281","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Graph neural networks have recently become a standard method for analyzing chemical compounds. In the field of molecular property prediction, the emphasis is now on designing new model architectures, and the importance of atom featurization is oftentimes belittled. When contrasting two graph neural networks, the use of different representations possibly leads to incorrect attribution of the results solely to the network architecture. To better understand this issue, we compare multiple atom representations by evaluating them on the prediction of free energy, solubility, and metabolic stability using graph convolutional networks. We discover that the choice of atom representation has a significant impact on model performance and that the optimal subset of features is task-specific. Additional experiments involving more sophisticated architectures, including graph transformers, support these findings. Moreover, we demonstrate that some commonly used atom features, such as the number of neighbors or the number of hydrogens, can be easily predicted using only information about bonds and atom type, yet their explicit inclusion in the representation has a positive impact on model performance. Finally, we explain the predictions of the best-performing models to better understand how they utilize the available atomic features.<\/jats:p>","DOI":"10.1186\/s13321-023-00751-7","type":"journal-article","created":{"date-parts":[[2023,9,19]],"date-time":"2023-09-19T05:02:13Z","timestamp":1695099733000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["Extended study on atomic featurization in graph neural networks for molecular property prediction"],"prefix":"10.1186","volume":"15","author":[{"given":"Agnieszka","family":"Wojtuch","sequence":"first","affiliation":[]},{"given":"Tomasz","family":"Danel","sequence":"additional","affiliation":[]},{"given":"Sabina","family":"Podlewska","sequence":"additional","affiliation":[]},{"given":"\u0141ukasz","family":"Maziarka","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,9,19]]},"reference":[{"key":"751_CR1","first-page":"2224","volume":"25","author":"DK Duvenaud","year":"2015","unstructured":"Duvenaud DK, Maclaurin D, Iparraguirre J, Bombarelli R, Hirzel T, Aspuru-Guzik A, Adams RP (2015) Convolutional networks on graphs for learning molecular fingerprints. Adv Neural Inf Process Syst 25:2224\u20132232","journal-title":"Adv Neural Inf Process Syst"},{"issue":"1","key":"751_CR2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-020-00479-8","volume":"13","author":"D Jiang","year":"2021","unstructured":"Jiang D, Wu Z, Hsieh C-Y, Chen G, Liao B, Wang Z, Shen C, Cao D, Wu J, Hou T (2021) Could graph neural networks learn better molecular representation for drug discovery? A comparison study of descriptor-based and graph-based models. J Cheminf 13(1):1\u201323","journal-title":"J Cheminf"},{"issue":"10","key":"751_CR3","doi-asserted-by":"crossref","first-page":"4371","DOI":"10.1021\/acs.molpharmaceut.7b01144","volume":"15","author":"P Hop","year":"2018","unstructured":"Hop P, Allgood B, Yu J (2018) Geometric deep learning autonomously learns chemical features that outperform those engineered by domain experts. Mol Pharm 15(10):4371\u20134377","journal-title":"Mol Pharm"},{"issue":"24","key":"751_CR4","doi-asserted-by":"crossref","first-page":"5441","DOI":"10.1039\/C8SC00148K","volume":"9","author":"A Mayr","year":"2018","unstructured":"Mayr A, Klambauer G, Unterthiner T, Steijaert M, Wegner JK, Ceulemans H, Clevert D-A, Hochreiter S (2018) Large-scale comparison of machine learning methods for drug target prediction on ChEMBL. Chem Sci 9(24):5441\u20135451","journal-title":"Chem Sci"},{"issue":"8","key":"751_CR5","doi-asserted-by":"crossref","first-page":"1757","DOI":"10.1021\/acs.jcim.6b00601","volume":"57","author":"CW Coley","year":"2017","unstructured":"Coley CW, Barzilay R, Green WH, Jaakkola TS, Jensen KF (2017) Convolutional embedding of attributed molecular graphs for physical property prediction. J Chem Inf Model 57(8):1757\u20131772","journal-title":"J Chem Inf Model"},{"key":"751_CR6","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1704.01212","author":"J Gilmer","year":"2017","unstructured":"Gilmer J, Schoenholz SS, Riley PF, Vinyals O, Dahl GE (2017) Neural message passing for quantum chemistry. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1704.01212","journal-title":"arXiv"},{"issue":"8","key":"751_CR7","doi-asserted-by":"crossref","first-page":"3370","DOI":"10.1021\/acs.jcim.9b00237","volume":"59","author":"K Yang","year":"2019","unstructured":"Yang K, Swanson K, Jin W, Coley C, Eiden P, Gao H, Guzman-Perez A, Hopper T, Kelley B, Mathea M et al (2019) Analyzing learned molecular representations for property prediction. J Chem Inf Model 59(8):3370\u20133388","journal-title":"J Chem Inf Model"},{"key":"751_CR8","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2003.03123","author":"J Klicpera","year":"2020","unstructured":"Klicpera J, Gro\u00df J, G\u00fcnnemann S (2020) Directional message passing for molecular graphs. arXiv. https:\/\/doi.org\/10.48550\/arXiv.2003.03123","journal-title":"arXiv"},{"key":"751_CR9","doi-asserted-by":"crossref","unstructured":"Danel T, Spurek P, Tabor J, \u015amieja M, Struski \u0141, S\u0142owik A, Maziarka \u0141 (2020) Spatial graph convolutional networks. In: International Conference on Neural Information Processing, Springer. pp. 668\u2013675","DOI":"10.1007\/978-3-030-63823-8_76"},{"key":"751_CR10","doi-asserted-by":"crossref","unstructured":"Song Y, Zheng S, Niu Z, Fu Z-H, Lu Y, Yang Y (2020) Communicative representation learning on attributed molecular graphs. In: IJCAI","DOI":"10.24963\/ijcai.2020\/392"},{"key":"751_CR11","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1710.10903","author":"P Veli\u010dkovi\u0107","year":"2017","unstructured":"Veli\u010dkovi\u0107 P, Cucurull G, Casanova A, Romero A, Lio P, Bengio Y (2017) Graph attention networks. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1710.10903","journal-title":"arXiv"},{"key":"751_CR12","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1709.03741","author":"J Li","year":"2017","unstructured":"Li J, Cai D, He X (2017) Learning graph-level representation for drug discovery. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1709.03741","journal-title":"arXiv"},{"key":"751_CR13","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1805.10988","author":"S Ryu","year":"2018","unstructured":"Ryu S, Lim J, Hong SH, Kim WY (2018) Deeply learning molecular structure-property relationships using attention-and gate-augmented graph convolutional network. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1805.10988","journal-title":"arXiv"},{"key":"751_CR14","volume-title":"Advances in neural information processing systems","author":"A Vaswani","year":"2017","unstructured":"Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Lu, Polosukhin I (2017) Attention is all you need. In: Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R (eds) Advances in neural information processing systems, vol 30. Curran Associates Inc., Red Hook"},{"key":"751_CR15","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1810.04805","author":"J Devlin","year":"2018","unstructured":"Devlin J, Chang M-W, Lee K, Toutanova K (2018) BERT: pre-training of deep bidirectional transformers for language understanding. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1810.04805","journal-title":"arXiv"},{"key":"751_CR16","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2010.11929","author":"A Dosovitskiy","year":"2020","unstructured":"Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G, Gelly S et al (2020) An image is worth 16x16 words: transformers for image recognition at scale. arXiv. https:\/\/doi.org\/10.48550\/arXiv.2010.11929","journal-title":"arXiv"},{"key":"751_CR17","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2002.08264","author":"\u0141 Maziarka","year":"2020","unstructured":"Maziarka \u0141, Danel T, Mucha S, Rataj K, Tabor J, Jastrz\u0119bski S (2020) Molecule attention transformer. arXiv. https:\/\/doi.org\/10.48550\/arXiv.2002.08264","journal-title":"arXiv"},{"key":"751_CR18","first-page":"12559","volume":"33","author":"Y Rong","year":"2020","unstructured":"Rong Y, Bian Y, Xu T, Xie W, Wei Y, Huang W, Huang J (2020) Self-supervised graph transformer on large-scale molecular data. Adv Neural Inf Process Syst 33:12559\u201312571","journal-title":"Adv Neural Inf Process Syst"},{"issue":"5","key":"751_CR19","doi-asserted-by":"crossref","first-page":"742","DOI":"10.1021\/ci100050t","volume":"50","author":"D Rogers","year":"2010","unstructured":"Rogers D, Hahn M (2010) Extended-connectivity fingerprints. J Chem Inf Model 50(5):742\u2013754","journal-title":"J Chem Inf Model"},{"issue":"21","key":"751_CR20","doi-asserted-by":"crossref","first-page":"2518","DOI":"10.1093\/bioinformatics\/btn479","volume":"24","author":"J Klekota","year":"2008","unstructured":"Klekota J, Roth FP (2008) Chemical substructures that enrich for biological activity. Bioinformatics 24(21):2518\u20132525","journal-title":"Bioinformatics"},{"issue":"2","key":"751_CR21","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1039\/C7SC02664A","volume":"9","author":"Z Wu","year":"2018","unstructured":"Wu Z, Ramsundar B, Feinberg EN, Gomes J, Geniesse C, Pappu AS, Leswing K, Pande V (2018) Moleculenet: a benchmark for molecular machine learning. Chem Sci 9(2):513\u2013530","journal-title":"Chem Sci"},{"issue":"1","key":"751_CR22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-021-00557-5","volume":"13","author":"V Venkatraman","year":"2021","unstructured":"Venkatraman V (2021) FP-ADMET: a compendium of fingerprint-based ADMET prediction models. J Cheminf 13(1):1\u201312","journal-title":"J Cheminf"},{"issue":"11","key":"751_CR23","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1016\/S1359-6446(97)01079-9","volume":"2","author":"H Kubinyi","year":"1997","unstructured":"Kubinyi H (1997) QSAR and 3D QSAR in drug design part 1: methodology. Drug Discov Today 2(11):457\u2013467","journal-title":"Drug Discov Today"},{"issue":"4","key":"751_CR24","doi-asserted-by":"crossref","first-page":"1040","DOI":"10.3390\/ijms19041040","volume":"19","author":"S Podlewska","year":"2018","unstructured":"Podlewska S, Kafel R (2018) MetSTabOn\u2014online platform for metabolic stability predictions. Int J Mol Sci 19(4):1040","journal-title":"Int J Mol Sci"},{"key":"751_CR25","unstructured":"Unterthiner T, Mayr A, Klambauer G, Steijaert M, Wegner JK, Ceulemans H, Hochreiter S (2014) Deep learning as an opportunity in virtual screening. In: Proceedings of the Deep Learning Workshop at NIPS, vol. 27, pp. 1\u20139"},{"key":"751_CR26","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1502.02072","author":"B Ramsundar","year":"2015","unstructured":"Ramsundar B, Kearnes S, Riley P, Webster D, Konerding D, Pande V (2015) Massively multitask networks for drug discovery. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1502.02072","journal-title":"arXiv"},{"issue":"39","key":"751_CR27","doi-asserted-by":"crossref","first-page":"14814","DOI":"10.1021\/ja407397z","volume":"135","author":"D Zamora-Olivares","year":"2013","unstructured":"Zamora-Olivares D, Kaoud TS, Dalby KN, Anslyn EV (2013) In-situ generation of differential sensors that fingerprint kinases and the cellular response to their expression. J Am Chem Soc 135(39):14814\u201314820","journal-title":"J Am Chem Soc"},{"issue":"5","key":"751_CR28","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1016\/j.chembiol.2018.01.015","volume":"25","author":"J Simm","year":"2018","unstructured":"Simm J, Klambauer G, Arany A, Steijaert M, Wegner JK, Gustin E, Chupakhin V, Chong YT, Vialard J, Buijnsters P et al (2018) Repurposing high-throughput image assays enables biological activity prediction for drug discovery. Cell Chem Biol 25(5):611\u2013618","journal-title":"Cell Chem Biol"},{"issue":"2","key":"751_CR29","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1007\/s11095-015-1800-5","volume":"33","author":"AL Perryman","year":"2016","unstructured":"Perryman AL, Stratton TP, Ekins S, Freundlich JS (2016) Predicting mouse liver microsomal stability with \u201cpruned\u2019\u2019 machine learning models and public data. Pharm Res 33(2):433\u2013449","journal-title":"Pharm Res"},{"issue":"1","key":"751_CR30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-019-0376-1","volume":"11","author":"O Laufk\u00f6tter","year":"2019","unstructured":"Laufk\u00f6tter O, Sturm N, Bajorath J, Chen H, Engkvist O (2019) Combining structural and bioactivity-based fingerprints improves prediction performance and scaffold hopping capability. J Cheminf 11(1):1\u201314","journal-title":"J Cheminf"},{"issue":"1","key":"751_CR31","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1186\/s13321-022-00630-7","volume":"14","author":"N Kumar","year":"2022","unstructured":"Kumar N, Acharya V (2022) Machine intelligence-driven framework for optimized hit selection in virtual screening. J Cheminf 14(1):48","journal-title":"J Cheminf"},{"issue":"10","key":"751_CR32","doi-asserted-by":"publisher","first-page":"5318","DOI":"10.1021\/acsomega.9b04339","volume":"5","author":"T Tuan-Anh","year":"2020","unstructured":"Tuan-Anh T, Zale\u015bny R (2020) Predictions of high-order electric properties of molecules: can we benefit from machine learning? ACS Omega 5(10):5318\u20135325. https:\/\/doi.org\/10.1021\/acsomega.9b04339","journal-title":"ACS Omega"},{"issue":"1","key":"751_CR33","doi-asserted-by":"publisher","first-page":"31","DOI":"10.1021\/ci00057a005","volume":"28","author":"D Weininger","year":"1988","unstructured":"Weininger D (1988) SMILES, a chemical language and information system. 1. introduction to methodology and encoding rules. J Chem Inf Comput Sci 28(1):31\u201336. https:\/\/doi.org\/10.1021\/ci00057a005","journal-title":"J Chem Inf Comput Sci"},{"key":"751_CR34","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1602.06289","author":"S Jastrz\u0119bski","year":"2016","unstructured":"Jastrz\u0119bski S, Le\u015bniak D, Czarnecki WM (2016) Learning to smile (s). arXiv. https:\/\/doi.org\/10.48550\/arXiv.1602.06289","journal-title":"arXiv"},{"key":"751_CR35","doi-asserted-by":"crossref","unstructured":"Wang S, Guo Y, Wang Y, Sun H, Huang J (2019) Smiles-bert: large scale unsupervised pre-training for molecular property prediction. In: Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, pp. 429\u2013436","DOI":"10.1145\/3307339.3342186"},{"issue":"47","key":"751_CR36","doi-asserted-by":"crossref","first-page":"9854","DOI":"10.1021\/acs.jpca.0c05969","volume":"124","author":"GA Pinheiro","year":"2020","unstructured":"Pinheiro GA, Mucelini J, Soares MD, Prati RC, Da Silva JL, Quiles MG (2020) Machine learning prediction of nine molecular properties based on the smiles representation of the QM9 quantum-chemistry dataset. J Phys Chem A 124(47):9854\u20139866","journal-title":"J Phys Chem A"},{"key":"751_CR37","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1911.04738","author":"S Honda","year":"2019","unstructured":"Honda S, Shi S, Ueda HR (2019) Smiles transformer: pre-trained molecular fingerprint for low data drug discovery. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1911.04738","journal-title":"arXiv"},{"key":"751_CR38","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2010.09885","author":"S Chithrananda","year":"2020","unstructured":"Chithrananda S, Grand G, Ramsundar B (2020) ChemBERTa: large-scale self-supervised pretraining for molecular property prediction. arXiv. https:\/\/doi.org\/10.48550\/arXiv.2010.09885","journal-title":"arXiv"},{"key":"751_CR39","unstructured":"Kusner MJ, Paige B, Hern\u00e1ndez-Lobato JM (2017) Grammar variational autoencoder. In: International Conference on Machine Learning, PMLR. pp. 1945\u20131954"},{"issue":"1","key":"751_CR40","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-017-0235-x","volume":"9","author":"M Olivecrona","year":"2017","unstructured":"Olivecrona M, Blaschke T, Engkvist O, Chen H (2017) Molecular de-novo design through deep reinforcement learning. J Cheminf 9(1):1\u201314","journal-title":"J Cheminf"},{"issue":"7","key":"751_CR41","doi-asserted-by":"crossref","first-page":"7885","DOI":"10.1126\/sciadv.aap7885","volume":"4","author":"M Popova","year":"2018","unstructured":"Popova M, Isayev O, Tropsha A (2018) Deep reinforcement learning for de novo drug design. Sci Adv 4(7):7885","journal-title":"Sci Adv"},{"issue":"2","key":"751_CR42","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1021\/acscentsci.7b00572","volume":"4","author":"R G\u00f3mez-Bombarelli","year":"2018","unstructured":"G\u00f3mez-Bombarelli R, Wei JN, Duvenaud D, Hern\u00e1ndez-Lobato JM, S\u00e1nchez-Lengeling B, Sheberla D, Aguilera-Iparraguirre J, Hirzel TD, Adams RP, Aspuru-Guzik A (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent Sci 4(2):268\u2013276","journal-title":"ACS Cent Sci"},{"key":"751_CR43","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1758-2946-4-22","volume":"4","author":"NM O\u2019Boyle","year":"2012","unstructured":"O\u2019Boyle NM (2012) Towards a universal smiles representation-a standard method to generate canonical smiles based on the InCHI. J Cheminf 4:1\u201314","journal-title":"J Cheminf"},{"key":"751_CR44","doi-asserted-by":"publisher","DOI":"10.1007\/s10822-021-00421-6","author":"JK Weber","year":"2022","unstructured":"Weber JK, Morrone JA, Bagchi S, Pabon JDE, Kang S-g, Zhang L, Cornell WD (2022) Simplified, interpretable graph convolutional neural networks for small molecule activity prediction. J Comput-Aided Mol Des. https:\/\/doi.org\/10.1007\/s10822-021-00421-6","journal-title":"J Comput-Aided Mol Des"},{"issue":"8","key":"751_CR45","doi-asserted-by":"crossref","DOI":"10.1063\/5.0079574","volume":"156","author":"T Stuyver","year":"2022","unstructured":"Stuyver T, Coley CW (2022) Quantum chemistry-augmented neural networks for reactivity prediction: performance, generalizability, and explainability. J Chem Phys 156(8):084104","journal-title":"J Chem Phys"},{"issue":"14","key":"751_CR46","doi-asserted-by":"crossref","first-page":"12268","DOI":"10.1021\/acsomega.2c00697","volume":"7","author":"S Lee","year":"2022","unstructured":"Lee S, Lee M, Gyak K-W, Kim SD, Kim M-J, Min K (2022) Novel solubility prediction models: Molecular fingerprints and physicochemical features vs graph convolutional neural networks. ACS Omega 7(14):12268\u201312277","journal-title":"ACS Omega"},{"issue":"1","key":"751_CR47","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1021\/acs.molpharmaceut.2c00680","volume":"20","author":"R Rodr\u00edguez-P\u00e9rez","year":"2022","unstructured":"Rodr\u00edguez-P\u00e9rez R, Trunzer M, Schneider N, Faller B, Gerebtzoff G (2022) Multispecies machine learning predictions of in vitro intrinsic clearance with uncertainty quantification analyses. Mol Pharm 20(1):383\u2013394","journal-title":"Mol Pharm"},{"issue":"1","key":"751_CR48","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-022-00626-3","volume":"14","author":"M Huang","year":"2022","unstructured":"Huang M, Lou C, Wu Z, Li W, Lee PW, Tang Y, Liu G (2022) In silico prediction of UGT-mediated metabolism in drug-like molecules via graph neural network. J Cheminf 14(1):1\u201316","journal-title":"J Cheminf"},{"key":"751_CR49","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.119055","volume":"213","author":"M Wiercioch","year":"2023","unstructured":"Wiercioch M, Kirchmair J (2023) DNN-PP: a novel deep neural network approach and its applicability in drug-related property prediction. Exp Syst Appl 213:119055. https:\/\/doi.org\/10.1016\/j.eswa.2022.119055","journal-title":"Exp Syst Appl"},{"key":"751_CR50","doi-asserted-by":"crossref","unstructured":"Wu F, Radev D, Li SZ (2023) Molformer: motif-based transformer on 3d heterogeneous molecular graphs. Rn 1, 1","DOI":"10.1609\/aaai.v37i4.25662"},{"issue":"1","key":"751_CR51","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1038\/s42004-023-00857-x","volume":"6","author":"Y Jiang","year":"2023","unstructured":"Jiang Y, Jin S, Jin X, Xiao X, Wu W, Liu X, Zhang Q, Zeng X, Yang G, Niu Z (2023) Pharmacophoric-constrained heterogeneous graph transformer model for molecular property prediction. Commun Chem 6(1):60","journal-title":"Commun Chem"},{"issue":"16","key":"751_CR52","doi-asserted-by":"crossref","first-page":"8705","DOI":"10.1021\/acs.jmedchem.0c00385","volume":"63","author":"KV Chuang","year":"2020","unstructured":"Chuang KV, Gunsalus LM, Keiser MJ (2020) Learning molecular representations for medicinal chemistry: miniperspective. J Med Chem 63(16):8705\u20138722","journal-title":"J Med Chem"},{"issue":"8","key":"751_CR53","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1007\/s10822-016-9938-8","volume":"30","author":"S Kearnes","year":"2016","unstructured":"Kearnes S, McCloskey K, Berndl M, Pande V, Riley P (2016) Molecular graph convolutions: moving beyond fingerprints. J Comput Aid Mol Des 30(8):595\u2013608","journal-title":"J Comput Aid Mol Des"},{"issue":"14","key":"751_CR54","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.3390\/ijms20143389","volume":"20","author":"K Liu","year":"2019","unstructured":"Liu K, Sun X, Jia L, Ma J, Xing H, Wu J, Gao H, Sun Y, Boulnois F, Fan J (2019) Chemi-net: a molecular graph convolutional network for accurate drug property prediction. Int J Mol Sci 20(14):3389","journal-title":"Int J Mol Sci"},{"key":"751_CR55","doi-asserted-by":"crossref","unstructured":"Pocha A, Danel T, Podlewska S, Tabor J, Maziarka \u0141 (2021) Comparison of atom representations in graph neural networks for molecular property prediction. In: 2021 International Joint Conference on Neural Networks (IJCNN), IEEE. pp. 1\u20138","DOI":"10.1109\/IJCNN52387.2021.9533698"},{"issue":"3","key":"751_CR56","doi-asserted-by":"crossref","first-page":"1000","DOI":"10.1021\/ci034243x","volume":"44","author":"JS Delaney","year":"2004","unstructured":"Delaney JS (2004) ESOL: estimating aqueous solubility directly from molecular structure. J Chem Inf Comput Sci 44(3):1000\u20131005","journal-title":"J Chem Inf Comput Sci"},{"issue":"11","key":"751_CR57","doi-asserted-by":"crossref","first-page":"2864","DOI":"10.1021\/ci300415d","volume":"52","author":"L Ruddigkeit","year":"2012","unstructured":"Ruddigkeit L, Van Deursen R, Blum LC, Reymond J-L (2012) Enumeration of 166 billion organic small molecules in the chemical universe database GDB-17. J Chem Inf Model 52(11):2864\u20132875","journal-title":"J Chem Inf Model"},{"key":"751_CR58","first-page":"9240","volume":"32","author":"R Ying","year":"2019","unstructured":"Ying R, Bourgeois D, You J, Zitnik M, Leskovec J (2019) Gnnexplainer: generating explanations for graph neural networks. Adv Neural Inf Process Syst 32:9240","journal-title":"Adv Neural Inf Process Syst"},{"key":"751_CR59","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1912.02757","author":"S Fort","year":"2019","unstructured":"Fort S, Hu H, Lakshminarayanan B (2019) Deep ensembles: a loss landscape perspective. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1912.02757","journal-title":"arXiv"},{"issue":"8","key":"751_CR60","doi-asserted-by":"publisher","first-page":"1225","DOI":"10.1021\/acs.chemrestox.6b00135","volume":"29","author":"AM Richard","year":"2016","unstructured":"Richard AM, Judson RS, Houck KA, Grulke CM, Volarath P, Thillainadarajah I, Yang C, Rathman J, Martin MT, Wambaugh JF, Knudsen TB, Kancherla J, Mansouri K, Patlewicz G, Williams AJ, Little SB, Crofton KM, Thomas RS (2016) ToxCast chemical landscape: paving the road to 21st century toxicology. Chem Res Toxicol 29(8):1225\u20131251. https:\/\/doi.org\/10.1021\/acs.chemrestox.6b00135","journal-title":"Chem Res Toxicol"},{"issue":"2","key":"751_CR61","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1021\/ci8002649","volume":"49","author":"SG Rohrer","year":"2009","unstructured":"Rohrer SG, Baumann K (2009) Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data. J Chem Inf Model 49(2):169\u2013184","journal-title":"J Chem Inf Model"},{"key":"751_CR62","unstructured":"Maziarz K, Jackson-Flux HR, Cameron P, Sirockin F, Schneider N, Stiefl N, Segler M, Brockschmidt M (2022) Learning to extend molecular scaffolds with structural motifs. In: International Conference on Learning Representations. https:\/\/openreview.net\/forum?id=ZTsoE8G3GG. Accessed 20 Feb 2023"},{"issue":"1","key":"751_CR63","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-019-0404-1","volume":"12","author":"\u0141 Maziarka","year":"2020","unstructured":"Maziarka \u0141, Pocha A, Kaczmarczyk J, Rataj K, Danel T, Warcho\u0142 M (2020) Mol-CycleGAN: a generative model for molecular optimization. J Cheminf 12(1):1\u201318","journal-title":"J Cheminf"},{"issue":"9","key":"751_CR64","doi-asserted-by":"crossref","first-page":"4246","DOI":"10.1021\/acs.jcim.9b01202","volume":"60","author":"S Jastrzebski","year":"2020","unstructured":"Jastrzebski S, Szymczak M, Pocha A, Mordalski S, Tabor J, Bojarski AJ, Podlewska S (2020) Emulating docking results using a deep neural network: a new perspective for virtual screening. J Chem Inf Model 60(9):4246\u20134262","journal-title":"J Chem Inf Model"},{"issue":"8","key":"751_CR65","doi-asserted-by":"crossref","first-page":"2000028","DOI":"10.1002\/minf.202000028","volume":"39","author":"A-T Ton","year":"2020","unstructured":"Ton A-T, Gentile F, Hsing M, Ban F, Cherkasov A (2020) Rapid identification of potential inhibitors of SARS-CoV-2 main protease by deep docking of 1.3 billion compounds. Mol inf 39(8):2000028","journal-title":"Mol inf"},{"issue":"15","key":"751_CR66","doi-asserted-by":"crossref","first-page":"2887","DOI":"10.1021\/jm9602928","volume":"39","author":"GW Bemis","year":"1996","unstructured":"Bemis GW, Murcko MA (1996) The properties of known drugs. 1. molecular frameworks. J Med Chem 39(15):2887\u20132893","journal-title":"J Med Chem"},{"key":"751_CR67","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1609.02907","author":"TN Kipf","year":"2016","unstructured":"Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1609.02907","journal-title":"arXiv"},{"key":"751_CR68","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1207.0580","author":"GE Hinton","year":"2012","unstructured":"Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR (2012) Improving neural networks by preventing co-adaptation of feature detectors. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1207.0580","journal-title":"arXiv"},{"key":"751_CR69","unstructured":"Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. In: International Conference on Machine Learning, PMLR. pp. 448\u2013456"},{"key":"751_CR70","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1412.6980","author":"DP Kingma","year":"2014","unstructured":"Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv. https:\/\/doi.org\/10.48550\/arXiv.1412.6980","journal-title":"arXiv"},{"key":"751_CR71","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin C (2016) Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, pp. 785\u2013794","DOI":"10.1145\/2939672.2939785"},{"key":"751_CR72","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-021-00542-y","volume":"13","author":"A Wojtuch","year":"2021","unstructured":"Wojtuch A, Jankowski R, Podlewska S (2021) How can SHAP values help to shape metabolic stability of chemical compounds? J Cheminf 13:1\u201320","journal-title":"J Cheminf"},{"key":"751_CR73","unstructured":"Fey M, Lenssen JE (2019) Fast graph representation learning with PyTorch Geometric. In: ICLR Workshop on Representation Learning on Graphs and Manifolds"},{"issue":"11","key":"751_CR74","first-page":"2579","volume":"9","author":"L Van der Maaten","year":"2008","unstructured":"Van der Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9(11):2579\u20132605","journal-title":"J Mach Learn Res"},{"key":"751_CR75","unstructured":"Ester M, Kriegel H-P, Sander J, Xu X et al (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Kdd, vol. 96, pp. 226\u2013231"},{"key":"751_CR76","unstructured":"Dai H, Dai B, Song L (2016) Discriminative embeddings of latent variable models for structured data. In: International Conference on Machine Learning, PMLR. pp. 2702\u20132711"},{"issue":"3","key":"751_CR77","doi-asserted-by":"crossref","first-page":"1044","DOI":"10.1021\/acs.jcim.8b00672","volume":"59","author":"X Li","year":"2019","unstructured":"Li X, Yan X, Gu Q, Zhou H, Wu D, Xu J (2019) Deepchemstable: chemical stability prediction with an attention-based graph convolution network. J Chem Inf Model 59(3):1044\u20131049","journal-title":"J Chem Inf Model"},{"issue":"4","key":"751_CR78","doi-asserted-by":"crossref","first-page":"1238","DOI":"10.1021\/acs.jmedchem.6b01437","volume":"60","author":"Y Hu","year":"2017","unstructured":"Hu Y, Stumpfe D, Bajorath J (2017) Recent advances in scaffold hopping: miniperspective. J Med Chem 60(4):1238\u20131246","journal-title":"J Med Chem"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00751-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00751-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00751-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,18]],"date-time":"2023-11-18T22:25:13Z","timestamp":1700346313000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00751-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,19]]},"references-count":78,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["751"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00751-7","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,19]]},"assertion":[{"value":"27 March 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"23 August 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 September 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"81"}}