{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,26]],"date-time":"2026-02-26T15:04:31Z","timestamp":1772118271147,"version":"3.50.1"},"reference-count":43,"publisher":"Oxford University Press (OUP)","issue":"8","license":[{"start":{"date-parts":[[2023,8,12]],"date-time":"2023-08-12T00:00:00Z","timestamp":1691798400000},"content-version":"vor","delay-in-days":11,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Nature Science Foundation of China","doi-asserted-by":"publisher","award":["61872297"],"award-info":[{"award-number":["61872297"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015401","name":"Shaanxi Province Key R&D Program","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100015401","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Metabolic stability plays a crucial role in the early stages of drug discovery and development. Accurately modeling and predicting molecular metabolic stability has great potential for the efficient screening of drug candidates as well as the optimization of lead compounds. Considering wet-lab experiment is time-consuming, laborious, and expensive, in silico prediction of metabolic stability is an alternative choice. However, few computational methods have been developed to address this task. In addition, it remains a significant challenge to explain key functional groups determining metabolic stability.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>To address these issues, we develop a novel cross-modality graph contrastive learning model named CMMS-GCL for predicting the metabolic stability of drug candidates. In our framework, we design deep learning methods to extract features for molecules from two modality data, i.e. SMILES sequence and molecule graph. In particular, for the sequence data, we design a multihead attention BiGRU-based encoder to preserve the context of symbols to learn sequence representations of molecules. For the graph data, we propose a graph contrastive learning-based encoder to learn structure representations by effectively capturing the consistencies between local and global structures. We further exploit fully connected neural networks to combine the sequence and structure representations for model training. Extensive experimental results on two datasets demonstrate that our CMMS-GCL consistently outperforms seven state-of-the-art methods. Furthermore, a collection of case studies on sequence data and statistical analyses of the graph structure module strengthens the validation of the interpretability of crucial functional groups recognized by CMMS-GCL. Overall, CMMS-GCL can serve as an effective and interpretable tool for predicting metabolic stability, identifying critical functional groups, and thus facilitating the drug discovery process and lead compound optimization.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The code and data underlying this article are freely available at https:\/\/github.com\/dubingxue\/CMMS-GCL.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad503","type":"journal-article","created":{"date-parts":[[2023,8,11]],"date-time":"2023-08-11T20:15:10Z","timestamp":1691784910000},"source":"Crossref","is-referenced-by-count":12,"title":["CMMS-GCL: cross-modality metabolic stability prediction with graph contrastive learning"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0201-2713","authenticated-orcid":false,"given":"Bing-Xue","family":"Du","sequence":"first","affiliation":[{"name":"School of Life Sciences, Northwestern Polytechnical University , Xi\u2019an 710072, China"},{"name":"Institute for Infocomm Research (I2R), Agency for Science, Technology and Research (A*STAR) , Singapore 138632, Singapore"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2765-3007","authenticated-orcid":false,"given":"Yahui","family":"Long","sequence":"additional","affiliation":[{"name":"Singapore Immunology Network (SIgN), Agency for Science, Technology and Research (A*STAR) , Singapore 138648, Singapore"}]},{"given":"Xiaoli","family":"Li","sequence":"additional","affiliation":[{"name":"Institute for Infocomm Research (I2R), Agency for Science, Technology and Research (A*STAR) , Singapore 138632, Singapore"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0977-3600","authenticated-orcid":false,"given":"Min","family":"Wu","sequence":"additional","affiliation":[{"name":"Institute for Infocomm Research (I2R), Agency for Science, Technology and Research (A*STAR) , Singapore 138632, Singapore"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2303-273X","authenticated-orcid":false,"given":"Jian-Yu","family":"Shi","sequence":"additional","affiliation":[{"name":"School of Life Sciences, Northwestern Polytechnical University , Xi\u2019an 710072, China"}]}],"member":"286","published-online":{"date-parts":[[2023,8,12]]},"reference":[{"key":"2023082607512544200_btad503-B1","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/B978-0-12-820007-0.00006-4","volume-title":"Remington","author":"Abelian","year":"2021"},{"key":"2023082607512544200_btad503-B2","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1186\/s13321-015-0069-3","article-title":"Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?","volume":"7","author":"Bajusz","year":"2015","journal-title":"J Cheminform"},{"key":"2023082607512544200_btad503-B3","doi-asserted-by":"crossref","first-page":"3521","DOI":"10.1038\/s41467-021-23720-w","article-title":"Algebraic graph-assisted bidirectional transformers for molecular property prediction","volume":"12","author":"Chen","year":"2021","journal-title":"Nat Commun"},{"key":"2023082607512544200_btad503-B4","first-page":"1597","volume-title":"Proceedings of the 37th International Conference on Machine Learning (ICML)","author":"Chen","year":"2020"},{"key":"2023082607512544200_btad503-B6","doi-asserted-by":"crossref","first-page":"390","DOI":"10.1016\/j.tips.2020.03.004","article-title":"Improving the accuracy of predicted human pharmacokinetics: lessons learned from the AstraZeneca drug pipeline over two decades","volume":"41","author":"Davies","year":"2020","journal-title":"Trends Pharmacol Sci"},{"key":"2023082607512544200_btad503-B7","doi-asserted-by":"crossref","first-page":"i325","DOI":"10.1093\/bioinformatics\/btac222","article-title":"MLGL-MP: a multi-label graph learning framework enhanced by pathway interdependence for metabolic pathway prediction","volume":"38","author":"Du","year":"2022","journal-title":"Bioinformatics"},{"key":"2023082607512544200_btad503-B8","first-page":"85","author":"Du","year":"2023"},{"key":"2023082607512544200_btad503-B9","doi-asserted-by":"crossref","first-page":"8835","DOI":"10.1021\/acs.jmedchem.9b02187","article-title":"Improvement in ADMET prediction with multitask deep featurization","volume":"63","author":"Feinberg","year":"2020","journal-title":"J Med Chem"},{"key":"2023082607512544200_btad503-B10","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1080\/03602532.2021.1970178","article-title":"Drug metabolic stability in early drug discovery to develop potential lead compounds","volume":"53","author":"Gajula","year":"2021","journal-title":"Drug Metab Rev"},{"key":"2023082607512544200_btad503-B11","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1186\/s13321-022-00599-3","article-title":"Transformer-based molecular optimization beyond matched molecular pairs","volume":"14","author":"He","year":"2022","journal-title":"J Cheminform"},{"key":"2023082607512544200_btad503-B12","doi-asserted-by":"crossref","first-page":"1344","DOI":"10.1021\/tx500134g","article-title":"Bioactivation potential of thiophene-containing drugs","volume":"27","author":"Gramec","year":"2014","journal-title":"Chem Res Toxicol"},{"key":"2023082607512544200_btad503-B13","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1038\/nrd4581","article-title":"Predicting drug metabolism: experiment and\/or computation?","volume":"14","author":"Kirchmair","year":"2015","journal-title":"Nat Rev Drug Discov"},{"key":"2023082607512544200_btad503-B14","author":"Kingma","year":"2015."},{"key":"2023082607512544200_btad503-B15","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1002\/phar.1194","article-title":"The role of human carboxylesterases in drug metabolism: have we overlooked their importance?","volume":"33","author":"Laizure","year":"2013","journal-title":"Pharmacotherapy"},{"key":"2023082607512544200_btad503-B16","doi-asserted-by":"crossref","first-page":"1614","DOI":"10.1021\/acs.chemrestox.2c00207","article-title":"In silico prediction of human and rat liver microsomal stability via machine learning methods","volume":"35","author":"Li","year":"2022","journal-title":"Chem Res Toxicol"},{"key":"2023082607512544200_btad503-B17","doi-asserted-by":"crossref","first-page":"8214","DOI":"10.1002\/anie.201206566","article-title":"Introduction of fluorine and fluorine-containing functional groups","volume":"52","author":"Liang","year":"2013","journal-title":"Angew Chem Int Ed Engl"},{"key":"2023082607512544200_btad503-B18","doi-asserted-by":"crossref","first-page":"2099","DOI":"10.1093\/bib\/bbz125","article-title":"A novel molecular representation with BiGRU neural networks for learning atom","volume":"21","author":"Lin","year":"2020","journal-title":"Brief Bioinform"},{"key":"2023082607512544200_btad503-B19","first-page":"1301","author":"Lin","year":"2020"},{"key":"2023082607512544200_btad503-B20","doi-asserted-by":"crossref","first-page":"2254","DOI":"10.1093\/bioinformatics\/btac100","article-title":"Pre-training graph neural networks for link prediction in biomedical networks","volume":"38","author":"Long","year":"2022","journal-title":"Bioinformatics"},{"key":"2023082607512544200_btad503-B21","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1016\/j.drudis.2021.11.005","article-title":"The role of DMPK science in improving pharmaceutical research and development efficiency","volume":"27","author":"Mak","year":"2022","journal-title":"Drug Discov Today"},{"key":"2023082607512544200_btad503-B22","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1070\/RCR4914","article-title":"Progress in the chemistry of nitrogen-, oxygen- and sulfur-containing heterocyclic systems","volume":"89","author":"Makhova","year":"2020","journal-title":"Russ Chem Rev"},{"key":"2023082607512544200_btad503-B23","doi-asserted-by":"crossref","first-page":"105043","DOI":"10.1016\/j.isci.2022.105043","article-title":"EdgeSHAPer: bond-centric Shapley value-based explanation method for graph neural networks","volume":"25","author":"Mastropietro","year":"2022","journal-title":"Iscience"},{"key":"2023082607512544200_btad503-B24","doi-asserted-by":"crossref","first-page":"433","DOI":"10.1007\/s11095-015-1800-5","article-title":"Predicting mouse liver microsomal stability with \u201cpruned\u201d machine learning models and public data","volume":"33","author":"Perryman","year":"2016","journal-title":"Pharm Res"},{"key":"2023082607512544200_btad503-B25","doi-asserted-by":"crossref","first-page":"542","DOI":"10.1038\/nrd1131","article-title":"Making better drugs: decision gates in non-clinical drug development","volume":"2","author":"Pritchard","year":"2003","journal-title":"Nat Rev Drug Discov"},{"key":"2023082607512544200_btad503-B26","doi-asserted-by":"crossref","first-page":"1040","DOI":"10.3390\/ijms19041040","article-title":"MetStabOn\u2014online platform for metabolic stability predictions","volume":"19","author":"Podlewska","year":"2018","journal-title":"Int J Mol Sci"},{"key":"2023082607512544200_btad503-B27","doi-asserted-by":"crossref","first-page":"e1479","DOI":"10.1002\/wcms.1479","article-title":"Advances in the prediction of mouse liver microsomal studies: from machine learning to deep learning","volume":"11","author":"Renn","year":"2021","journal-title":"WIREs Comput Mol Sci"},{"key":"2023082607512544200_btad503-B28","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1021\/acs.molpharmaceut.2c00680","article-title":"Multispecies machine learning predictions of in vitro intrinsic clearance with uncertainty quantification analyses","volume":"20","author":"Rodr\u00edguez-P\u00e9rez","year":"2023","journal-title":"Mol Pharm"},{"key":"2023082607512544200_btad503-B29","doi-asserted-by":"crossref","first-page":"364","DOI":"10.1093\/bioinformatics\/btab547","article-title":"PredMS: a random forest model for predicting metabolic stability of drug candidates in human liver microsomes","volume":"38","author":"Ryu","year":"2022","journal-title":"Bioinformatics"},{"key":"2023082607512544200_btad503-B30","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1186\/s13321-020-00426-7","article-title":"Predicting liver cytosol stability of small molecules","volume":"12","author":"Shah","year":"2020","journal-title":"J Cheminform"},{"key":"2023082607512544200_btad503-B31","doi-asserted-by":"crossref","first-page":"e23110","DOI":"10.1002\/bip.23110","article-title":"N-methylation in amino acids and peptides: scope and limitations","volume":"109","author":"Sharma","year":"2018","journal-title":"Biopolymers"},{"key":"2023082607512544200_btad503-B32","doi-asserted-by":"crossref","first-page":"345","DOI":"10.2478\/acph-2019-0024","article-title":"Metabolic stability and its role in the discovery of new chemical entities","volume":"69","author":"S\u0142oczy\u0144ska","year":"2019","journal-title":"Acta Pharm"},{"key":"2023082607512544200_btad503-B33","doi-asserted-by":"crossref","first-page":"3546","DOI":"10.1021\/acs.jmedchem.0c01930","article-title":"Successful and unsuccessful prediction of human hepatic clearance for lead optimization","volume":"64","author":"Sodhi","year":"2021","journal-title":"J Med Chem"},{"key":"2023082607512544200_btad503-B34","doi-asserted-by":"crossref","first-page":"5625","DOI":"10.1021\/acs.jmedchem.9b01617","article-title":"Acetylene group, friend or foe in medicinal chemistry","volume":"63","author":"Talele","year":"2020","journal-title":"J Med Chem"},{"key":"2023082607512544200_btad503-B35","doi-asserted-by":"crossref","first-page":"4","DOI":"10.1186\/s13321-020-00482-z","article-title":"Substructure-based neural machine translation for retrosynthetic prediction","volume":"13","author":"Ucak","year":"2021","journal-title":"J Cheminform"},{"key":"2023082607512544200_btad503-B36","author":"Veli\u010dkovi\u0107"},{"key":"2023082607512544200_btad503-B37","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1038\/s42256-022-00447-x","article-title":"Molecular contrastive learning of representations via graph neural networks","volume":"4","author":"Wang","year":"2022","journal-title":"Nat Mach Intell"},{"key":"2023082607512544200_btad503-B38","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1186\/s13321-021-00542-y","article-title":"How can SHAP values help to shape metabolic stability of chemical compounds?","volume":"13","author":"Wojtuch","year":"2021","journal-title":"J Cheminform"},{"key":"2023082607512544200_btad503-B39","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1039\/C7SC02664A","article-title":"MoleculeNet: a benchmark for molecular machine learning","volume":"9","author":"Wu","year":"2018","journal-title":"Chem Sci"},{"key":"2023082607512544200_btad503-B40","doi-asserted-by":"crossref","first-page":"8749","DOI":"10.1021\/acs.jmedchem.9b00959","article-title":"Pushing the boundaries of molecular representation for drug discovery with the graph attention mechanism","volume":"63","author":"Xiong","year":"2020","journal-title":"J Med Chem"},{"key":"2023082607512544200_btad503-B41","doi-asserted-by":"crossref","first-page":"W5","DOI":"10.1093\/nar\/gkab255","article-title":"ADMETlab 2.0: an integrated online platform for accurate and comprehensive predictions of ADMET properties","volume":"49","author":"Xiong","year":"2021","journal-title":"Nucleic Acids Res"},{"key":"2023082607512544200_btad503-B43","first-page":"5812","article-title":"Graph contrastive learning with augmentations","volume":"33","author":"You","year":"2020","journal-title":"Adv Neural Inf Process Syst"},{"key":"2023082607512544200_btad503-B44","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1016\/j.apsb.2018.04.003","article-title":"Drug metabolism in drug discovery and development","volume":"8","author":"Zhang","year":"2018","journal-title":"Acta Pharm Sin B"},{"key":"2023082607512544200_btad503-B45","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1038\/s42256-020-0152-y","article-title":"Predicting drug\u2013protein interaction using quasi-visual question answering system","volume":"2","author":"Zheng","year":"2020","journal-title":"Nat Mach Intell"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad503\/51102645\/btad503.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/8\/btad503\/51269397\/btad503.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/8\/btad503\/51269397\/btad503.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,8,26]],"date-time":"2023-08-26T07:51:56Z","timestamp":1693036316000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad503\/7241687"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,1]]},"references-count":43,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2023,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad503","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,8,1]]},"published":{"date-parts":[[2023,8,1]]},"article-number":"btad503"}}