{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T15:53:18Z","timestamp":1765295598914,"version":"3.45.0"},"reference-count":25,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T00:00:00Z","timestamp":1763337600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:sec>\n                    <jats:title>Introduction<\/jats:title>\n                    <jats:p>KRAS mutations are key oncogenic drivers in lung cancer, yet effective pharmacological targeting has remained a major challenge due to the protein's elusive and dynamic binding pockets. Computational modeling offers a promising route to identify novel inhibitors with improved potency and selectivity.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Methods<\/jats:title>\n                    <jats:p>\n                      A quantitative structure\u2013activity relationship (QSAR) modeling approach was developed to predict the inhibitory potency (pIC\n                      <jats:sub>50<\/jats:sub>\n                      ) of KRAS inhibitors and support\n                      <jats:italic>de novo<\/jats:italic>\n                      drug design. Molecular descriptors for 62 inhibitors retrieved from the ChEMBL database (CHEMBL4354832) were computed using Chemopy. Following descriptor normalization and dimensionality reduction, five machine learning algorithm spartial least squares (PLS), random forest (RF), stepwise multiple linear regression (MLR), genetic algorithm optimized MLR (GA-MLR), and XGBoost were applied. Model performance was evaluated using\n                      <jats:italic>R<\/jats:italic>\n                      <jats:sup>2<\/jats:sup>\n                      , RMSE, and MAE, while permutation-based importance and SHAP analyses provided feature interpretability.\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>\n                      Among the models tested, PLS exhibited the best predictive performance (\n                      <jats:italic>R<\/jats:italic>\n                      <jats:sup>2<\/jats:sup>\n                      = 0.851; RMSE = 0.292), followed by RF (\n                      <jats:italic>R<\/jats:italic>\n                      <jats:sup>2<\/jats:sup>\n                      = 0.796). The GA-MLR model, based on eight optimized molecular descriptors, achieved good interpretability and robust internal validation (\n                      <jats:italic>R<\/jats:italic>\n                      <jats:sup>2<\/jats:sup>\n                      = 0.677). Virtual screening of 56\n                      <jats:italic>de novo<\/jats:italic>\n                      designed compounds within the model's applicability domain identified compound C9 with a predicted pIC\n                      <jats:sub>50<\/jats:sub>\n                      ) of 8.11 as the most promising hit.\n                    <\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Discussion<\/jats:title>\n                    <jats:p>\n                      This integrative QSAR modeling and\n                      <jats:italic>de novo<\/jats:italic>\n                      design framework effectively predicted the bioactivity of KRAS inhibitors and facilitated the identification of novel candidate molecules. The findings demonstrate the utility of combining interpretable machine learning models with virtual screening to accelerate the discovery of potent KRAS inhibitors for lung cancer therapy.\n                    <\/jats:p>\n                  <\/jats:sec>","DOI":"10.3389\/fbinf.2025.1663846","type":"journal-article","created":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T06:28:02Z","timestamp":1763360882000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["QSAR-guided discovery of novel KRAS inhibitors for lung cancer therapy"],"prefix":"10.3389","volume":"5","author":[{"given":"Osasan","family":"Stephen Adebayo","sequence":"first","affiliation":[]},{"given":"George","family":"Oche Ambrose","sequence":"additional","affiliation":[]},{"given":"Daramola","family":"Olusola","sequence":"additional","affiliation":[]},{"given":"Adefolalu","family":"Oluwafemi","sequence":"additional","affiliation":[]},{"given":"Hind A.","family":"Alzahrani","sequence":"additional","affiliation":[]},{"given":"Abdulkarim","family":"Hasan","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2025,11,17]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"927","DOI":"10.1021\/ci010247v","article-title":"Genetic algorithm guided selection: variable selection and subset selection","volume":"42","author":"Cho","year":"2002","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"B2","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1038\/s41586-019-1694-1","article-title":"The clinical KRAS (G12C) inhibitor AMG 510 drives anti-tumour immunity","volume":"575","author":"Canon","year":"2019","journal-title":"Nature"},{"key":"B3","doi-asserted-by":"publisher","first-page":"1092","DOI":"10.1093\/bioinformatics\/btt105","article-title":"ChemoPy: freely available python package for computational biology and chemoinformatics","volume":"29","author":"Cao","year":"2013","journal-title":"Bioinformatics"},{"key":"B4","doi-asserted-by":"publisher","first-page":"4977","DOI":"10.1021\/jm4004285","article-title":"QSAR modeling: where have you been? Where are you going to?","volume":"57","author":"Cherkasov","year":"2014","journal-title":"J. Med. Chem."},{"key":"B5","doi-asserted-by":"publisher","first-page":"828","DOI":"10.1038\/nrd4389","article-title":"Drugging the undruggable RAS: mission possible?","volume":"13","author":"Cox","year":"2014","journal-title":"Nat. Rev. Drug Discov."},{"key":"B6","doi-asserted-by":"publisher","first-page":"3714","DOI":"10.1021\/jm000942e","article-title":"Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties","volume":"43","author":"Ertl","year":"2000","journal-title":"J. Med. Chem."},{"key":"B7","doi-asserted-by":"publisher","first-page":"6679","DOI":"10.1021\/acs.jmedchem.9b02052","article-title":"Identification of the clinical development candidate MRTX849, a covalent KRASG12C inhibitor for the treatment of cancer","volume":"63","author":"Fell","year":"2020","journal-title":"J. Med. Chem."},{"key":"B8","doi-asserted-by":"publisher","first-page":"549","DOI":"10.1016\/j.sbspro.2010.12.076","article-title":"Stepwise multiple regression method to forecast fish landing","volume":"8","author":"Ghani","year":"2010","journal-title":"Procedia-Social Behav. Sci."},{"key":"B9","doi-asserted-by":"publisher","first-page":"61","DOI":"10.4018\/ijqspr.20200701.oa1","article-title":"Principles of QSAR modeling: comments and suggestions from personal experience","volume":"5","author":"Gramatica","year":"2020","journal-title":"Int. J. Quantitative Structure-Property Relat. (IJQSPR)"},{"key":"B10","doi-asserted-by":"publisher","first-page":"151","DOI":"10.1016\/s0924-2031(99)00014-4","article-title":"Deriving the 3D structure of organic molecules from their infrared spectra","volume":"19","author":"Hemmer","year":"1999","journal-title":"Vib. Spectrosc."},{"key":"B11","doi-asserted-by":"publisher","first-page":"3016","DOI":"10.3390\/ijms11083016","article-title":"Advances and challenges in protein-ligand docking","volume":"11","author":"Huang","year":"2010","journal-title":"Int. J. Mol. Sci."},{"key":"B12","doi-asserted-by":"publisher","first-page":"52","DOI":"10.1021\/acs.jmedchem.9b01180","article-title":"Discovery of a covalent inhibitor of KRASG12C (AMG 510) for the treatment of solid tumors","volume":"63","author":"Lanman","year":"2019","journal-title":"J. Med. Chem."},{"key":"B13","doi-asserted-by":"publisher","first-page":"335","DOI":"10.1080\/17460441.2019.1581170","article-title":"DataWarrior: an evaluation of the open-source drug discovery tool","volume":"14","author":"L\u00f3pez-L\u00f3pez","year":"2019","journal-title":"Expert Opin. Drug Discov."},{"key":"B14","doi-asserted-by":"publisher","first-page":"786","DOI":"10.21105\/joss.00786","article-title":"iml: an R package for interpretable machine learning","volume":"3","author":"Molnar","year":"2018","journal-title":"J. Open Source Softw."},{"key":"B15","doi-asserted-by":"publisher","first-page":"1818","DOI":"10.1002\/med.70001","article-title":"The discovery of cryptic pockets increases the druggability of \u201cundruggable\u201d proteins","volume":"45","author":"Mou","year":"2025","journal-title":"Med. Res. Rev."},{"key":"B16","doi-asserted-by":"publisher","first-page":"548","DOI":"10.1038\/nature12796","article-title":"K-Ras (G12C) inhibitors allosterically control GTP affinity and effector interactions","volume":"503","author":"Ostrem","year":"2013","journal-title":"Nature"},{"key":"B17","doi-asserted-by":"publisher","first-page":"761","DOI":"10.1038\/nrc3106","article-title":"RAS oncogenes: weaving a tumorigenic web","volume":"11","author":"Pylayeva-Gupta","year":"2011","journal-title":"Nat. Rev. Cancer"},{"key":"B18","volume-title":"Understanding the basics of QSAR for applications in pharmaceutical sciences and risk assessment","author":"Roy","year":"2015"},{"key":"B19","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1021\/ci950164c","article-title":"The coding of the three-dimensional structure of molecules by molecular transforms and its application to structure-spectra correlations and studies of biological activity","volume":"36","author":"Schuur","year":"1996","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"B20","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1080\/09715010.2019.1653799","article-title":"Assessing and solving multicollinearity in sediment transport prediction models using principal component analysis","volume":"27","author":"Sulaiman","year":"2021","journal-title":"ISH J. Hydraulic Eng."},{"key":"B21","doi-asserted-by":"publisher","first-page":"209","DOI":"10.3322\/caac.21660","article-title":"Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries","volume":"71","author":"Sung","year":"2021","journal-title":"CA a Cancer J. Clin."},{"key":"B22","doi-asserted-by":"crossref","DOI":"10.1002\/9783527628766","volume-title":"Molecular descriptors for chemoinformatics","author":"Todeschini","year":"2009"},{"key":"B23","doi-asserted-by":"publisher","first-page":"476","DOI":"10.1002\/minf.201000061","article-title":"Best practices for QSAR model development, validation, and exploitation","volume":"29","author":"Tropsha","year":"2010","journal-title":"Mol. Inf."},{"key":"B24","doi-asserted-by":"publisher","first-page":"5912","DOI":"10.1021\/jm050362n","article-title":"A critical assessment of docking programs and scoring functions","volume":"49","author":"Warren","year":"2006","journal-title":"J. Med. Chem."},{"key":"B25","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1158\/2159-8290.cd-16-0092","article-title":"Progress on covalent inhibition of KRASG12C","volume":"6","author":"Westover","year":"2016","journal-title":"Cancer Discov."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1663846\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,17]],"date-time":"2025-11-17T06:28:08Z","timestamp":1763360888000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1663846\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,11,17]]},"references-count":25,"alternative-id":["10.3389\/fbinf.2025.1663846"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2025.1663846","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,11,17]]},"article-number":"1663846"}}