{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,13]],"date-time":"2026-03-13T18:04:24Z","timestamp":1773425064624,"version":"3.50.1"},"reference-count":90,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,5,6]],"date-time":"2023-05-06T00:00:00Z","timestamp":1683331200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,5,6]],"date-time":"2023-05-06T00:00:00Z","timestamp":1683331200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Specific League Funds from Mahidol University"},{"name":"College of Arts, Media and Technology, Chiang Mai University"},{"DOI":"10.13039\/501100004704","name":"National Research Council of Thailand","doi-asserted-by":"publisher","award":["N42A660380"],"award-info":[{"award-number":["N42A660380"]}],"id":[{"id":"10.13039\/501100004704","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cheminform"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Drug resistance represents a major obstacle to therapeutic innovations and is a prevalent feature in prostate cancer (PCa). Androgen receptors (ARs) are the hallmark therapeutic target for prostate cancer modulation and AR antagonists have achieved great success. However, rapid emergence of resistance contributing to PCa progression is the ultimate burden of their long-term usage. Hence, the discovery and development of AR antagonists with capability to combat the resistance, remains an avenue for further exploration. Therefore, this study proposes a novel deep learning (DL)-based hybrid framework, named DeepAR, to accurately and rapidly identify AR antagonists by using only the SMILES notation. Specifically, DeepAR is capable of extracting and learning the key information embedded in AR antagonists. Firstly, we established a benchmark dataset by collecting active and inactive compounds against AR from the ChEMBL database. Based on this dataset, we developed and optimized a collection of baseline models by using a comprehensive set of well-known molecular descriptors and machine learning algorithms. Then, these baseline models were utilized for creating probabilistic features. Finally, these probabilistic features were combined and used for the construction of a meta-model based on a one-dimensional convolutional neural network. Experimental results indicated that DeepAR is a more accurate and stable approach for identifying AR antagonists in terms of the independent test dataset, by achieving an accuracy of 0.911 and MCC of 0.823. In addition, our proposed framework is able to provide feature importance information by leveraging a popular computational approach, named SHapley Additive exPlanations (SHAP). In the meanwhile, the characterization and analysis of potential AR antagonist candidates were achieved through the SHAP waterfall plot and molecular docking. The analysis inferred that <jats:italic>N<\/jats:italic>-heterocyclic moieties, halogenated substituents, and a cyano functional group were significant determinants of potential AR antagonists. Lastly, we implemented an online web server by using DeepAR (at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"http:\/\/pmlabstack.pythonanywhere.com\/DeepAR\">http:\/\/pmlabstack.pythonanywhere.com\/DeepAR<\/jats:ext-link>). We anticipate that DeepAR could be a useful computational tool for community-wide facilitation of AR candidates from a large number of uncharacterized compounds.<\/jats:p>","DOI":"10.1186\/s13321-023-00721-z","type":"journal-article","created":{"date-parts":[[2023,5,6]],"date-time":"2023-05-06T09:02:19Z","timestamp":1683363739000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":47,"title":["DeepAR: a novel deep learning-based hybrid framework for the interpretable prediction of androgen receptor antagonists"],"prefix":"10.1186","volume":"15","author":[{"given":"Nalini","family":"Schaduangrat","sequence":"first","affiliation":[]},{"given":"Nuttapat","family":"Anuwongcharoen","sequence":"additional","affiliation":[]},{"given":"Phasit","family":"Charoenkwan","sequence":"additional","affiliation":[]},{"given":"Watshara","family":"Shoombuatong","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,5,6]]},"reference":[{"issue":"6","key":"721_CR1","doi-asserted-by":"publisher","first-page":"1067","DOI":"10.1016\/j.molonc.2014.05.004","volume":"8","author":"FH Groenendijk","year":"2014","unstructured":"Groenendijk FH, Bernards R (2014) Drug resistance to targeted therapies: deja vu all over again. Mol Oncol 8(6):1067\u20131083","journal-title":"Mol Oncol"},{"issue":"1","key":"721_CR2","doi-asserted-by":"publisher","first-page":"7","DOI":"10.3322\/caac.21708","volume":"72","author":"RL Siegel","year":"2022","unstructured":"Siegel RL, Miller KD, Fuchs HE, Jemal A (2022) Cancer statistics, 2022. CA Cancer J Clin 72(1):7\u201333","journal-title":"CA Cancer J Clin"},{"key":"721_CR3","unstructured":"International agency for research on cancer. (2022). Cancer tomorrow. https:\/\/gco.iarc.fr\/tomorrow\/en\/dataviz\/bars?types=0&sexes=0&mode=population&group_populations=0&multiple_populations=1&multiple_cancers=1&cancers=39_27&populations=903_904_905_908_909_935&apc=cat_ca20v1.5_ca23v-1.5&group_cancers=1&bar_mode=stacked."},{"issue":"2","key":"721_CR4","doi-asserted-by":"publisher","first-page":"463","DOI":"10.1111\/j.1365-2605.2009.01047.x","volume":"33","author":"A Kortenkamp","year":"2010","unstructured":"Kortenkamp A, Faust M (2010) Combined exposures to anti-androgenic chemicals: steps towards cumulative risk assessment. Int J Androl 33(2):463\u2013474","journal-title":"Int J Androl"},{"issue":"2","key":"721_CR5","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1016\/S0012-1606(02)00031-3","volume":"253","author":"PC Marker","year":"2003","unstructured":"Marker PC, Donjacour AA, Dahiya R, Cunha GR (2003) Hormonal, cellular, and molecular control of prostatic development. Dev Biol 253(2):165\u2013174","journal-title":"Dev Biol"},{"issue":"3","key":"721_CR6","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1111\/j.1432-0436.2006.00135.x","volume":"75","author":"XD Wang","year":"2007","unstructured":"Wang XD et al (2007) Expression profiling of the mouse prostate after castration and hormone replacement: implication of H-cadherin in prostate tumorigenesis. Differentiation 75(3):219\u2013234","journal-title":"Differentiation"},{"issue":"1","key":"721_CR7","first-page":"3","volume":"37","author":"RA Davey","year":"2016","unstructured":"Davey RA, Grossmann M (2016) Androgen receptor structure, function and biology: from bench to bedside. Clin Biochem Rev 37(1):3\u201315","journal-title":"Clin Biochem Rev"},{"issue":"1","key":"721_CR8","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1038\/aps.2014.18","volume":"36","author":"MH Tan","year":"2015","unstructured":"Tan MH, Li J, Xu HE, Melcher K, Yong EL (2015) Androgen receptor: structure, role in prostate cancer and drug discovery. Acta Pharmacol Sin 36(1):3\u201323","journal-title":"Acta Pharmacol Sin"},{"issue":"10","key":"721_CR9","doi-asserted-by":"publisher","first-page":"983","DOI":"10.1016\/S1470-2045(12)70379-0","volume":"13","author":"K Fizazi","year":"2012","unstructured":"Fizazi K et al (2012) Abiraterone acetate for treatment of metastatic castration-resistant prostate cancer: final overall survival analysis of the COU-AA-301 randomised, double-blind, placebo-controlled phase 3 study. Lancet Oncol 13(10):983\u2013992","journal-title":"Lancet Oncol"},{"key":"721_CR10","doi-asserted-by":"publisher","DOI":"10.3390\/biom11040492","author":"TJ Kim","year":"2021","unstructured":"Kim TJ, Lee YH, Koo KC (2021) Current status and future perspectives of androgen receptor inhibition therapy for prostate cancer: a comprehensive review. Biomolecules. https:\/\/doi.org\/10.3390\/biom11040492","journal-title":"Biomolecules"},{"issue":"2","key":"721_CR11","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1080\/14737140.2022.2020651","volume":"22","author":"MC Nigro","year":"2022","unstructured":"Nigro MC et al (2022) Current androgen receptor antagonists under investigation for resistant prostate cancer. Expert Rev Anticancer Ther 22(2):191\u2013202","journal-title":"Expert Rev Anticancer Ther"},{"key":"721_CR12","doi-asserted-by":"publisher","DOI":"10.1016\/j.compbiolchem.2021.107490","volume":"92","author":"O Sercinoglu","year":"2021","unstructured":"Sercinoglu O, Bereketoglu C, Olsson PE, Pradhan A (2021) In silico and in vitro assessment of androgen receptor antagonists. Comput Biol Chem 92:107490","journal-title":"Comput Biol Chem"},{"issue":"1","key":"721_CR13","doi-asserted-by":"publisher","first-page":"4","DOI":"10.1016\/j.eururo.2017.08.012","volume":"73","author":"H Borgmann","year":"2018","unstructured":"Borgmann H et al (2018) Moving towards precision urologic oncology: targeting enzalutamide-resistant prostate cancer and mutated forms of the androgen receptor using the novel inhibitor darolutamide (ODM-201). Eur Urol 73(1):4\u20138","journal-title":"Eur Urol"},{"issue":"9","key":"721_CR14","doi-asserted-by":"publisher","first-page":"1020","DOI":"10.1158\/2159-8290.CD-13-0226","volume":"3","author":"JD Joseph","year":"2013","unstructured":"Joseph JD et al (2013) A clinically relevant androgen receptor mutation confers resistance to second-generation antiandrogens enzalutamide and ARN-509. Cancer Discov 3(9):1020\u20131029","journal-title":"Cancer Discov"},{"issue":"10","key":"721_CR15","doi-asserted-by":"publisher","first-page":"1267","DOI":"10.1080\/14728222.2016.1192131","volume":"20","author":"S Sakkiah","year":"2016","unstructured":"Sakkiah S, Ng HW, Tong W, Hong H (2016) Structures of androgen receptor bound with ligands: advancing understanding of biological functions and drug discovery. Expert Opin Ther Targets 20(10):1267\u20131282","journal-title":"Expert Opin Ther Targets"},{"key":"721_CR16","doi-asserted-by":"publisher","first-page":"1419","DOI":"10.3389\/fphar.2018.01419","volume":"9","author":"Y Liu","year":"2018","unstructured":"Liu Y et al (2018) Structural based screening of antiandrogen targeting activation function-2 binding site. Front Pharmacol 9:1419","journal-title":"Front Pharmacol"},{"issue":"4","key":"721_CR17","doi-asserted-by":"publisher","first-page":"1635","DOI":"10.1021\/jm201438f","volume":"55","author":"L Caboni","year":"2012","unstructured":"Caboni L et al (2012) \u201cTrue\u201d antiandrogens-selective non-ligand-binding pocket disruptors of androgen receptor-coactivator interactions: novel tools for prostate cancer. J Med Chem 55(4):1635\u20131644","journal-title":"J Med Chem"},{"issue":"41","key":"721_CR18","doi-asserted-by":"publisher","first-page":"16074","DOI":"10.1073\/pnas.0708036104","volume":"104","author":"E Estebanez-Perpina","year":"2007","unstructured":"Estebanez-Perpina E et al (2007) A surface on the androgen receptor that allosterically regulates coactivator binding. Proc Natl Acad Sci U S A 104(41):16074\u201316079","journal-title":"Proc Natl Acad Sci U S A"},{"issue":"18","key":"721_CR19","doi-asserted-by":"publisher","first-page":"6197","DOI":"10.1021\/jm200532b","volume":"54","author":"P Axerio-Cilies","year":"2011","unstructured":"Axerio-Cilies P et al (2011) Inhibitors of androgen receptor activation function-2 (AF2) site identified through virtual screening. J Med Chem 54(18):6197\u20136205","journal-title":"J Med Chem"},{"issue":"29","key":"721_CR20","doi-asserted-by":"publisher","first-page":"12178","DOI":"10.1073\/pnas.0900185106","volume":"106","author":"JD Joseph","year":"2009","unstructured":"Joseph JD et al (2009) Inhibition of prostate cancer cell growth by second-site androgen receptor antagonists. Proc Natl Acad Sci USA 106(29):12178\u201312183","journal-title":"Proc Natl Acad Sci USA"},{"issue":"6","key":"721_CR21","doi-asserted-by":"publisher","first-page":"435","DOI":"10.1021\/cb900043e","volume":"4","author":"JR Gunther","year":"2009","unstructured":"Gunther JR, Parent AA, Katzenellenbogen JA (2009) Alternative inhibition of androgen receptor signaling: peptidomimetic pyrimidines as direct androgen receptor\/coactivator disruptors. ACS Chem Biol 4(6):435\u2013440","journal-title":"ACS Chem Biol"},{"issue":"5","key":"721_CR22","doi-asserted-by":"publisher","first-page":"373","DOI":"10.1080\/10629360310001623962","volume":"14","author":"H Hong","year":"2003","unstructured":"Hong H, Fang H, Xie Q, Perkins R, Sheehan DM, Tong W (2003) Comparative molecular field analysis (CoMFA) model using a large diverse set of natural, synthetic and environmental chemicals for binding to the androgen receptor. SAR QSAR Environ Res 14(5):373\u201388","journal-title":"SAR QSAR Environ Res"},{"key":"721_CR23","doi-asserted-by":"publisher","DOI":"10.1016\/j.jmgm.2021.108081","volume":"111","author":"A Kocak","year":"2022","unstructured":"Kocak A, Yildiz M (2022) Molecular dynamics simulations reveal the plausible agonism\/antagonism mechanism by steroids on androgen receptor mutations. J Mol Graph Model 111:108081","journal-title":"J Mol Graph Model"},{"key":"721_CR24","doi-asserted-by":"publisher","first-page":"1784","DOI":"10.3390\/ijms19061784","volume":"6","author":"J Wahl","year":"2018","unstructured":"Wahl J, Smiesko M (2018) Endocrine disruption at the androgen receptor: employing molecular dynamics and docking for improved virtual screening and toxicity prediction. Int J Mol Sci 6:1784","journal-title":"Int J Mol Sci"},{"issue":"1","key":"721_CR25","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1021\/ci300514v","volume":"53","author":"H Li","year":"2013","unstructured":"Li H, Ren X, Leblanc E, Frewin K, Rennie PS, Cherkasov A (2013) Identification of novel androgen receptor antagonists using structure- and ligand-based methods. J Chem Inf Model 53(1):123\u2013130","journal-title":"J Chem Inf Model"},{"key":"721_CR26","doi-asserted-by":"publisher","first-page":"143","DOI":"10.1016\/j.ecoenv.2012.11.020","volume":"89","author":"X Wang","year":"2013","unstructured":"Wang X et al (2013) Docking and CoMSIA studies on steroids and non-steroidal chemicals as androgen receptor ligands. Ecotoxicol Environ Saf 89:143\u2013149","journal-title":"Ecotoxicol Environ Saf"},{"issue":"5","key":"721_CR27","doi-asserted-by":"publisher","first-page":"1839","DOI":"10.1021\/acs.jcim.8b00794","volume":"59","author":"F Grisoni","year":"2019","unstructured":"Grisoni F, Consonni V, Ballabio D (2019) Machine learning consensus to predict the binding to the androgen receptor within the CoMPARA project. J Chem Inf Model 59(5):1839\u20131848","journal-title":"J Chem Inf Model"},{"issue":"1","key":"721_CR28","doi-asserted-by":"publisher","first-page":"15887","DOI":"10.1038\/s41598-021-94707-2","volume":"11","author":"HJ Gim","year":"2021","unstructured":"Gim HJ, Park J, Jung ME, Houk KN (2021) Conformational dynamics of androgen receptors bound to agonists and antagonists. Sci Rep 11(1):15887","journal-title":"Sci Rep"},{"issue":"16","key":"721_CR29","doi-asserted-by":"publisher","first-page":"5847","DOI":"10.3390\/ijms21165847","volume":"21","author":"O Snow","year":"2020","unstructured":"Snow O, Lallous N, Ester M, Cherkasov A (2020) Deep learning modeling of androgen receptor responses to prostate cancer therapies. Int J Mol Sci 21(16):5847","journal-title":"Int J Mol Sci"},{"key":"721_CR30","doi-asserted-by":"publisher","DOI":"10.1039\/D0NJ04204H","author":"M Yuan","year":"2021","unstructured":"Yuan M, Cheng P, Zhang S (2021) Structure\u2013activity relationship analysis of a series of nonsteroidal analogues as androgen receptor antagonists. New J Chem. https:\/\/doi.org\/10.1039\/D0NJ04204H","journal-title":"New J Chem"},{"issue":"7880","key":"721_CR31","doi-asserted-by":"publisher","first-page":"348","DOI":"10.1038\/s41586-021-03922-4","volume":"598","author":"HA Elmarakeby","year":"2021","unstructured":"Elmarakeby HA et al (2021) Biologically informed deep neural network for prostate cancer discovery. Nature 598(7880):348\u2013352","journal-title":"Nature"},{"key":"721_CR32","doi-asserted-by":"publisher","first-page":"1044","DOI":"10.3389\/fphys.2019.01044","volume":"10","author":"G Idakwo","year":"2009","unstructured":"Idakwo G, Thangapandian S, Luttrell J, Zhou Z, Zhang C, Gong P (2009) Deep Learning-based structure-activity relationship modeling for multi-category toxicity classification: a case study of 10K Tox21 chemicals with high-throughput cell-based androgen receptor bioassay data. Front Physiol 10:1044","journal-title":"Front Physiol"},{"issue":"5","key":"721_CR33","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1186\/s12859-020-03588-1","volume":"21","author":"MS Yu","year":"2020","unstructured":"Yu MS, Lee J, Lee Y, Na D (2020) 2-D chemical structure image-based in silico model to predict agonist activity for androgen receptor. BMC Bioinform 21(5):245","journal-title":"BMC Bioinform"},{"issue":"D1","key":"721_CR34","doi-asserted-by":"publisher","first-page":"D930","DOI":"10.1093\/nar\/gky1075","volume":"47","author":"D Mendez","year":"2019","unstructured":"Mendez D et al (2019) ChEMBL: towards direct deposition of bioassay data. Nucleic Acids Res 47(D1):D930\u2013D940","journal-title":"Nucleic Acids Res"},{"key":"721_CR35","first-page":"237","volume":"56","author":"A Mauri","year":"2006","unstructured":"Mauri A, Consonni V, Pavan M, Todeschini R, Chemometrics M (2006) Dragon software: an easy approach to molecular descriptor calculations. Match Commun Mathemat Comput Chem 56:237\u2013248","journal-title":"Match Commun Mathemat Comput Chem"},{"key":"721_CR36","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-387-98141-3","volume-title":"Elegant Graphics for Data Analysis (Use R)","author":"H Wickham","year":"2009","unstructured":"Wickham H (2009) Elegant Graphics for Data Analysis (Use R), 1st edn. Springer, New York","edition":"1"},{"key":"721_CR37","volume-title":"RA Language and Environment for Statistical Computing: R Foundation for Statistical Computing","author":"RC. Team","year":"2021","unstructured":"RC. Team (2021) RA Language and Environment for Statistical Computing: R Foundation for Statistical Computing. Springer, Vienna"},{"issue":"7","key":"721_CR38","doi-asserted-by":"publisher","first-page":"1466","DOI":"10.1002\/jcc.21707","volume":"32","author":"CW Yap","year":"2011","unstructured":"Yap CW (2011) PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J Comput Chem 32(7):1466\u20131474","journal-title":"J Comput Chem"},{"issue":"1","key":"721_CR39","first-page":"57","volume":"17","author":"MF Sanner","year":"1999","unstructured":"Sanner MF (1999) Python: a programming language for software integration and development. J Mol Graph Model 17(1):57\u201361","journal-title":"J Mol Graph Model"},{"key":"721_CR40","doi-asserted-by":"publisher","first-page":"105704","DOI":"10.1016\/j.compbiomed.2022.105704","volume":"11","author":"P Charoenkwan","year":"2022","unstructured":"Charoenkwan P, Schaduangrat N, Moni MA, Manavalan B, Shoombuatong W (2022) SAPPHIRE: a stacking-based ensemble learning framework for accurate prediction of thermophilic proteins. Comput Biol Med 11:105704","journal-title":"Comput Biol Med"},{"issue":"1","key":"721_CR41","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-022-20143-5","volume":"12","author":"N Schaduangrat","year":"2022","unstructured":"Schaduangrat N, Anuwongcharoen N, Moni MA, Lio P, Charoenkwan P, Shoombuatong W (2022) StackPR is a new computational approach for large-scale identification of progesterone receptor antagonists using the stacking strategy. Sci Rep 12(1):1\u201316","journal-title":"Sci Rep"},{"key":"721_CR42","doi-asserted-by":"publisher","DOI":"10.1016\/j.compbiomed.2022.105700","volume":"148","author":"P Charoenkwan","year":"2022","unstructured":"Charoenkwan P, Schaduangrat N, Moni MA, Manavalan B, Shoombuatong W (2022) NEPTUNE: a novel computational approach for accurate and large-scale identification of tumor homing peptides. Comput Biol Med 148:105700","journal-title":"Comput Biol Med"},{"issue":"10","key":"721_CR43","doi-asserted-by":"publisher","first-page":"3028","DOI":"10.1093\/bioinformatics\/btaa131","volume":"36","author":"X Fu","year":"2020","unstructured":"Fu X, Cai L, Zeng X, Zou Q (2020) StackCPPred: a stacking and pairwise energy content-based prediction of cell-penetrating peptides and their uptake efficiency. Bioinformatics 36(10):3028\u20133034","journal-title":"Bioinformatics"},{"issue":"3","key":"721_CR44","doi-asserted-by":"publisher","first-page":"433","DOI":"10.1093\/bioinformatics\/bty653","volume":"35","author":"A Mishra","year":"2019","unstructured":"Mishra A, Pokhrel P, Hoque MT (2019) StackDPPred: a stacking based prediction of DNA-binding protein from sequence. Bioinformatics 35(3):433\u2013441","journal-title":"Bioinformatics"},{"key":"721_CR45","doi-asserted-by":"publisher","DOI":"10.7717\/peerj.1979","volume":"4","author":"S Simeon","year":"2016","unstructured":"Simeon S et al (2016) Origin of aromatase inhibitory activity via proteochemometric modeling. PeerJ 4:e1979","journal-title":"PeerJ"},{"key":"721_CR46","doi-asserted-by":"publisher","first-page":"51","DOI":"10.1016\/j.chemolab.2015.12.002","volume":"151","author":"S Simeon","year":"2016","unstructured":"Simeon S et al (2016) Unraveling the origin of splice switching activity of hemoglobin \u03b2-globin gene modulators via QSAR modeling. Chemom Intell Lab Syst 151:51\u201360","journal-title":"Chemom Intell Lab Syst"},{"issue":"45","key":"721_CR47","doi-asserted-by":"publisher","first-page":"28056","DOI":"10.1039\/C7RA02332D","volume":"7","author":"B Rasti","year":"2017","unstructured":"Rasti B, Schaduangrat N, Shahangian SS, Nantasenamat C (2017) Exploring the origin of phosphodiesterase inhibition via proteochemometric modeling. RSC Adv 7(45):28056\u201328068","journal-title":"RSC Adv"},{"issue":"10","key":"721_CR48","doi-asserted-by":"publisher","first-page":"1037","DOI":"10.1007\/s10822-021-00418-1","volume":"35","author":"AA Malik","year":"2021","unstructured":"Malik AA, Chotpatiwetchkul W, Phanus-Umporn C, Nantasenamat C, Charoenkwan P, Shoombuatong W (2021) StackHCV: a web-based integrative machine-learning framework for large-scale identification of hepatitis C virus NS5B inhibitors. J Comput Aided Mol Des 35(10):1037\u20131053","journal-title":"J Comput Aided Mol Des"},{"issue":"5","key":"721_CR49","doi-asserted-by":"publisher","first-page":"2749","DOI":"10.1109\/TCBB.2021.3102133","volume":"19","author":"M Arif","year":"2021","unstructured":"Arif M et al (2021) DeepCPPred: a deep learning framework for the discrimination of cell-penetrating peptides and their uptake efficiencies. IEEE\/ACM Trans Comput Biol Bioinf 19(5):2749\u20132759","journal-title":"IEEE\/ACM Trans Comput Biol Bioinf"},{"key":"721_CR50","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btab133","author":"P Charoenkwan","year":"2021","unstructured":"Charoenkwan P, Nantasenamat C, Hasan MM, Manavalan B, Shoombuatong W (2021) BERT4Bitter: a bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides. Bioinformatics. https:\/\/doi.org\/10.1093\/bioinformatics\/btab133","journal-title":"Bioinformatics"},{"key":"721_CR51","doi-asserted-by":"publisher","DOI":"10.1016\/j.chemolab.2021.104326","volume":"215","author":"F Ge","year":"2021","unstructured":"Ge F, Muhammad A, Yu D-J (2021) DeepnsSNPs: accurate prediction of non-synonymous single-nucleotide polymorphisms by combining multi-scale convolutional neural network and residue environment information. Chemom Intell Lab Syst 215:104326","journal-title":"Chemom Intell Lab Syst"},{"issue":"14","key":"721_CR52","doi-asserted-by":"publisher","first-page":"i37","DOI":"10.1093\/bioinformatics\/btx228","volume":"33","author":"M Habibi","year":"2017","unstructured":"Habibi M, Weber L, Neves M, Wiegandt DL, Leser U (2017) Deep learning with word embeddings improves biomedical named entity recognition. Bioinformatics 33(14):i37\u2013i48","journal-title":"Bioinformatics"},{"issue":"4","key":"721_CR53","doi-asserted-by":"publisher","first-page":"1057","DOI":"10.1093\/bioinformatics\/btz721","volume":"36","author":"F Li","year":"2020","unstructured":"Li F et al (2020) DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites. Bioinformatics 36(4):1057\u20131065","journal-title":"Bioinformatics"},{"issue":"24","key":"721_CR54","doi-asserted-by":"publisher","first-page":"5128","DOI":"10.1093\/bioinformatics\/btz464","volume":"35","author":"Q Shi","year":"2019","unstructured":"Shi Q et al (2019) DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network. Bioinformatics 35(24):5128\u20135136","journal-title":"Bioinformatics"},{"key":"721_CR55","first-page":"124","volume":"14","author":"Y Zhang","year":"2015","unstructured":"Zhang Y, Wallace B (2015) A sensitivity analysis of (and practitioners\u2019 guide to) convolutional neural networks for sentence classification. arXiv preprint 14:124","journal-title":"arXiv preprint"},{"issue":"3","key":"721_CR56","doi-asserted-by":"publisher","first-page":"140","DOI":"10.1121\/1.4865840","volume":"135","author":"M Azadpour","year":"2014","unstructured":"Azadpour M, McKay CM, Smith RL (2014) Estimating confidence intervals for information transfer analysis of confusion matrices. J Acoustical Soc Am 135(3):140\u2013146","journal-title":"J Acoustical Soc Am"},{"issue":"1","key":"721_CR57","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41598-022-11897-z","volume":"12","author":"P Charoenkwan","year":"2022","unstructured":"Charoenkwan P et al (2022) AMYPred-FRL is a novel approach for accurate prediction of amyloid proteins by using feature representation learning. Sci Rep 12(1):1\u201314","journal-title":"Sci Rep"},{"key":"721_CR58","first-page":"401","volume":"1","author":"MF Sanner","year":"1999","unstructured":"Sanner MF, Duncan BS, Carrillo CJ, Olson AJ (1999) Integrating computation and visualization for biomolecular analysis: an example using python and AVS. Pac Symp Biocomput 1:401\u201312","journal-title":"Pac Symp Biocomput"},{"key":"721_CR59","doi-asserted-by":"publisher","first-page":"33","DOI":"10.1186\/1758-2946-3-33","volume":"3","author":"NM O'Boyle","year":"2011","unstructured":"O\u2019Boyle NM, Banck M, James CA, Morley C, Vandermeersch T, Hutchison GR (2011) Open babel: an open chemical toolbox. J Cheminform 3:33","journal-title":"J Cheminform"},{"issue":"8","key":"721_CR60","doi-asserted-by":"publisher","first-page":"3891","DOI":"10.1021\/acs.jcim.1c00203","volume":"61","author":"J Eberhardt","year":"2021","unstructured":"Eberhardt J, Santos-Martins D, Tillack AF, Forli S (2021) AutoDock vina 1.2.0: new docking methods, expanded force field, and python bindings. J Chem Inf Model 61(8):3891\u20133898","journal-title":"J Chem Inf Model"},{"issue":"W1","key":"721_CR61","doi-asserted-by":"publisher","first-page":"W443","DOI":"10.1093\/nar\/gkv315","volume":"43","author":"S Salentin","year":"2015","unstructured":"Salentin S, Schreiber S, Haupt VJ, Adasme MF, Schroeder M (2015) PLIP: fully automated protein-ligand interaction profiler. Nucleic Acids Res 43(W1):W443\u2013W447","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"721_CR62","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1016\/S0169-409X(00)00129-0","volume":"46","author":"CA Lipinski","year":"2001","unstructured":"Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 46(1):3\u201326","journal-title":"Adv Drug Deliv Rev"},{"key":"721_CR63","unstructured":"LSI Lundberg SM, A unified approach to interpreting model predictions. presented at the 31st Conference on neural information processing systems, long beach, CA, USA, 2017. https:\/\/proceedings.neurips.cc\/paper\/2017\/hash\/8a20a8621978632d76c43dfd28b67767-Abstract.html"},{"issue":"9","key":"721_CR64","doi-asserted-by":"publisher","first-page":"16852","DOI":"10.3390\/molecules200916852","volume":"20","author":"P Martins","year":"2015","unstructured":"Martins P et al (2015) Heterocyclic anticancer compounds: recent advances and the paradigm shift towards the use of nanomedicine\u2019s tool box. Molecules 20(9):16852\u201316891","journal-title":"Molecules"},{"key":"721_CR65","doi-asserted-by":"publisher","DOI":"10.1016\/j.ejmech.2016.09.023","author":"J Akhtar","year":"2017","unstructured":"Akhtar J, Khan AA, Ali Z, Haider R, Shahar Yar M (2017) Structure-activity relationship (SAR) study and design strategies of nitrogen-containing heterocyclic moieties for their anticancer activities. Eur J Med Chem. https:\/\/doi.org\/10.1016\/j.ejmech.2016.09.023","journal-title":"Eur J Med Chem"},{"issue":"2","key":"721_CR66","doi-asserted-by":"publisher","first-page":"320","DOI":"10.1039\/B610213C","volume":"37","author":"S Purser","year":"2008","unstructured":"Purser S, Moore PR, Swallow S, Gouverneur V (2008) Fluorine in medicinal chemistry. Chem Soc Rev 37(2):320\u2013330","journal-title":"Chem Soc Rev"},{"issue":"18","key":"721_CR67","doi-asserted-by":"publisher","first-page":"1481","DOI":"10.1002\/cmdc.201700356","volume":"12","author":"MF Sowaileh","year":"2017","unstructured":"Sowaileh MF, Hazlitt RA, Colby DA (2017) Application of the pentafluorosulfanyl group as a bioisosteric replacement. ChemMedChem 12(18):1481\u20131490","journal-title":"ChemMedChem"},{"key":"721_CR68","unstructured":"Tucker N. (2022). FDA grants fast track designation to novel AR inhibitor for AR+\/ER+\/HER- mBC. https:\/\/www.targetedonc.com\/view\/fda-grants-fast-track-designation-to-novel-ar-inhibitor-for-ar-er-her--mbc"},{"key":"721_CR69","doi-asserted-by":"publisher","first-page":"230","DOI":"10.1016\/j.ejmech.2016.04.052","volume":"118","author":"M Bassetto","year":"2016","unstructured":"Bassetto M et al (2016) Design and synthesis of novel bicalutamide and enzalutamide derivatives as antiproliferative agents for the treatment of prostate cancer. Eur J Med Chem 118:230\u2013243","journal-title":"Eur J Med Chem"},{"issue":"7","key":"721_CR70","doi-asserted-by":"publisher","first-page":"2779","DOI":"10.1021\/jm901488g","volume":"53","author":"ME Jung","year":"2010","unstructured":"Jung ME et al (2010) Structure-activity relationship for thiohydantoin androgen receptor antagonists for castration-resistant prostate cancer (CRPC). J Med Chem 53(7):2779\u20132796","journal-title":"J Med Chem"},{"issue":"15","key":"721_CR71","doi-asserted-by":"publisher","first-page":"3636","DOI":"10.1016\/j.bmcl.2016.06.001","volume":"26","author":"S Ferla","year":"2016","unstructured":"Ferla S et al (2016) Rational design and synthesis of novel anti-prostate cancer agents bearing a 3,5-bis-trifluoromethylphenyl moiety. Bioorg Med Chem Lett 26(15):3636\u20133640","journal-title":"Bioorg Med Chem Lett"},{"key":"721_CR72","doi-asserted-by":"publisher","first-page":"24","DOI":"10.18433\/jpps32417","volume":"25","author":"E Mateev","year":"2022","unstructured":"Mateev E, Georgieva M, Zlatkov A (2022) Pyrrole as an important scaffold of anticancer drugs: recent advances. J Pharm Pharm Sci 25:24\u201340","journal-title":"J Pharm Pharm Sci"},{"key":"721_CR73","doi-asserted-by":"publisher","DOI":"10.3390\/ph14090893","author":"M Bianco","year":"2021","unstructured":"Bianco M, Marinho D, Hoelz LVB, Bastos MM, Boechat N (2021) Pyrroles as privileged scaffolds in the search for new potential HIV inhibitors. Pharmaceuticals. https:\/\/doi.org\/10.3390\/ph14090893","journal-title":"Pharmaceuticals"},{"key":"721_CR74","doi-asserted-by":"publisher","first-page":"172943","DOI":"10.1016\/j.ejphar.2020.172943","volume":"871","author":"P Olszewska","year":"2020","unstructured":"Olszewska P, Cal D, Zagorski P, Mikiciuk-Olasik E (2020) A novel trifluoromethyl 2-phosphonopyrrole analogue inhibits human cancer cell migration and growth by cell cycle arrest at G1 phase and apoptosis. Eur J Pharmacol 871:172943","journal-title":"Eur J Pharmacol"},{"issue":"5","key":"721_CR75","doi-asserted-by":"publisher","first-page":"547","DOI":"10.1080\/1061186X.2019.1703189","volume":"28","author":"H Kuznietsova","year":"2020","unstructured":"Kuznietsova H et al (2020) Pyrrole derivatives as potential anti-cancer therapeutics: synthesis, mechanisms of action, safety. J Drug Target 28(5):547\u2013563","journal-title":"J Drug Target"},{"key":"721_CR76","doi-asserted-by":"publisher","first-page":"9","DOI":"10.1016\/j.bbrc.2022.07.042","volume":"623","author":"D Funakoshi","year":"2022","unstructured":"Funakoshi D et al (2022) Antitumor effects of pyrrole-imidazole polyamide modified with alkylating agent on prostate cancer cells. Biochem Biophys Res Commun 623:9\u201316","journal-title":"Biochem Biophys Res Commun"},{"issue":"9","key":"721_CR77","doi-asserted-by":"publisher","first-page":"2207","DOI":"10.1158\/0008-5472.CAN-16-2503","volume":"77","author":"AA Kurmis","year":"2017","unstructured":"Kurmis AA, Yang F, Welch TR, Nickols NG, Dervan PB (2017) A pyrrole-imidazole polyamide is active against enzalutamide-resistant prostate cancer. Cancer Res 77(9):2207\u20132212","journal-title":"Cancer Res"},{"key":"721_CR78","doi-asserted-by":"publisher","first-page":"113685","DOI":"10.1016\/j.ejmech.2021.113685","volume":"224","author":"MK Tiwari","year":"2021","unstructured":"Tiwari MK et al (2021) Novel halogenated arylvinyl-1,2,4 trioxanes as potent antiplasmodial as well as anticancer agents: synthesis, bioevaluation, structure-activity relationship and in-silico studies. Eur J Med Chem. 224:113685","journal-title":"Eur J Med Chem."},{"issue":"9","key":"721_CR79","first-page":"257","volume":"7","author":"VF Zohra Benfodda","year":"2017","unstructured":"Zohra Benfodda VF, Henriquet C, Fattorusso C, Cebri\u00e1n-Torrej\u00f3n G, Persico M, Di Dato A, Menna M, Blancou H, Fajas L (2017) Synthesis, anticancer activity and computational sar analysis of acylsulfonylpiperazines derivatives. Med Chem 7(9):257\u2013267","journal-title":"Med Chem"},{"key":"721_CR80","doi-asserted-by":"publisher","DOI":"10.3390\/ijms23158271","author":"P Gonzalez-Berdullas","year":"2022","unstructured":"Gonzalez-Berdullas P et al (2022) Discovery of the anticancer activity for lung and gastric cancer of a brominated coelenteramine analog. Int J Mol Sci. https:\/\/doi.org\/10.3390\/ijms23158271","journal-title":"Int J Mol Sci"},{"key":"721_CR81","doi-asserted-by":"publisher","first-page":"9","DOI":"10.3390\/biomedicines9091199","volume":"9","author":"CM Magalhaes","year":"2021","unstructured":"Magalhaes CM et al (2021) Target-oriented synthesis of marine coelenterazine derivatives with anticancer activity by applying the heavy-atom effect. Biomedicines 9:9","journal-title":"Biomedicines"},{"key":"721_CR82","doi-asserted-by":"publisher","first-page":"8","DOI":"10.3390\/biom9080384","volume":"9","author":"L Pinto da Silva","year":"2019","unstructured":"Pinto da Silva L et al (2019) Study of the combination of self-activating photodynamic therapy and chemotherapy for cancer treatment. Biomolecules 9:8","journal-title":"Biomolecules"},{"issue":"2","key":"721_CR83","doi-asserted-by":"publisher","first-page":"64","DOI":"10.1021\/ci00046a002","volume":"25","author":"RE Carhart","year":"1985","unstructured":"Carhart RE, Smith DH, Venkataraghavan R (1985) Atom pairs as molecular features in structure-activity studies: definition and applications. J Chem Inform Comput Sci 25(2):64\u201373","journal-title":"J Chem Inform Comput Sci"},{"issue":"2","key":"721_CR84","doi-asserted-by":"publisher","first-page":"493","DOI":"10.1021\/ci025584y","volume":"43","author":"C Steinbeck","year":"2003","unstructured":"Steinbeck C, Han Y, Kuhn S, Horlacher O, Luttmann E, Willighagen E (2003) The chemistry development kit (CDK): an open-source java library for chemo- and bioinformatics. J Chem Inform Comput Sci 43(2):493\u2013500","journal-title":"J Chem Inform Comput Sci"},{"issue":"1","key":"721_CR85","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13321-016-0187-6","volume":"9","author":"EL Willighagen","year":"2017","unstructured":"Willighagen EL et al (2017) The chemistry development kit (CDK) v2. 0: atom typing, depiction, molecular formulas, and substructure searching. J Cheminform 9(1):1\u201319","journal-title":"J Cheminform"},{"issue":"6","key":"721_CR86","doi-asserted-by":"publisher","first-page":"1039","DOI":"10.1021\/ci00028a014","volume":"35","author":"LH Hall","year":"1995","unstructured":"Hall LH, Kier LB (1995) Electrotopological state indices for atom types: a novel combination of electronic, topological, and valence state information. J Chem Inform Comput Sci 35(6):1039\u20131045","journal-title":"J Chem Inform Comput Sci"},{"issue":"21","key":"721_CR87","doi-asserted-by":"publisher","first-page":"2518","DOI":"10.1093\/bioinformatics\/btn479","volume":"24","author":"J Klekota","year":"2008","unstructured":"Klekota J, Roth FP (2008) Chemical substructures that enrich for biological activity. Bioinformatics 24(21):2518\u20132525","journal-title":"Bioinformatics"},{"issue":"6","key":"721_CR88","doi-asserted-by":"publisher","first-page":"1273","DOI":"10.1021\/ci010132r","volume":"42","author":"JL Durant","year":"2002","unstructured":"Durant JL, Leland BA, Henry DR, Nourse JG (2002) Reoptimization of MDL Keys for use in drug discovery. J Chem Inform Comput Sci 42(6):1273\u20131280","journal-title":"J Chem Inform Comput Sci"},{"issue":"D1","key":"721_CR89","doi-asserted-by":"publisher","first-page":"D1202","DOI":"10.1093\/nar\/gkv951","volume":"44","author":"S Kim","year":"2016","unstructured":"Kim S et al (2016) PubChem substance and compound databases. Nucleic Acids Res 44(D1):D1202\u2013D1213","journal-title":"Nucleic Acids Res"},{"key":"721_CR90","unstructured":"Laggner C, SMARTS patterns for functional group classification. 2005"}],"container-title":["Journal of Cheminformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00721-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13321-023-00721-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13321-023-00721-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,6]],"date-time":"2023-05-06T10:07:16Z","timestamp":1683367636000},"score":1,"resource":{"primary":{"URL":"https:\/\/jcheminf.biomedcentral.com\/articles\/10.1186\/s13321-023-00721-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,5,6]]},"references-count":90,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["721"],"URL":"https:\/\/doi.org\/10.1186\/s13321-023-00721-z","relation":{},"ISSN":["1758-2946"],"issn-type":[{"value":"1758-2946","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,5,6]]},"assertion":[{"value":"26 December 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 April 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 May 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical approval and consent to participate"}},{"value":"The authors declare no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"50"}}