{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,2]],"date-time":"2025-08-02T14:53:13Z","timestamp":1754146393229,"version":"3.41.2"},"reference-count":67,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T00:00:00Z","timestamp":1752796800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Bioinform."],"abstract":"<jats:sec><jats:title>Introduction<\/jats:title><jats:p>Intrinsically disordered regions (IDRs) of proteins have traditionally been overlooked as drug targets. However, with growing recognition of their crucial role in biological activity and their involvement in various diseases, IDRs have emerged as promising targets for drug discovery. Despite this potential, rational methodologies for IDR-targeted drug discovery remain underdeveloped, primarily due to a lack of reference experimental data.<\/jats:p><\/jats:sec><jats:sec><jats:title>Methods<\/jats:title><jats:p>This study explores a machine learning approach to predict IDR functions, drug interaction sites, and interacting molecular substructures within IDR sequences. To address the data gap, stepwise transfer learning was employed. IDRdecoder sequentially generate predictions for IDR classification, interaction sites, and interacting ligand substructures. In the first step, the neural net was trained as autoencoder by using 26,480,862 predicted IDR sequences. Then it was trained against 57,692 ligand-binding PDB sequences with higher IDR tendency via transfer learning for predict ligand interacting sites and ligand types.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>IDRdecoder was evaluated against 9 IDR sequences, which were experimentally detailed as drug targets. In the encoding space, specific GO terms related to the hypothesized functions of the evaluation IDR sequences were highly enriched. The model\u2019s prediction performance for drug interacting sites and ligand types demonstrated the area under the curve (AUC) of 0.616 and 0.702, respectively. The performance was compared with existing methods including ProteinBERT, and IDRdecoder demonstrated moderately improved performance.<\/jats:p><\/jats:sec><jats:sec><jats:title>Discussion<\/jats:title><jats:p>IDRdecoder is the first application for predicting drug interaction sites and ligands in IDR sequences. Analysis of the prediction results revealed characteristics beneficial for IDR-drug design; for instance, Tyr and Ala are preferred target sites, while flexible substructures, such as alkyl groups, are favored in ligand molecules.<\/jats:p><\/jats:sec>","DOI":"10.3389\/fbinf.2025.1627836","type":"journal-article","created":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T05:30:43Z","timestamp":1752816643000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["IDRdecoder: a machine learning approach for rational drug discovery toward intrinsically disordered regions"],"prefix":"10.3389","volume":"5","author":[{"given":"Clara","family":"Shionyu-Mitusyama","sequence":"first","affiliation":[]},{"given":"Satoshi","family":"Ohmori","sequence":"additional","affiliation":[]},{"given":"Subaru","family":"Hirata","sequence":"additional","affiliation":[]},{"given":"Hirokazu","family":"Ishida","sequence":"additional","affiliation":[]},{"given":"Tsuyoshi","family":"Shirai","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2025,7,18]]},"reference":[{"key":"B1","article-title":"Tensorflow: a system for large-scale machine learning","author":"Abadi","year":"2016"},{"key":"B2","doi-asserted-by":"publisher","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol."},{"key":"B3","doi-asserted-by":"publisher","first-page":"1487","DOI":"10.1016\/j.csbj.2023.02.018","article-title":"Computational prediction of disordered binding regions","volume":"21","author":"Basu","year":"2023","journal-title":"Comput. Struct. Biotechnol. J."},{"key":"B4","doi-asserted-by":"publisher","first-page":"90","DOI":"10.1016\/j.cbpa.2021.02.009","article-title":"Intrinsically disordered proteins and biomolecular condensates as drug targets","volume":"62","author":"Biesaga","year":"2021","journal-title":"Curr. Opin. Chem. Biol."},{"key":"B5","doi-asserted-by":"publisher","first-page":"2102","DOI":"10.1093\/bioinformatics\/btac020","article-title":"ProteinBERT: a universal deep-learning model of protein sequence and function","volume":"38","author":"Brandes","year":"2022","journal-title":"Bioinformatics"},{"key":"B6","doi-asserted-by":"publisher","first-page":"956","DOI":"10.2174\/092986608785849164","article-title":"TOP-IDP-scale: a new amino acid scale measuring propensity for intrinsic disorder","volume":"15","author":"Campen","year":"2008","journal-title":"Protein Pept. Lett."},{"key":"B7","doi-asserted-by":"publisher","first-page":"3438","DOI":"10.1016\/j.molcel.2023.08.027","article-title":"Hormone-induced enhancer assembly requires an optimal level of hormone receptor multivalent interactions","volume":"83","author":"Chen","year":"2023","journal-title":"Mol. Cell"},{"key":"B8","doi-asserted-by":"publisher","first-page":"985022","DOI":"10.3389\/fmolb.2022.985022","article-title":"Prediction of protein-protein interaction sites in intrinsically disordered proteins","volume":"9","author":"Chen","year":"2022","journal-title":"Front. Mol. Biosci."},{"key":"B9","doi-asserted-by":"publisher","first-page":"2437","DOI":"10.1073\/pnas.74.6.2437","article-title":"Immunoflourescent staining of cytoplasmic and spindle microtubules in mouse fibroblasts with antibody to tau protein","volume":"74","author":"Connolly","year":"1977","journal-title":"Proc. Natl. Acad. Sci. U. S. A."},{"key":"B10","doi-asserted-by":"publisher","first-page":"41578","DOI":"10.1074\/jbc.M111.285957","article-title":"Disordered binding of small molecules to a\u03b2(12\u201328)","volume":"286","author":"Convertino","year":"2011","journal-title":"J. Biol. Chem."},{"key":"B11","doi-asserted-by":"publisher","first-page":"2027","DOI":"10.3390\/molecules22122027","article-title":"Intrinsic disorder in proteins with pathogenic repeat expansions","volume":"22","author":"Darling","year":"2017","journal-title":"Molecules"},{"key":"B12","doi-asserted-by":"publisher","first-page":"9964","DOI":"10.1038\/srep09964","article-title":"Tau co-organizes dynamic microtubule and actin networks","volume":"5","author":"Elie","year":"2015","journal-title":"Sci. Rep."},{"key":"B13","doi-asserted-by":"publisher","first-page":"1149","DOI":"10.1016\/j.chembiol.2008.09.011","article-title":"Structural rationale for the coupled binding and unfolding of the c-Myc oncoprotein by small molecules","volume":"15","author":"Follis","year":"2008","journal-title":"Chem. Biol."},{"key":"B14","doi-asserted-by":"publisher","first-page":"11929","DOI":"10.1093\/nar\/gkaa912","article-title":"The role of FACT in managing chromatin: disruption, assembly, or repair?","volume":"48","author":"Formosa","year":"2020","journal-title":"Nucleic Acids Res."},{"key":"B15","doi-asserted-by":"publisher","first-page":"iyad031","DOI":"10.1093\/genetics\/iyad031","article-title":"The gene ontology knowledgebase in 2023","volume":"224","author":"Gene Ontology","year":"2023","journal-title":"Genetics"},{"key":"B16","doi-asserted-by":"publisher","first-page":"E5498","DOI":"10.1073\/pnas.1412070111","article-title":"Structural basis and dynamics of multidrug recognition in a minimal bacterial multidrug resistance system","volume":"111","author":"Habazettl","year":"2014","journal-title":"Proc. Natl. Acad. Sci. U. S. A."},{"key":"B17","doi-asserted-by":"publisher","first-page":"127","DOI":"10.1038\/s41573-022-00571-8","article-title":"Drugging p53 in cancer: one protein, many targets","volume":"22","author":"Hassin","year":"2023","journal-title":"Nat. Rev. Drug Discov."},{"key":"B18","doi-asserted-by":"publisher","first-page":"216","DOI":"10.3389\/fnins.2019.00216","article-title":"Alpha-synuclein continues to enhance SNARE-dependent vesicle docking at exorbitant concentrations","volume":"13","author":"Hawk","year":"2019","journal-title":"Front. Neurosci."},{"key":"B19","doi-asserted-by":"publisher","first-page":"8541","DOI":"10.1038\/s41598-017-08902-1","article-title":"Decoding disease-causing mechanisms of missense mutations from supramolecular structures","volume":"7","author":"Hijikata","year":"2017","journal-title":"Sci. Rep."},{"key":"B20","doi-asserted-by":"publisher","first-page":"504","DOI":"10.1126\/science.1127647","article-title":"Reducing the dimensionality of data with neural networks","volume":"313","author":"Hinton","year":"2006","journal-title":"Science"},{"key":"B21","doi-asserted-by":"publisher","first-page":"15686","DOI":"10.1038\/srep15686","article-title":"Discovery of small molecules that inhibit the disordered protein, p27(kip1)","volume":"5","author":"Iconaru","year":"2015","journal-title":"Sci. Rep."},{"key":"B22","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1186\/s43094-020-00047-9","article-title":"Exploring different approaches to improve the success of drug discovery and development projects: a review","volume":"6","author":"Kiriiri","year":"2020","journal-title":"Future J. Pharm. Sci."},{"key":"B23","doi-asserted-by":"publisher","first-page":"558","DOI":"10.1038\/nchembio.1528","article-title":"Targeting the disordered C terminus of PTP1B with an allosteric inhibitor","volume":"10","author":"Krishnan","year":"2014","journal-title":"Nat. Chem. Biol."},{"key":"B24","doi-asserted-by":"publisher","first-page":"1149","DOI":"10.1126\/science.abn7229","article-title":"Bacteria require phase separation for fitness in the mammalian gut","volume":"379","author":"Krypotou","year":"2023","journal-title":"Science"},{"key":"B25","doi-asserted-by":"publisher","first-page":"237","DOI":"10.3389\/fnmol.2019.00237","article-title":"Voltage-gated calcium channels and alpha-synuclein: implications in Parkinson's disease","volume":"12","author":"Leandrou","year":"2019","journal-title":"Front. Mol. Neurosci."},{"key":"B26","doi-asserted-by":"publisher","first-page":"1522","DOI":"10.3390\/molecules25071522","article-title":"Disordered residues and patterns in the protein Data Bank","volume":"25","author":"Lobanov","year":"2020","journal-title":"Molecules"},{"key":"B27","doi-asserted-by":"publisher","first-page":"1250","DOI":"10.1021\/jacsau.3c00579","article-title":"Interplay of alpha-synuclein with lipid membranes: cooperative adsorption, membrane remodeling and coaggregation","volume":"4","author":"Makasewicz","year":"2024","journal-title":"JACS Au"},{"key":"B28","doi-asserted-by":"publisher","first-page":"1738","DOI":"10.1093\/bioinformatics\/btv060","article-title":"Computational identification of MoRFs in protein sequences","volume":"31","author":"Malhis","year":"2015","journal-title":"Bioinformatics"},{"key":"B29","doi-asserted-by":"publisher","first-page":"1027","DOI":"10.1111\/j.1348-0421.1999.tb01232.x","article-title":"Identification of a novel DNA-binding protein from Mycobacterium bovis bacillus Calmette-Guerin","volume":"43","author":"Matsumoto","year":"1999","journal-title":"Microbiol. Immunol."},{"key":"B30","doi-asserted-by":"publisher","first-page":"10183","DOI":"10.1038\/s41598-019-46617-7","article-title":"Structural visualization of key steps in nucleosome reorganization by human FACT","volume":"9","author":"Mayanagi","year":"2019","journal-title":"Sci. Rep."},{"key":"B31","doi-asserted-by":"publisher","first-page":"W329","DOI":"10.1093\/nar\/gky384","article-title":"IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding","volume":"46","author":"Meszaros","year":"2018","journal-title":"Nucleic Acids Res."},{"key":"B32","doi-asserted-by":"publisher","first-page":"39732","DOI":"10.1038\/srep39732","article-title":"Identification of a drug targeting an intrinsically disordered protein involved in pancreatic adenocarcinoma","volume":"7","author":"Neira","year":"2017","journal-title":"Sci. Rep."},{"key":"B33","doi-asserted-by":"publisher","first-page":"816","DOI":"10.1093\/nar\/gkad1149","article-title":"Dynamic action of an intrinsically disordered protein in DNA compaction that induces mycobacterial dormancy","volume":"52","author":"Nishiyama","year":"2024","journal-title":"Nucleic Acids Res."},{"key":"B34","doi-asserted-by":"publisher","first-page":"553","DOI":"10.1146\/annurev-biochem-072711-164947","article-title":"Intrinsically disordered proteins and intrinsically disordered protein regions","volume":"83","author":"Oldfield","year":"2014","journal-title":"Annu. Rev. Biochem."},{"key":"B35","doi-asserted-by":"publisher","first-page":"D733","DOI":"10.1093\/nar\/gkv1189","article-title":"Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation","volume":"44","author":"O'Leary","year":"2016","journal-title":"Nucleic Acids Res."},{"key":"B36","doi-asserted-by":"publisher","first-page":"428","DOI":"10.1038\/nrd3405","article-title":"The productivity crisis in pharmaceutical R&D","volume":"10","author":"Pammolli","year":"2011","journal-title":"Nat. Rev. Drug Discov."},{"key":"B37","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1186\/s13059-021-02322-1","article-title":"Mechanisms of enhancer action: the known and the unknown","volume":"22","author":"Panigrahi","year":"2021","journal-title":"Genome Biol."},{"key":"B38","doi-asserted-by":"publisher","first-page":"10165","DOI":"10.1523\/JNEUROSCI.1146-13.2013","article-title":"Amyloid precursor proteins interact with the heterotrimeric G protein Go in the control of neuronal migration","volume":"33","author":"Ramaker","year":"2013","journal-title":"J. Neurosci."},{"key":"B39","doi-asserted-by":"publisher","first-page":"217","DOI":"10.1016\/j.drudis.2018.09.017","article-title":"Targeting intrinsically disordered proteins at the edge of chaos","volume":"24","author":"Ruan","year":"2019","journal-title":"Drug Discov. Today"},{"key":"B40","doi-asserted-by":"publisher","first-page":"3004","DOI":"10.1039\/d0sc04670a","article-title":"Computational strategy for intrinsically disordered protein ligand design leads to the discovery of p53 transactivation domain I binding compounds that activate the p53 pathway","volume":"12","author":"Ruan","year":"2020","journal-title":"Chem. Sci."},{"key":"B41","doi-asserted-by":"publisher","first-page":"551","DOI":"10.1080\/17460441.2020.1732920","article-title":"Discovery of drugs that directly target the intrinsically disordered region of the androgen receptor","volume":"15","author":"Sadar","year":"2020","journal-title":"Expert Opin. Drug Discov."},{"key":"B42","doi-asserted-by":"publisher","first-page":"379","DOI":"10.1016\/j.jmb.2012.10.001","article-title":"Classification of ligand molecules in PDB with fast heuristic graph match algorithm COMPLIG","volume":"424","author":"Saito","year":"2012","journal-title":"J. Mol. Biol."},{"key":"B43","doi-asserted-by":"publisher","first-page":"1695","DOI":"10.1007\/s00018-019-03347-3","article-title":"Targeting intrinsically disordered proteins involved in cancer","volume":"77","author":"Santofimia-Castano","year":"2020","journal-title":"Cell Mol. Life Sci."},{"key":"B44","doi-asserted-by":"publisher","first-page":"9729","DOI":"10.1021\/acsomega.2c07708","article-title":"Fuzzy drug targets: disordered proteins in the drug-discovery realm","volume":"8","author":"Saurabh","year":"2023","journal-title":"ACS Omega"},{"key":"B45","doi-asserted-by":"publisher","first-page":"D20","DOI":"10.1093\/nar\/gkab1112","article-title":"Database resources of the national center for biotechnology information","volume":"50","author":"Sayers","year":"2022","journal-title":"Nucleic Acids Res."},{"key":"B46","doi-asserted-by":"publisher","first-page":"e11101","DOI":"10.1371\/journal.pone.0011101","article-title":"Complete phenotypic recovery of an Alzheimer's disease model by a quinone-tryptophan hybrid aggregation inhibitor","volume":"5","author":"Scherzer-Attali","year":"2010","journal-title":"PLoS One"},{"key":"B47","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1093\/jb\/mvx056","article-title":"Common molecular pathogenesis of disease-related intrinsically disordered proteins revealed by NMR analysis","volume":"163","author":"Shigemitsu","year":"2018","journal-title":"J. Biochem."},{"key":"B48","doi-asserted-by":"publisher","first-page":"400","DOI":"10.1021\/ci6002202","article-title":"Use of amino acid composition to predict ligand-binding sites","volume":"47","author":"Soga","year":"2007","journal-title":"J. Chem. Inf. Model"},{"key":"B49","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1186\/s40478-016-0310-y","article-title":"Fasudil attenuates aggregation of alpha-synuclein in models of Parkinson's disease","volume":"4","author":"Tatenhorst","year":"2016","journal-title":"Acta Neuropathol. Commun."},{"key":"B50","volume-title":"R: a language and environment for statistical computing","author":"Team","year":"2007"},{"key":"B51","doi-asserted-by":"publisher","first-page":"781635","DOI":"10.3389\/fmolb.2021.781635","article-title":"Explore protein conformational space with variational autoencoder","volume":"8","author":"Tian","year":"2021","journal-title":"Front. Mol. Biosci."},{"key":"B52","doi-asserted-by":"publisher","first-page":"419","DOI":"10.1016\/j.sbi.2011.03.012","article-title":"Unstructural biology coming of age","volume":"21","author":"Tompa","year":"2011","journal-title":"Curr. Opin. Struct. Biol."},{"key":"B53","doi-asserted-by":"publisher","first-page":"e87133","DOI":"10.1371\/journal.pone.0087133","article-title":"Targeting the intrinsically disordered structural ensemble of alpha-synuclein by small molecules as a potential therapeutic strategy for Parkinson's disease","volume":"9","author":"Toth","year":"2014","journal-title":"PLoS One"},{"key":"B54","doi-asserted-by":"publisher","first-page":"256","DOI":"10.3389\/fnagi.2019.00256","article-title":"The evolution of tau phosphorylation and interactions","volume":"11","author":"Trushina","year":"2019","journal-title":"Front. Aging Neurosci."},{"key":"B55","doi-asserted-by":"publisher","first-page":"3380","DOI":"10.3390\/ijms24043380","article-title":"Aggregation of disordered proteins associated with neurodegeneration","volume":"24","author":"Tsoi","year":"2023","journal-title":"Int. J. Mol. Sci."},{"key":"B56","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1146\/annurev.biophys.37.032807.125924","article-title":"Intrinsically disordered proteins in human diseases: introducing the D2 concept","volume":"37","author":"Uversky","year":"2008","journal-title":"Annu. Rev. Biophys."},{"key":"B57","doi-asserted-by":"publisher","first-page":"e1002709","DOI":"10.1371\/journal.pcbi.1002709","article-title":"Disease-associated mutations disrupt functionally important regions of intrinsic protein disorder","volume":"8","author":"Vacic","year":"2012","journal-title":"PLoS Comput. Biol."},{"key":"B58","doi-asserted-by":"publisher","first-page":"1111","DOI":"10.1038\/nsmb.3321","article-title":"Large-scale ATP-independent nucleosome unfolding by a histone chaperone","volume":"23","author":"Valieva","year":"2016","journal-title":"Nat. Struct. Mol. Biol."},{"key":"B59","article-title":"Python 3 reference manual","author":"Van Rossum","year":"2009"},{"key":"B60","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1038\/s41592-019-0686-2","article-title":"SciPy 1.0: fundamental algorithms for scientific computing in Python","volume":"17","author":"Virtanen","year":"2020","journal-title":"Nat. Methods"},{"key":"B61","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1124\/jpet.110.171751","article-title":"Productivity shortfalls in drug discovery: contributions from the preclinical sciences?","volume":"336","author":"Williams","year":"2011","journal-title":"J. Pharmacol. Exp. Ther."},{"key":"B62","doi-asserted-by":"publisher","first-page":"D520","DOI":"10.1093\/nar\/gky949","article-title":"Protein Data Bank: the single global archive for 3D macromolecular structure data","volume":"47","author":"Berman","year":"2019","journal-title":"Nucleic Acids Res."},{"key":"B63","doi-asserted-by":"publisher","first-page":"2642","DOI":"10.1093\/bioinformatics\/bty178","article-title":"Learned protein embeddings for machine learning","volume":"34","author":"Yang","year":"2018","journal-title":"Bioinformatics"},{"key":"B64","doi-asserted-by":"publisher","first-page":"22298","DOI":"10.1038\/srep22298","article-title":"Structure-based inhibitor design for the intrinsically disordered protein c-myc","volume":"6","author":"Yu","year":"2016","journal-title":"Sci. Rep."},{"key":"B65","doi-asserted-by":"publisher","first-page":"bbab521","DOI":"10.1093\/bib\/bbab521","article-title":"DeepDISOBind: accurate prediction of RNA-DNA- and protein-binding intrinsically disordered residues with deep multi-task learning","volume":"23","author":"Zhang","year":"2022","journal-title":"Brief. Bioinform"},{"key":"B66","doi-asserted-by":"publisher","first-page":"1983","DOI":"10.1104\/pp.16.00532","article-title":"VIPP1 has a disordered C-terminal tail necessary for protecting photosynthetic membranes against stress","volume":"171","author":"Zhang","year":"2016","journal-title":"Plant Physiol."},{"key":"B67","doi-asserted-by":"publisher","first-page":"824146","DOI":"10.3389\/fmolb.2022.824146","article-title":"Characterizing heparin tetrasaccharides binding to amyloid-beta peptide","volume":"9","author":"Zhou","year":"2022","journal-title":"Front. Mol. Biosci."}],"container-title":["Frontiers in Bioinformatics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1627836\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,18]],"date-time":"2025-07-18T05:30:46Z","timestamp":1752816646000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fbinf.2025.1627836\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,7,18]]},"references-count":67,"alternative-id":["10.3389\/fbinf.2025.1627836"],"URL":"https:\/\/doi.org\/10.3389\/fbinf.2025.1627836","relation":{},"ISSN":["2673-7647"],"issn-type":[{"value":"2673-7647","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,7,18]]},"article-number":"1627836"}}