{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T07:23:42Z","timestamp":1778311422438,"version":"3.51.4"},"reference-count":32,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2019,2,20]],"date-time":"2019-02-20T00:00:00Z","timestamp":1550620800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61771262"],"award-info":[{"award-number":["61771262"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Intrinsically disordered proteins perform a variety of important biological functions, which makes their accurate prediction useful for a wide range of applications. We develop a scheme for predicting intrinsically disordered proteins by employing 35 features including eight structural properties, seven physicochemical properties and 20 pieces of evolutionary information. In particular, the scheme includes a preprocessing procedure which greatly reduces the input features. Using two different windows, the preprocessed data containing not only the properties of the surroundings of the target residue but also the properties related to the specific target residue are fed into a multi-layer perceptron neural network as its inputs. The Adam algorithm for the back propagation together with the dropout algorithm to avoid overfitting are introduced during the training process. The training as well as testing our procedure is performed on the dataset DIS803 from a DisProt database. The simulation results show that the performance of our scheme is competitive in comparison with ESpritz and IsUnstruct.<\/jats:p>","DOI":"10.3390\/a12020046","type":"journal-article","created":{"date-parts":[[2019,2,20]],"date-time":"2019-02-20T11:45:39Z","timestamp":1550663139000},"page":"46","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["The Prediction of Intrinsically Disordered Proteins Based on Feature Selection"],"prefix":"10.3390","volume":"12","author":[{"given":"Hao","family":"He","sequence":"first","affiliation":[{"name":"College of Electronic Information and Optical Engineering, Nankai University, Tianjin 300350, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2287-0811","authenticated-orcid":false,"given":"Jiaxiang","family":"Zhao","sequence":"additional","affiliation":[{"name":"College of Electronic Information and Optical Engineering, Nankai University, Tianjin 300350, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guiling","family":"Sun","sequence":"additional","affiliation":[{"name":"College of Electronic Information and Optical Engineering, Nankai University, Tianjin 300350, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2019,2,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"568068","DOI":"10.1155\/2010\/568068","article-title":"The mysterious unfoldome: structureless, underappreciated, yet vital part of any given proteome","volume":"2010","author":"Uversky","year":"2010","journal-title":"J. Biomed. Biotechnol."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/scientificamerican0411-68","article-title":"The orderly chaos of proteins","volume":"304","author":"Dunker","year":"2011","journal-title":"Sci. Am."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1146\/annurev-biochem-072711-164947","article-title":"Intrinsically Disordered Proteins and Intrinsically Disordered Protein Regions","volume":"83","author":"Oldfield","year":"2014","journal-title":"Annu. Rev. Biochem."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1182","DOI":"10.1111\/febs.13202","article-title":"Functional roles of transiently and intrinsically disordered regions within proteins","volume":"282","author":"Uversky","year":"2015","journal-title":"FEBS J."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1006\/jmbi.1999.3110","article-title":"Intrinsically unstructured proteins: Re-assessing the protein structure-function paradigm","volume":"293","author":"Wright","year":"1999","journal-title":"J. Mol. Biol."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"14451","DOI":"10.1016\/j.eswa.2011.04.160","article-title":"Prediction of disorder with new computational tool: BVDEA","volume":"38","author":"Kaya","year":"2011","journal-title":"Expert Syst. Appl."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1002\/prot.20446","article-title":"Addressing the intrinsic disorder bottleneck in structural proteomics","volume":"59","author":"Oldfield","year":"2005","journal-title":"Proteins"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"3435","DOI":"10.1093\/bioinformatics\/bti537","article-title":"FoldIndex: A simple tool to predict whether a given protein sequence is intrinsically unfolded","volume":"21","author":"Prilusky","year":"2005","journal-title":"Bioinformatics"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3701","DOI":"10.1093\/nar\/gkg519","article-title":"Globplot: Exploring Protein Sequences for Globularity and Disorder","volume":"31","author":"Linding","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"3433","DOI":"10.1093\/bioinformatics\/bti541","article-title":"IUPred: Web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content","volume":"21","author":"Dosztanyi","year":"2005","journal-title":"Bioinformatics"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"2948","DOI":"10.1093\/bioinformatics\/btl504","article-title":"FoldUnfold: web server for the prediction of disordered regions in protein chain","volume":"22","author":"Galzitskaya","year":"2006","journal-title":"Bioinformatics"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1088\/1478-3975\/8\/3\/035004","article-title":"The Ising model for prediction of disordered residues from protein sequence alone","volume":"8","author":"Lobanov","year":"2011","journal-title":"Phys. Biol."},{"key":"ref_13","unstructured":"(2019, February 20). PONDR: Predictors of Natural Disordered Regions. Available online: http:\/\/www.pondr.com\/."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"3369","DOI":"10.1093\/bioinformatics\/bti534","article-title":"RONN: The bio-basis function neural network technique applied to the detection of natively disordered regions in proteins","volume":"21","author":"Yang","year":"2005","journal-title":"Bioinformatics"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1016\/j.jmb.2004.02.002","article-title":"Prediction and Functional Analysis of Native Disorder in Proteins from the Three Kingdoms of Life","volume":"337","author":"Ward","year":"2004","journal-title":"J. Mol. Biol."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Su, C.T., Chen, C.Y., and Ou, Y.Y. (2006). Protein disorder prediction by condensed pssm considering propensity for order or disorder. BMC Bioinform., 7.","DOI":"10.1186\/1471-2105-7-319"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1080\/073911012010525022","article-title":"SPINE-D: Accurate prediction of short and long disordered regions by a single neural-network based method","volume":"29","author":"Zhang","year":"2012","journal-title":"J. Biomol. Struct. Dyn."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"503","DOI":"10.1093\/bioinformatics\/btr682","article-title":"ESpritz: Accurate and fast prediction of protein disorder","volume":"28","author":"Walsh","year":"2012","journal-title":"Bioinformatics"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"i489","DOI":"10.1093\/bioinformatics\/btq373","article-title":"Improved sequence-based prediction of disordered regions with multilayer fusion of multiple information sources","volume":"26","author":"Mizianty","year":"2010","journal-title":"Bioinformatics"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1344","DOI":"10.1093\/bioinformatics\/btn195","article-title":"Prediction of disordered regions in proteins based on the meta approach","volume":"24","author":"Ishida","year":"2008","journal-title":"Bioinformatics"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Schlessinger, A., Punta, M., Yachdav, G., Kajan, L., and Rost, B. (2009). Improved disorder prediction by combination of orthogonal approaches. PLoS ONE, 4.","DOI":"10.1371\/journal.pone.0004433"},{"key":"ref_22","unstructured":"Kingma, D.P., and Ba, J.L. (2015, January 7). Adam: A method for stochastic optimization. Proceedings of the 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA."},{"key":"ref_23","first-page":"1929","article-title":"Dropout: A Simple Way to Prevent Neural Networks from Overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J. Mach. Learn. Res."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"786","DOI":"10.1093\/nar\/gkl893","article-title":"DisProt: the database of disordered proteins","volume":"35","author":"Sickmeier","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"121","DOI":"10.1023\/A:1009715923555","article-title":"A tutorial on support vector machines for pattern recognition","volume":"2","author":"Burges","year":"1998","journal-title":"Data Min. Knowl. Discov."},{"key":"ref_26","unstructured":"Mika, S., Ratsch, G., Weston, J., Scholkopf, B., and Mullers, K.R. (1999, January 25). Fisher discriminant analysis with kernels. Proceedings of the Neural Networks for Signal Processing IX: Proceedings of the 1999 IEEE Signal Processing Society Workshop, Madison, WI, USA."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"8087391","DOI":"10.1155\/2018\/8087391","article-title":"A Low Computational Complexity Scheme for the Prediction of Intrinsically Disordered Protein Regions","volume":"2018","author":"He","year":"2018","journal-title":"Math. Probl. Eng."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Shimizu, K., Muraoka, Y., Hirose, S., and Noguchi, T. (2005, January 15). Feature selection based on physicochemical properties of redefined n-term region and c-term regions for predicting disorder. Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, La Jolla, CA, USA.","DOI":"10.1109\/CIBCB.2005.1594927"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"360","DOI":"10.1007\/s008940100038","article-title":"Generation and evaluation of dimension-reduced amino acid parameter representations by artificial neural networks","volume":"7","author":"Meiler","year":"2001","journal-title":"J. Mol. Model."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1002\/prot.10528","article-title":"Prediction of Disordered Regions in Proteins from Position Specific Score Matrices","volume":"3","author":"Jones","year":"2003","journal-title":"Proteins"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1093\/nar\/gkn721","article-title":"NCBI Reference Sequences: current status, policy and new initiatives","volume":"37","author":"Pruitt","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1002\/prot.23161","article-title":"Evaluation of disorder predictions in CASP9","volume":"79","author":"Monastyrskyy","year":"2011","journal-title":"Proteins"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/12\/2\/46\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:33:34Z","timestamp":1760186014000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/12\/2\/46"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,2,20]]},"references-count":32,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2019,2]]}},"alternative-id":["a12020046"],"URL":"https:\/\/doi.org\/10.3390\/a12020046","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,2,20]]}}}