{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,14]],"date-time":"2026-02-14T12:13:22Z","timestamp":1771071202022,"version":"3.50.1"},"reference-count":50,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2220,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Intrinsically disordered proteins play a crucial role in numerous regulatory processes. Their abundance and ubiquity combined with a relatively low quantity of their annotations motivate research toward the development of computational models that predict disordered regions from protein sequences. Although the prediction quality of these methods continues to rise, novel and improved predictors are urgently needed.<\/jats:p>\n               <jats:p>Results: We propose a novel method, named MFDp (Multilayered Fusion-based Disorder predictor), that aims to improve over the current disorder predictors. MFDp is as an ensemble of 3 Support Vector Machines specialized for the prediction of short, long and generic disordered regions. It combines three complementary disorder predictors, sequence, sequence profiles, predicted secondary structure, solvent accessibility, backbone dihedral torsion angles, residue flexibility and B-factors. Our method utilizes a custom-designed set of features that are based on raw predictions and aggregated raw values and recognizes various types of disorder. The MFDp is compared at the residue level on two datasets against eight recent disorder predictors and top-performing methods from the most recent CASP8 experiment. In spite of using training chains with \u226425% similarity to the test sequences, our method consistently and significantly outperforms the other methods based on the MCC index. The MFDp outperforms modern disorder predictors for the binary disorder assignment and provides competitive real-valued predictions. The MFDp's outputs are also shown to outperform the other methods in the identification of proteins with long disordered regions.<\/jats:p>\n               <jats:p>Availability: \u00a0http:\/\/biomine.ece.ualberta.ca\/MFDp.html<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <jats:p>Contact: \u00a0lkurgan@ece.ualberta.ca<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq373","type":"journal-article","created":{"date-parts":[[2010,9,7]],"date-time":"2010-09-07T17:41:46Z","timestamp":1283881306000},"page":"i489-i496","source":"Crossref","is-referenced-by-count":160,"title":["Improved sequence-based prediction of disordered regions with multilayer fusion of multiple information sources"],"prefix":"10.1093","volume":"26","author":[{"given":"Marcin J.","family":"Mizianty","sequence":"first","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada T6G 2V4"}]},{"given":"Wojciech","family":"Stach","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada T6G 2V4"}]},{"given":"Ke","family":"Chen","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada T6G 2V4"}]},{"given":"Kanaka Durga","family":"Kedarisetti","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada T6G 2V4"}]},{"given":"Fatemeh Miri","family":"Disfani","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada T6G 2V4"}]},{"given":"Lukasz","family":"Kurgan","sequence":"additional","affiliation":[{"name":"Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada T6G 2V4"}]}],"member":"286","published-online":{"date-parts":[[2010,9,4]]},"reference":[{"key":"2023012508281325300_B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012508281325300_B2","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/nar\/28.1.235","article-title":"The Protein Data Bank","volume":"28","author":"Berman","year":"2000","journal-title":"Nucleic Acids Res."},{"issue":"Suppl. 8","key":"2023012508281325300_B3","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1002\/prot.21671","article-title":"Assessment of disorder predictions in CASP7","volume":"69","author":"Bordoli","year":"2007","journal-title":"Proteins"},{"key":"2023012508281325300_B4","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/s10618-005-0001-y","article-title":"Accurate prediction of protein disordered regions by mining protein structure data","volume":"11","author":"Cheng","year":"2005","journal-title":"Data Mining Knowl. Disc."},{"key":"2023012508281325300_B5","doi-asserted-by":"crossref","first-page":"3433","DOI":"10.1093\/bioinformatics\/bti541","article-title":"IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content","volume":"21","author":"Doszt\u00e1nyi","year":"2005","journal-title":"Bioinformatics"},{"issue":"Suppl. 2","key":"2023012508281325300_B6","doi-asserted-by":"crossref","first-page":"S1","DOI":"10.1186\/1471-2164-9-S2-S1","article-title":"The unfoldomics decade: an update on intrinsically disordered proteins","volume":"9","author":"Dunker","year":"2008","journal-title":"BMC Genomics"},{"key":"2023012508281325300_B7","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1038\/nrm1589","article-title":"Intrinsically unstructured proteins and their functions","volume":"6","author":"Dyson","year":"2005","journal-title":"Nat. Rev. Mol. Cell. Biol."},{"key":"2023012508281325300_B8","first-page":"1871","article-title":"LIBLINEAR: a library for large linear classification","volume":"9","author":"Fan","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"2023012508281325300_B9","doi-asserted-by":"crossref","first-page":"857","DOI":"10.1002\/prot.22193","article-title":"Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by fast guided-learning through a two-layer neural network","volume":"74","author":"Faraggi","year":"2009","journal-title":"Proteins"},{"key":"2023012508281325300_B10","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1186\/1471-2105-10-8","article-title":"Large-scale prediction of long disordered regions in proteins using random forests","volume":"10","author":"Han","year":"2009","journal-title":"BMC Bioinformatics"},{"issue":"Suppl. 1","key":"2023012508281325300_B11","doi-asserted-by":"crossref","first-page":"S9","DOI":"10.1186\/1471-2164-9-S1-S9","article-title":"Protein disorder prediction at multiple levels of sensitivity and specificity","volume":"9","author":"Hecker","year":"2008","journal-title":"BMC Genomics"},{"key":"2023012508281325300_B12","doi-asserted-by":"crossref","first-page":"2046","DOI":"10.1093\/bioinformatics\/btm302","article-title":"POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions","volume":"23","author":"Hirose","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B13","doi-asserted-by":"crossref","first-page":"W460","DOI":"10.1093\/nar\/gkm363","article-title":"PrDOS: prediction of disordered protein regions from amino acid sequence","volume":"35","author":"Ishida","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012508281325300_B14","doi-asserted-by":"crossref","first-page":"1344","DOI":"10.1093\/bioinformatics\/btn195","article-title":"Prediction of disordered regions in proteins based on the meta approach","volume":"24","author":"Ishida","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B15","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/S0968-0004(01)02039-4","article-title":"Getting the most from PSI-BLAST","volume":"27","author":"Jones","year":"2002","journal-title":"Trends Biochem. Sci."},{"issue":"Suppl. 6","key":"2023012508281325300_B16","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1002\/prot.10528","article-title":"Prediction of disordered regions in proteins from position specific score matrices","volume":"53","author":"Jones","year":"2003","journal-title":"Proteins"},{"key":"2023012508281325300_B17","doi-asserted-by":"crossref","first-page":"3701","DOI":"10.1093\/nar\/gkg519","article-title":"GlobPlot: exploring protein sequences for globularity and disorder","volume":"31","author":"Linding","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012508281325300_B18","doi-asserted-by":"crossref","first-page":"1798","DOI":"10.1093\/bioinformatics\/btn326","article-title":"Intrinsic disorder prediction from the analysis of multiple protein fold recognition models","volume":"24","author":"McGuffin","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B19","doi-asserted-by":"crossref","first-page":"404","DOI":"10.1093\/bioinformatics\/16.4.404","article-title":"The PSIPRED protein structure prediction server","volume":"16","author":"McGuffin","year":"2000","journal-title":"Bioinformatics"},{"issue":"Suppl. 9","key":"2023012508281325300_B20","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1002\/prot.22586","article-title":"Assessment of disorder predictions in CASP8","volume":"77","author":"Noivirt-Brik","year":"2009","journal-title":"Proteins"},{"key":"2023012508281325300_B21","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1002\/prot.20735","article-title":"Exploiting heterogeneous sequence properties improves prediction of protein disorder","volume":"61","author":"Obradovic","year":"2005","journal-title":"Proteins"},{"key":"2023012508281325300_B22","doi-asserted-by":"crossref","first-page":"1989","DOI":"10.1021\/bi047993o","article-title":"Comparing and combining predictors of mostly disordered proteins","volume":"44","author":"Oldfield","year":"2005","journal-title":"Biochemistry"},{"key":"2023012508281325300_B23","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1142\/S0219720005000886","article-title":"Optimizing intrinsic disorder predictors with protein evolutionary information","volume":"3","author":"Peng","year":"2005","journal-title":"J. Bioinform. Comput. Biol."},{"key":"2023012508281325300_B24","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1186\/1471-2105-7-208","article-title":"Length-dependent prediction of protein intrinsic disorder","volume":"7","author":"Peng","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508281325300_B25","doi-asserted-by":"crossref","first-page":"261","DOI":"10.18388\/abp.2008_3073","article-title":"Prediction of signal peptides in protein sequences by neural networks","volume":"55","author":"Plewczynski","year":"2008","journal-title":"Acta Biochim. Pol."},{"key":"2023012508281325300_B26","doi-asserted-by":"crossref","first-page":"3435","DOI":"10.1093\/bioinformatics\/bti537","article-title":"FoldIndex: a simple tool to predict whether a given protein sequence is intrinsically unfolded","volume":"21","author":"Prilusky","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B27","first-page":"255","article-title":"Structural genomics target selection for the New York consortium on membrane protein structure","volume-title":"J. Struct. Funct. Genomics.","author":"Punta","year":"2009"},{"key":"2023012508281325300_B28","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1110\/ps.03128904","article-title":"Protein flexibility and intrinsic disorder","volume":"13","author":"Radivojac","year":"2004","journal-title":"Prot. Sci."},{"key":"2023012508281325300_B29","doi-asserted-by":"crossref","first-page":"1439","DOI":"10.1529\/biophysj.106.094045","article-title":"Intrinsic disorder and functional proteomics","volume":"92","author":"Radivojac","year":"2007","journal-title":"Biophys. J."},{"key":"2023012508281325300_B30","doi-asserted-by":"crossref","first-page":"e5566","DOI":"10.1371\/journal.pone.0005566","article-title":"The role of intrinsically unstructured proteins in neurodegenerative diseases","volume":"4","author":"Raychaudhuri","year":"2009","journal-title":"PLoS One"},{"key":"2023012508281325300_B31","doi-asserted-by":"crossref","first-page":"891","DOI":"10.1093\/bioinformatics\/btl032","article-title":"PROFbval: predict flexible and rigid residues in proteins","volume":"22","author":"Schlessinger","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B32","doi-asserted-by":"crossref","first-page":"2376","DOI":"10.1093\/bioinformatics\/btm349","article-title":"Natively unstructured regions in proteins identified from contact predictions","volume":"23","author":"Schlessinger","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B33","doi-asserted-by":"crossref","first-page":"e140","DOI":"10.1371\/journal.pcbi.0030140","article-title":"Natively unstructured loops differ from other loops","volume":"3","author":"Schlessinger","year":"2007","journal-title":"PLoS Comput. Biol."},{"key":"2023012508281325300_B34","doi-asserted-by":"crossref","first-page":"e4433","DOI":"10.1371\/journal.pone.0004433","article-title":"Improved disorder prediction by combination of orthogonal approaches","volume":"4","author":"Schlessinger","year":"2009","journal-title":"PLoS One"},{"key":"2023012508281325300_B35","doi-asserted-by":"crossref","first-page":"2337","DOI":"10.1093\/bioinformatics\/btm330","article-title":"POODLE-S: web application for predicting protein disorder by using physicochemical features and reduced amino acid set of a position-specific scoring matrix","volume":"23","author":"Shimizu","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B36","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1186\/1471-2105-8-78","article-title":"Predicting mostly disordered proteins by using structure-unknown protein data","volume":"8","author":"Shimizu","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012508281325300_B37","doi-asserted-by":"crossref","first-page":"D786","DOI":"10.1093\/nar\/gkl893","article-title":"DisProt: the Database of Disordered Proteins","volume":"35","author":"Sickmeier","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012508281325300_B38","doi-asserted-by":"crossref","first-page":"2472","DOI":"10.1110\/ps.073037907","article-title":"The challenge of protein structure determination - lessons from structural genomics","volume":"16","author":"Slabinski","year":"2007","journal-title":"Prot. Sci."},{"key":"2023012508281325300_B39","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1093\/nar\/gkm353","article-title":"iPDA: integrated protein disorder analyzer","volume":"35","author":"Su","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012508281325300_B40","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1186\/1471-2105-7-319","article-title":"Protein disorder prediction by condensed PSSM considering propensity for order or disorder","volume":"7","author":"Su","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508281325300_B41","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1002\/bies.200800151","article-title":"Close encounters of the third kind: disordered domains and the interactions of proteins","volume":"31","author":"Tompa","year":"2009","journal-title":"Bioessays"},{"key":"2023012508281325300_B42","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1002\/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7","article-title":"Why are \u201cnatively unfolded\u201d proteins unstructured under physiologic conditions?","volume":"41","author":"Uversky","year":"2000","journal-title":"Proteins"},{"key":"2023012508281325300_B43","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1002\/prot.10437","article-title":"Flavors of protein disorder","volume":"52","author":"Vucetic","year":"2003","journal-title":"Proteins"},{"key":"2023012508281325300_B44","doi-asserted-by":"crossref","first-page":"W164","DOI":"10.1093\/nar\/gkl166","article-title":"Spritz server for the prediction of intrinsically disordered regions in protein sequences using kernel machines","volume":"34","author":"Vullo","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012508281325300_B45","doi-asserted-by":"crossref","first-page":"1589","DOI":"10.1093\/bioinformatics\/btg224","article-title":"PISCES: a protein sequence culling server","volume":"19","author":"Wang","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B46","doi-asserted-by":"crossref","first-page":"1401","DOI":"10.1093\/bioinformatics\/btn132","article-title":"OnD-CRF: predicting order and disorder in proteins using conditional random fields","volume":"24","author":"Wang","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B47","doi-asserted-by":"crossref","first-page":"2138","DOI":"10.1093\/bioinformatics\/bth195","article-title":"The DISOPRED server for the prediction of protein disorder","volume":"20","author":"Ward","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508281325300_B48","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1002\/prot.21945","article-title":"MUSTER: improving protein sequence profile-profile alignments by using multiple sources of structure information","volume":"72","author":"Wu","year":"2008","journal-title":"Proteins"},{"key":"2023012508281325300_B49","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1109\/BIBE.2006.253309","article-title":"IUP: intrinsically unstructured protein predictor \u2013 a software tool for analyzing polypeptide sequences","volume-title":"Sixth IEEE Symposium on BioInformatics and BioEngineering.","author":"Yang","year":"2006"},{"key":"2023012508281325300_B50","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1002\/prot.22375","article-title":"On the relation between residue flexibility and local solvent accessibility in proteins","volume":"76","author":"Zhang","year":"2009","journal-title":"Proteins"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/18\/i489\/48858886\/bioinformatics_26_18_i489.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/18\/i489\/48858886\/bioinformatics_26_18_i489.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:28:51Z","timestamp":1674635331000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/18\/i489\/205459"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,9,4]]},"references-count":50,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2010,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq373","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,9,15]]},"published":{"date-parts":[[2010,9,4]]}}}