{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,19]],"date-time":"2026-05-19T18:48:44Z","timestamp":1779216524326,"version":"3.51.4"},"reference-count":45,"publisher":"Oxford University Press (OUP)","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,2,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Intrinsically disordered regions are key for the function of numerous proteins, and the scant available experimental annotations suggest the existence of different disorder flavors. While efficient predictions are required to annotate entire genomes, most existing methods require sequence profiles for disorder prediction, making them cumbersome for high-throughput applications.<\/jats:p>\n               <jats:p>Results: In this work, we present an ensemble of protein disorder predictors called ESpritz. These are based on bidirectional recursive neural networks and trained on three different flavors of disorder, including a novel NMR flexibility predictor. ESpritz can produce fast and accurate sequence-only predictions, annotating entire genomes in the order of hours on a single processor core. Alternatively, a slower but slightly more accurate ESpritz variant using sequence profiles can be used for applications requiring maximum performance. Two levels of prediction confidence allow either to maximize reasonable disorder detection or to limit expected false positives to 5%. ESpritz performs consistently well on the recent CASP9 data, reaching a Sw measure of 54.82 and area under the receiver operator curve of 0.856. The fast predictor is four orders of magnitude faster and remains better than most publicly available CASP9 methods, making it ideal for genomic scale predictions.<\/jats:p>\n               <jats:p>Conclusions: ESpritz predicts three flavors of disorder at two distinct false positive rates, either with a fast or slower and slightly more accurate approach. Given its state-of-the-art performance, it can be especially useful for high-throughput applications.<\/jats:p>\n               <jats:p>Availability: Both a web server for high-throughput analysis and a Linux executable version of ESpritz are available from: http:\/\/protein.bio.unipd.it\/espritz\/<\/jats:p>\n               <jats:p>Contact: \u00a0silvio.tosatto@unipd.it<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr682","type":"journal-article","created":{"date-parts":[[2011,12,22]],"date-time":"2011-12-22T04:26:39Z","timestamp":1324527999000},"page":"503-509","source":"Crossref","is-referenced-by-count":510,"title":["ESpritz: accurate and fast prediction of protein disorder"],"prefix":"10.1093","volume":"28","author":[{"given":"Ian","family":"Walsh","sequence":"first","affiliation":[{"name":"Department of Biology, University of Padua, Viale G. Colombo 3, I-35131 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alberto J. M.","family":"Martin","sequence":"additional","affiliation":[{"name":"Department of Biology, University of Padua, Viale G. Colombo 3, I-35131 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tom\u00e0s","family":"Di Domenico","sequence":"additional","affiliation":[{"name":"Department of Biology, University of Padua, Viale G. Colombo 3, I-35131 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Silvio C. E.","family":"Tosatto","sequence":"additional","affiliation":[{"name":"Department of Biology, University of Padua, Viale G. Colombo 3, I-35131 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2011,12,20]]},"reference":[{"key":"2023012512182720000_B1","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012512182720000_B2","doi-asserted-by":"crossref","first-page":"6395","DOI":"10.1073\/pnas.0408677102","article-title":"Solving the protein sequence metric problem","volume":"102","author":"Atchley","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012512182720000_B3","doi-asserted-by":"crossref","first-page":"937","DOI":"10.1093\/bioinformatics\/15.11.937","article-title":"Exploiting the past and the future in protein secondary structure prediction","volume":"15","author":"Baldi","year":"1999","journal-title":"Bioinformatics"},{"key":"2023012512182720000_B4","doi-asserted-by":"crossref","first-page":"D301","DOI":"10.1093\/nar\/gkl971","article-title":"The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data","volume":"35","author":"Berman","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012512182720000_B5","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1007\/s10618-005-0001-y","article-title":"Accurate prediction of protein disordered regions by mining protein structure data","volume":"11","author":"Cheng","year":"2005","journal-title":"Data Min. Knowl. Discov."},{"key":"2023012512182720000_B6","doi-asserted-by":"crossref","first-page":"827","DOI":"10.1016\/j.jmb.2005.01.071","article-title":"The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins","volume":"347","author":"Dosztanyi","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012512182720000_B7","doi-asserted-by":"crossref","first-page":"2985","DOI":"10.1021\/pr060171o","article-title":"Disorder and sequence repeats in hub proteins and their implications for network evolution","volume":"5","author":"Dosztanyi","year":"2006","journal-title":"J. Proteome Res."},{"key":"2023012512182720000_B8","doi-asserted-by":"crossref","first-page":"805","DOI":"10.1038\/nbt0901-805","article-title":"The protein trinity\u2013linking function and disorder","volume":"19","author":"Dunker","year":"2001","journal-title":"Nat. Biotechnol."},{"key":"2023012512182720000_B9","first-page":"161","article-title":"Intrinsic protein disorder in complete genomes","volume":"11","author":"Dunker","year":"2000","journal-title":"Genome Inform. Ser. Workshop Genome Inform."},{"key":"2023012512182720000_B10","doi-asserted-by":"crossref","first-page":"756","DOI":"10.1016\/j.sbi.2008.10.002","article-title":"Function and structure of inherently disordered proteins","volume":"18","author":"Dunker","year":"2008","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023012512182720000_B11","doi-asserted-by":"crossref","first-page":"e177","DOI":"10.1371\/journal.pcbi.0020177","article-title":"Prediction of amyloidogenic and disordered regions in protein chains","volume":"2","author":"Galzitskaya","year":"2006","journal-title":"PLoS Comput. Biol."},{"key":"2023012512182720000_B12","doi-asserted-by":"crossref","first-page":"2046","DOI":"10.1093\/bioinformatics\/btm302","article-title":"POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions","volume":"23","author":"Hirose","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012512182720000_B13","doi-asserted-by":"crossref","first-page":"W460","DOI":"10.1093\/nar\/gkm363","article-title":"PrDOS: prediction of disordered protein regions from amino acid sequence","volume":"35","author":"Ishida","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012512182720000_B14","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/S0968-0004(01)02039-4","article-title":"Getting the most from PSI-BLAST","volume":"27","author":"Jones","year":"2002","journal-title":"Trends Biochem. Sci."},{"key":"2023012512182720000_B15","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1093\/nar\/27.1.368","article-title":"AAindex: Amino Acid Index Database","volume":"27","author":"Kawashima","year":"1999","journal-title":"Nucleic Acids Res."},{"key":"2023012512182720000_B16","doi-asserted-by":"crossref","first-page":"1453","DOI":"10.1016\/j.str.2003.10.002","article-title":"Protein disorder prediction: implications for structural proteomics","volume":"11","author":"Linding","year":"2003","journal-title":"Structure"},{"key":"2023012512182720000_B17","doi-asserted-by":"crossref","first-page":"035004","DOI":"10.1088\/1478-3975\/8\/3\/035004","article-title":"The Ising model for prediction of disordered residues from protein sequence alone","volume":"8","author":"Lobanov","year":"2011","journal-title":"Phys. Biol."},{"key":"2023012512182720000_B18","doi-asserted-by":"crossref","first-page":"e1000958","DOI":"10.1371\/journal.pcbi.1000958","article-title":"Library of disordered patterns in 3D protein structures","volume":"6","author":"Lobanov","year":"2010","journal-title":"PLoS Comput. Biol."},{"key":"2023012512182720000_B19","doi-asserted-by":"crossref","first-page":"i289","DOI":"10.1093\/bioinformatics\/btp232","article-title":"REPETITA: detection and discrimination of the periodicity of protein solenoid repeats by discrete Fourier transform","volume":"25","author":"Marsella","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512182720000_B20","doi-asserted-by":"crossref","first-page":"2916","DOI":"10.1093\/bioinformatics\/btq537","article-title":"MOBI: a web server to define and visualize structural mobility in NMR protein ensembles","volume":"26","author":"Martin","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012512182720000_B21","doi-asserted-by":"crossref","first-page":"1798","DOI":"10.1093\/bioinformatics\/btn326","article-title":"Intrinsic disorder prediction from the analysis of multiple protein fold recognition models","volume":"24","author":"McGuffin","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012512182720000_B22","doi-asserted-by":"crossref","first-page":"3789","DOI":"10.1093\/nar\/gkg620","article-title":"UniqueProt: Creating representative protein sequence sets","volume":"31","author":"Mika","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012512182720000_B23","doi-asserted-by":"crossref","first-page":"i489","DOI":"10.1093\/bioinformatics\/btq373","article-title":"Improved sequence-based prediction of disordered regions with multilayer fusion of multiple information sources","volume":"26","author":"Mizianty","year":"2010","journal-title":"Bioinformatics"},{"issue":"Suppl. 9","key":"2023012512182720000_B24","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1002\/prot.22586","article-title":"Assessment of disorder predictions in CASP8","volume":"77","author":"Noivirt-Brik","year":"2009","journal-title":"Proteins"},{"issue":"Suppl. 7","key":"2023012512182720000_B25","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1002\/prot.20735","article-title":"Exploiting heterogeneous sequence properties improves prediction of protein disorder","volume":"61","author":"Obradovic","year":"2005","journal-title":"Proteins"},{"key":"2023012512182720000_B26","doi-asserted-by":"crossref","first-page":"1719","DOI":"10.1093\/bioinformatics\/bti203","article-title":"Porter: a new, accurate server for protein secondary structure prediction","volume":"21","author":"Pollastri","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012512182720000_B27","doi-asserted-by":"crossref","first-page":"3435","DOI":"10.1093\/bioinformatics\/bti537","article-title":"FoldIndex: a simple tool to predict whether a given protein sequence is intrinsically unfolded","volume":"21","author":"Prilusky","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012512182720000_B28","doi-asserted-by":"crossref","first-page":"e4433","DOI":"10.1371\/journal.pone.0004433","article-title":"Improved disorder prediction by combination of orthogonal approaches","volume":"4","author":"Schlessinger","year":"2009","journal-title":"PLoS One"},{"key":"2023012512182720000_B29","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1016\/j.sbi.2011.03.014","article-title":"Protein disorder\u2013a breakthrough invention of evolution?","volume":"21","author":"Schlessinger","year":"2011","journal-title":"Curr. Opin. Struct. Biol."},{"key":"2023012512182720000_B30","doi-asserted-by":"crossref","first-page":"D786","DOI":"10.1093\/nar\/gkl893","article-title":"DisProt: the Database of Disordered Proteins","volume":"35","author":"Sickmeier","year":"2007","journal-title":"Nucleic Acids Res."},{"issue":"Suppl. 1","key":"2023012512182720000_B31","doi-asserted-by":"crossref","first-page":"S15","DOI":"10.1186\/1471-2164-11-S1-S15","article-title":"Parameterization of disorder predictors for large-scale applications requiring high specificity by using an extended benchmark dataset","volume":"11","author":"Sirota","year":"2010","journal-title":"BMC Genomics"},{"key":"2023012512182720000_B32","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1016\/S0968-0004(02)02169-2","article-title":"Intrinsically unstructured proteins","volume":"27","author":"Tompa","year":"2002","journal-title":"Trends Biochem. Sci."},{"key":"2023012512182720000_B33","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1016\/j.tibs.2007.10.003","article-title":"Fuzzy complexes: polymorphism and structural disorder in protein-protein interactions","volume":"33","author":"Tompa","year":"2008","journal-title":"Trends Biochem. Sci."},{"key":"2023012512182720000_B34","doi-asserted-by":"crossref","first-page":"328","DOI":"10.1002\/bies.200800151","article-title":"Close encounters of the third kind: disordered domains and the interactions of proteins","volume":"31","author":"Tompa","year":"2009","journal-title":"Bioessays"},{"key":"2023012512182720000_B35","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1046\/j.0014-2956.2001.02649.x","article-title":"What does it mean to be natively unfolded?","volume":"269","author":"Uversky","year":"2002","journal-title":"Eur. J. Biochem."},{"key":"2023012512182720000_B36","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1002\/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7","article-title":"Why are \u201cnatively unfolded\u201d proteins unstructured under physiologic conditions?","volume":"41","author":"Uversky","year":"2000","journal-title":"Proteins"},{"key":"2023012512182720000_B37","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1146\/annurev.biophys.37.032807.125924","article-title":"Intrinsically disordered proteins in human diseases: introducing the D2 concept","volume":"37","author":"Uversky","year":"2008","journal-title":"Annu. Rev. Biophys."},{"key":"2023012512182720000_B38","doi-asserted-by":"crossref","first-page":"D262","DOI":"10.1093\/nar\/gki058","article-title":"E-MSD: an integrated data resource for bioinformatics","volume":"33","author":"Velankar","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012512182720000_B39","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1002\/prot.10437","article-title":"Flavors of protein disorder","volume":"52","author":"Vucetic","year":"2003","journal-title":"Proteins"},{"key":"2023012512182720000_B40","doi-asserted-by":"crossref","first-page":"W164","DOI":"10.1093\/nar\/gkl166","article-title":"Spritz: a server for the prediction of intrinsically disordered regions in protein sequences using kernel machines","volume":"34","author":"Vullo","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012512182720000_B41","doi-asserted-by":"crossref","first-page":"W190","DOI":"10.1093\/nar\/gkr411","article-title":"CSpritz: accurate prediction of protein disorder segments with annotation for homology, secondary structure and linear motifs","volume":"39","author":"Walsh","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012512182720000_B42","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1016\/j.jmb.2004.02.002","article-title":"Prediction and functional analysis of native disorder in proteins from the three kingdoms of life","volume":"337","author":"Ward","year":"2004","journal-title":"J. Mol. Biol."},{"key":"2023012512182720000_B43","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1006\/jmbi.1999.3110","article-title":"Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm","volume":"293","author":"Wright","year":"1999","journal-title":"J. Mol. Biol."},{"key":"2023012512182720000_B44","doi-asserted-by":"crossref","first-page":"996","DOI":"10.1016\/j.bbapap.2010.01.011","article-title":"PONDR-FIT: a meta-predictor of intrinsically disordered amino acids","volume":"1804","author":"Xue","year":"2010","journal-title":"Biochim. Biophys. Acta"},{"key":"2023012512182720000_B45","doi-asserted-by":"crossref","first-page":"3369","DOI":"10.1093\/bioinformatics\/bti534","article-title":"RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins","volume":"21","author":"Yang","year":"2005","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/4\/503\/48874766\/bioinformatics_28_4_503.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/4\/503\/48874766\/bioinformatics_28_4_503.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T14:59:50Z","timestamp":1674658790000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/4\/503\/211681"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,12,20]]},"references-count":45,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2012,2,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr682","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,2,15]]},"published":{"date-parts":[[2011,12,20]]}}}