{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T20:33:15Z","timestamp":1769632395055,"version":"3.49.0"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>We describe Distill, a suite of servers for the prediction of protein structural features: secondary structure; relative solvent accessibility; contact density; backbone structural motifs; residue contact maps at 6, 8 and 12 Angstrom; coarse protein topology. The servers are based on large-scale ensembles of recursive neural networks and trained on large, up-to-date, non-redundant subsets of the Protein Data Bank. Together with structural feature predictions, Distill includes a server for prediction of C<jats:sub>\n                <jats:italic>\u03b1<\/jats:italic>\n              <\/jats:sub>traces for short proteins (up to 200 amino acids).<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>The servers are state-of-the-art, with secondary structure predicted correctly for nearly 80% of residues (currently the top performance on EVA), 2-class solvent accessibility nearly 80% correct, and contact maps exceeding 50% precision on the top non-diagonal contacts. A preliminary implementation of the predictor of protein C<jats:sub>\n                <jats:italic>\u03b1<\/jats:italic>\n              <\/jats:sub>traces featured among the top 20 Novel Fold predictors at the last CASP6 experiment as group Distill (ID 0348). The majority of the servers, including the C<jats:sub>\n                <jats:italic>\u03b1<\/jats:italic>\n              <\/jats:sub>trace predictor, now take into account homology information from the PDB, when available, resulting in greatly improved reliability.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>All predictions are freely available through a simple joint web interface and the results are returned by email. In a single submission the user can send protein sequences for a total of up to 32k residues to all or a selection of the servers. Distill is accessible at the address: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/distill.ucd.ie\/distill\/\" ext-link-type=\"uri\">http:\/\/distill.ucd.ie\/distill\/<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-7-402","type":"journal-article","created":{"date-parts":[[2006,9,5]],"date-time":"2006-09-05T18:20:52Z","timestamp":1157480452000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":79,"title":["Distill: a suite of web servers for the prediction of one-, two- and three-dimensional structural features of proteins"],"prefix":"10.1186","volume":"7","author":[{"given":"Davide","family":"Ba\u00fa","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alberto JM","family":"Martin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Catherine","family":"Mooney","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alessandro","family":"Vullo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ian","family":"Walsh","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gianluca","family":"Pollastri","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2006,9,5]]},"reference":[{"key":"1141_CR1","doi-asserted-by":"publisher","first-page":"797","DOI":"10.1006\/jmbi.1999.2583","volume":"287","author":"D Jones","year":"1999","unstructured":"Jones D: GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. J Mol Biol 1999, 287: 797\u2013815. 10.1006\/jmbi.1999.2583","journal-title":"J Mol Biol"},{"key":"1141_CR2","doi-asserted-by":"publisher","first-page":"255","DOI":"10.1016\/j.jmb.2004.05.041","volume":"341","author":"D Przybylski","year":"2004","unstructured":"Przybylski D, Rost B: Improving fold recognition without folds. J Mol Biol 2004, 341: 255\u2013269. 10.1016\/j.jmb.2004.05.041","journal-title":"J Mol Biol"},{"issue":"S6","key":"1141_CR3","doi-asserted-by":"publisher","first-page":"457","DOI":"10.1002\/prot.10552","volume":"53","author":"P Bradley","year":"2003","unstructured":"Bradley P, Chivian D, Meiler J, Misura K, Rohl C, Schief W, Wedemeyer W, Schueler-Furman O, Murphy P, Schonbrun J, Strauss C, Baker D: Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation. Proteins 2003, 53(S6):457\u201368. 10.1002\/prot.10552","journal-title":"Proteins"},{"key":"1141_CR4","volume-title":"BMC Bioinformatics","author":"A Vullo","year":"2006","unstructured":"Vullo A, Walsh I, Pollastri G: A two-stage approach for improved prediction of residue contact maps. BMC Bioinformatics 2006., 7(180):"},{"key":"1141_CR5","unstructured":"Mooney C, Vullo A, Pollastri G: Protein Structural Motif Prediction in Multidimensional \u03c6 \u2013 \u03c8 Space leads to improved Secondary Structure Prediction. Journal of Computational Biologyin, in press."},{"issue":"3","key":"1141_CR6","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1089\/cmb.2006.13.631","volume":"13","author":"G Pollastri","year":"2006","unstructured":"Pollastri G, Vullo A, Frasconi P, Baldi P: Modular DAG-RNN Architectures for Assembling Coarse Protein Structures. Journal of Computational Biology 2006, 13(3):631\u2013650. 10.1089\/cmb.2006.13.631","journal-title":"Journal of Computational Biology"},{"issue":"8","key":"1141_CR7","doi-asserted-by":"publisher","first-page":"1719","DOI":"10.1093\/bioinformatics\/bti203","volume":"21","author":"G Pollastri","year":"2005","unstructured":"Pollastri G, McLysaght A: Porter: a new, accurate server for protein secondary structure prediction. Bioinformatics 2005, 21(8):1719\u201320. 10.1093\/bioinformatics\/bti203","journal-title":"Bioinformatics"},{"key":"1141_CR8","doi-asserted-by":"publisher","first-page":"575","DOI":"10.1162\/153244304773936054","volume":"4","author":"P Baldi","year":"2003","unstructured":"Baldi P, Pollastri G: The Principled Design of Large-Scale Recursive Neural Network Architectures \u2013 DAG-RNNs and the Protein Structure Prediction Problem. Journal of Machine Learning Research 2003, 4: 575\u2013602. 10.1162\/153244304773936054","journal-title":"Journal of Machine Learning Research"},{"key":"1141_CR9","doi-asserted-by":"publisher","first-page":"937","DOI":"10.1093\/bioinformatics\/15.11.937","volume":"15","author":"P Baldi","year":"1999","unstructured":"Baldi P, Brunak S, Frasconi P, Soda G, Pollastri G: Exploiting the past and the future in protein secondary structure prediction. Bioinformatics 1999, 15: 937\u2013946. 10.1093\/bioinformatics\/15.11.937","journal-title":"Bioinformatics"},{"issue":"Suppl 1","key":"1141_CR10","doi-asserted-by":"publisher","first-page":"S62","DOI":"10.1093\/bioinformatics\/18.suppl_1.S62","volume":"18","author":"G Pollastri","year":"2002","unstructured":"Pollastri G, Baldi P: Prediction of Contact Maps by Recurrent Neural Network Architectures and Hidden Context Propagation from All Four Cardinal Corners. Bioinformatics 2002, 18(Suppl 1):S62-S70.","journal-title":"Bioinformatics"},{"key":"1141_CR11","unstructured":"Pollastri G, Ba\u00fa D, Vullo A: DISTILL: A Machine Learning Approach to Ab Initio Protein Structure Prediction. In Analysis of Biological Data: A Soft Computing Approach. Edited by: Bandyopadhyay S, Maulik U, Wang J. World Scientific; in press."},{"key":"1141_CR12","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1093\/nar\/28.1.235","volume":"28","author":"H Berman","year":"2000","unstructured":"Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H, Shindyalov I, Bourne P: The Protein Data Bank. Nucl Acids Res 2000, 28: 235\u2013242. [http:\/\/pdbbeta.rcsb.org\/pdb\/Welcome.do] 10.1093\/nar\/28.1.235","journal-title":"Nucl Acids Res"},{"key":"1141_CR13","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1016\/S1359-0278(97)00041-2","volume":"2","author":"M Vendruscolo","year":"1997","unstructured":"Vendruscolo M, Kussell E, Domany E: Recovery of protein structure from contact maps. Folding and Design 1997, 2: 295\u2013306. 10.1016\/S1359-0278(97)00041-2","journal-title":"Folding and Design"},{"key":"1141_CR14","doi-asserted-by":"publisher","first-page":"522","DOI":"10.1002\/pro.5560030317","volume":"3","author":"U Hobohm","year":"1994","unstructured":"Hobohm U, Sander C: Enlarged representative set of protein structures. Protein Sci 1994, 3: 522\u201324. [http:\/\/bioinfo.tg.fh-giessen.de\/pdbselect\/]","journal-title":"Protein Sci"},{"key":"1141_CR15","doi-asserted-by":"publisher","first-page":"2577","DOI":"10.1002\/bip.360221211","volume":"22","author":"W Kabsch","year":"1983","unstructured":"Kabsch W, Sander C: Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 1983, 22: 2577\u20132637. 10.1002\/bip.360221211","journal-title":"Biopolymers"},{"key":"1141_CR16","doi-asserted-by":"publisher","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","volume":"25","author":"S Altschul","year":"1997","unstructured":"Altschul S, Madden T, Schaffer A: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl Acids Res 1997, 25: 3389\u20133402. 10.1093\/nar\/25.17.3389","journal-title":"Nucl Acids Res"},{"key":"1141_CR17","doi-asserted-by":"publisher","first-page":"228","DOI":"10.1002\/prot.10082","volume":"47","author":"G Pollastri","year":"2002","unstructured":"Pollastri G, Przybylski D, Rost B, Baldi P: Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. Proteins 2002, 47: 228\u2013235. 10.1002\/prot.10082","journal-title":"Proteins"},{"key":"1141_CR18","unstructured":"Pollastri G, Martin AJM, Mooney C, Vullo A: High-throughput comparative modelling of protein secondary structure and solvent accessibility. submitted submitted"},{"key":"1141_CR19","doi-asserted-by":"publisher","first-page":"618","DOI":"10.1073\/pnas.0408746102","volume":"18","author":"GE Sims","year":"2005","unstructured":"Sims GE, Choi I, Kim S: Protein conformational space in higher order \u03c8 - \u03c6 maps. PNAS 2005, 18: 618\u2013621. 10.1073\/pnas.0408746102","journal-title":"PNAS"},{"issue":"S7","key":"1141_CR20","doi-asserted-by":"publisher","first-page":"67","DOI":"10.1002\/prot.20722","volume":"61","author":"J Vincent","year":"2006","unstructured":"Vincent J, Tai C, Sathyanarayana B, Lee B: Assessment of CASP6 Predictions for New and Nearly New Fold Targets. Proteins 2006, 61(S7):67\u201383. 10.1002\/prot.20722","journal-title":"Proteins"},{"key":"1141_CR21","unstructured":"Distill[http:\/\/distill.ucd.ie\/distill\/]"},{"issue":"9","key":"1141_CR22","first-page":"374","volume":"20","author":"R Sayle","year":"1995","unstructured":"Sayle R, Milner-White E: RasMol: Biomolecular graphics for all. TIBS 1995, 20(9):374.","journal-title":"TIBS"},{"issue":"4","key":"1141_CR23","doi-asserted-by":"publisher","first-page":"702","DOI":"10.1002\/prot.20264","volume":"57","author":"Y Zhang","year":"2004","unstructured":"Zhang Y, Skolnik J: Scoring function for automated assessment of protein structure template quality. Proteins 2004, 57(4):702\u2013710. 10.1002\/prot.20264","journal-title":"Proteins"},{"key":"1141_CR24","doi-asserted-by":"publisher","first-page":"1242","DOI":"10.1093\/bioinformatics\/17.12.1242","volume":"17","author":"V Eyrich","year":"2001","unstructured":"Eyrich V, Marti-Renom M, Przybylski D, Madhusudan M, Fiser A, Pazos F, Valencia A, Sali A, Rost B: EVA: continuous automatic evaluation od protein structure prediction servers. Bioinformatics 2001, 17: 1242\u20131251. 10.1093\/bioinformatics\/17.12.1242","journal-title":"Bioinformatics"},{"issue":"8","key":"1141_CR25","doi-asserted-by":"publisher","first-page":"1029","DOI":"10.1016\/j.neunet.2005.07.001","volume":"18","author":"A Ceroni","year":"2005","unstructured":"Ceroni A, Frasconi P, Pollastri G: Learning Protein Secondary Structure from Sequential and Relational Data. Neural Networks 2005, 18(8):1029\u201339. 10.1016\/j.neunet.2005.07.001","journal-title":"Neural Networks"},{"issue":"S6","key":"1141_CR26","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1002\/prot.10556","volume":"53","author":"J Moult","year":"2003","unstructured":"Moult J, Fidelis K, Zemla A, Hubbard T: Critical assessment of methods of protein structure prediction (CASP)-round V. Proteins 2003, 53(S6):334\u20139. 10.1002\/prot.10556","journal-title":"Proteins"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-402.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T03:19:02Z","timestamp":1630466342000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-402"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,9,5]]},"references-count":26,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["1141"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-402","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,9,5]]},"assertion":[{"value":"6 July 2006","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 September 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 September 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"402"}}