{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T20:03:04Z","timestamp":1776369784286,"version":"3.51.2"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2019,12,9]],"date-time":"2019-12-09T00:00:00Z","timestamp":1575849600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100003130","name":"Research Foundation Flanders","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100003130","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003130","name":"FWO","doi-asserted-by":"publisher","award":["G.0328.16N"],"award-info":[{"award-number":["G.0328.16N"]}],"id":[{"id":"10.13039\/501100003130","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008530","name":"European Regional Development Fund","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008530","name":"ERDF","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100008530","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Brussels-Capital Region-Innoviris"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,4,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Protein beta-aggregation is an important but poorly understood phenomena involved in diseases as well as in beneficial physiological processes. However, while this task has been investigated for over 50\u2009years, very little is known about its mechanisms of action. Moreover, the identification of regions involved in aggregation is still an open problem and the state-of-the-art methods are often inadequate in real case applications.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this article we present AgMata, an unsupervised tool for the identification of such regions from amino acidic sequence based on a generalized definition of statistical potentials that includes biophysical information. The tool outperforms the state-of-the-art methods on two different benchmarks. As case-study, we applied our tool to human ataxin-3, a protein involved in Machado\u2013Joseph disease. Interestingly, AgMata identifies aggregation-prone residues that share the very same structural environment. Additionally, it successfully predicts the outcome of in vitro mutagenesis experiments, identifying point mutations that lead to an alteration of the aggregation propensity of the wild-type ataxin-3.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>A python implementation of the tool is available at https:\/\/bitbucket.org\/bio2byte\/agmata.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btz912","type":"journal-article","created":{"date-parts":[[2019,12,3]],"date-time":"2019-12-03T12:14:01Z","timestamp":1575375241000},"page":"2076-2081","source":"Crossref","is-referenced-by-count":34,"title":["Accurate prediction of protein beta-aggregation with generalized statistical potentials"],"prefix":"10.1093","volume":"36","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5935-5258","authenticated-orcid":false,"given":"Gabriele","family":"Orlando","sequence":"first","affiliation":[{"name":"Interuniversity Institute of Bioinformatics in Brussels , ULB\/VUB, Triomflaan, Brussels 1050, Belgium"},{"name":"Structural Biology, Vrije Universiteit Brussel , Brussels 1050, Belgium"}]},{"given":"Alexandra","family":"Silva","sequence":"additional","affiliation":[{"name":"IBMC-Instituto de Biologia Molecular e Celular"},{"name":"Instituto de Investiga\u00e7\u00e3o e Inova\u00e7\u00e3o em Sa\u00fade, Universidade do Porto , Porto 4200-135, Portugal"}]},{"given":"Sandra","family":"Macedo-Ribeiro","sequence":"additional","affiliation":[{"name":"IBMC-Instituto de Biologia Molecular e Celular"},{"name":"Instituto de Investiga\u00e7\u00e3o e Inova\u00e7\u00e3o em Sa\u00fade, Universidade do Porto , Porto 4200-135, Portugal"}]},{"given":"Daniele","family":"Raimondi","sequence":"additional","affiliation":[{"name":"ESAT-STADIUS , KU Leuven, Leuven 3001, Belgium"}]},{"given":"Wim","family":"Vranken","sequence":"additional","affiliation":[{"name":"Interuniversity Institute of Bioinformatics in Brussels , ULB\/VUB, Triomflaan, Brussels 1050, Belgium"},{"name":"Structural Biology, Vrije Universiteit Brussel , Brussels 1050, Belgium"},{"name":"Centre for Structural Biology , VIB, Brussels 1050, Belgium"}]}],"member":"286","published-online":{"date-parts":[[2019,12,9]]},"reference":[{"key":"2023062312013926600_btz912-B1","doi-asserted-by":"crossref","first-page":"567","DOI":"10.1073\/pnas.90.2.567","article-title":"Alzheimer disease amyloid beta protein forms calcium channels in bilayer membranes: blockade by tromethamine and aluminum","volume":"90","author":"Arispe","year":"1993","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023062312013926600_btz912-B2","doi-asserted-by":"crossref","first-page":"2741","DOI":"10.1038\/ncomms3741","article-title":"From protein sequence to dynamics and disorder with dynamine","volume":"4","author":"Cilia","year":"2013","journal-title":"Nat. Commun"},{"key":"2023062312013926600_btz912-B3","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1186\/1471-2105-8-65","article-title":"AGGRESCAN: a server for the prediction and evaluation of \u201chot spots\u201d of aggregation in polypeptides","volume":"8","author":"Conchillo-Sol\u00e9","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023062312013926600_btz912-B4","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1016\/j.jmb.2007.02.058","article-title":"Mechanisms of ataxin-3 misfolding and fibril formation: kinetic analysis of a disease-associated polyglutamine protein","volume":"368","author":"Ellisdon","year":"2007","journal-title":"J. Mol. Biol"},{"key":"2023062312013926600_btz912-B5","doi-asserted-by":"crossref","first-page":"e79722","DOI":"10.1371\/journal.pone.0079722","article-title":"MetAmyl: a METa-predictor for AMYLoid proteins","volume":"8","author":"Emily","year":"2013","journal-title":"PLoS One"},{"key":"2023062312013926600_btz912-B6","doi-asserted-by":"crossref","first-page":"1302","DOI":"10.1038\/nbt1012","article-title":"Prediction of sequence-dependent and mutational effects on the aggregation of peptides and proteins","volume":"22","author":"Fernandez-Escamilla","year":"2004","journal-title":"Nat. Biotechnol"},{"key":"2023062312013926600_btz912-B7","doi-asserted-by":"crossref","first-page":"642","DOI":"10.1016\/j.jmb.2005.08.061","article-title":"Towards a structural understanding of the fibrillization pathway in Machado-Joseph\u2019s disease: trapping early oligomers of non-expanded ataxin-3","volume":"353","author":"Gales","year":"2005","journal-title":"J. Mol. Biol"},{"key":"2023062312013926600_btz912-B8","doi-asserted-by":"crossref","first-page":"326","DOI":"10.1093\/bioinformatics\/btp691","article-title":"FoldAmyloid: a method of prediction of amyloidogenic regions from protein sequence","volume":"26","author":"Garbuzynskiy","year":"2010","journal-title":"Bioinformatics"},{"key":"2023062312013926600_btz912-B9","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1186\/1471-2105-15-54","article-title":"FISH Amyloid - a new method for finding amyloidogenic segments in proteins based on site specific co-occurrence of aminoacids","volume":"15","author":"Gasior","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023062312013926600_btz912-B10","doi-asserted-by":"crossref","first-page":"2577","DOI":"10.1002\/bip.360221211","article-title":"Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features","volume":"22","author":"Kabsch","year":"1983","journal-title":"Biopolymers"},{"key":"2023062312013926600_btz912-B11","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1016\/S0959-440X(98)80016-X","article-title":"The alternative conformations of amyloidogenic proteins and their multi-step assembly pathways","volume":"8","author":"Kelly","year":"1998","journal-title":"Curr. Opin. Struct. Biol"},{"key":"2023062312013926600_btz912-B12","doi-asserted-by":"crossref","first-page":"1658","DOI":"10.1093\/bioinformatics\/btl158","article-title":"Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences","volume":"22","author":"Li","year":"2006","journal-title":"Bioinformatics"},{"key":"2023062312013926600_btz912-B13","doi-asserted-by":"crossref","first-page":"437","DOI":"10.1002\/prot.10286","article-title":"Structure validation by c\u03b1 geometry: \u03d5, \u03c8 and c\u03b2 deviation","volume":"50","author":"Lovell","year":"2003","journal-title":"Proteins"},{"key":"2023062312013926600_btz912-B14","doi-asserted-by":"crossref","first-page":"24190","DOI":"10.1074\/jbc.M115.659532","article-title":"Enhanced molecular mobility of ordinarily structured regions drives polyglutamine disease","volume":"290","author":"Lupton","year":"2015","journal-title":"J. Biol. Chem"},{"key":"2023062312013926600_btz912-B15","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/S0014-5793(03)00748-8","article-title":"Domain architecture of the polyglutamine protein ataxin-3: a globular domain followed by a flexible tail","volume":"549","author":"Masino","year":"2003","journal-title":"FEBS Lett"},{"key":"2023062312013926600_btz912-B16","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1096\/fj.10-161208","article-title":"Functional interactions as a survival strategy against abnormal aggregation","volume":"25","author":"Masino","year":"2011","journal-title":"FASEB J"},{"key":"2023062312013926600_btz912-B17","doi-asserted-by":"crossref","first-page":"1021","DOI":"10.1016\/j.jmb.2004.09.065","article-title":"Characterization of the structure and the amyloidogenic properties of the Josephin domain of the polyglutamine-containing protein ataxin-3","volume":"344","author":"Masino","year":"2004","journal-title":"J. Mol. Biol"},{"key":"2023062312013926600_btz912-B18","doi-asserted-by":"crossref","first-page":"36679","DOI":"10.1038\/srep36679","article-title":"Observation selection bias in contact prediction and its implications for structural bioinformatics","volume":"6","author":"Orlando","year":"2016","journal-title":"Sci. Rep"},{"key":"2023062312013926600_btz912-B19","first-page":"2825","article-title":"Scikit-learn: machine learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J. Mach. Learn. Res"},{"key":"2023062312013926600_btz912-B20","first-page":"37","article-title":"Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation","volume":"2","author":"Powers","year":"2011","journal-title":"J. Mach. Learn. Technol"},{"key":"2023062312013926600_btz912-B21","doi-asserted-by":"crossref","first-page":"8826","DOI":"10.1038\/s41598-017-08366-3","article-title":"Exploring the sequence-based prediction of folding initiation sites in proteins","volume":"7","author":"Raimondi","year":"2017","journal-title":"Sci. Rep"},{"key":"2023062312013926600_btz912-B22","doi-asserted-by":"crossref","first-page":"1219","DOI":"10.1093\/bioinformatics\/btu794","article-title":"Clustering-based model of cysteine co-evolution improves disulfide bond connectivity prediction and reduces homologous sequence requirements","volume":"31","author":"Raimondi","year":"2015","journal-title":"Bioinformatics"},{"key":"2023062312013926600_btz912-B23","doi-asserted-by":"crossref","first-page":"2932","DOI":"10.1016\/j.bpj.2014.10.008","article-title":"Characterization of the conformational fluctuations in the Josephin domain of ataxin-3","volume":"107","author":"Sanfelice","year":"2014","journal-title":"Biophys. J"},{"key":"2023062312013926600_btz912-B24","doi-asserted-by":"crossref","first-page":"1675","DOI":"10.1002\/pro.698","article-title":"Flanking domain stability modulates the aggregation kinetics of a polyglutamine disease protein","volume":"20","author":"Saunders","year":"2011","journal-title":"Protein Sci"},{"key":"2023062312013926600_btz912-B25","doi-asserted-by":"crossref","first-page":"1241","DOI":"10.1074\/mcp.M114.044610","article-title":"Examination of ataxin-3 (atx-3) aggregation by structural mass spectrometry techniques: a rationale for expedited aggregation upon polyglutamine (polyQ) expansion","volume":"14","author":"Scarff","year":"2015","journal-title":"Mol. Cell. Proteomics"},{"key":"2023062312013926600_btz912-B26","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1016\/j.ijms.2012.08.032","article-title":"A tale of a tail: structural insights into the conformational properties of the polyglutamine protein ataxin-3","volume":"345\u2013347","author":"Scarff","year":"2013","journal-title":"Int. J. Mass Spectrom"},{"key":"2023062312013926600_btz912-B27","doi-asserted-by":"crossref","first-page":"847","DOI":"10.1016\/0092-8674(89)90608-9","article-title":"Transgenic mice expressing hamster prion protein produce species-specific scrapie infectivity and amyloid plaques","volume":"59","author":"Scott","year":"1989","journal-title":"Cell"},{"key":"2023062312013926600_btz912-B28","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.jsb.2017.09.006","article-title":"Polyglutamine expansion diseases: more than simple repeats","volume":"201","author":"Silva","year":"2018","journal-title":"J. Struct. Biol"},{"key":"2023062312013926600_btz912-B29","doi-asserted-by":"crossref","first-page":"e170","DOI":"10.1371\/journal.pcbi.0020170","article-title":"Insight into the structure of amyloid fibrils from the analysis of globular proteins","volume":"2","author":"Trovato","year":"2006","journal-title":"PLoS Comput. Biol"},{"key":"2023062312013926600_btz912-B30","doi-asserted-by":"crossref","first-page":"e54175","DOI":"10.1371\/journal.pone.0054175","article-title":"A consensus method for the prediction of \u2018aggregation-prone\u2019peptides in globular proteins","volume":"8","author":"Tsolis","year":"2013","journal-title":"PLoS One"},{"key":"2023062312013926600_btz912-B31","doi-asserted-by":"crossref","first-page":"D387","DOI":"10.1093\/nar\/gkx950","article-title":"AmyPro: a database of proteins with validated amyloidogenic regions","volume":"46","author":"Varadi","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2023062312013926600_btz912-B32","doi-asserted-by":"crossref","first-page":"W301","DOI":"10.1093\/nar\/gku399","article-title":"PASTA 2.0: an improved server for protein aggregation prediction","volume":"42","author":"Walsh","year":"2014","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btz912\/31720997\/btz912.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/7\/2076\/50670159\/bioinformatics_36_7_2076.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/7\/2076\/50670159\/bioinformatics_36_7_2076.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,24]],"date-time":"2023-06-24T21:03:09Z","timestamp":1687640589000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/7\/2076\/5670527"}},"subtitle":[],"editor":[{"given":"Yann","family":"Ponty","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2019,12,9]]},"references-count":32,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2020,4,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btz912","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,4,1]]},"published":{"date-parts":[[2019,12,9]]}}}