{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:45:36Z","timestamp":1740185136668,"version":"3.37.3"},"reference-count":29,"publisher":"Oxford University Press (OUP)","issue":"18","funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["GM101457"],"award-info":[{"award-number":["GM101457"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2016,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Repeat proteins, which contain multiple repeats of short sequence motifs, form a large but seldom-studied group of proteins. Methods focusing on the analysis of 3D structures of such proteins identified many subtle effects in length distribution of individual motifs that are important for their functions. However, similar analysis was yet not applied to the vast majority of repeat proteins with unknown 3D structures, mostly because of the extreme diversity of the underlying motifs and the resulting difficulty to detect those.<\/jats:p>\n               <jats:p>Results: We developed FAIT, a sequence-based algorithm for the precise assignment of individual repeats in repeat proteins and introduced a framework to classify and compare aperiodicity patterns for large protein families. FAIT extracts repeat positions by post-processing FFAS alignment matrices with image processing methods. On examples of proteins with Leucine Rich Repeat (LRR) domains and other solenoids like proteins, we show that the automated analysis with FAIT correctly identifies exact lengths of individual repeats based entirely on sequence information.<\/jats:p>\n               <jats:p>Availability and Implementation: \u00a0https:\/\/github.com\/GodzikLab\/FAIT.<\/jats:p>\n               <jats:p>Contact: \u00a0adam@godziklab.org<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btw319","type":"journal-article","created":{"date-parts":[[2016,6,24]],"date-time":"2016-06-24T05:19:32Z","timestamp":1466745572000},"page":"2776-2782","source":"Crossref","is-referenced-by-count":3,"title":["Revealing aperiodic aspects of solenoid proteins from sequence information"],"prefix":"10.1093","volume":"32","author":[{"given":"Thomas","family":"Hrabe","sequence":"first","affiliation":[{"name":"Department of Bioinformatics and Systems Biology, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA"}]},{"given":"Lukasz","family":"Jaroszewski","sequence":"additional","affiliation":[{"name":"Department of Bioinformatics and Systems Biology, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA"}]},{"given":"Adam","family":"Godzik","sequence":"additional","affiliation":[{"name":"Department of Bioinformatics and Systems Biology, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA 92037, USA"}]}],"member":"286","published-online":{"date-parts":[[2016,6,9]]},"reference":[{"key":"2023020113393863200_btw319-B1","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1006\/jsbi.2001.4392","article-title":"Protein repeats: structures, functions, and evolution","volume":"134","author":"Andrade","year":"2001","journal-title":"J. Struct. Biol"},{"key":"2023020113393863200_btw319-B2","doi-asserted-by":"crossref","first-page":"7657","DOI":"10.1038\/srep07657","article-title":"Insights into the species-specific TLR4 signaling mechanism in response to Rhodobacter sphaeroides lipid A detection","volume":"5","author":"Anwar","year":"2015","journal-title":"Sci. Rep"},{"key":"2023020113393863200_btw319-B3","first-page":"103","article-title":"Designs on a curve","volume":"22","author":"Bazan","year":"2015","journal-title":"Nat. Publ. Gr"},{"key":"2023020113393863200_btw319-B4","doi-asserted-by":"crossref","first-page":"807","DOI":"10.1093\/bioinformatics\/btn039","article-title":"De novo identification of highly diverged protein repeats by probabilistic consistency","volume":"24","author":"Biegert","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020113393863200_btw319-B5","doi-asserted-by":"crossref","first-page":"D352","DOI":"10.1093\/nar\/gkt1175","article-title":"RepeatsDB: a database of tandem repeat protein structures","volume":"42","author":"Di Domenico","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023020113393863200_btw319-B6","volume-title":"Computer Vision","author":"Forsyth","year":"2003","edition":"1st edn"},{"key":"2023020113393863200_btw319-B7","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1002\/1097-0134(20001101)41:2<224::AID-PROT70>3.0.CO;2-Z","article-title":"Rapid automatic detection and alignment of repeats in protein sequences","volume":"41","author":"Heger","year":"2000","journal-title":"Proteins"},{"key":"2023020113393863200_btw319-B8","doi-asserted-by":"crossref","first-page":"10915","DOI":"10.1073\/pnas.89.22.10915","article-title":"Amino acid substitution matrices from protein blocks","volume":"89","author":"Henikoff","year":"1992","journal-title":"Proc. Natl. Acad. Sci. U. S. A"},{"key":"2023020113393863200_btw319-B9","doi-asserted-by":"crossref","first-page":"D423","DOI":"10.1093\/nar\/gkv1316","article-title":"PDBFlex: exploring flexibility in protein structures","volume":"44","author":"Hrabe","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023020113393863200_btw319-B10","doi-asserted-by":"crossref","DOI":"10.1002\/9780470015902.a0023175","article-title":"Structure determination by single particle tomography","author":"Hrabe","year":"2011","journal-title":"Encycl. Life Sci"},{"key":"2023020113393863200_btw319-B11","doi-asserted-by":"crossref","first-page":"119.","DOI":"10.1186\/1471-2105-15-119","article-title":"ConSole: using modularity of contact maps to locate Solenoid domains in protein structures","volume":"15","author":"Hrabe","year":"2014","journal-title":"BMC Bioinformatics"},{"first-page":"2194","year":"2001","author":"Jacobson","key":"2023020113393863200_btw319-B12"},{"key":"2023020113393863200_btw319-B13","doi-asserted-by":"crossref","first-page":"W38","DOI":"10.1093\/nar\/gkr441","article-title":"FFAS server: novel features and applications","volume":"39","author":"Jaroszewski","year":"2011","journal-title":"Nucleic Acids Res"},{"key":"2023020113393863200_btw319-B14","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1006\/jmbi.1998.1643","article-title":"Structural diversity of leucine-rich repeat proteins","volume":"277","author":"Kajava","year":"1998","journal-title":"J. Mol. Biol"},{"key":"2023020113393863200_btw319-B15","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1016\/j.jsb.2011.08.009","article-title":"Tandem repeats in proteins: from sequence to structure","volume":"179","author":"Kajava","year":"2012","journal-title":"J. Struct. Biol"},{"key":"2023020113393863200_btw319-B16","doi-asserted-by":"crossref","first-page":"725","DOI":"10.1016\/S0959-440X(01)00266-4","article-title":"The leucine-rich repeat as a protein recognition motif","volume":"11","author":"Kobe","year":"2001","journal-title":"Curr. Opin. Struct. Biol"},{"key":"2023020113393863200_btw319-B17","doi-asserted-by":"crossref","first-page":"15168","DOI":"10.1021\/bi062188q","article-title":"Ankyrin repeat: a unique motif mediating protein-protein interactions","volume":"45","author":"Li","year":"2006","journal-title":"Biochemistry"},{"key":"2023020113393863200_btw319-B18","doi-asserted-by":"crossref","first-page":"582","DOI":"10.1093\/bib\/bbt003","article-title":"Understanding and identifying amino acid repeats","volume":"15","author":"Luo","year":"2014","journal-title":"Brief. Bioinf"},{"key":"2023020113393863200_btw319-B19","doi-asserted-by":"crossref","first-page":"i289","DOI":"10.1093\/bioinformatics\/btp232","article-title":"REPETITA: detection and discrimination of the periodicity of protein solenoid repeats by discrete Fourier transform","volume":"25","author":"Marsella","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020113393863200_btw319-B20","doi-asserted-by":"crossref","first-page":"2771","DOI":"10.1007\/s00018-005-5187-z","article-title":"Structural analysis of leucine-rich-repeat variants in proteins associated with human diseases","volume":"62","author":"Matsushima","year":"2005","journal-title":"Cell. Mol. Life Sci"},{"key":"2023020113393863200_btw319-B21","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1038\/nsmb.2938","article-title":"Control of repeat-protein curvature by computational protein design","volume":"22","author":"Park","year":"2015","journal-title":"Nat. Struct. Mol. Biol. Mol. Biol"},{"key":"2023020113393863200_btw319-B22","doi-asserted-by":"crossref","first-page":"12887","DOI":"10.1021\/jp402105j","article-title":"Detecting repetitions and periodicities in proteins by tiling the structural space","volume":"117","author":"Parra","year":"2013","journal-title":"J. Phys. Chem. B"},{"key":"2023020113393863200_btw319-B23","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1110\/ps.9.2.232","article-title":"Comparison of sequence profiles. Strategies for structural predictions using sequence information","volume":"9","author":"Rychlewski","year":"2000","journal-title":"Protein Sci"},{"key":"2023020113393863200_btw319-B24","first-page":"588","article-title":"An efficient algorithm for automatic peak detection in noisy periodic and quasi-periodic signals","volume-title":"Algorithms","author":"Scholkmann","year":"2012"},{"key":"2023020113393863200_btw319-B25","doi-asserted-by":"crossref","first-page":"470","DOI":"10.1016\/j.tcb.2010.05.003","article-title":"Armadillo-repeat protein functions: questions for little creatures","volume":"20","author":"Tewari","year":"2010","journal-title":"Trends Cell Biol"},{"key":"2023020113393863200_btw319-B26","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/0022-2836(91)90871-3","article-title":"Motif recognition and alignment for many sequences by comparison of dot-matrices","volume":"218","author":"Vingron","year":"1991","journal-title":"J. Mol. Biol"},{"key":"2023020113393863200_btw319-B27","doi-asserted-by":"crossref","first-page":"3257","DOI":"10.1093\/bioinformatics\/bts550","article-title":"RAPHAEL: recognition, periodicity and insertion assignment of solenoid protein structures","volume":"28","author":"Walsh","year":"2012","journal-title":"Bioinformatics"},{"key":"2023020113393863200_btw319-B28","doi-asserted-by":"crossref","first-page":"2264.","DOI":"10.1002\/(SICI)1097-0258(19961030)15:20<2264::AID-SIM386>3.0.CO;2-7","article-title":"Introduction to computational biology: maps, sequences and genomes","volume":"15","author":"Wilson","year":"1996","journal-title":"Stat. Med"},{"key":"2023020113393863200_btw319-B29","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1093\/bioinformatics\/btt578","article-title":"FFAS-3D: Improving fold recognition by including optimized structural features and template re-ranking","volume":"30","author":"Xu","year":"2014","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/18\/2776\/49021430\/bioinformatics_32_18_2776.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/32\/18\/2776\/49021430\/bioinformatics_32_18_2776.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,1]],"date-time":"2023-02-01T23:45:16Z","timestamp":1675295116000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/32\/18\/2776\/1743965"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,6,9]]},"references-count":29,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2016,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btw319","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2016,9,15]]},"published":{"date-parts":[[2016,6,9]]}}}