{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T01:58:56Z","timestamp":1775613536849,"version":"3.50.1"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2024,11,19]],"date-time":"2024-11-19T00:00:00Z","timestamp":1731974400000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"European Union\u2019s Horizon 2020 research and innovation programme","award":["823886"],"award-info":[{"award-number":["823886"]}]},{"DOI":"10.13039\/501100000921","name":"European Cooperation in Science and Technology","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100000921","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2024,11,28]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>Structured Tandem Repeats Proteins (STRPs) constitute a subclass of tandem repeats characterized by repetitive structural motifs. These proteins exhibit distinct secondary structures that form repetitive tertiary arrangements, often resulting in large molecular assemblies. Despite highly variable sequences, STRPs can perform important and diverse biological functions, maintaining a consistent structure with a variable number of repeat units. With the advent of protein structure prediction methods, millions of 3D models of proteins are now publicly available. However, automatic detection of STRPs remains challenging with current state-of-the-art tools due to their lack of accuracy and long execution times, hindering their application on large datasets. In most cases, manual curation remains the most accurate method for detecting and classifying STRPs, making it impracticable to annotate millions of structures.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We introduce STRPsearch, a novel tool for the rapid identification, classification, and mapping of STRPs. Leveraging manually curated entries from RepeatsDB as the known conformational space of STRPs, STRPsearch uses the latest advances in structural alignment for a fast and accurate detection of repeated structural motifs in proteins, followed by an innovative approach to map units and insertions through the generation of TM-score profiles. STRPsearch is highly scalable, efficiently processing large datasets, and can be applied to both experimental structures and predicted models. In addition, it demonstrates superior performance compared to existing tools, offering researchers a reliable and comprehensive solution for STRP analysis across diverse proteomes.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>STRPsearch is coded in Python. All scripts and associated documentation are available from: https:\/\/github.com\/BioComputingUP\/STRPsearch.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btae690","type":"journal-article","created":{"date-parts":[[2024,11,14]],"date-time":"2024-11-14T15:26:05Z","timestamp":1731597965000},"source":"Crossref","is-referenced-by-count":7,"title":["STRPsearch: fast detection of structured tandem repeat proteins"],"prefix":"10.1093","volume":"40","author":[{"given":"Soroush","family":"Mozaffari","sequence":"first","affiliation":[{"name":"Department of Biomedical Sciences, University of Padova , Padova 35121,","place":["Italy"]}]},{"given":"Paula Nazarena","family":"Arr\u00edas","sequence":"additional","affiliation":[{"name":"Department of Biomedical Sciences, University of Padova , Padova 35121,","place":["Italy"]},{"name":"Department of Protein Science, KTH Royal Institute of Technology , Stockholm SE-10691,","place":["Sweden"]}]},{"given":"Damiano","family":"Clementel","sequence":"additional","affiliation":[{"name":"Department of Biomedical Sciences, University of Padova , Padova 35121,","place":["Italy"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8210-2390","authenticated-orcid":false,"given":"Damiano","family":"Piovesan","sequence":"additional","affiliation":[{"name":"Department of Biomedical Sciences, University of Padova , Padova 35121,","place":["Italy"]}]},{"given":"Carlo","family":"Ferrari","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, University of Padua , Padova 35121,","place":["Italy"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4525-7793","authenticated-orcid":false,"given":"Silvio C E","family":"Tosatto","sequence":"additional","affiliation":[{"name":"Department of Biomedical Sciences, University of Padova , Padova 35121,","place":["Italy"]},{"name":"Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM) , Bari 70126,","place":["Italy"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0362-8218","authenticated-orcid":false,"given":"Alexander Miguel","family":"Monzon","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, University of Padua , Padova 35121,","place":["Italy"]}]}],"member":"286","published-online":{"date-parts":[[2024,11,18]]},"reference":[{"key":"2024121404151416200_btae690-B1","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J Mol Biol"},{"key":"2024121404151416200_btae690-B2","doi-asserted-by":"crossref","first-page":"108001","DOI":"10.1016\/j.jsb.2023.108001","article-title":"The repetitive structure of DNA clamps: an overlooked protein tandem repeat","volume":"215","author":"Arr\u00edas","year":"2023","journal-title":"J Struct Biol"},{"key":"2024121404151416200_btae690-B3","doi-asserted-by":"crossref","first-page":"871","DOI":"10.1126\/science.abj8754","article-title":"Accurate prediction of protein structures and interactions using a three-track neural network","volume":"373","author":"Baek","year":"2021","journal-title":"Science"},{"key":"2024121404151416200_btae690-B4","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1038\/nature16162","article-title":"Exploring the repeat protein universe through computational protein design","volume":"528","author":"Brunette","year":"2015","journal-title":"Nature"},{"key":"2024121404151416200_btae690-B19","doi-asserted-by":"publisher","DOI":"10.1093\/nar\/gkae965","article-title":"RepeatsDB in 2025: expanding annotations of structured tandem repeats proteins on AlphaFoldDB","author":"Clementel","year":"2024","journal-title":"Nucleic Acids Res"},{"key":"2024121404151416200_btae690-B5","doi-asserted-by":"crossref","first-page":"697","DOI":"10.1146\/annurev-cellbio-092910-154111","article-title":"Role of leucine-rich repeat proteins in the development and function of neural circuits","volume":"27","author":"de Wit","year":"2011","journal-title":"Annu Rev Cell Dev Biol"},{"key":"2024121404151416200_btae690-B6","doi-asserted-by":"crossref","first-page":"691865","DOI":"10.3389\/fbinf.2021.691865","article-title":"TRAL 2.0: tandem repeat detection with circular profile hidden Markov models and evolutionary aligner","volume":"1","author":"Delucchi","year":"2021","journal-title":"Front Bioinform"},{"key":"2024121404151416200_btae690-B7","doi-asserted-by":"publisher","first-page":"407","DOI":"10.3390\/genes11040407","article-title":"A new census of protein tandem repeats and their relationship with intrinsic disorder","volume":"11","author":"Delucchi","year":"2020","journal-title":"Genes (Basel)"},{"key":"2024121404151416200_btae690-B8","doi-asserted-by":"crossref","first-page":"D352","DOI":"10.1093\/nar\/gkt1175","article-title":"RepeatsDB: a database of tandem repeat protein structures","volume":"42","author":"Di Domenico","year":"2014","journal-title":"Nucleic Acids Res"},{"key":"2024121404151416200_btae690-B9","doi-asserted-by":"crossref","first-page":"2611","DOI":"10.1016\/j.febslet.2015.08.025","article-title":"TAPO: a combined method for the identification of tandem repeats in protein structures","volume":"589","author":"Do Viet","year":"2015","journal-title":"FEBS Lett"},{"key":"2024121404151416200_btae690-B10","doi-asserted-by":"crossref","first-page":"e79894","DOI":"10.1371\/journal.pone.0079894","article-title":"Functional and genomic analyses of alpha-solenoid proteins","volume":"8","author":"Fournier","year":"2013","journal-title":"PLoS One"},{"key":"2024121404151416200_btae690-B11","doi-asserted-by":"crossref","first-page":"W402","DOI":"10.1093\/nar\/gky360","article-title":"RepeatsDB-lite: a web server for unit annotation of tandem repeat proteins","volume":"46","author":"Hirsh","year":"2018","journal-title":"Nucleic Acids Res"},{"key":"2024121404151416200_btae690-B12","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1016\/j.sbi.2014.04.007","article-title":"Design of proteins from smaller fragments\u2014learning from evolution","volume":"27","author":"H\u00f6cker","year":"2014","journal-title":"Curr Opin Struct Biol"},{"key":"2024121404151416200_btae690-B13","doi-asserted-by":"crossref","first-page":"583","DOI":"10.1038\/s41586-021-03819-2","article-title":"Highly accurate protein structure prediction with AlphaFold","volume":"596","author":"Jumper","year":"2021","journal-title":"Nature"},{"key":"2024121404151416200_btae690-B14","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1016\/j.jsb.2011.08.009","article-title":"Tandem repeats in proteins: from sequence to structure","volume":"179","author":"Kajava","year":"2012","journal-title":"J Struct Biol"},{"key":"2024121404151416200_btae690-B15","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1016\/j.jsb.2017.12.011","article-title":"Editorial for special issue \u201cproteins with tandem repeats: sequences, structures and functions\u201d","volume":"201","author":"Kajava","year":"2018","journal-title":"J Struct Biol"},{"key":"2024121404151416200_btae690-B16","doi-asserted-by":"crossref","first-page":"166895","DOI":"10.1016\/j.jmb.2021.166895","article-title":"REP2: a web server to detect common tandem repeats in protein sequences","volume":"433","author":"Kamel","year":"2021","journal-title":"J Mol Biol"},{"key":"2024121404151416200_btae690-B17","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1038\/nrg2303","article-title":"Toll-like receptors\u2013taking an evolutionary approach","volume":"9","author":"Leulier","year":"2008","journal-title":"Nat Rev Genet"},{"key":"2024121404151416200_btae690-B18","doi-asserted-by":"crossref","first-page":"108023","DOI":"10.1016\/j.jsb.2023.108023","article-title":"A STRP-ed definition of structured tandem repeats in proteins","volume":"215","author":"Monzon","year":"2023","journal-title":"J Struct Biol"},{"key":"2024121404151416200_btae690-B20","author":"Schr\u00f6dinger","year":"2015"},{"key":"2024121404151416200_btae690-B21","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1038\/s41587-023-01773-0","article-title":"Fast and accurate protein structure search with foldseek","volume":"42","author":"van Kempen","year":"2024","journal-title":"Nat Biotechnol"},{"key":"2024121404151416200_btae690-B22","doi-asserted-by":"crossref","first-page":"D439","DOI":"10.1093\/nar\/gkab1061","article-title":"AlphaFold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models","volume":"50","author":"Varadi","year":"2022","journal-title":"Nucleic Acids Res"},{"key":"2024121404151416200_btae690-B23","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1038\/s41586-023-05909-9","article-title":"De novo design of modular peptide-binding proteins by superhelical matching","volume":"616","author":"Wu","year":"2023","journal-title":"Nature"},{"key":"2024121404151416200_btae690-B24","doi-asserted-by":"crossref","first-page":"2302","DOI":"10.1093\/nar\/gki524","article-title":"TM-align: a protein structure alignment algorithm based on the TM-score","volume":"33","author":"Zhang","year":"2005","journal-title":"Nucleic Acids Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btae690\/60744097\/btae690.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/12\/btae690\/60924516\/btae690.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/40\/12\/btae690\/60924516\/btae690.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,13]],"date-time":"2024-12-13T23:15:33Z","timestamp":1734131733000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btae690\/7903285"}},"subtitle":[],"editor":[{"given":"Xin","family":"Gao","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2024,11,18]]},"references-count":24,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2024,11,28]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btae690","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2024.07.10.602726","asserted-by":"object"}]},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2024,12]]},"published":{"date-parts":[[2024,11,18]]},"article-number":"btae690"}}