{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T11:13:09Z","timestamp":1773659589704,"version":"3.50.1"},"reference-count":59,"publisher":"Oxford University Press (OUP)","issue":"11","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: An increasing amount of evidence from experimental and computational analysis suggests that rare codon clusters are functionally important for protein activity. Most of the studies on rare codon clusters were performed on a limited number of proteins or protein families. In the present study, we present the Sherlocc program and how it can be used for large scale protein family analysis of evolutionarily conserved rare codon clusters and their relation to protein function and structure. This large-scale analysis was performed using the whole Pfam database covering over 70% of the known protein sequence universe. Our program Sherlocc, detects statistically relevant conserved rare codon clusters and produces a user-friendly HTML output.<\/jats:p>\n               <jats:p>Results: Statistically significant rare codon clusters were detected in a multitude of Pfam protein families. The most statistically significant rare codon clusters were predominantly identified in N-terminal Pfam families. Many of the longest rare codon clusters are found in membrane-related proteins which are required to interact with other proteins as part of their function, for example in targeting or insertion. We identified some cases where rare codon clusters can play a regulating role in the folding of catalytically important domains. Our results support the existence of a widespread functional role for rare codon clusters across species. Finally, we developed an online filter-based search interface that provides access to Sherlocc results for all Pfam families.<\/jats:p>\n               <jats:p>Availability: The Sherlocc program and search interface are open access and are available at http:\/\/bcb.med.usherbrooke.ca<\/jats:p>\n               <jats:p>Contact: \u00a0rafael.najmanovich@usherbrooke.ca<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts149","type":"journal-article","created":{"date-parts":[[2012,3,31]],"date-time":"2012-03-31T00:24:47Z","timestamp":1333153487000},"page":"1438-1445","source":"Crossref","is-referenced-by-count":40,"title":["Large-scale analysis of conserved rare codon clusters suggests an involvement in co-translational molecular recognition events"],"prefix":"10.1093","volume":"28","author":[{"given":"Matthieu","family":"Chartier","sequence":"first","affiliation":[{"name":"Department of Biochemistry, Faculty of Medicine and Health Sciences, Universit\u00e9 de Sherbrooke, 12e Avenue Nord, Sherbrooke, J1H 5N4, Qu\u00e9bec, Canada"}]},{"given":"Francis","family":"Gaudreault","sequence":"additional","affiliation":[{"name":"Department of Biochemistry, Faculty of Medicine and Health Sciences, Universit\u00e9 de Sherbrooke, 12e Avenue Nord, Sherbrooke, J1H 5N4, Qu\u00e9bec, Canada"}]},{"given":"Rafael","family":"Najmanovich","sequence":"additional","affiliation":[{"name":"Department of Biochemistry, Faculty of Medicine and Health Sciences, Universit\u00e9 de Sherbrooke, 12e Avenue Nord, Sherbrooke, J1H 5N4, Qu\u00e9bec, Canada"}]}],"member":"286","published-online":{"date-parts":[[2012,3,30]]},"reference":[{"key":"2023012512313898800_B1","doi-asserted-by":"crossref","first-page":"D32","DOI":"10.1093\/nar\/gkq1079","article-title":"GenBank","volume":"39","author":"Benson","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012512313898800_B2","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1038\/nsmb.1756","article-title":"alpha-Helical nascent polypeptide chains visualized within distinct regions of the ribosomal exit tunnel","volume":"17","author":"Bhushan","year":"2010","journal-title":"Nat. Struct. Mol. Biol."},{"key":"2023012512313898800_B3","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1016\/0014-5793(85)81048-6","article-title":"Rare codons in E. coli and S. typhimurium signal sequences","volume":"189","author":"Burns","year":"1985","journal-title":"FEBS Lett."},{"key":"2023012512313898800_B4","doi-asserted-by":"crossref","first-page":"2005","DOI":"10.1093\/bioinformatics\/btg272","article-title":"Codon adaptation index as a measure of dominating codon bias","volume":"19","author":"Carbone","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012512313898800_B5","doi-asserted-by":"crossref","first-page":"e3412","DOI":"10.1371\/journal.pone.0003412","article-title":"Rare codons cluster","volume":"3","author":"Clarke","year":"2008","journal-title":"PLoS ONE"},{"key":"2023012512313898800_B6","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1186\/1471-2164-11-118","article-title":"Increased incidence of rare codon clusters at 5\u2032and 3\u2032gene termini: implications for function","volume":"11","author":"Clarke","year":"2010","journal-title":"BMC genomics"},{"key":"2023012512313898800_B7","doi-asserted-by":"crossref","first-page":"537","DOI":"10.1016\/S0006-291X(02)00226-7","article-title":"Silent mutations affect in vivo protein folding in Escherichia coli","volume":"293","author":"Cortazzo","year":"2002","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023012512313898800_B8","doi-asserted-by":"crossref","first-page":"641","DOI":"10.1002\/biot.201000329","article-title":"The imprint of codons on protein structure","volume":"6","author":"Deane","year":"2011","journal-title":"Biotechnol. J."},{"key":"2023012512313898800_B9","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1016\/S0378-1119(00)00002-0","article-title":"The PAUSE software for analysis of translational control over protein targeting: application to E. nidulans membrane proteins","volume":"244","author":"Dessen","year":"2000","journal-title":"Gene"},{"key":"2023012512313898800_B10","doi-asserted-by":"crossref","first-page":"649","DOI":"10.1006\/jmbi.1996.0428","article-title":"Co-variation of tRNA abundance and codon usage in Escherichia coli at different growth rates","volume":"260","author":"Dong","year":"1996","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B11","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1016\/S0168-9525(00)02041-2","article-title":"tRNA gene number and codon usage in the C. elegans genome are co-adapted for optimal translation of highly expressed genes","volume":"16","author":"Duret","year":"2000","journal-title":"Trends Genet."},{"key":"2023012512313898800_B12","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1038\/nsmb0504-391","article-title":"The dynamic tunnel","volume":"11","author":"Etchells","year":"2004","journal-title":"Nat. Struct. Mol. Biol."},{"key":"2023012512313898800_B13","doi-asserted-by":"crossref","first-page":"D211","DOI":"10.1093\/nar\/gkp985","article-title":"The Pfam protein families database","volume":"38","author":"Finn","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012512313898800_B14","doi-asserted-by":"crossref","first-page":"572","DOI":"10.1038\/msb.2012.3","article-title":"Genes adopt non-optimal codon usage to generate cell cycle-dependent oscillations in protein levels","volume":"8","author":"Frenkel-Morgenstern","year":"2012","journal-title":"Mol. Syst. Biol."},{"key":"2023012512313898800_B15","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1016\/S0022-2836(03)00345-0","article-title":"Crystal structure of Proteus vulgaris chondroitin sulfate ABC lyase I at 1.9A resolution","volume":"328","author":"Huang","year":"2003","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B16","first-page":"13","article-title":"Codon usage and tRNA content in unicellular and multicellular organisms","volume":"2","author":"Ikemura","year":"1985","journal-title":"Mol. Biol. Evol."},{"key":"2023012512313898800_B17","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1002\/biot.201000327","article-title":"Birth, life and death of nascent polypeptide chains","volume":"6","author":"Jha","year":"2011","journal-title":"Biotechnol. J."},{"key":"2023012512313898800_B18","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1006\/abbi.1996.9869","article-title":"Changes of tRNA population during compensatory cell proliferation: differential expression of methionine-tRNA species","volume":"342","author":"Kanduc","year":"1997","journal-title":"Arch. Biochem. Biophys."},{"key":"2023012512313898800_B19","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1006\/jmbi.1996.0500","article-title":"The \u201c+70 pause\u201d: hypothesis of a translational control of membrane protein assembly","volume":"262","author":"K\u00e9p\u00e8s","year":"1996","journal-title":"J.Mol.Biol."},{"key":"2023012512313898800_B20","doi-asserted-by":"crossref","first-page":"14931","DOI":"10.1016\/S0021-9258(18)98567-4","article-title":"Ribosomes pause at specific sites during synthesis of membrane-bound chloroplast reaction center protein D1","volume":"266","author":"Kim","year":"1991","journal-title":"J. Biol. Chem."},{"key":"2023012512313898800_B21","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1126\/science.1135308","article-title":"A \u201csilent\u201d polymorphism in the MDR1 gene changes substrate specificity","volume":"315","author":"Kimchi-Sarfaty","year":"2007","journal-title":"Science"},{"key":"2023012512313898800_B22","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1016\/j.tibs.2008.10.002","article-title":"A pause for thought along the co-translational folding pathway","volume":"34","author":"Komar","year":"2009","journal-title":"Trends Biochem. Sci."},{"key":"2023012512313898800_B23","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0014-5793(95)01275-0","article-title":"Kinetics of translation of gamma B crystallin and its circularly permutated variant in an in vitro cell-free system: possible relations to codon distribution and protein folding","volume":"376","author":"Komar","year":"1995","journal-title":"FEBS Lett."},{"key":"2023012512313898800_B24","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1016\/S0014-5793(99)01566-5","article-title":"Synonymous codon substitutions affect ribosome traffic and protein folding during in vitro translation","volume":"462","author":"Komar","year":"1999","journal-title":"FEBS Lett."},{"key":"2023012512313898800_B25","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1007\/BF01025472","article-title":"Nonuniform size distribution of nascent globin peptides, evidence for pause localization sites, and a contranslational protein-folding model","volume":"10","author":"Krasheninnikov","year":"1991","journal-title":"J. Protein Chem."},{"key":"2023012512313898800_B26","doi-asserted-by":"crossref","first-page":"614","DOI":"10.1016\/j.jmb.2005.05.067","article-title":"Protein function prediction using local 3D templates","volume":"351","author":"Laskowski","year":"2005","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B27","doi-asserted-by":"crossref","first-page":"D28","DOI":"10.1093\/nar\/gkq967","article-title":"The European nucleotide archive","volume":"39","author":"Leinonen","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012512313898800_B28","doi-asserted-by":"crossref","first-page":"175","DOI":"10.1006\/jtbi.1999.1047","article-title":"Ribosome traffic in E. coli and regulation of gene expression","volume":"202","author":"Lesnik","year":"2000","journal-title":"J. Theor. Biol."},{"key":"2023012512313898800_B29","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1016\/j.jmb.2008.08.089","article-title":"Electrostatics in the ribosomal tunnel modulate chain elongation rates","volume":"384","author":"Lu","year":"2008","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B30","doi-asserted-by":"crossref","first-page":"bar009","DOI":"10.1093\/database\/bar009","article-title":"UniProt Knowledgebase: a hub of integrated protein data","volume":"2011","author":"Magrane","year":"2011","journal-title":"Database"},{"key":"2023012512313898800_B31","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1080\/07391102.2002.10506859","article-title":"Distribution of rare triplets along mRNA and their relation to protein folding","volume":"20","author":"Makhoul","year":"2002","journal-title":"J. Biomol. Struct. Dyn."},{"key":"2023012512313898800_B32","doi-asserted-by":"crossref","first-page":"514","DOI":"10.1007\/PL00006256","article-title":"Codon usage bias and tRNA abundance in Drosophila","volume":"45","author":"Moriyama","year":"1997","journal-title":"J. Mol. Evol."},{"key":"2023012512313898800_B33","first-page":"2365","article-title":"Assembly of the D1 precursor in monomeric photosystem II reaction center precomplexes precedes chlorophyll a-triggered accumulation of reaction center II in barley etioplasts","volume":"11","author":"M\u00fcller","year":"1999","journal-title":"Plant Cell"},{"key":"2023012512313898800_B34","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1093\/nar\/28.1.292","article-title":"Codon usage tabulated from international DNA sequence databases: status for the year 2000","volume":"28","author":"Nakamura","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023012512313898800_B35","doi-asserted-by":"crossref","first-page":"1093","DOI":"10.1016\/S0969-2126(97)00260-8","article-title":"CATH\u2014a hierarchic classification of protein domain structures","volume":"5","author":"Orengo","year":"1997","journal-title":"Structure"},{"key":"2023012512313898800_B36","doi-asserted-by":"crossref","first-page":"e1000548","DOI":"10.1371\/journal.pgen.1000548","article-title":"Clustering of codons with rare cognate tRNAs in human genes suggests an extra level of expression regulation","volume":"5","author":"Parmley","year":"2009","journal-title":"PLoS Genetics"},{"key":"2023012512313898800_B37","doi-asserted-by":"crossref","first-page":"2444","DOI":"10.1073\/pnas.85.8.2444","article-title":"Improved tools for biological sequence comparison","volume":"85","author":"Pearson","year":"1988","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012512313898800_B38","doi-asserted-by":"crossref","first-page":"322","DOI":"10.1006\/jmbi.1997.0942","article-title":"Transfer RNA gene redundancy and translational selection in Saccharomyces cerevisiae","volume":"268","author":"Percudani","year":"1997","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B39","doi-asserted-by":"crossref","first-page":"1038","DOI":"10.1016\/j.bbrc.2004.08.022","article-title":"Whole genome analysis reveals a high incidence of non-optimal codons in secretory signal sequences of Escherichia coli","volume":"322","author":"Power","year":"2004","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023012512313898800_B40","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1016\/0022-2836(87)90230-0","article-title":"The efficiency of folding of some proteins is increased by controlled rates of translation in vivo. A hypothesis","volume":"193","author":"Purvis","year":"1987","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B41","doi-asserted-by":"crossref","first-page":"535","DOI":"10.1111\/j.1600-0854.2011.01171.x","article-title":"Molecular mechanism of co-translational protein targeting by the signal recognition particle","volume":"12","author":"Saraogi","year":"2011","journal-title":"Traffic"},{"key":"2023012512313898800_B42","doi-asserted-by":"crossref","first-page":"6719","DOI":"10.1093\/nar\/gkq495","article-title":"Synonymous codon usage influences the local protein structure observed","volume":"38","author":"Saunders","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012512313898800_B43","author":"Schr\u00f6dinger, LLC","journal-title":"The PyMol Molecular Graphics System"},{"key":"2023012512313898800_B44","doi-asserted-by":"crossref","first-page":"1412","DOI":"10.1126\/science.1177662","article-title":"Structural insight into nascent polypeptide chain-mediated translational stalling","volume":"326","author":"Seidelt","year":"2009","journal-title":"Science"},{"key":"2023012512313898800_B45","doi-asserted-by":"crossref","first-page":"1281","DOI":"10.1093\/nar\/15.3.1281","article-title":"The codon Adaptation Index\u2014a measure of directional synonymous codon usage bias, and its potential applications","volume":"15","author":"Sharp","year":"1987","journal-title":"Nucleic Acids Res."},{"key":"2023012512313898800_B46","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1093\/nar\/13.1.275","article-title":"The secondary structure of mRNAs from Escherichia coli: its possible role in increasing the accuracy of translation","volume":"13","author":"Shpaer","year":"1985","journal-title":"Nucleic Acids Res."},{"key":"2023012512313898800_B47","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1016\/0022-2836(89)90260-X","article-title":"Codon usage determines translation rate in Escherichia coli","volume":"207","author":"S\u00f8rensen","year":"1989","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B48","doi-asserted-by":"crossref","first-page":"1973","DOI":"10.1002\/pro.5560051003","article-title":"Protein secondary structural types are differentially coded on messenger RNA","volume":"5","author":"Thanaraj","year":"1996","journal-title":"Protein Sci."},{"key":"2023012512313898800_B49","doi-asserted-by":"crossref","first-page":"1594","DOI":"10.1002\/pro.5560050814","article-title":"Ribosome-mediated translational pause and protein domain organization","volume":"5","author":"Thanaraj","year":"1996","journal-title":"Protein Sci."},{"key":"2023012512313898800_B50","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1016\/j.jmb.2008.08.012","article-title":"Synonymous mutations and ribosome stalling can lead to altered folding pathways and distinct minima","volume":"383","author":"Tsai","year":"2008","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B51","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1002\/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7","article-title":"Why are \u201cnatively unfolded\u201d proteins unstructured under physiologic conditions?","volume":"41","author":"Uversky","year":"2000","journal-title":"Proteins"},{"key":"2023012512313898800_B52","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1016\/0022-2836(84)90027-5","article-title":"Translation is a non-uniform process. Effect of tRNA availability on the rate of elongation of nascent polypeptide chains","volume":"180","author":"Varenne","year":"1984","journal-title":"J. Mol. Biol."},{"key":"2023012512313898800_B53","doi-asserted-by":"crossref","first-page":"D262","DOI":"10.1093\/nar\/gki058","article-title":"E-MSD: an integrated data resource for bioinformatics","volume":"33","author":"Velankar","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012512313898800_B54","doi-asserted-by":"crossref","first-page":"866","DOI":"10.1016\/j.bbamem.2010.08.014","article-title":"Inserting membrane proteins: the YidC\/Oxa1\/Alb3 machinery in bacteria, mitochondria, and chloroplasts","volume":"1808","author":"Wang","year":"2011","journal-title":"Biochim. Biophys. Acta."},{"key":"2023012512313898800_B55","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1186\/1471-2164-9-207","article-title":"Analysis of the distribution of functionally relevant rare codons","volume":"9","author":"Widmann","year":"2008","journal-title":"BMC Genomics"},{"key":"2023012512313898800_B56","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1016\/j.bbrc.2007.01.126","article-title":"Experimental confirmation of a key role for non-optimal codons in protein export","volume":"355","author":"Zalucki","year":"2007","journal-title":"Biochem. Biophys. Res. Commun."},{"key":"2023012512313898800_B57","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1002\/biot.201000334","article-title":"Coupling between codon usage, translation and protein export in Escherichia coli","volume":"6","author":"Zalucki","year":"2011","journal-title":"Biotechnol. J."},{"key":"2023012512313898800_B58","first-page":"97","article-title":"Discontinuous translation and mRNA secondary structure","volume":"34","author":"Zama","year":"1995","journal-title":"Nucleic Acids Symp. Ser."},{"key":"2023012512313898800_B59","doi-asserted-by":"crossref","first-page":"16062","DOI":"10.1074\/jbc.274.23.16062","article-title":"Co-translational assembly of the D1 protein into photosystem II","volume":"274","author":"Zhang","year":"1999","journal-title":"J. Biol. Chem."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/11\/1438\/48869486\/bioinformatics_28_11_1438.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/11\/1438\/48869486\/bioinformatics_28_11_1438.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T16:02:20Z","timestamp":1674662540000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/11\/1438\/265476"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,3,30]]},"references-count":59,"journal-issue":{"issue":"11","published-print":{"date-parts":[[2012,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts149","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,6,1]]},"published":{"date-parts":[[2012,3,30]]}}}