{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T19:21:16Z","timestamp":1774380076812,"version":"3.50.1"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"21","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2203,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Selenoproteins are a group of proteins that contain selenocysteine (Sec), a rare amino acid inserted co-translationally into the protein chain. The Sec codon is UGA, which is normally a stop codon. In selenoproteins, UGA is recoded to Sec in presence of specific features on selenoprotein gene transcripts. Due to the dual role of the UGA codon, selenoprotein prediction and annotation are difficult tasks, and even known selenoproteins are often misannotated in genome databases.<\/jats:p>\n               <jats:p>Results: We present an homology-based in silico method to scan genomes for members of the known eukaryotic selenoprotein families: selenoprofiles. The core of the method is a set of manually curated highly reliable multiple sequence alignments of selenoprotein families, which are used as queries to scan genomic sequences. Results of the scan are processed through a number of steps, to produce highly accurate predictions of selenoprotein genes with little or no human intervention. Selenoprofiles is a valuable tool for bioinformatic characterization of eukaryotic selenoproteomes, and can complement genome annotation pipelines.<\/jats:p>\n               <jats:p>Availability and Implementation: Selenoprofiles is a python-built pipeline that internally runs psitblastn, exonerate, genewise, SECISearch and a number of custom-made scripts and programs. The program is available at http:\/\/big.crg.cat\/services\/selenoprofiles. The predictions presented in this article are available through DAS at http:\/\/genome.crg.cat:9000\/das\/Selenoprofiles_ensembl.<\/jats:p>\n               <jats:p>Contact: \u00a0marco.mariotti@crg.es<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq516","type":"journal-article","created":{"date-parts":[[2010,9,23]],"date-time":"2010-09-23T00:31:47Z","timestamp":1285201907000},"page":"2656-2663","source":"Crossref","is-referenced-by-count":47,"title":["Selenoprofiles: profile-based scanning of eukaryotic genome sequences for selenoprotein genes"],"prefix":"10.1093","volume":"26","author":[{"given":"M.","family":"Mariotti","sequence":"first","affiliation":[{"name":"Bioinformatics and genomics group, Center for Genomic Regulation and Universitat Pompeu Fabra, Barcelona, Catalonia, Spain"}]},{"given":"R.","family":"Guig\u00f3","sequence":"additional","affiliation":[{"name":"Bioinformatics and genomics group, Center for Genomic Regulation and Universitat Pompeu Fabra, Barcelona, Catalonia, Spain"}]}],"member":"286","published-online":{"date-parts":[[2010,9,21]]},"reference":[{"key":"2023012507542871000_B1","doi-asserted-by":"crossref","first-page":"1415","DOI":"10.1016\/j.bbagen.2009.03.003","article-title":"The selenium to selenoprotein pathway in eukaryotes: more molecular partners than anticipated","volume":"1790","author":"Allmang","year":"2009","journal-title":"Biochim. Biophys. Acta (BBA) Gen. Subj."},{"key":"2023012507542871000_B2","doi-asserted-by":"crossref","first-page":"3389","DOI":"10.1093\/nar\/25.17.3389","article-title":"Gapped BLAST and PSI-BLAST: a new generation of protein database search programs","volume":"25","author":"Altschul","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012507542871000_B3","doi-asserted-by":"crossref","first-page":"988","DOI":"10.1101\/gr.1865504","article-title":"GeneWise and Genomewise","volume":"14","author":"Birney","year":"2004","journal-title":"Genome Res."},{"key":"2023012507542871000_B4","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1006\/geno.1996.0298","article-title":"Evaluation of gene structure prediction programs","volume":"34","author":"Burset","year":"1996","journal-title":"Genomics"},{"key":"2023012507542871000_B5","doi-asserted-by":"crossref","first-page":"128","DOI":"10.1016\/j.molbiopara.2006.05.002","article-title":"Identification of Leishmania selenoproteins and SECIS element","volume":"149","author":"Cassago","year":"2006","journal-title":"Mol. Biochem. Parasitol."},{"key":"2023012507542871000_B6","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1038\/sj.embor.7400036","article-title":"Reconsidering the evolution of eukaryotic selenoproteins: a novel nonmammalian family with scattered phylogenetic distribution","volume":"5","author":"Castellano","year":"2004","journal-title":"EMBO Rep."},{"key":"2023012507542871000_B7","doi-asserted-by":"crossref","first-page":"16188","DOI":"10.1073\/pnas.0505146102","article-title":"Diversity and functional plasticity of eukaryotic selenoproteins: identification and characterization of the SelJ family","volume":"102","author":"Castellano","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012507542871000_B8","doi-asserted-by":"crossref","first-page":"D332","DOI":"10.1093\/nar\/gkm731","article-title":"SelenoDB 1.0: a database of selenoprotein genes, proteins and SECIS elements","volume":"36","author":"Castellano","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012507542871000_B9","doi-asserted-by":"crossref","first-page":"2031","DOI":"10.1093\/molbev\/msp109","article-title":"Low exchangeability of selenocysteine, the 21st amino acid, in vertebrate proteins","volume":"26","author":"Castellano","year":"2009","journal-title":"Mol. Biol. Evol."},{"key":"2023012507542871000_B10","article-title":"Relaxation of selective constraints causes independent selenoprotein extinction in insect genomes","volume":"8","author":"Chapple","year":"2008","journal-title":"PLoS ONE"},{"key":"2023012507542871000_B11","doi-asserted-by":"crossref","first-page":"674","DOI":"10.1093\/bioinformatics\/btp020","article-title":"SECISaln, a web-based tool for the creation of structure-based alignments of eukaryotic SECIS elements","volume":"25","author":"Chapple","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012507542871000_B12","doi-asserted-by":"crossref","first-page":"1491","DOI":"10.1128\/MCB.21.5.1491-1498.2001","article-title":"Insight into mammalian selenocysteine insertion: domain structure and ribosome binding properties of Sec insertion sequence binding protein 2","volume":"21","author":"Copeland","year":"2001","journal-title":"Mol. Cell Biol."},{"key":"2023012507542871000_B13","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1038\/sj.embor.7400080","article-title":"Finding needles in a haystack. In silico identification of eukaryotic selenoprotein genes","volume":"5","author":"Driscoll","year":"2004","journal-title":"EMBO Rep."},{"key":"2023012507542871000_B14","doi-asserted-by":"crossref","first-page":"2414","DOI":"10.1007\/s00018-005-5143-y","article-title":"Human selenoproteins at a glance","volume":"62","author":"Gromer","year":"2005","journal-title":"Cell Mol. Life Sci."},{"key":"2023012507542871000_B15","doi-asserted-by":"crossref","first-page":"625","DOI":"10.1017\/S1355838299981542","article-title":"Two distinct SECIS structures capable of directing selenocysteine incorporation in eukaryotes","volume":"5","author":"Grundner-Culemann","year":"1999","journal-title":"RNA"},{"issue":"Suppl. 1","key":"2023012507542871000_B16","first-page":"S2.1","article-title":"EGASP: the human ENCODE genome annotation assessment project","volume":"7","author":"Guig\u00f3","year":"2006","journal-title":"Genome Biol."},{"key":"2023012507542871000_B17","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1186\/gb-2009-10-1-201","article-title":"Identifying protein-coding genes in genomic sequences","volume":"10","author":"Harrow","year":"2009","journal-title":"Genome Biol."},{"key":"2023012507542871000_B18","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1016\/S0079-6603(06)81003-2","article-title":"Selenocysteine incorporation machinery and the role of selenoproteins in development and health","volume":"81","author":"Hatfield","year":"2006","journal-title":"Prog. Nucleic Acid Res. Mol. Biol."},{"key":"2023012507542871000_B19","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1186\/1471-2164-11-289","article-title":"In silico identification of the sea squirt selenoproteome","volume":"11","author":"Jiang","year":"2010","journal-title":"BMC Genomics"},{"key":"2023012507542871000_B20","doi-asserted-by":"crossref","first-page":"765","DOI":"10.1016\/S0300-9084(02)01405-0","article-title":"Evolutionarily different RNA motifs and RNA-protein complexes to achieve selenoprotein synthesis","volume":"84","author":"Krol","year":"2002","journal-title":"Biochimie"},{"key":"2023012507542871000_B21","doi-asserted-by":"crossref","first-page":"33888","DOI":"10.1074\/jbc.274.48.33888","article-title":"New mammalian selenocysteine-containing proteins identified with an algorithm that searches for selenocysteine insertion sequence elements","volume":"274","author":"Kryukov","year":"1999","journal-title":"J. Biol. Chem."},{"key":"2023012507542871000_B22","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1016\/S1672-0229(08)60034-0","article-title":"A method for identification of selenoprotein genes in archaeal genomes","volume":"7","author":"Li","year":"2009","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"2023012507542871000_B23","doi-asserted-by":"crossref","first-page":"1424","DOI":"10.1016\/j.bbagen.2009.05.014","article-title":"Eukaryotic selenoproteins and selenoproteomes","volume":"1790","author":"Lobanov","year":"2009","journal-title":"Biochim. Biophys. Acta"},{"key":"2023012507542871000_B24","doi-asserted-by":"crossref","first-page":"4012","DOI":"10.1093\/nar\/gkl541","article-title":"Selenium metabolism in Trypanosoma: characterization of selenoproteomes and identification of a Kinetoplastida-specific selenoprotein","volume":"34","author":"Lobanov","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012507542871000_B25","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1093\/nar\/gkj450","article-title":"The plasmodium selenoproteome","volume":"34","author":"Lobanov","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012507542871000_B26","doi-asserted-by":"crossref","first-page":"R16","DOI":"10.1186\/gb-2010-11-2-r16","article-title":"2x genomes - depth does matter","volume":"11","author":"Milinkovitch","year":"2010","journal-title":"Genome Biol."},{"key":"2023012507542871000_B27","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1006\/jmbi.2000.4042","article-title":"T-Coffee: a novel method for fast and accurate multiple sequence alignment","volume":"302","author":"Notredame","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023012507542871000_B28","doi-asserted-by":"crossref","first-page":"3681","DOI":"10.1093\/emboj\/cdf372","article-title":"Selenoproteins and selenocysteine insertion system in the model plant cell system, Chlamydomonas reinhardtii","volume":"21","author":"Novoselov","year":"2002","journal-title":"EMBO J."},{"key":"2023012507542871000_B29","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1042\/BJ20051569","article-title":"Identification and characterization of Fep15, a new selenocysteine-containing member of the Sep15 protein family","volume":"394","author":"Novoselov","year":"2006","journal-title":"Biochem. J."},{"key":"2023012507542871000_B30","doi-asserted-by":"crossref","first-page":"7857","DOI":"10.1073\/pnas.0610683104","article-title":"A highly efficient form of the selenocysteine insertion sequence element in protozoan parasites and its use in mammalian cells","volume":"104","author":"Novoselov","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012507542871000_B31","doi-asserted-by":"crossref","first-page":"18462","DOI":"10.1074\/jbc.M501517200","article-title":"A novel eukaryotic selenoprotein in the haptophyte alga Emiliania huxleyi","volume":"280","author":"Obata","year":"2005","journal-title":"J. Biol. Chem."},{"key":"2023012507542871000_B32","doi-asserted-by":"crossref","first-page":"7705","DOI":"10.1073\/pnas.0611046104","article-title":"The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation","volume":"104","author":"Palenik","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012507542871000_B33","doi-asserted-by":"crossref","first-page":"D5","DOI":"10.1093\/nar\/gkp967","article-title":"Database resources of the National Center for Biotechnology Information","volume":"38","author":"Sayers","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012507542871000_B34","doi-asserted-by":"crossref","first-page":"13919","DOI":"10.1073\/pnas.0703448104","article-title":"Identification and characterization of a selenoprotein family containing a diselenide bond in a redox motif","volume":"104","author":"Shchedrina","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012507542871000_B35","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/1471-2105-6-31","article-title":"Automated generation of heuristics for biological sequence comparison","volume":"6","author":"Slater","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012507542871000_B36","doi-asserted-by":"crossref","first-page":"115","DOI":"10.1042\/BJ20070165","article-title":"Selenophosphate synthetase 2 is essential for selenoprotein biosynthesis","volume":"404","author":"Xu","year":"2007","journal-title":"Biochem. J."},{"key":"2023012507542871000_B37","doi-asserted-by":"crossref","first-page":"2580","DOI":"10.1093\/bioinformatics\/bti400","article-title":"An algorithm for identification of bacterial selenocysteine insertion sequence elements and selenoprotein genes","volume":"21","author":"Zhang","year":"2005","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/21\/2656\/48852702\/bioinformatics_26_21_2656.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/21\/2656\/48852702\/bioinformatics_26_21_2656.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T07:55:00Z","timestamp":1674633300000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/21\/2656\/214067"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,9,21]]},"references-count":37,"journal-issue":{"issue":"21","published-print":{"date-parts":[[2010,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq516","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,11,1]]},"published":{"date-parts":[[2010,9,21]]}}}