{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T08:19:42Z","timestamp":1760170782309},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"24","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Assessing the false positive rate of function prediction methods is difficult, as it is hard to establish that a protein does not have a certain function. To determine to what extent proteins with similar sequences have a common function, we focused on photosynthesis-related proteins. A protein that comes from a non-photosynthetic organism is, undoubtedly, not involved in photosynthesis.<\/jats:p>\n               <jats:p>Results: We show that function diverges very rapidly: 70% of the close homologs of photosynthetic proteins come from non-photosynthetic organisms. Therefore, high sequence similarity, in most cases, is not tantamount to similar function. However, we found that many functionally similar proteins often share short sequence elements, which may correspond to a functional site and could reveal functional similarities more accurately than sequence similarity.<\/jats:p>\n               <jats:p>Conclusions: These results shed light on the way biological function is conserved in evolution and may help improve large-scale analysis of protein function.<\/jats:p>\n               <jats:p>Contact: \u00a0yanay@ofranlab.org<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts608","type":"journal-article","created":{"date-parts":[[2012,10,19]],"date-time":"2012-10-19T01:31:14Z","timestamp":1350610274000},"page":"3203-3210","source":"Crossref","is-referenced-by-count":16,"title":["Assessing the relationship between conservation of function and conservation of sequence using photosynthetic proteins"],"prefix":"10.1093","volume":"28","author":[{"given":"Shaul","family":"Ashkenazi","sequence":"first","affiliation":[{"name":"The Goodman faculty of life sciences, Bar Ilan University, Ramat Gan 52900, Israel"}]},{"given":"Rotem","family":"Snir","sequence":"additional","affiliation":[{"name":"The Goodman faculty of life sciences, Bar Ilan University, Ramat Gan 52900, Israel"}]},{"given":"Yanay","family":"Ofran","sequence":"additional","affiliation":[{"name":"The Goodman faculty of life sciences, Bar Ilan University, Ramat Gan 52900, Israel"}]}],"member":"286","published-online":{"date-parts":[[2012,10,18]]},"reference":[{"key":"2023012513244146500_bts608-B1","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1385\/MB:12:3:241","article-title":"Protein consensus sequence motifs","volume":"12","author":"Aitken","year":"1999","journal-title":"Mol. Biotechnol."},{"key":"2023012513244146500_bts608-B2","doi-asserted-by":"crossref","first-page":"W202","DOI":"10.1093\/nar\/gkp335","article-title":"Meme suite: tools for motif discovery and searching","volume":"37","author":"Bailey","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012513244146500_bts608-B3","doi-asserted-by":"crossref","first-page":"W369","DOI":"10.1093\/nar\/gkl198","article-title":"Meme: discovering and analyzing dna and protein sequence motifs","volume":"34","author":"Bailey","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012513244146500_bts608-B4","doi-asserted-by":"crossref","first-page":"S16","DOI":"10.1186\/1471-2105-6-S1-S16","article-title":"Evaluation of biocreative assessment of task 2","volume":"6","author":"Blaschke","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012513244146500_bts608-B5","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1093\/nar\/gkg095","article-title":"The swiss-prot protein knowledgebase and its supplement trembl in 2003","volume":"31","author":"Boeckmann","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012513244146500_bts608-B6","doi-asserted-by":"crossref","first-page":"313","DOI":"10.1038\/ng0498-313","article-title":"Predicting functions from protein sequences\u2013where are the bottlenecks?","volume":"18","author":"Bork","year":"1998","journal-title":"Nat. Genet."},{"key":"2023012513244146500_bts608-B7","doi-asserted-by":"crossref","first-page":"132","DOI":"10.1016\/S0168-9525(99)01706-0","article-title":"Errors in genome annotation","volume":"15","author":"Brenner","year":"1999","journal-title":"Trends Genet."},{"key":"2023012513244146500_bts608-B8","doi-asserted-by":"crossref","first-page":"D169","DOI":"10.1093\/nar\/gkn664","article-title":"The universal protein resource (uniprot) 2009","volume":"37","author":"Consortium","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012513244146500_bts608-B9","doi-asserted-by":"crossref","first-page":"14679","DOI":"10.1073\/pnas.1001665107","article-title":"Targeted metagenomics and ecology of globally important uncultured eukaryotic phytoplankton","volume":"107","author":"Cuvelier","year":"2010","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012513244146500_bts608-B10","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1016\/S0168-9525(01)02348-4","article-title":"Intrinsic errors in genome annotation","volume":"17","author":"Devos","year":"2001","journal-title":"Trends Genet."},{"key":"2023012513244146500_bts608-B11","doi-asserted-by":"crossref","first-page":"E1000798","DOI":"10.1371\/journal.pcbi.1000798","article-title":"Expansion of the protein repertoire in newly explored environments: human gut microbiome specific protein families","volume":"6","author":"Ellrott","year":"2010","journal-title":"PLoS Comput. Biol."},{"key":"2023012513244146500_bts608-B12","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/S0168-9525(00)02005-9","article-title":"Homology a personal view on some of the problems","volume":"16","author":"Fitch","year":"2000","journal-title":"Trends Genet."},{"key":"2023012513244146500_bts608-B13","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1016\/j.jash.2009.05.001","article-title":"A HMGCR polymorphism is associated with relations between blood pressure and urinary sodium and potassium ratio in the Epic-Norfolk study","volume":"3","author":"Freitas","year":"2009","journal-title":"J. Am. Soc. Hypertens."},{"key":"2023012513244146500_bts608-B14","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1110\/ps.062158406","article-title":"New avenues in protein function prediction","volume":"15","author":"Friedberg","year":"2006","journal-title":"Protein Sci."},{"key":"2023012513244146500_bts608-B15","doi-asserted-by":"crossref","first-page":"REVIEWS0005","DOI":"10.1186\/gb-2000-1-5-reviews0005","article-title":"Can sequence determine function?","volume":"1","author":"Gerlt","year":"2000","journal-title":"Genome Biol."},{"key":"2023012513244146500_bts608-B16","doi-asserted-by":"crossref","first-page":"1067","DOI":"10.1073\/pnas.0335769100","article-title":"Plant-like traits associated with metabolism of trypanosoma parasites","volume":"100","author":"Hannaert","year":"2003","journal-title":"Proc Natl Acad. Sci. USA"},{"key":"2023012513244146500_bts608-B17","doi-asserted-by":"crossref","first-page":"13913","DOI":"10.1073\/pnas.0702636104","article-title":"Quantitative assessment of protein function prediction from metagenomics shotgun sequences","volume":"104","author":"Harrington","year":"2007","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012513244146500_bts608-B18","doi-asserted-by":"crossref","first-page":"D188","DOI":"10.1093\/nar\/gki096","article-title":"ADDA: a domain database with global coverage of the protein universe","volume":"33","author":"Heger","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012513244146500_bts608-B19","doi-asserted-by":"crossref","first-page":"S2","DOI":"10.1186\/1471-2105-9-S5-S2","article-title":"Gene ontology annotations: what they mean and where they come from","volume":"9","author":"Hill","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012513244146500_bts608-B21","doi-asserted-by":"crossref","first-page":"E167","DOI":"10.1371\/journal.pcbi.0030167","article-title":"Functional representation of enzymes by specific peptides","volume":"3","author":"Kunik","year":"2007","journal-title":"PLoS Comput. Biol."},{"key":"2023012513244146500_bts608-B22","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1002\/prot.21651","article-title":"Assessment of predictions submitted for the casp7 function prediction category","volume":"69","author":"Lopez","year":"2007","journal-title":"Proteins"},{"key":"2023012513244146500_bts608-B23","first-page":"REVIEWS2001","article-title":"Tools and resources for identifying protein families, domains and motifs","volume":"3","author":"Mulder","year":"2002","journal-title":"Genome Biol."},{"key":"2023012513244146500_bts608-B24","author":"Owen","year":"1843","journal-title":"Lectures on the Comparative Anatomy and Physiology of the Invertebrate Animals: Delivered at the Royal College of Surgeons, in 1843"},{"key":"2023012513244146500_bts608-B25","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1186\/1471-2105-7-277","article-title":"Everest: automatic identification and classification of protein domains in all protein sequences","volume":"7","author":"Portugaly","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012513244146500_bts608-B26","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1101\/gr.10.4.483","article-title":"Genome annotation assessment in drosophila melanogaster","volume":"10","author":"Reese","year":"2000","journal-title":"Genome Res."},{"key":"2023012513244146500_bts608-B27","doi-asserted-by":"crossref","first-page":"S1","DOI":"10.1186\/1471-2105-8-S4-S1","article-title":"The 2006 automated function prediction meeting","volume":"8","author":"Rodrigues","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023012513244146500_bts608-B28","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1016\/S0022-2836(02)00016-5","article-title":"Enzyme function less conserved than anticipated","volume":"318","author":"Rost","year":"2002","journal-title":"J. Mol. Biol."},{"key":"2023012513244146500_bts608-B29","doi-asserted-by":"crossref","first-page":"226","DOI":"10.1093\/nar\/25.1.226","article-title":"The HSSP database of protein structure-sequence alignments","volume":"25","author":"Schneider","year":"1997","journal-title":"Nucleic Acids Res."},{"key":"2023012513244146500_bts608-B30","doi-asserted-by":"crossref","first-page":"E1000605","DOI":"10.1371\/journal.pcbi.1000605","article-title":"Annotation error in public databases: misannotation of molecular function in enzyme superfamilies","volume":"5","author":"Schnoes","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023012513244146500_bts608-B31","doi-asserted-by":"crossref","first-page":"648","DOI":"10.1101\/gr.222902","article-title":"predicting gene ontology functions from prodom and cdd protein domains","volume":"12","author":"Schug","year":"2002","journal-title":"Genome Res."},{"key":"2023012513244146500_bts608-B32","doi-asserted-by":"crossref","first-page":"258","DOI":"10.1038\/nature08284","article-title":"Photosystem I gene cassettes are present in marine virus genomes","volume":"461","author":"Sharon","year":"2009","journal-title":"Nature"},{"key":"2023012513244146500_bts608-B33","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1093\/bib\/3.3.265","article-title":"Prosite: a documented database using patterns and profiles as motif descriptors","volume":"3","author":"Sigrist","year":"2002","journal-title":"Brief. Bioinform."},{"key":"2023012513244146500_bts608-B34","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1002\/(SICI)1097-0134(199707)28:3<405::AID-PROT10>3.0.CO;2-L","article-title":"Pfam: a comprehensive database of protein domain families based on seed alignments","volume":"28","author":"Sonnhammer","year":"1997","journal-title":"Proteins"},{"key":"2023012513244146500_bts608-B35","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1002\/prot.20738","article-title":"The prediction of protein function at CASP6","volume":"61","author":"Soro","year":"2005","journal-title":"Proteins"},{"key":"2023012513244146500_bts608-B36","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1186\/1471-2105-4-41","article-title":"The COG database: an updated version includes eukaryotes","volume":"4","author":"Tatusov","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023012513244146500_bts608-B37","doi-asserted-by":"crossref","first-page":"863","DOI":"10.1016\/j.jmb.2003.08.057","article-title":"How well is enzyme function conserved as a function of pairwise sequence identity?","volume":"333","author":"Tian","year":"2003","journal-title":"J. Mol. Biol."},{"key":"2023012513244146500_bts608-B38","doi-asserted-by":"crossref","first-page":"1329","DOI":"10.1016\/S0969-2126(02)00854-7","article-title":"Sequence landmark patterns identify and characterize protein families","volume":"10","author":"Wade","year":"2002","journal-title":"Structure"},{"key":"2023012513244146500_bts608-B39","doi-asserted-by":"crossref","first-page":"798","DOI":"10.1093\/bioinformatics\/btn037","article-title":"Confunc\u2013functional annotation in the twilight zone","volume":"24","author":"Wass","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012513244146500_bts608-B40","doi-asserted-by":"crossref","first-page":"2215","DOI":"10.1016\/j.febslet.2007.02.010","article-title":"Making the connections\u2013the crucial role of metabolite transporters at the interface between chloroplast and cytosol","volume":"581","author":"Weber","year":"2007","journal-title":"FEBS Lett."},{"key":"2023012513244146500_bts608-B41","doi-asserted-by":"crossref","first-page":"681","DOI":"10.2174\/092986610791190255","article-title":"Using affinity propagation combined post-processing to cluster protein sequences","volume":"17","author":"Yang","year":"2010","journal-title":"Protein Pept. Lett."},{"key":"2023012513244146500_bts608-B42","doi-asserted-by":"crossref","first-page":"2027","DOI":"10.1111\/j.1462-2920.2005.00843.x","article-title":"Putative novel photosynthetic reaction centre organizations in marine aerobic anoxygenic photosynthetic bacteria: insights from metagenomics and environmental genomics","volume":"7","author":"Yutin","year":"2005","journal-title":"Environ. Microbiol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/24\/3203\/48877748\/bioinformatics_28_24_3203.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/24\/3203\/48877748\/bioinformatics_28_24_3203.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T19:21:50Z","timestamp":1674674510000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/24\/3203\/245893"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,10,18]]},"references-count":41,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2012,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts608","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,12]]},"published":{"date-parts":[[2012,10,18]]}}}