{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,1,19]],"date-time":"2025-01-19T22:10:09Z","timestamp":1737324609768,"version":"3.33.0"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"13","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Current methodologies for the selection of putative transcription factor binding sites (TFBS) rely on various assumptions such as over-representation of motifs occurring on gene promoters, and the use of motif descriptions such as consensus or position-specific scoring matrices (PSSMs). In order to avoid bias introduced by such assumptions, we apply an unsupervised motif extraction (MEX) algorithm to sequences of promoters. The extracted motifs are assessed for their likely cis-regulatory function by calculating the expression coherence (EC) of the corresponding genes, across a set of biological conditions.<\/jats:p><jats:p>Results: Applying MEX to all Saccharomyces cerevisiae promoters, followed by EC analysis across 40 biological conditions, we obtained a high percentage of putative cis-regulatory motifs. We clustered motifs that obtained highly significant EC scores, based on both their sequence similarity and similarity in the biological conditions these motifs appear to regulate. We describe 20 clusters, some of which regroup known TFBS. The clusters display different mRNA expression profiles, correlated with typical changes in the nucleotide composition of their relevant motifs. In several cases, a variation of a single nucleotide is shown to lead to distinct differences in expression patterns. These results are confronted with additional information, such as binding of transcription factors to groups of genes. Detailed analysis is presented for clusters related to MCB\/SCB, STRE and PAC. In the first two cases, we provide evidence for different binding mechanisms of different clusters of motifs. For PAC-related motifs we uncover a new cluster that has so far been overshadowed by the stronger effects of known PAC motifs.<\/jats:p><jats:p>Contact: horn@tau.ac.il<\/jats:p><jats:p>Supplementary information: Supplementary data are available at http:\/\/adios.tau.ac.il\/regmotifs and at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm183","type":"journal-article","created":{"date-parts":[[2007,7,23]],"date-time":"2007-07-23T16:13:46Z","timestamp":1185207226000},"page":"i440-i449","source":"Crossref","is-referenced-by-count":10,"title":["Nucleotide variation of regulatory motifs may lead to distinct expression patterns"],"prefix":"10.1093","volume":"23","author":[{"given":"Liat","family":"Segal","sequence":"first","affiliation":[{"name":"1 Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, 2Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, 3School of Physics and Astronomy and 4School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel"}]},{"given":"Michal","family":"Lapidot","sequence":"additional","affiliation":[{"name":"1 Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, 2Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, 3School of Physics and Astronomy and 4School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel"}]},{"given":"Zach","family":"Solan","sequence":"additional","affiliation":[{"name":"1 Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, 2Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, 3School of Physics and Astronomy and 4School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel"}]},{"given":"Eytan","family":"Ruppin","sequence":"additional","affiliation":[{"name":"1 Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, 2Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, 3School of Physics and Astronomy and 4School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel"},{"name":"1 Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, 2Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, 3School of Physics and Astronomy and 4School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel"}]},{"given":"Yitzhak","family":"Pilpel","sequence":"additional","affiliation":[{"name":"1 Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, 2Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, 3School of Physics and Astronomy and 4School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel"}]},{"given":"David","family":"Horn","sequence":"additional","affiliation":[{"name":"1 Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv 69978, 2Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 76100, 3School of Physics and Astronomy and 4School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel"}]}],"member":"286","published-online":{"date-parts":[[2007,7,1]]},"reference":[{"key":"2023062708514252100_B1","first-page":"25","article-title":"Gene ontology: tool for the unification of biology","volume":"25","author":"Ashburner","year":"2000"},{"key":"2023062708514252100_B2","first-page":"21","article-title":"The value of prior knowledge in discovering motifs with MEME","volume":"3","author":"Bailey","year":"1995","journal-title":"Proc. Int. Conf. Intell. Syst. Mol. Biol"},{"key":"2023062708514252100_B3","first-page":"28","volume-title":"Modeling Dependencies in Protein-DNA Binding Sites","author":"Barash","year":"2003"},{"key":"2023062708514252100_B4","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc., Series B (Methodological)"},{"key":"2023062708514252100_B5","doi-asserted-by":"crossref","first-page":"4442","DOI":"10.1093\/nar\/gkf578","article-title":"Additivity in protein-DNA interactions: how good an approximation is it?","volume":"30","author":"Benos","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2023062708514252100_B6","doi-asserted-by":"crossref","first-page":"723","DOI":"10.1016\/0022-2836(87)90354-8","article-title":"Selection of DNA binding sites by regulatory proteins: statistical-mechanical theory and application to operators and promoters","volume":"193","author":"Berg","year":"1987","journal-title":"J. Mol. Biol"},{"key":"2023062708514252100_B7","doi-asserted-by":"crossref","first-page":"3710","DOI":"10.1093\/bioinformatics\/bth456","article-title":"GO::TermFinder \u2013 open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes","volume":"20","author":"Boyle","year":"2004","journal-title":"Bioinformatics"},{"key":"2023062708514252100_B8","doi-asserted-by":"crossref","first-page":"1202","DOI":"10.1101\/gr.8.11.1202","article-title":"Predicting gene regulatory elements in silico on a genomic scale","volume":"8","author":"Brazma","year":"1998","journal-title":"Genome Res"},{"key":"2023062708514252100_B9","doi-asserted-by":"crossref","first-page":"1255","DOI":"10.1093\/nar\/30.5.1255","article-title":"Nucleotides of transcription factor binding sites exert interdependent effects on the binding affinities of transcription factors","volume":"30","author":"Bulyk","year":"2002","journal-title":"Nucleic Acids Res"},{"key":"2023062708514252100_B10","doi-asserted-by":"crossref","first-page":"10096","DOI":"10.1073\/pnas.180265397","article-title":"Building a dictionary for genomes: identification of presumptive regulatory sites by statistical analysis","volume":"97","author":"Bussemaker","year":"2000","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023062708514252100_B11","doi-asserted-by":"crossref","first-page":"1114","DOI":"10.1093\/oxfordjournals.molbev.a004169","article-title":"Evolution of transcription factor binding sites in mammalian gene regulatory regions: conservation and turnover","volume":"19","author":"Dermitzakis","year":"2002","journal-title":"Mol. Biol. Evol"},{"key":"2023062708514252100_B12","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511790492","volume-title":"Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids","author":"Durbin","year":"1998"},{"key":"2023062708514252100_B13","doi-asserted-by":"crossref","first-page":"4241","DOI":"10.1091\/mbc.11.12.4241","article-title":"Genomic expression programs in the response of yeast cells to environmental changes","volume":"11","author":"Gasch","year":"2000","journal-title":"Mol. Biol. Cell"},{"key":"2023062708514252100_B14","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1038\/nature02800","article-title":"Transcriptional regulatory code of a eukaryotic genome","volume":"431","author":"Harbison","year":"2004","journal-title":"Nature"},{"key":"2023062708514252100_B15","doi-asserted-by":"crossref","first-page":"1551","DOI":"10.1126\/science.8372350","article-title":"A role for the transcription factors Mbp1 and Swi4 in progression from G1 to S phase","volume":"261","author":"Koch","year":"1993","journal-title":"Science"},{"key":"2023062708514252100_B16","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1126\/science.8211139","article-title":"Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment","volume":"262","author":"Lawrence","year":"1993","journal-title":"Science"},{"key":"2023062708514252100_B17","doi-asserted-by":"crossref","first-page":"3824","DOI":"10.1093\/nar\/gkg593","article-title":"Comprehensive quantitative analyses of the effects of promoter sequence elements on mRNA transcription","volume":"31","author":"Lapidot","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023062708514252100_B18","doi-asserted-by":"crossref","first-page":"885","DOI":"10.1038\/31860","article-title":"Allosteric effects of DNA on transcriptional regulators","volume":"392","author":"Lefstin","year":"1998","journal-title":"Nature"},{"key":"2023062708514252100_B19","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1016\/j.cell.2004.08.007","article-title":"One nucleotide in a [kappa]B site can determine cofactor specificity for NF-[kappa]B dimers","volume":"118","author":"Leung","year":"2004","journal-title":"Cell"},{"key":"2023062708514252100_B21","doi-asserted-by":"crossref","first-page":"W199","DOI":"10.1093\/nar\/gkh465","article-title":"Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes","volume":"32","author":"Pavesi","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023062708514252100_B22","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1038\/ng724","article-title":"Identifying regulatory networks by combinatorial analysis of promoter elements","volume":"29","author":"Pilpel","year":"2001","journal-title":"Nat. Genet"},{"key":"2023062708514252100_B23","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1101\/gr.1739204","article-title":"Whole-genome discovery of transcription factor binding sites by network-level conservation 10.1101\/gr.1739204","volume":"14","author":"Pritsker","year":"2004","journal-title":"Genome Res"},{"key":"2023062708514252100_B24","doi-asserted-by":"crossref","first-page":"939","DOI":"10.1038\/nbt1098-939","article-title":"Finding DNA regulatory motifs within unaligned non-coding sequences clustered by whole-genome mRNA quantitation","volume":"16","author":"Roth","year":"1998","journal-title":"Nat. Biotechnol"},{"key":"2023062708514252100_B25","doi-asserted-by":"crossref","first-page":"772","DOI":"10.1038\/nature04979","article-title":"A genomic code for nucleosome positioning","volume":"442","author":"Segal","year":"2006","journal-title":"Nature"},{"key":"2023062708514252100_B26","doi-asserted-by":"crossref","first-page":"R86","DOI":"10.1186\/gb-2005-6-10-r86","article-title":"A catalog of stability-associated sequence elements in 3\u2032 UTRs of yeast mRNAs","volume":"6","author":"Shalgi","year":"2005","journal-title":"Genome Biol"},{"key":"2023062708514252100_B27","doi-asserted-by":"crossref","first-page":"3586","DOI":"10.1093\/nar\/gkg618","article-title":"YMF: a program for discovery of novel transcription factor binding sites by statistical overrepresentation","volume":"31","author":"Sinha","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023062708514252100_B28","doi-asserted-by":"crossref","first-page":"11629","DOI":"10.1073\/pnas.0409746102","article-title":"Unsupervised learning of natural languages","volume":"102","author":"Solan","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023062708514252100_B29","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1093\/bioinformatics\/16.1.16","article-title":"DNA binding sites: representation and discovery","volume":"16","author":"Stormo","year":"2000","journal-title":"Bioinformatics"},{"key":"2023062708514252100_B30","doi-asserted-by":"crossref","first-page":"1723","DOI":"10.1101\/gr.301202","article-title":"Genome-wide co-occurrence of promoter elements reveals a cis-regulatory cassette of rRNA transcription motifs in Saccharomyces cerevisiae","volume":"12","author":"Sudarsanam","year":"2002","journal-title":"Genome Res"},{"key":"2023062708514252100_B31","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1038\/10343","article-title":"Systematic determination of genetic network architecture","volume":"22","author":"Tavazoie","year":"1999","journal-title":"Nat. Genet"},{"key":"2023062708514252100_B32","doi-asserted-by":"crossref","first-page":"933","DOI":"10.1093\/bioinformatics\/btm055","article-title":"Position dependencies in transcription factor binding sites","volume":"23","author":"Tomovic","year":"2007","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/13\/i440\/50715282\/bioinformatics_23_13_i440.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/13\/i440\/50715282\/bioinformatics_23_13_i440.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,19]],"date-time":"2025-01-19T21:38:29Z","timestamp":1737322709000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/13\/i440\/228439"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,7,1]]},"references-count":31,"journal-issue":{"issue":"13","published-print":{"date-parts":[[2007,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm183","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2007,7]]},"published":{"date-parts":[[2007,7,1]]}}}