{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,5,31]],"date-time":"2024-05-31T18:00:45Z","timestamp":1717178445318},"reference-count":31,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Current approaches for identifying transcriptional regulatory elements are mainly via the combination of two properties, the evolutionary conservation and the overrepresentation of functional elements in the promoters of co-regulated genes. Despite the development of many motif detection algorithms, the discovery of conserved motifs in a wide range of phylogenetically related promoters is still a challenge, especially for the short motifs embedded in distantly related gene promoters or very closely related promoters, or in the situation that there are not enough orthologous genes available.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>A mutation degree model is proposed and a new word counting method is developed for the identification of transcriptional regulatory elements from a set of co-expressed genes. The new method comprises two parts: 1) identifying overrepresented oligo-nucleotides in promoters of co-expressed genes, 2) estimating the conservation of the oligo-nucleotides in promoters of phylogenetically related genes by the mutation degree model. Compared with the performance of other algorithms, our method shows the advantages of low false positive rate and higher specificity, especially the robustness to noisy data. Applying the method to co-expressed gene sets from Arabidopsis, most of known <jats:italic>cis<\/jats:italic>-elements were successfully detected. The tool and example are available at <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"http:\/\/mcube.nju.edu.cn\/jwang\/lab\/soft\/ocw\/OCW.html\" ext-link-type=\"uri\">http:\/\/mcube.nju.edu.cn\/jwang\/lab\/soft\/ocw\/OCW.html<\/jats:ext-link>.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>The mutation degree model proposed in this paper is adapted to phylogenetic data of different qualities, and to a wide range of evolutionary distances. The new word-counting method based on this model has the advantage of better performance in detecting short sequence of <jats:italic>cis<\/jats:italic>-elements from co-expressed genes of eukaryotes and is robust to less complete phylogenetic data.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-12-262","type":"journal-article","created":{"date-parts":[[2011,6,28]],"date-time":"2011-06-28T18:20:33Z","timestamp":1309285233000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["A mutation degree model for the identification of transcriptional regulatory elements"],"prefix":"10.1186","volume":"12","author":[{"given":"Changqing","family":"Zhang","sequence":"first","affiliation":[]},{"given":"Jin","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Xu","family":"Hua","sequence":"additional","affiliation":[]},{"given":"Jinggui","family":"Fang","sequence":"additional","affiliation":[]},{"given":"Huaiqiu","family":"Zhu","sequence":"additional","affiliation":[]},{"given":"Xiang","family":"Gao","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,6,27]]},"reference":[{"issue":"4","key":"4644_CR1","doi-asserted-by":"publisher","first-page":"276","DOI":"10.1038\/nrg1315","volume":"5","author":"WW Wasserman","year":"2004","unstructured":"Wasserman WW, Sandelin A: Applied bioinformatics for the identification of regulatory elements. Nat Rev Genet 2004, 5(4):276\u2013287. 10.1038\/nrg1315","journal-title":"Nat Rev Genet"},{"issue":"6","key":"4644_CR2","doi-asserted-by":"publisher","first-page":"439","DOI":"10.1038\/nrg2765","volume":"11","author":"JR Raab","year":"2010","unstructured":"Raab JR, Kamakaka RT: Insulators and promoters: closer than we think. Nat Rev Genet 2010, 11(6):439\u2013446. 10.1038\/nrg2765","journal-title":"Nat Rev Genet"},{"issue":"5","key":"4644_CR3","doi-asserted-by":"publisher","first-page":"643","DOI":"10.1016\/j.pbi.2009.07.016","volume":"12","author":"HD Priest","year":"2009","unstructured":"Priest HD, Filichkin SA, Mockler TC: Cis-regulatory elements in plant cell signaling. Curr Opin Plant Biol 2009, 12(5):643\u2013649. 10.1016\/j.pbi.2009.07.016","journal-title":"Curr Opin Plant Biol"},{"issue":"5","key":"4644_CR4","doi-asserted-by":"publisher","first-page":"636","DOI":"10.1093\/bioinformatics\/btg459","volume":"20","author":"N Shah","year":"2004","unstructured":"Shah N, Couronne O, Pennacchio LA, Brudno M, Batzoglou S, Bethel EW, Rubin EM, Hamann B, Dubchak I: Phylo-VISTA: interactive visualization of multiple DNA sequence alignments. Bioinformatics 2004, 20(5):636\u2013643. 10.1093\/bioinformatics\/btg459","journal-title":"Bioinformatics"},{"issue":"8","key":"4644_CR5","doi-asserted-by":"publisher","first-page":"1034","DOI":"10.1101\/gr.3715005","volume":"15","author":"A Siepel","year":"2005","unstructured":"Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S, et al.: Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res 2005, 15(8):1034\u20131050. 10.1101\/gr.3715005","journal-title":"Genome Res"},{"key":"4644_CR6","first-page":"348","volume-title":"Pac Symp Biocomput","author":"A Prakash","year":"2004","unstructured":"Prakash A, Blanchette M, Sinha S, Tompa M: Motif discovery in heterogeneous sequence data. Pac Symp Biocomput 2004, 348\u2013359."},{"issue":"18","key":"4644_CR7","doi-asserted-by":"publisher","first-page":"2369","DOI":"10.1093\/bioinformatics\/btg329","volume":"19","author":"T Wang","year":"2003","unstructured":"Wang T, Stormo GD: Combining phylogenetic data with co-regulated genes to identify regulatory motifs. Bioinformatics 2003, 19(18):2369\u20132380. 10.1093\/bioinformatics\/btg329","journal-title":"Bioinformatics"},{"issue":"7","key":"4644_CR8","doi-asserted-by":"publisher","first-page":"e67","DOI":"10.1371\/journal.pcbi.0010067","volume":"1","author":"R Siddharthan","year":"2005","unstructured":"Siddharthan R, Siggia ED, van Nimwegen E: PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny. PLoS Comput Biol 2005, 1(7):e67. 10.1371\/journal.pcbi.0010067","journal-title":"PLoS Comput Biol"},{"key":"4644_CR9","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1007\/978-1-59745-514-5_19","volume":"395","author":"S Sinha","year":"2007","unstructured":"Sinha S: PhyME: a software tool for finding motifs in sets of orthologous sequences. Methods Mol Biol 2007, 395: 309\u2013318. 10.1007\/978-1-59745-514-5_19","journal-title":"Methods Mol Biol"},{"key":"4644_CR10","first-page":"324","volume-title":"Pac Symp Biocomput","author":"AM Moses","year":"2004","unstructured":"Moses AM, Chiang DY, Eisen MB: Phylogenetic motif detection by expectation-maximization on evolutionary mixtures. Pac Symp Biocomput 2004, 324\u2013335."},{"key":"4644_CR11","doi-asserted-by":"publisher","first-page":"46","DOI":"10.1186\/1471-2105-8-46","volume":"8","author":"G Pavesi","year":"2007","unstructured":"Pavesi G, Zambelli F, Pesole G: WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences. BMC Bioinformatics 2007, 8: 46. 10.1186\/1471-2105-8-46","journal-title":"BMC Bioinformatics"},{"issue":"4","key":"4644_CR12","doi-asserted-by":"publisher","first-page":"1589","DOI":"10.1104\/pp.106.085639","volume":"142","author":"G Haberer","year":"2006","unstructured":"Haberer G, Mader MT, Kosarev P, Spannagl M, Yang L, Mayer KF: Large-scale cis-element detection by analysis of correlated expression and sequence conservation between Arabidopsis and Brassica oleracea. Plant Physiol 2006, 142(4):1589\u20131602. 10.1104\/pp.106.085639","journal-title":"Plant Physiol"},{"issue":"10","key":"4644_CR13","doi-asserted-by":"publisher","first-page":"939","DOI":"10.1038\/nbt1098-939","volume":"16","author":"FP Roth","year":"1998","unstructured":"Roth FP, Hughes JD, Estep PW, Church GM: Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation. Nat Biotechnol 1998, 16(10):939\u2013945. 10.1038\/nbt1098-939","journal-title":"Nat Biotechnol"},{"issue":"4","key":"4644_CR14","doi-asserted-by":"publisher","first-page":"e1000071","DOI":"10.1371\/journal.pcbi.1000071","volume":"4","author":"MC Frith","year":"2008","unstructured":"Frith MC, Saunders NF, Kobe B, Bailey TL: Discovering sequence motifs with arbitrary insertions and deletions. PLoS Comput Biol 2008, 4(4):e1000071.","journal-title":"PLoS Comput Biol"},{"key":"4644_CR15","series-title":"Nucleic Acids Res","volume-title":"Pscan: finding over-represented transcription factor binding site motifs in sequences from co-regulated or co-expressed genes","author":"F Zambelli","year":"2009","unstructured":"Zambelli F, Pesole G, Pavesi G: Pscan: finding over-represented transcription factor binding site motifs in sequences from co-regulated or co-expressed genes. Nucleic Acids Res 2009, (37 Web Server):W247\u2013252."},{"issue":"1","key":"4644_CR16","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1038\/nbt1053","volume":"23","author":"M Tompa","year":"2005","unstructured":"Tompa M, Li N, Bailey TL, Church GM, De Moor B, Eskin E, Favorov AV, Frith MC, Fu Y, Kent WJ, et al.: Assessing computational tools for the discovery of transcription factor binding sites. Nat Biotechnol 2005, 23(1):137\u2013144. 10.1038\/nbt1053","journal-title":"Nat Biotechnol"},{"key":"4644_CR17","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1007\/978-3-540-74126-8_14","volume":"4645","author":"CBD Boucher","year":"2007","unstructured":"Boucher CBD, Church P: A Graph Clustering Approach to Weak Motif Recognition. Lecture Notes in Computer Science 2007, 4645: 149\u2013160. 10.1007\/978-3-540-74126-8_14","journal-title":"Lecture Notes in Computer Science"},{"issue":"3","key":"4644_CR18","doi-asserted-by":"publisher","first-page":"279","DOI":"10.1046\/j.1365-313X.2002.01359.x","volume":"31","author":"M Seki","year":"2002","unstructured":"Seki M, Narusaka M, Ishida J, Nanjo T, Fujita M, Oono Y, Kamiya A, Nakajima M, Enju A, Sakurai T, et al.: Monitoring the expression profiles of 7000 Arabidopsis genes under drought, cold and high-salinity stresses using a full-length cDNA microarray. Plant J 2002, 31(3):279\u2013292. 10.1046\/j.1365-313X.2002.01359.x","journal-title":"Plant J"},{"issue":"4","key":"4644_CR19","doi-asserted-by":"publisher","first-page":"1555","DOI":"10.1104\/pp.103.034736","volume":"134","author":"H Goda","year":"2004","unstructured":"Goda H, Sawa S, Asami T, Fujioka S, Shimada Y, Yoshida S: Comprehensive comparison of auxin-regulated and brassinosteroid-regulated genes in Arabidopsis. Plant Physiol 2004, 134(4):1555\u20131573. 10.1104\/pp.103.034736","journal-title":"Plant Physiol"},{"issue":"13","key":"4644_CR20","doi-asserted-by":"publisher","first-page":"3461","DOI":"10.1111\/j.1742-4658.2005.04770.x","volume":"272","author":"S Kamauchi","year":"2005","unstructured":"Kamauchi S, Nakatani H, Nakano C, Urade R: Gene expression in response to endoplasmic reticulum stress in Arabidopsis thaliana. FEBS J 2005, 272(13):3461\u20133476. 10.1111\/j.1742-4658.2005.04770.x","journal-title":"FEBS J"},{"issue":"5","key":"4644_CR21","doi-asserted-by":"publisher","first-page":"683","DOI":"10.1023\/B:PLAN.0000040898.86788.59","volume":"54","author":"Y Gao","year":"2004","unstructured":"Gao Y, Li J, Strickland E, Hua S, Zhao H, Chen Z, Qu L, Deng XW: An arabidopsis promoter microarray and its initial usage in the identification of HY5 binding targets in vitro. Plant Mol Biol 2004, 54(5):683\u2013699.","journal-title":"Plant Mol Biol"},{"issue":"393","key":"4644_CR22","doi-asserted-by":"publisher","first-page":"2709","DOI":"10.1093\/jxb\/erg304","volume":"54","author":"S Oh","year":"2003","unstructured":"Oh S, Park S, Han KH: Transcriptional regulation of secondary growth in Arabidopsis thaliana. J Exp Bot 2003, 54(393):2709\u20132722. 10.1093\/jxb\/erg304","journal-title":"J Exp Bot"},{"key":"4644_CR23","doi-asserted-by":"crossref","unstructured":"Barta E, Sebestyen E, Palfy TB, Toth G, Ortutay CP, Patthy L: DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants. Nucleic Acids Res 2005, (33 Database):D86\u201390.","DOI":"10.1093\/nar\/gki097"},{"issue":"11","key":"4644_CR24","doi-asserted-by":"publisher","first-page":"e82","DOI":"10.1093\/nar\/gkp311","volume":"37","author":"F Colecchia","year":"2009","unstructured":"Colecchia F, Kottwitz D, Wagner M, Pfenninger CV, Thiel G, Tamm I, Peterson C, Nuber UA: Tissue-specific regulatory network extractor (TS-REX): a database and software resource for the tissue and cell type-specific investigation of transcription factor-gene networks. Nucleic Acids Res 2009, 37(11):e82. 10.1093\/nar\/gkp311","journal-title":"Nucleic Acids Res"},{"issue":"7","key":"4644_CR25","doi-asserted-by":"publisher","first-page":"e49","DOI":"10.1093\/nar\/gkp084","volume":"37","author":"B Tokovenko","year":"2009","unstructured":"Tokovenko B, Golda R, Protas O, Obolenskaya M, El'skaya A: COTRASIF: conservation-aided transcription-factor-binding site finder. Nucleic Acids Res 2009, 37(7):e49. 10.1093\/nar\/gkp084","journal-title":"Nucleic Acids Res"},{"issue":"2","key":"4644_CR26","doi-asserted-by":"publisher","first-page":"e8938","DOI":"10.1371\/journal.pone.0008938","volume":"5","author":"V Storms","year":"2010","unstructured":"Storms V, Claeys M, Sanchez A, De Moor B, Verstuyf A, Marchal K: The effect of orthology and coregulation on detecting regulatory motifs. PLoS One 2010, 5(2):e8938. 10.1371\/journal.pone.0008938","journal-title":"PLoS One"},{"issue":"3","key":"4644_CR27","doi-asserted-by":"publisher","first-page":"1162","DOI":"10.1104\/pp.102.017715","volume":"132","author":"S Rombauts","year":"2003","unstructured":"Rombauts S, Florquin K, Lescot M, Marchal K, Rouze P, van de Peer Y: Computational approaches to identify promoters and cis-regulatory elements in plant genomes. Plant Physiol 2003, 132(3):1162\u20131176. 10.1104\/pp.102.017715","journal-title":"Plant Physiol"},{"issue":"9","key":"4644_CR28","doi-asserted-by":"publisher","first-page":"1215","DOI":"10.3724\/SP.J.1206.2009.00088","volume":"36","author":"CQWJ ZHANG","year":"2009","unstructured":"ZHANG CQWJ, ZHU H, GAO X: The transcriptional regulatory mechanism of CYP72B1 and AUR3 in response to light, auxin and brassinosteroid. Prog Biochem Biophys 2009, 36(9):1215\u20131221.","journal-title":"Prog Biochem Biophys"},{"issue":"4","key":"4644_CR29","doi-asserted-by":"publisher","first-page":"569","DOI":"10.1093\/bioinformatics\/btg450","volume":"20","author":"W Xue","year":"2004","unstructured":"Xue W, Wang J, Shen Z, Zhu H: Enrichment of transcriptional regulatory sites in non-coding genomic region. Bioinformatics 2004, 20(4):569\u2013575. 10.1093\/bioinformatics\/btg450","journal-title":"Bioinformatics"},{"issue":"5871","key":"4644_CR30","doi-asserted-by":"publisher","first-page":"1785","DOI":"10.1126\/science.1151651","volume":"319","author":"O Hobert","year":"2008","unstructured":"Hobert O: Gene regulation by transcription factors and microRNAs. Science 2008, 319(5871):1785\u20131786. 10.1126\/science.1151651","journal-title":"Science"},{"key":"4644_CR31","series-title":"Nucleic Acids Res","volume-title":"BLAST: improvements for better sequence analysis","author":"J Ye","year":"2006","unstructured":"Ye J, McGinnis S, Madden TL: BLAST: improvements for better sequence analysis. Nucleic Acids Res 2006, (34 Web Server):W6\u20139."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-262.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T15:05:40Z","timestamp":1630508740000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-262"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,6,27]]},"references-count":31,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4644"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-262","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,6,27]]},"assertion":[{"value":"12 November 2010","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 June 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 June 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"262"}}