{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,23]],"date-time":"2025-12-23T10:26:37Z","timestamp":1766485597621},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2005,5,18]],"date-time":"2005-05-18T00:00:00Z","timestamp":1116374400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0\/"},{"start":{"date-parts":[[2005,5,18]],"date-time":"2005-05-18T00:00:00Z","timestamp":1116374400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0\/"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec>\n                        <jats:title>Background<\/jats:title>\n                        <jats:p>Searching for approximate patterns in large promoter sequences frequently produces an exceedingly high numbers of results. Our aim was to exploit biological knowledge for definition of a sheltered search space and of appropriate search parameters, in order to develop a method for identification of a tractable number of sequence motifs.<\/jats:p>\n                     <\/jats:sec><jats:sec>\n                        <jats:title>Results<\/jats:title>\n                        <jats:p>Novel software (COOP) was developed for extraction of sequence motifs, based on clustering of exact or approximate patterns according to the frequency of their overlapping occurrences. Genomic sequences of 1 Kb upstream of 91 genes differentially expressed and\/or encoding proteins with relevant function in adult human retina were analyzed. Methodology and results were tested by analysing 1,000 groups of putatively unrelated sequences, randomly selected among 17,156 human gene promoters. When applied to a sample of human promoters, the method identified 279 putative motifs frequently occurring in retina promoters sequences. Most of them are localized in the proximal portion of promoters, less variable in central region than in lateral regions and similar to known regulatory sequences. COOP software and reference manual are freely available upon request to the Authors.<\/jats:p>\n                     <\/jats:sec><jats:sec>\n                        <jats:title>Conclusion<\/jats:title>\n                        <jats:p>The approach described in this paper seems effective for identifying a tractable number of sequence motifs with putative regulatory role.<\/jats:p>\n                     <\/jats:sec>","DOI":"10.1186\/1471-2105-6-121","type":"journal-article","created":{"date-parts":[[2005,5,18]],"date-time":"2005-05-18T18:14:05Z","timestamp":1116440045000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["A multistep bioinformatic approach detects putative regulatory elements in gene promoters"],"prefix":"10.1186","volume":"6","author":[{"given":"Stefania","family":"Bortoluzzi","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alessandro","family":"Coppe","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andrea","family":"Bisognin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Cinzia","family":"Pizzi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gian Antonio","family":"Danieli","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2005,5,18]]},"reference":[{"key":"446_CR1","doi-asserted-by":"publisher","first-page":"400","DOI":"10.1016\/S0959-440X(99)80054-2","volume":"9","author":"P Bucher","year":"1999","unstructured":"Bucher P: Regulatory elements and expression profiles. Curr Opin Struct Biol 1999, 9: 400\u2013407. 10.1016\/S0959-440X(99)80054-2","journal-title":"Curr Opin Struct Biol"},{"key":"446_CR2","doi-asserted-by":"publisher","first-page":"168","DOI":"10.1007\/s003359900963","volume":"10","author":"T Werner","year":"1999","unstructured":"Werner T: Models for prediction and recognition of eukaryotic promoters. Mamm Genome 1999, 10: 168\u201375. 10.1007\/s003359900963","journal-title":"Mamm Genome"},{"key":"446_CR3","doi-asserted-by":"crossref","first-page":"1202","DOI":"10.1101\/gr.8.11.1202","volume":"8","author":"A Brazma","year":"1998","unstructured":"Brazma A, Jonassen I, Vilo J, Ukkonen E: Predicting gene regulatory elements in silico on a genomic scale. Genome Res 1998, 8: 1202\u20131215.","journal-title":"Genome Res"},{"key":"446_CR4","first-page":"249","volume":"2","author":"T Werner","year":"2002","unstructured":"Werner T: Finding and decrypting of promoters contributes to the elucidation of gene function. In Silico Biol 2002, 2: 249\u2013255.","journal-title":"In Silico Biol"},{"key":"446_CR5","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1038\/84792","volume":"27","author":"HJ Bussemaker","year":"2001","unstructured":"Bussemaker HJ, Li H, Saggia ED: Regulatory element detection using correlation with expression. Nat Genet 2001, 27: 167\u2013171. 10.1038\/84792","journal-title":"Nat Genet"},{"key":"446_CR6","doi-asserted-by":"publisher","first-page":"482","DOI":"10.1038\/ng776","volume":"29","author":"H Ge","year":"2001","unstructured":"Ge H, Liu Z, Church GM, Vidal M: Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae. Nat Genet 2001, 29: 482\u2013486. 10.1038\/ng776","journal-title":"Nat Genet"},{"key":"446_CR7","volume-title":"The Analysis of Gene Expression Data: Methods and Software","author":"J Vilo","year":"2003","unstructured":"Vilo J, Kapushesky M, Kemmeren P, Sarkans U, Brazma A: Expression Profiler. In The Analysis of Gene Expression Data: Methods and Software. Edited by: Parmigiani G, Garrett ES, Irizarry R, Zeger SL. Springer Verlag, New York, NY; 2003."},{"key":"446_CR8","volume-title":"\"Algorithms in C\"","author":"R Sedgewick","year":"1998","unstructured":"Sedgewick R: \"Algorithms in C\". Third edition. Addison-Wesley editor, Reading, MA; 1998.","edition":"Third"},{"key":"446_CR9","doi-asserted-by":"publisher","first-page":"4673","DOI":"10.1093\/nar\/22.22.4673","volume":"22","author":"D Higgins","year":"1994","unstructured":"Higgins D, Thompson J, Gibson T, Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 1994, 22: 4673\u20134680.","journal-title":"Nucleic Acids Res"},{"key":"446_CR10","doi-asserted-by":"publisher","first-page":"238","DOI":"10.1093\/nar\/24.1.238","volume":"24","author":"E Wingender","year":"1996","unstructured":"Wingender E, Dietze P, Karas H, Knuppel R: TRANSFAC: a database on transcription factors and their DNA binding sites. Nucleic Acids Res 1996, 24: 238\u2013241. 10.1093\/nar\/24.1.238","journal-title":"Nucleic Acids Res"},{"key":"446_CR11","doi-asserted-by":"crossref","first-page":"986","DOI":"10.1101\/gr.7.10.986","volume":"7","author":"S Audic","year":"1997","unstructured":"Audic S, Claverie JM: The significance of digital gene expression profiles. Genome Res 1997, 7: 986\u2013995.","journal-title":"Genome Res"},{"key":"446_CR12","doi-asserted-by":"publisher","first-page":"656","DOI":"10.1101\/gr.229202. Article published online before March 2002","volume":"12","author":"WJ Kent","year":"2002","unstructured":"Kent WJ: BLAT \u2013 the BLAST-like alignment tool. Genome Res 2002, 12: 656\u2013664. 10.1101\/gr.229202. Article published online before March 2002","journal-title":"Genome Res"},{"key":"446_CR13","unstructured":"Supplementary material[http:\/\/telethon.bio.unipd.it\/bioinfo\/Retina\/suppl_material.html]"},{"key":"446_CR14","doi-asserted-by":"publisher","first-page":"3554","DOI":"10.1093\/nar\/gkg549","volume":"31","author":"AS Halees","year":"2003","unstructured":"Halees AS, Leyfer D, Weng Z: PromoSer: A large-scale mammalian promoter and transcription start site identification service. Nucleic Acids Res 2003, 31: 3554\u20133559. 10.1093\/nar\/gkg549","journal-title":"Nucleic Acids Res"},{"key":"446_CR15","unstructured":"PromoSer[http:\/\/biowulf.bu.edu\/zlab\/PromoSer\/]"},{"key":"446_CR16","volume-title":"Current Protocols in Bioinformatics","author":"G Petsko","year":"2002","unstructured":"Petsko G: Modeling Structure from Sequence. In Current Protocols in Bioinformatics. Edited by: Baxevanis AD. John Wiley & Sons Inc; 2002."},{"key":"446_CR17","unstructured":"TESS[http:\/\/www.cbil.upenn.edu\/cgi-bin\/tess\/tess?RQ=WELCOME]"},{"key":"446_CR18","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1038\/nbt1053","volume":"23","author":"M Tompa","year":"2005","unstructured":"Tompa M, Li N, Bailey TL, Church GM, De Moor B, Eskin E, Favorov AV, Frith MC, Fu Y, Kent WJ, Makeev VJ, Mironov AA, Noble WS, Pavesi G, Pesole G, Regnier M, Simonis N, Sinha S, Thijs G, van Helden J, Vandenbogaert M, Weng Z, Workman C, Ye C, Zhu Z: Assessing computational tools for the discovery of transcription factor binding sites. Nat Biotechnol 2005, 23: 137\u2013144. 10.1038\/nbt1053","journal-title":"Nat Biotechnol"},{"key":"446_CR19","unstructured":"Assessment of Computational Motif Discovery Tools[http:\/\/bio.cs.washington.edu\/assessment\/index.html]"},{"key":"446_CR20","doi-asserted-by":"publisher","first-page":"345","DOI":"10.1089\/106652700750050826","volume":"7","author":"L Marsan","year":"2000","unstructured":"Marsan L, Sagot MF: Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification. J Comput Biol 2000, 7: 345\u2013362. 10.1089\/106652700750050826","journal-title":"J Comput Biol"},{"key":"446_CR21","first-page":"269","volume":"8","author":"PA Pevzner","year":"2000","unstructured":"Pevzner PA, Sze SH: Combinatorial approaches to finding subtle signals in DNA sequences. Proc Int Conf Intell Syst Mol Biol 2000, 8: 269\u2013278.","journal-title":"Proc Int Conf Intell Syst Mol Biol"},{"key":"446_CR22","first-page":"417","volume":"2","author":"G Pavesi","year":"2001","unstructured":"Pavesi G, Mauri G, Pesole G: Methods for pattern discovery in unaligned biological sequences. Briefings in Bioinformatics 2001, 2: 417\u2013430.","journal-title":"Briefings in Bioinformatics"},{"key":"446_CR23","doi-asserted-by":"publisher","first-page":"225","DOI":"10.1089\/10665270252935430","volume":"9","author":"J Buhler","year":"2002","unstructured":"Buhler J, Tompa M: Finding motifs using random projections. J Comput Biol 2002, 9: 225\u2013242. 10.1089\/10665270252935430","journal-title":"J Comput Biol"},{"issue":"Suppl 1","key":"446_CR24","doi-asserted-by":"publisher","first-page":"S354","DOI":"10.1093\/bioinformatics\/18.suppl_1.S354","volume":"18","author":"E Eskin","year":"2002","unstructured":"Eskin E, Pevzner PA: Finding composite regulatory patterns in DNA sequences. Bioinformatics 2002, 18(Suppl 1):S354\u2013363.","journal-title":"Bioinformatics"},{"key":"446_CR25","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1089\/10665270360688020","volume":"10","author":"A Apostolico","year":"2003","unstructured":"Apostolico A, Bock ME, Lonardi S: Monotony of surprise and large-scale quest for unusual words. J Comput Biol 2003, 10: 283\u2013311. 10.1089\/10665270360688020","journal-title":"J Comput Biol"},{"key":"446_CR26","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1186\/1471-2105-5-18","volume":"5","author":"J Allocco","year":"2004","unstructured":"Allocco J, Kohane IS, Butte AJ: Quantifying the relationship between co-expression, co-regulation and gene function. BMC Bioinformatics 2004, 5: 18\u201328. 10.1186\/1471-2105-5-18","journal-title":"BMC Bioinformatics"},{"key":"446_CR27","doi-asserted-by":"publisher","first-page":"1382","DOI":"10.1093\/bioinformatics\/18.10.1382","volume":"18","author":"U Keich","year":"2002","unstructured":"Keich U, Pevzner PA: Subtle motifs: defining the limits of motif finding algorithms. Bioinformatics 2002, 18: 1382\u20131390. 10.1093\/bioinformatics\/18.10.1382","journal-title":"Bioinformatics"},{"key":"446_CR28","doi-asserted-by":"publisher","first-page":"827","DOI":"10.1006\/jmbi.1998.1947","volume":"281","author":"J van Helden","year":"1998","unstructured":"van Helden J, Andre B, Collado-Vides J: Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J Mol Biol 1998, 281: 827\u2013842. 10.1006\/jmbi.1998.1947","journal-title":"J Mol Biol"},{"key":"446_CR29","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1186\/1471-2105-3-7","volume":"3","author":"M Caselle","year":"2002","unstructured":"Caselle M, Di Cunto F, Provero P: Correlating overrepresented upstream motifs to gene expression: a computational approach to regulatory element discovery in eukaryotes. BMC Bioinformatics 2002, 3: 7. 10.1186\/1471-2105-3-7","journal-title":"BMC Bioinformatics"},{"key":"446_CR30","doi-asserted-by":"publisher","first-page":"308","DOI":"10.1101\/gr.794803","volume":"13","author":"ND Trinklein","year":"2003","unstructured":"Trinklein ND, Aldred SJ, Saldanha AJ, Myers RM: Identification and functional analysis of human transcriptional promoters. Genome Res 2003, 13: 308\u2013312. 10.1101\/gr.794803","journal-title":"Genome Res"},{"key":"446_CR31","doi-asserted-by":"publisher","first-page":"3863","DOI":"10.1093\/nar\/25.19.3863","volume":"25","author":"A Di Polo","year":"1997","unstructured":"Di Polo A, Lerner LE, Farber DB: Transcriptional activation of the human rod cGMP-phosphodiesterase beta-subunit gene is mediated by an upstream AP-1 element. Nucleic Acids Res 1997, 25: 3863\u20133867. 10.1093\/nar\/25.19.3863","journal-title":"Nucleic Acids Res"},{"key":"446_CR32","doi-asserted-by":"crossref","first-page":"31969","DOI":"10.1016\/S0021-9258(18)31790-3","volume":"269","author":"IR Rodriguez","year":"1994","unstructured":"Rodriguez IR, Mazuruk K, Schoen TJ, Chader GJ: Structural analysis of the human hydroxyindole-O-methyltransferase gene. Presence of two distinct promoters. J Biol Chem 1994, 269: 31969\u201331977.","journal-title":"J Biol Chem"},{"key":"446_CR33","doi-asserted-by":"publisher","first-page":"1398","DOI":"10.1093\/emboj\/21.6.1398","volume":"21","author":"KD","year":"2002","unstructured":"KD , Wagner N, Vidal VP, Schley G, Wilhelm D, Schedl A, Englert C, Scholz H: The Wilms' tumor gene Wt1 is required for normal development of the retina. EMBO J 2002, 21: 1398\u20131405. 10.1093\/emboj\/21.6.1398","journal-title":"EMBO J"},{"key":"446_CR34","unstructured":"HGXP[http:\/\/telethon.bio.unipd.it\/bioinfo\/HGXP]"},{"key":"446_CR35","unstructured":"OMIM[http:\/\/www.ncbi.nlm.nih.gov\/entrez\/query.fcgi?db=OMIM]"},{"key":"446_CR36","unstructured":"RetNet[http:\/\/www.sph.uth.tmc.edu\/Retnet\/]"},{"key":"446_CR37","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1093\/nar\/29.1.137","volume":"29","author":"KD Pruitt","year":"2001","unstructured":"Pruitt KD, Maglott DR: RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res 2001, 29: 137\u2013140. 10.1093\/nar\/29.1.137","journal-title":"Nucleic Acids Res"},{"key":"446_CR38","doi-asserted-by":"publisher","first-page":"1542","DOI":"10.1093\/bioinformatics\/18.11.1542","volume":"18","author":"M Safran","year":"2002","unstructured":"Safran M, Solomonm I, Shmueli O, Lapidot M, Shen-Orr S, Adato A, Ben-Dor U, Esterman N, Rosen N, Peter I, Olender T, Chalifa-Caspi V, Lancet D: GeneCards 2002: towards a complete, object-oriented, human gene compendium. Bioinformatics 2002, 18: 1542\u20131543. 10.1093\/bioinformatics\/18.11.1542","journal-title":"Bioinformatics"},{"key":"446_CR39","unstructured":"GeneCards[http:\/\/bioinfo.weizmann.ac.il\/cards\/]"},{"key":"446_CR40","unstructured":"BLAT[http:\/\/genome.ucsc.edu\/cgi-bin\/hgBlat]"},{"key":"446_CR41","unstructured":"Acembly[http:\/\/www.infobiogen.fr\/doc\/ACEDBdoc\/Acembly.doc.html]"},{"key":"446_CR42","unstructured":"RepeatMasker[http:\/\/www.repeatmasker.org]"},{"key":"446_CR43","unstructured":"Biobase[http:\/\/www.biobase.de\/]"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-121.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/1471-2105-6-121\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-121.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,7]],"date-time":"2024-10-07T12:12:23Z","timestamp":1728303143000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-6-121"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,5,18]]},"references-count":43,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2005,12]]}},"alternative-id":["446"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-6-121","relation":{},"ISSN":["1471-2105"],"issn-type":[{"type":"electronic","value":"1471-2105"}],"subject":[],"published":{"date-parts":[[2005,5,18]]},"assertion":[{"value":"12 November 2004","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 May 2005","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"18 May 2005","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"121"}}