{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,1,16]],"date-time":"2025-01-16T21:40:08Z","timestamp":1737063608464,"version":"3.33.0"},"publisher-location":"Berlin, Heidelberg","reference-count":27,"publisher":"Springer Berlin Heidelberg","isbn-type":[{"type":"print","value":"9783540425168"},{"type":"electronic","value":"9783540446965"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2001]]},"DOI":"10.1007\/3-540-44696-6_7","type":"book-chapter","created":{"date-parts":[[2007,6,1]],"date-time":"2007-06-01T03:45:10Z","timestamp":1180669510000},"page":"85-97","source":"Crossref","is-referenced-by-count":7,"title":["Assessing the Statistical Significance of Overrepresented Oligonucleotides"],"prefix":"10.1007","author":[{"given":"Alain","family":"Denise","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mireille","family":"R\u00e9gnier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mathias","family":"Vandenbogaert","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2001,8,17]]},"reference":[{"key":"7_CR1","doi-asserted-by":"publisher","first-page":"1001","DOI":"10.1101\/gr.10.7.1001","volume":"10","author":"E. Beaudoing","year":"2000","unstructured":"E. Beaudoing, S. Freier, J. Wyatt, J.M. Claverie, and D. Gautheret. Patterns of Variant Polyadenylation Signal Usage in Human Genes. Genome Research., 10:1001\u20131010, 2000.","journal-title":"Genome Research"},{"key":"7_CR2","doi-asserted-by":"crossref","unstructured":"J. Buhler and M. Tompa. Finding Motifs Using Random Projections. In RECOMB\u2019 01, pages 69\u201376. ACM-, 2001. Proc.RECOMBrs01, Montr\u00e9al.","DOI":"10.1145\/369133.369172"},{"key":"7_CR3","unstructured":"A. Denise and M. R\u00e9gnier. Word statistics conditioned by overrepresented words, 2001. in preparation; http:\/\/algo.inria.fr\/regnier\/index.html ."},{"issue":"12","key":"7_CR4","doi-asserted-by":"publisher","first-page":"2430","DOI":"10.1093\/nar\/25.12.2430","volume":"25","author":"M.S. Gelfand","year":"1997","unstructured":"M.S. Gelfand and E.V. Koonin. Avoidance of palindromic words in bacterial and ar-chaeal genomes: a close connection with restriction enzymess. Nucleic Acids Research, 25(12):2430\u20132439, 1997.","journal-title":"Nucleic Acids Research"},{"key":"7_CR5","doi-asserted-by":"publisher","first-page":"877","DOI":"10.2307\/3215201","volume":"32","author":"M. Geske","year":"1995","unstructured":"M. Geske, A. Godbole, A. Schafner, A. Skolnick, and G. Wallstrom. Compound Poisson Approximations for Word Patterns Under Markovian Hypotheses. J. Appl. Prob., 32:877\u2013892, 1995.","journal-title":"J. Appl. Prob."},{"key":"7_CR6","doi-asserted-by":"publisher","first-page":"5873","DOI":"10.1073\/pnas.90.12.5873","volume":"90","author":"R. Karlin","year":"1993","unstructured":"R. Karlin and S.F. Altschul. Applications and statistics for multiple high-scoring segments in molecular sequences. Proc. Natl. Acad. Sci. U.S.A., 90:5873\u20135877, 1993.","journal-title":"Proc. Natl. Acad. Sci. U.S.A."},{"issue":"1","key":"7_CR7","doi-asserted-by":"publisher","first-page":"57","DOI":"10.1016\/S0097-8485(99)00047-9","volume":"24","author":"M. Klaerr-Blanchard","year":"2000","unstructured":"Maude Klaerr-Blanchard, H\u00e9l\u00e8ne Chiapello, and Eivind Coward. Detecting localized repeats in genomic sequences: A new strategy and its application to B. subtilis and A. thaliana sequences. Comput. Chem., 24(1):57\u201370, 2000.","journal-title":"Comput. Chem."},{"key":"7_CR8","first-page":"433","volume":"8","author":"J. Kleffe","year":"1992","unstructured":"J. Kleffe and M. Borodovsky. First and second moment counts of words in random texts generated by Markov chains. Comput. Appl. Biosci., 8, 433\u2013441, 1992.","journal-title":"Comput. Appl. Biosci."},{"key":"7_CR9","doi-asserted-by":"crossref","unstructured":"X. Liu, D.L. Brutlag, and J. Liu. Bioprospector: Discovering conserved dna motifs in upstream regulatory regions of co-expressed gene. In 6-th Pacific Symposium on Biocomputing, pages 127\u2013138, 2001.","DOI":"10.1142\/9789814447362_0014"},{"key":"7_CR10","doi-asserted-by":"crossref","unstructured":"L. Marsan and M.F. Sagot. Extracting structured motifs using a suffix tree-algorithms and application to promoter consensus identification. In RECOMB\u201900, pages 210\u2013219. ACM-, 2000. Proceedings RECOMB\u201900, Tokyo.","DOI":"10.1145\/332306.332553"},{"key":"7_CR11","unstructured":"P. Nicod\u00e8me. The symbolic package Regexpcount. In GCB\u201900, 2000. presented at GCB\u201900, Heidelberg, October 2000; available at http:\/\/algo.inria.fr\/libraries\/software.html ."},{"key":"7_CR12","series-title":"Phd thesis","volume-title":"Grandes d\u00e9viations et chaines de Markov pour l\u2019\u00e8tude des mots exceptionnels dans les s\u00e9quences biologiques","author":"G. Nuel","year":"2001","unstructured":"G. Nuel. Grandes d\u00e9viations et chaines de Markov pour l\u2019\u00e8tude des mots exceptionnels dans les s\u00e9quences biologiques. Phd thesis, Universit\u00e9 Ren\u00e9 Descartes, Paris V, 2001. to be defended in July, 2001."},{"key":"7_CR13","doi-asserted-by":"crossref","first-page":"1013","DOI":"10.1080\/07391102.1989.10506528","volume":"6","author":"P.A. Pevzner","year":"1991","unstructured":"P.A. Pevzner, M. Borodovski, and A. Mironov. Linguistic of Nucleotide sequences:The Significance of Deviations from the Mean: Statistical Characteristics and Prediction of the Frequency of Occurrences of Words. J. Biomol. Struct. Dynam., 6:1013\u20131026, 1991.","journal-title":"J. Biomol. Struct. Dynam."},{"issue":"2","key":"7_CR14","first-page":"215","volume":"34","author":"E.M. Panina","year":"2000","unstructured":"E.M. Panina, A.A. Mironov, and M.S. Gelfand. Statistical analysis of Complete Bacterial Genomes: Avoidance of Palindromes and Restriction-Modification Systems. Genomics. Proteomics. Bioinformatics, 34(2):215\u2013221, 2000.","journal-title":"Genomics. Proteomics. Bioinformatics"},{"key":"7_CR15","doi-asserted-by":"publisher","first-page":"939","DOI":"10.1038\/nbt1098-939","volume":"16","author":"F.R. Roth","year":"1998","unstructured":"F.R. Roth, J.D. Hughes, P.E. Estep, and G.M. Church. Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation. Nature Biotechnol., 16:939\u2013945, 1998.","journal-title":"Nature Biotechnol."},{"key":"7_CR16","unstructured":"M. R\u00e9gnier, A. Lifanov, and V. Makeev. Three variations on word counting. In GCB\u201900, pages 75\u201382. Logos-Verlag, 2000. Proc. German Conference on Bioinformatics, Heidelberg; submitted to BioInformatics."},{"issue":"4","key":"7_CR17","doi-asserted-by":"publisher","first-page":"631","DOI":"10.1007\/PL00009244","volume":"22","author":"M. R\u00e9gnier","year":"1998","unstructured":"M. R\u00e9gnier and W. Szpankowski. On Pattern Frequency Occurrences in a Markovian Sequence. Algorithmica, 22(4):631\u2013649, 1998. preliminary draft at ISIT\u201997.","journal-title":"Algorithmica"},{"key":"7_CR18","doi-asserted-by":"crossref","unstructured":"G. Reinert and S. Schbath. Compound Poisson Approximation for Occurrences of Multiple Words in Markov Chains. Journal of Computational Biology, 5(2):223\u2013253","DOI":"10.1089\/cmb.1998.5.223"},{"issue":"1","key":"7_CR19","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1089\/10665270050081360","volume":"7","author":"G. Reinert","year":"2000","unstructured":"G. Reinert, S. Schbath, and M. Waterman. Probabilistic and Statistical Properties of Words: An Overview. Journal of Computational Biology, 7(1):1\u201346, 2000.","journal-title":"Journal of Computational Biology"},{"issue":"1","key":"7_CR20","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1239\/jap\/1032374240","volume":"36","author":"S. Robin","year":"1999","unstructured":"S. Robin and J. J. Daudin. Exact distribution of word occurrences in a random sequence of letters. J. Appl. Prob., 36(1): 179\u2013193, 1999.","journal-title":"J. Appl. Prob."},{"key":"7_CR21","doi-asserted-by":"publisher","first-page":"2971","DOI":"10.1093\/nar\/26.12.2971","volume":"26","author":"E. Rocha","year":"1998","unstructured":"E. Rocha, A. Viari, and A. Danchin. Oligonucleotides bias in bacillus subtilis: general trands and taxonomic comparisons. Nucl. Acids Research, 26:2971\u20132980, 1998.","journal-title":"Nucl. Acids Research"},{"issue":"11","key":"7_CR22","doi-asserted-by":"publisher","first-page":"968","DOI":"10.1093\/bioinformatics\/16.11.968","volume":"16","author":"A.T. Vasconcelos","year":"2000","unstructured":"A.T. Vasconcelos, M.A. Grivet-Mattoso-Maia, and D.F. de Almeida. Short interrupted palindromes on the extragenic DNA of Escherichia coli K-12, Haemophilus influenzae and Neis-seria meningitidis. BioInformatics, 16(11):968\u2013977, 2000.","journal-title":"BioInformatics"},{"key":"7_CR23","doi-asserted-by":"publisher","first-page":"827","DOI":"10.1006\/jmbi.1998.1947","volume":"281","author":"J. Helden van","year":"1998","unstructured":"J. van Helden, B. Andre, and J. Collado-Vides. Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J. Mol. Biol., 281:827\u2013842, 1998.","journal-title":"J. Mol. Biol."},{"key":"7_CR24","unstructured":"Martin Tompa. An exact method for finding short motifs in sequences, with application to the ribosome binding site problem. In ISMB\u201999, pages 262\u2013271. AAAI Press, 1999. Seventh International Conference on Intelligent Systems for Molecular Biology, Heidelberg,Germany."},{"key":"7_CR25","doi-asserted-by":"publisher","first-page":"779","DOI":"10.1016\/S0923-2508(99)00115-1","volume":"150","author":"A. Vanet","year":"1999","unstructured":"A. Vanet, L. Marsan, and M.-F. Sagot. Promoter sequences and algorithmical methods for identifying them. Res. Microbiol., 150:779\u2013799, 1999.","journal-title":"Res. Microbiol"},{"key":"7_CR26","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1007\/BF02459500","volume":"45","author":"M.S. Waterman","year":"1984","unstructured":"M.S. Waterman, R. Arratia, and D.J. Galas. Pattern recognition in several sequences: consensus and alignment. Bull. Math. Biol., 45, 515\u2013527, 1984.","journal-title":"Bull. Math. Biol."},{"key":"7_CR27","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4899-6846-3","volume-title":"Introduction to Computational Biology","author":"M. Waterman","year":"1995","unstructured":"M. Waterman. Introduction to Computational Biology. Chapman and Hall, London, 1995."}],"container-title":["Lecture Notes in Computer Science","Algorithms in Bioinformatics"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/3-540-44696-6_7","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,16]],"date-time":"2025-01-16T21:01:36Z","timestamp":1737061296000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/3-540-44696-6_7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2001]]},"ISBN":["9783540425168","9783540446965"],"references-count":27,"URL":"https:\/\/doi.org\/10.1007\/3-540-44696-6_7","relation":{},"ISSN":["0302-9743"],"issn-type":[{"type":"print","value":"0302-9743"}],"subject":[],"published":{"date-parts":[[2001]]}}}