{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T14:56:55Z","timestamp":1781103415213,"version":"3.54.1"},"reference-count":0,"publisher":"National Academy of Sciences","issue":"18","content-domain":{"domain":["www.pnas.org"],"crossmark-restriction":true},"short-container-title":["Proc. Natl. Acad. Sci. U.S.A."],"published-print":{"date-parts":[[1995,8,29]]},"abstract":"<jats:p>The bithorax complex (BX-C) of Drosophila, one of two complexes that act as master regulators of the body plan of the fly, has now been entirely sequenced and comprises approximately 315,000 bp, only 1.4% of which codes for protein. Analysis of this sequence reveals significantly overrepresented DNA motifs of unknown, as well as known, functions in the non-protein-coding portion of the sequence. The following types of motifs in that portion are analyzed: (i) concatamers of mono-, di-, and trinucleotides; (ii) tightly clustered hexanucleotides (spaced &lt; or = 5 bases apart); (iii) direct and reverse repeats longer than 20 bp; and (iv) a number of motifs known from biochemical studies to play a role in the regulation of the BX-C. The hexanucleotide AGATAC is remarkably overrepresented and is surmised to play a role in chromosome pairing. The positions of sites of highly overrepresented motifs are plotted for those that occur at more than five sites in the sequence, when &lt; 0.5 case is expected. Expected values are based on a third-order Markov chain, which is the optimal order for representing the BXCALL sequence.<\/jats:p>","DOI":"10.1073\/pnas.92.18.8403","type":"journal-article","created":{"date-parts":[[2006,5,31]],"date-time":"2006-05-31T09:17:21Z","timestamp":1149067041000},"page":"8403-8407","update-policy":"https:\/\/doi.org\/10.1073\/pnas.cm10313","source":"Crossref","is-referenced-by-count":24,"title":["Sequence analysis of the cis-regulatory regions of the bithorax complex of Drosophila."],"prefix":"10.1073","volume":"92","author":[{"given":"E B","family":"Lewis","sequence":"first","affiliation":[{"name":"Division of Biology, California Institute of Technology, Pasadena 91125, USA."}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"J D","family":"Knafels","sequence":"additional","affiliation":[{"name":"Division of Biology, California Institute of Technology, Pasadena 91125, USA."}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"D R","family":"Mathog","sequence":"additional","affiliation":[{"name":"Division of Biology, California Institute of Technology, Pasadena 91125, USA."}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"S E","family":"Celniker","sequence":"additional","affiliation":[{"name":"Division of Biology, California Institute of Technology, Pasadena 91125, USA."}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"341","published-online":{"date-parts":[[1995,8,29]]},"container-title":["Proceedings of the National Academy of Sciences"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/pnas.org\/doi\/pdf\/10.1073\/pnas.92.18.8403","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,4,13]],"date-time":"2022-04-13T13:49:46Z","timestamp":1649857786000},"score":1,"resource":{"primary":{"URL":"https:\/\/pnas.org\/doi\/full\/10.1073\/pnas.92.18.8403"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1995,8,29]]},"references-count":0,"journal-issue":{"issue":"18","published-print":{"date-parts":[[1995,8,29]]}},"alternative-id":["10.1073\/pnas.92.18.8403"],"URL":"https:\/\/doi.org\/10.1073\/pnas.92.18.8403","relation":{},"ISSN":["0027-8424","1091-6490"],"issn-type":[{"value":"0027-8424","type":"print"},{"value":"1091-6490","type":"electronic"}],"subject":[],"published":{"date-parts":[[1995,8,29]]},"assertion":[{"value":"1995-08-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}