{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,31]],"date-time":"2025-10-31T14:21:42Z","timestamp":1761920502422},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"20","license":[{"start":{"date-parts":[[2018,9,11]],"date-time":"2018-09-11T00:00:00Z","timestamp":1536624000000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"name":"InDAM Projects"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2018,10,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Although the nucleosome occupancy along a genome can be in part predicted by in vitro experiments, it has been recently observed that the chromatin organization presents important differences in vitro with respect to in vivo. Such differences mainly regard the hierarchical and regular structures of the nucleosome fiber, whose existence has long been assumed, and in part also observed in vitro, but that does not apparently occur in vivo. It is also well known that the DNA sequence has a role in determining the nucleosome occupancy. Therefore, an important issue is to understand if, and to what extent, the structural differences in the chromatin organization between in vitro and in vivo have a counterpart in terms of the underlying genomic sequences.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We present the first quantitative comparison between the in vitro and in vivo nucleosome maps of two model organisms (S. cerevisiae and C. elegans). The comparison is based on the construction of weighted k-mer dictionaries. Our findings show that there is a good level of sequence conservation between in vitro and in vivo in both the two organisms, in contrast to the abovementioned important differences in chromatin structural organization. Moreover, our results provide evidence that the two organisms predispose themselves differently, in terms of sequence composition and both in vitro and in vivo, for the nucleosome occupancy. This leads to the conclusion that, although the notion of a genome encoding for its own nucleosome occupancy is general, the intrinsic histone k-mer sequence preferences tend to be species-specific.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The files containing the dictionaries and the main results of the analysis are available at http:\/\/math.unipa.it\/rombo\/material.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/bty799","type":"journal-article","created":{"date-parts":[[2018,9,8]],"date-time":"2018-09-08T11:14:50Z","timestamp":1536405290000},"page":"3454-3460","source":"Crossref","is-referenced-by-count":9,"title":["<i>In vitro<\/i> versus <i>in vivo<\/i> compositional landscapes of histone sequence preferences in eucaryotic genomes"],"prefix":"10.1093","volume":"34","author":[{"given":"Raffaele","family":"Giancarlo","sequence":"first","affiliation":[{"name":"Dipartimento di Matematica ed Informatica, Universit\u00e0 degli Studi di Palermo, Palermo, Italy"}]},{"given":"Simona E","family":"Rombo","sequence":"additional","affiliation":[{"name":"Dipartimento di Matematica ed Informatica, Universit\u00e0 degli Studi di Palermo, Palermo, Italy"}]},{"given":"Filippo","family":"Utro","sequence":"additional","affiliation":[{"name":"Computational Biology Center, IBM T. J. Watson Research, Yorktown Heights, NY, USA"}]}],"member":"286","published-online":{"date-parts":[[2018,9,10]]},"reference":[{"key":"2023012712413318400_bty799-B1","volume-title":"Molecular Biology of the Cell","author":"Alberts","year":"2002"},{"key":"2023012712413318400_bty799-B2","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1093\/bioinformatics\/btw562","article-title":"Prediction of nucleosome positioning by the incorporation of frequencies and distributions of three different nucleotide segment lengths into a general pseudo k-tuple nucleotide composition","volume":"33","author":"Awazu","year":"2017","journal-title":"Bioinformatics"},{"key":"2023012712413318400_bty799-B3","first-page":"245","article-title":"Protein binding microarrays (PBMs) for rapid, high-throughput characterization of the sequence specificities of DNA binding proteins","volume":"338","author":"Berger","year":"2006","journal-title":"Methods Mol. Biol"},{"key":"2023012712413318400_bty799-B4","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/j.molcel.2012.06.028","article-title":"DNA sequence preferences of transcriptional activators correlate more strongly than repressors with nucleosomes","volume":"47","author":"Charoensawan","year":"2012","journal-title":"Mol. Cell"},{"key":"2023012712413318400_bty799-B5","first-page":"1096","article-title":"2D motif basis applied to the classification of digital images","volume":"60","author":"Furfaro","year":"2017","journal-title":"Comput. J"},{"key":"2023012712413318400_bty799-B6","doi-asserted-by":"crossref","first-page":"884","DOI":"10.15252\/msb.20167131","article-title":"A gene-centered C. elegans protein\u2013DNA interaction network provides a framework for functional predictions","volume":"12","author":"Fuxman Bass","year":"2016","journal-title":"Mol. Syst. Biol"},{"key":"2023012712413318400_bty799-B7","doi-asserted-by":"crossref","first-page":"390","DOI":"10.1093\/bib\/bbt088","article-title":"Compressive biological sequence analysis and archival in the era of high-throughput sequencing technologies","volume":"15","author":"Giancarlo","year":"2014","journal-title":"Brief. Bioinform"},{"key":"2023012712413318400_bty799-B8","doi-asserted-by":"crossref","first-page":"2939","DOI":"10.1093\/bioinformatics\/btv295","article-title":"Epigenomic k-mer dictionaries: shedding light on how sequence composition influences in vivo nucleosome positioning","volume":"31","author":"Giancarlo","year":"2015","journal-title":"Bioinformatics"},{"key":"2023012712413318400_bty799-B9","article-title":"DNA combinatorial messages and epigenomics: the case of chromatin organization and nucleosome occupancy in eukaryotic genomes","author":"Giancarlo","year":"2018","journal-title":"Theor. Comput. Sci"},{"key":"2023012712413318400_bty799-B10","doi-asserted-by":"crossref","first-page":"644.","DOI":"10.1038\/nbt.1883","article-title":"Full-length transcriptome assembly from RNA-Seq data without a reference genome","volume":"29","author":"Grabherr","year":"2011","journal-title":"Nat. Biotechnol"},{"key":"2023012712413318400_bty799-B11","doi-asserted-by":"crossref","first-page":"1621","DOI":"10.1038\/emboj.2012.66","article-title":"Human mitotic chromosome structure: what happened to the 30-nm fibre?","volume":"31","author":"Hansen","year":"2012","journal-title":"EMBO J"},{"key":"2023012712413318400_bty799-B12","doi-asserted-by":"crossref","first-page":"362","DOI":"10.1038\/nature07667","article-title":"The DNA-encoded nucleosome organization of a eukaryotic genome","volume":"458","author":"Kaplan","year":"2009","journal-title":"Nature"},{"key":"2023012712413318400_bty799-B13","doi-asserted-by":"crossref","first-page":"709","DOI":"10.1016\/j.cell.2016.09.045","article-title":"Genomic nucleosome organization reconstituted with pure proteins","volume":"167","author":"Krietenstein","year":"2016","journal-title":"Cell"},{"key":"2023012712413318400_bty799-B14","doi-asserted-by":"crossref","first-page":"707","DOI":"10.1016\/j.cell.2007.01.015","article-title":"The role of chromatin during transcription","volume":"128","author":"Li","year":"2007","journal-title":"Cell"},{"key":"2023012712413318400_bty799-B15","first-page":"114","volume-title":"Proceedings of CIBB","author":"Lo Bosco","year":"2016"},{"key":"2023012712413318400_bty799-B16","doi-asserted-by":"crossref","first-page":"284","DOI":"10.1186\/1471-2164-14-284","article-title":"Global remodeling of nucleosome positions in C. elegans","volume":"14","author":"Locke","year":"2013","journal-title":"BMC Genomics"},{"key":"2023012712413318400_bty799-B17","doi-asserted-by":"crossref","first-page":"2492","DOI":"10.1101\/gad.250704.114","article-title":"Role of DNA sequence in chromatin remodeling and the formation of nucleosome-free regions","volume":"28","author":"Lorch","year":"2014","journal-title":"Genes Dev"},{"key":"2023012712413318400_bty799-B18","doi-asserted-by":"crossref","first-page":"1826","DOI":"10.1093\/bioinformatics\/bty018","article-title":"Informational and linguistic analysis of large genomic sequence collections via efficient Hadoop cluster algorithms","volume":"34","author":"Petrillo","year":"2018","journal-title":"Bioinformatics"},{"key":"2023012712413318400_bty799-B19","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1186\/s13015-016-0072-x","article-title":"MissMax: alignment-free sequence comparison with mismatches through filtering and heuristics","volume":"11","author":"Pizzi","year":"2016","journal-title":"Algorithms Mol. Biol"},{"key":"2023012712413318400_bty799-B20","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1109\/TCBB.2016.2620143","article-title":"Efficient algorithms for sequence analysis with entropic profiles","volume":"15","author":"Pizzi","year":"2018","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform"},{"key":"2023012712413318400_bty799-B21","doi-asserted-by":"crossref","first-page":"258","DOI":"10.1016\/j.ydbio.2009.06.012","article-title":"Nucleosome positioning: how is it established, and why does it matter?","volume":"339","author":"Radman-Livaja","year":"2010","journal-title":"Dev. Biol"},{"key":"2023012712413318400_bty799-B22","doi-asserted-by":"crossref","first-page":"653","DOI":"10.4161\/epi.28297","article-title":"Chromatin without the 30-nm fiber: constrained disorder instead of hierarchical folding","volume":"9","author":"Razin","year":"2014","journal-title":"Epigenetics"},{"key":"2023012712413318400_bty799-B23","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1016\/j.cell.2015.01.054","article-title":"Chromatin fibers are formed by heterogeneous groups of nucleosomes in vivo","volume":"160","author":"Ricci","year":"2015","journal-title":"Cell"},{"key":"2023012712413318400_bty799-B24","doi-asserted-by":"crossref","first-page":"6506","DOI":"10.1073\/pnas.0601212103","article-title":"EM measurements define the dimensions of the 30-nm chromatin fiber: evidence for a compact, interdigitated structure","volume":"103","author":"Robinson","year":"2006","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023012712413318400_bty799-B25","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1016\/j.tcs.2012.06.021","article-title":"Extracting string motif bases for quorum higher than two","volume":"460","author":"Rombo","year":"2012","journal-title":"Theor. Comput. Sci"},{"key":"2023012712413318400_bty799-B26","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/j.sbi.2009.01.004","article-title":"Poly(dA:dT) tracts: major determinants of nucleosome organization","volume":"19","author":"Segal","year":"2009","journal-title":"Curr. Opin. Struct. Biol"},{"key":"2023012712413318400_bty799-B27","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1038\/nsmb.2506","article-title":"Determinants of nucleosome positioning","volume":"20","author":"Struhl","year":"2013","journal-title":"Nat. Struct. Mol. Biol"},{"key":"2023012712413318400_bty799-B28","doi-asserted-by":"crossref","first-page":"D158","DOI":"10.1093\/nar\/gkw1099","article-title":"UniProt: the universal protein knowledgebase","volume":"45","author":"The UniProt Consortium","year":"2017","journal-title":"Nucleic Acid Res"},{"key":"2023012712413318400_bty799-B29","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1016\/j.bpj.2016.12.041","article-title":"Genomes of multicellular organisms have evolved to attract nucleosomes to promoter regions","volume":"112","author":"Tompitak","year":"2017","journal-title":"Biophys. J"},{"key":"2023012712413318400_bty799-B30","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1016\/j.cell.2007.02.008","article-title":"Higher-order structures of chromatin: the elusive 30 nm fiber","volume":"128","author":"Tremethick","year":"2007","journal-title":"Cell"},{"key":"2023012712413318400_bty799-B31","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1093\/bioinformatics\/btv679","article-title":"The intrinsic combinatorial organization and information theoretic content of a sequence are correlated to the DNA encoded nucleosome organization of eukaryotic genomes","volume":"32","author":"Utro","year":"2016","journal-title":"Bioinformatics"},{"key":"2023012712413318400_bty799-B32","doi-asserted-by":"crossref","first-page":"977","DOI":"10.1126\/science.1200508","article-title":"A packing mechanism for nucleosome organization reconstituted across a eukaryotic genome","volume":"332","author":"Zhang","year":"2011","journal-title":"Science"},{"key":"2023012712413318400_bty799-B33","article-title":"SlopMap: a software application tool for quick and flexible identification of similar sequences using exact k-mer matching","volume":"4","author":"Zhbannikov","year":"2013","journal-title":"J. Data Min. Genomics Proteomics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/20\/3454\/48918851\/bioinformatics_34_20_3454.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/34\/20\/3454\/48918851\/bioinformatics_34_20_3454.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,27]],"date-time":"2023-01-27T13:33:52Z","timestamp":1674826432000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/34\/20\/3454\/5094779"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2018,9,10]]},"references-count":33,"journal-issue":{"issue":"20","published-print":{"date-parts":[[2018,10,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bty799","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2018,10,15]]},"published":{"date-parts":[[2018,9,10]]}}}