{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,24]],"date-time":"2025-10-24T08:24:57Z","timestamp":1761294297555},"reference-count":25,"publisher":"Oxford University Press (OUP)","issue":"Supplement_1","license":[{"start":{"date-parts":[[2021,7,12]],"date-time":"2021-07-12T00:00:00Z","timestamp":1626048000000},"content-version":"vor","delay-in-days":11,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Department of Biotechnology, Government of India","award":["BT\/IN\/BMBF-BioHr\/32\/LN\/2018-19"],"award-info":[{"award-number":["BT\/IN\/BMBF-BioHr\/32\/LN\/2018-19"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,8,4]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>High-throughput chromatin immunoprecipitation (ChIP) sequencing-based assays capture genomic regions associated with the profiled transcription factor (TF). ChIP-exo is a modified protocol, which uses lambda exonuclease to digest DNA close to the TF-DNA complex, in order to improve on the positional resolution of the TF-DNA contact. Because the digestion occurs in the 5\u2032\u20133\u2032 orientation, the protocol produces directional footprints close to the complex, on both sides of the double stranded DNA. Like all ChIP-based methods, ChIP-exo reports a mixture of different regions associated with the TF: those bound directly to the TF as well as via intermediaries. However, the distribution of footprints are likely to be indicative of the complex forming at the DNA.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We present ExoDiversity, which uses a model-based framework to learn a joint distribution over footprints and motifs, thus resolving the mixture of ChIP-exo footprints into diverse binding modes. It uses no prior motif or TF information and automatically learns the number of different modes from the data. We show its application on a wide range of TFs and organisms\/cell-types. Because its goal is to explain the complete set of reported regions, it is able to identify co-factor TF motifs that appear in a small fraction of the dataset. Further, ExoDiversity discovers small nucleotide variations within and outside canonical motifs, which co-occur with variations in footprints, suggesting that the TF-DNA structural configuration at those regions is likely to be different. Finally, we show that detected modes have specific DNA shape features and conservation signals, giving insights into the structure and function of the putative TF-DNA complexes.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The code for ExoDiversity is available on https:\/\/github.com\/NarlikarLab\/exoDIVERSITY.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab274","type":"journal-article","created":{"date-parts":[[2021,4,23]],"date-time":"2021-04-23T00:04:18Z","timestamp":1619136258000},"page":"i367-i375","source":"Crossref","is-referenced-by-count":3,"title":["Resolving diverse protein\u2013DNA footprints from exonuclease-based ChIP experiments"],"prefix":"10.1093","volume":"37","author":[{"given":"Anushua","family":"Biswas","sequence":"first","affiliation":[{"name":"Department of Chemical Engineering, CSIR-National Chemical Laboratory , Pune 411008, India"},{"name":"Academy of Scientific and Innovative Research , Ghaziabad 201002, India"}]},{"given":"Leelavati","family":"Narlikar","sequence":"additional","affiliation":[{"name":"Department of Chemical Engineering, CSIR-National Chemical Laboratory , Pune 411008, India"},{"name":"Academy of Scientific and Innovative Research , Ghaziabad 201002, India"}]}],"member":"286","published-online":{"date-parts":[[2021,7,12]]},"reference":[{"key":"2023062410164622300_btab274-B1","doi-asserted-by":"crossref","first-page":"354","DOI":"10.1038\/s41588-021-00782-6","article-title":"Base-resolution models of transcription-factor binding reveal soft motif syntax","volume":"53","author":"Avsec","year":"2021","journal-title":"Nat. Genet"},{"key":"2023062410164622300_btab274-B2","article-title":"A universal framework for detecting cis-regulatory diversity in DNA regulatory regions","author":"Biswas","year":"2020","journal-title":"bioRxiv"},{"key":"2023062410164622300_btab274-B3","doi-asserted-by":"crossref","first-page":"e1002770","DOI":"10.1371\/journal.pgen.1002770","article-title":"Genome-wide location analysis reveals distinct transcriptional circuitry by paralogous regulators Foxa1 and Foxa2","volume":"8","author":"Bochkis","year":"2012","journal-title":"PLoS Genet"},{"key":"2023062410164622300_btab274-B4","doi-asserted-by":"crossref","first-page":"e101177","DOI":"10.1371\/journal.pone.0101177","article-title":"Microsatellite repeat instability fuels evolution of embryonic enhancers in Hawaiian Drosophila","volume":"9","author":"Brittain","year":"2014","journal-title":"PLoS One"},{"key":"2023062410164622300_btab274-B5","doi-asserted-by":"crossref","first-page":"D103","DOI":"10.1093\/nar\/gku977","article-title":"GBshape: a genome browser database for DNA shape annotations","volume":"43","author":"Chiu","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"2023062410164622300_btab274-B6","doi-asserted-by":"crossref","first-page":"e85629","DOI":"10.1371\/journal.pone.0085629","article-title":"On the value of intra-motif dependencies of human insulator protein CTCF","volume":"9","author":"Eggeling","year":"2014","journal-title":"PLoS One"},{"key":"2023062410164622300_btab274-B7","doi-asserted-by":"crossref","first-page":"1728","DOI":"10.1038\/nprot.2012.101","article-title":"Identifying ChIP-seq enrichment using MACS","volume":"7","author":"Feng","year":"2012","journal-title":"Nat. Protoc"},{"key":"2023062410164622300_btab274-B8","doi-asserted-by":"crossref","first-page":"840","DOI":"10.1038\/nrg3306","article-title":"ChIP-seq and beyond: new and improved methodologies to detect and characterize protein-DNA interactions","volume":"13","author":"Furey","year":"2012","journal-title":"Nat. Rev. Genet"},{"key":"2023062410164622300_btab274-B9","doi-asserted-by":"crossref","first-page":"e1003711","DOI":"10.1371\/journal.pcbi.1003711","article-title":"Enhanced regulatory sequence prediction using gapped k-mer features","volume":"10","author":"Ghandi","year":"2014","journal-title":"PLoS Comput. Biol"},{"key":"2023062410164622300_btab274-B10","doi-asserted-by":"crossref","first-page":"1093","DOI":"10.1016\/j.celrep.2013.03.014","article-title":"Genomic regions flanking E-box binding sites influence DNA binding specificity of bHLH transcription factors through DNA shape","volume":"3","author":"Gordan","year":"2013","journal-title":"Cell Rep"},{"key":"2023062410164622300_btab274-B11","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1038\/nbt.3121","article-title":"ChIP-nexus enables improved detection of in vivo transcription factor binding footprints","volume":"33","author":"He","year":"2015","journal-title":"Nat. Biotechnol"},{"key":"2023062410164622300_btab274-B12","doi-asserted-by":"crossref","first-page":"D493","DOI":"10.1093\/nar\/gkh103","article-title":"The UCSC Table Browser data retrieval tool","volume":"32","author":"Karolchik","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023062410164622300_btab274-B13","doi-asserted-by":"crossref","first-page":"958","DOI":"10.1080\/01621459.1994.10476829","article-title":"The collapsed Gibbs sampler with applications to a gene regulation problem","volume":"89","author":"Liu","year":"1994","journal-title":"J. Am. Stat. Assoc"},{"key":"2023062410164622300_btab274-B14","doi-asserted-by":"crossref","first-page":"e1003501","DOI":"10.1371\/journal.pcbi.1003501","article-title":"An integrated model of multiple-condition ChIP-Seq data reveals predeterminants of Cdx2 binding","volume":"10","author":"Mahony","year":"2014","journal-title":"PLoS Comput. Biol"},{"key":"2023062410164622300_btab274-B15","doi-asserted-by":"crossref","first-page":"e1006090","DOI":"10.1371\/journal.pcbi.1006090","article-title":"Diversity in binding, regulation, and evolution revealed from high-throughput chip","volume":"14","author":"Mitra","year":"2018","journal-title":"PLoS Comput. Biol"},{"key":"2023062410164622300_btab274-B16","volume-title":"Probabilistic Machine Learning: An Introduction","author":"Murphy","year":"2021"},{"key":"2023062410164622300_btab274-B17","doi-asserted-by":"crossref","first-page":"1678","DOI":"10.1016\/j.celrep.2013.04.024","article-title":"A genome-wide map of CTCF multivalency redefines the CTCF code","volume":"3","author":"Nakahashi","year":"2013","journal-title":"Cell Rep"},{"key":"2023062410164622300_btab274-B18","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1093\/nar\/gks950","article-title":"MuMoD: a Bayesian approach to detect multiple modes of protein-DNA binding from genome-wide ChIP data","volume":"41","author":"Narlikar","year":"2013","journal-title":"Nucleic Acids Res"},{"key":"2023062410164622300_btab274-B19","doi-asserted-by":"crossref","first-page":"research0087","DOI":"10.1186\/gb-2002-3-12-research0087","article-title":"Computational analysis of core promoters in the Drosophila genome","volume":"3","author":"Ohler","year":"2002","journal-title":"Genome Biol"},{"key":"2023062410164622300_btab274-B20","doi-asserted-by":"crossref","first-page":"1408","DOI":"10.1016\/j.cell.2011.11.013","article-title":"Comprehensive genome-wide protein\u2013DNA interactions detected at single-nucleotide resolution","volume":"147","author":"Rhee","year":"2011","journal-title":"Cell"},{"key":"2023062410164622300_btab274-B21","doi-asserted-by":"crossref","first-page":"2842","DOI":"10.1038\/s41467-018-05265-7","article-title":"Simplified ChIP-exo assays","volume":"9","author":"Rossi","year":"2018","journal-title":"Nat. Commun"},{"key":"2023062410164622300_btab274-B22","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1093\/nar\/12.1Part2.505","article-title":"Computer methods to locate signals in nucleic acid sequences","volume":"12","author":"Staden","year":"1984","journal-title":"Nucleic Acids Res"},{"key":"2023062410164622300_btab274-B23","doi-asserted-by":"crossref","first-page":"825","DOI":"10.1101\/gr.185157.114","article-title":"Chip-exo signal associated with DNA-binding motifs provides insight into the genomic binding of the glucocorticoid receptor and cooperating transcription factors","volume":"25","author":"Starick","year":"2015","journal-title":"Genome Res"},{"key":"2023062410164622300_btab274-B24","doi-asserted-by":"crossref","first-page":"903","DOI":"10.1093\/bioinformatics\/bty703","article-title":"Characterizing protein\u2013DNA binding event subtypes in chip-exo data","volume":"35","author":"Yamada","year":"2019","journal-title":"Bioinformatics"},{"key":"2023062410164622300_btab274-B25","doi-asserted-by":"crossref","first-page":"1147","DOI":"10.1101\/gr.169243.113","article-title":"Dissection of thousands of cell type-specific enhancers identifies dinucleotide repeat motifs as general enhancer features","volume":"24","author":"Yanez-Cuna","year":"2014","journal-title":"Genome Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/Supplement_1\/i367\/50694027\/btab274.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/Supplement_1\/i367\/50694027\/btab274.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,25]],"date-time":"2023-06-25T00:15:29Z","timestamp":1687652129000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/Supplement_1\/i367\/6319667"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,1]]},"references-count":25,"journal-issue":{"issue":"Supplement_1","published-print":{"date-parts":[[2021,8,4]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab274","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,7,1]]},"published":{"date-parts":[[2021,7,1]]}}}