{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T17:56:12Z","timestamp":1775930172757,"version":"3.50.1"},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2011,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM). By analyzing both simulated and real experimental data, we found that the MIM measure can be used to detect sequence specificity independent of presence of transcription factor (TF) binding motifs. We also found that the level of specificity associated with H3K4me1 target sequences is highly cell-type specific and highest in embryonic stem (ES) cells. We predicted H3K4me1 target sequences by using the N- score model and found that the prediction accuracy is indeed high in ES cells.The software to compute the MIM is freely available at:<jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" xlink:href=\"https:\/\/github.com\/lucapinello\/mim\" ext-link-type=\"uri\">https:\/\/github.com\/lucapinello\/mim<\/jats:ext-link>.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>Our method provides a unified framework for quantifying DNA sequence specificity and serves as a guide for development of sequence-based prediction models.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1471-2105-12-408","type":"journal-article","created":{"date-parts":[[2012,1,6]],"date-time":"2012-01-06T07:51:52Z","timestamp":1325836312000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":21,"title":["A motif-independent metric for DNA sequence specificity"],"prefix":"10.1186","volume":"12","author":[{"given":"Luca","family":"Pinello","sequence":"first","affiliation":[]},{"given":"Giosu\u00e8","family":"Lo Bosco","sequence":"additional","affiliation":[]},{"given":"Bret","family":"Hanlon","sequence":"additional","affiliation":[]},{"given":"Guo-Cheng","family":"Yuan","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,10,21]]},"reference":[{"issue":"7146","key":"4986_CR1","doi-asserted-by":"publisher","first-page":"799","DOI":"10.1038\/nature05874","volume":"447","author":"E Birney","year":"2007","unstructured":"Birney E, et al.: Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 2007, 447(7146):799\u2013816. 10.1038\/nature05874","journal-title":"Nature"},{"issue":"7216","key":"4986_CR2","doi-asserted-by":"publisher","first-page":"1061","DOI":"10.1038\/nature07385","volume":"455","author":"TCGA","year":"2008","unstructured":"TCGA: Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 2008, 455(7216):1061\u20138. 10.1038\/nature07385","journal-title":"Nature"},{"key":"4986_CR3","first-page":"67","volume":"8","author":"HJ Bussemaker","year":"2000","unstructured":"Bussemaker HJ, Li H, Siggia ED: Regulatory element detection using a probabilistic segmentation model. Proc Int Conf Intell Syst Mol Biol 2000, 8: 67\u201374.","journal-title":"Proc Int Conf Intell Syst Mol Biol"},{"issue":"4","key":"4986_CR4","doi-asserted-by":"publisher","first-page":"823","DOI":"10.1016\/j.cell.2007.05.009","volume":"129","author":"A Barski","year":"2007","unstructured":"Barski A, et al.: High-resolution profiling of histone methylations in the human genome. Cell 2007, 129(4):823\u201337. 10.1016\/j.cell.2007.05.009","journal-title":"Cell"},{"issue":"7153","key":"4986_CR5","doi-asserted-by":"publisher","first-page":"553","DOI":"10.1038\/nature06008","volume":"448","author":"TS Mikkelsen","year":"2007","unstructured":"Mikkelsen TS, et al.: Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature 2007, 448(7153):553\u201360. 10.1038\/nature06008","journal-title":"Nature"},{"issue":"7243","key":"4986_CR6","doi-asserted-by":"publisher","first-page":"108","DOI":"10.1038\/nature07829","volume":"459","author":"ND Heintzman","year":"2009","unstructured":"Heintzman ND, et al.: Histone modifications at human enhancers reflect global cell-type-specific gene expression. Nature 2009, 459(7243):108\u201312. 10.1038\/nature07829","journal-title":"Nature"},{"issue":"1","key":"4986_CR7","doi-asserted-by":"publisher","first-page":"123","DOI":"10.1101\/gr.4074106","volume":"16","author":"GE Crawford","year":"2006","unstructured":"Crawford GE, et al.: Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS). Genome Res 2006, 16(1):123\u201331.","journal-title":"Genome Res"},{"issue":"2-3","key":"4986_CR8","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1089\/1066527041410382","volume":"11","author":"CH Yeang","year":"2004","unstructured":"Yeang CH, Ideker T, Jaakkola T: Physical network models. J Comput Biol 2004, 11(2\u20133):243\u201362. 10.1089\/1066527041410382","journal-title":"J Comput Biol"},{"issue":"7004","key":"4986_CR9","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1038\/nature02800","volume":"431","author":"CT Harbison","year":"2004","unstructured":"Harbison CT, et al.: Transcriptional regulatory code of a eukaryotic genome. Nature 2004, 431(7004):99\u2013104. 10.1038\/nature02800","journal-title":"Nature"},{"issue":"42","key":"4986_CR10","doi-asserted-by":"publisher","first-page":"16438","DOI":"10.1073\/pnas.0701014104","volume":"104","author":"Q Zhou","year":"2007","unstructured":"Zhou Q, et al.: A gene regulatory network in mouse embryonic stem cells. Proc Natl Acad Sci USA 2007, 104(42):16438\u201343. 10.1073\/pnas.0701014104","journal-title":"Proc Natl Acad Sci USA"},{"issue":"2","key":"4986_CR11","doi-asserted-by":"publisher","first-page":"R38","DOI":"10.1186\/gb-2008-9-2-r38","volume":"9","author":"LW Chang","year":"2008","unstructured":"Chang LW, et al.: Computational identification of the normal and perturbed genetic networks involved in myeloid differentiation and acute promyelocytic leukemia. Genome Biol 2008, 9(2):R38. 10.1186\/gb-2008-9-2-r38","journal-title":"Genome Biol"},{"issue":"4","key":"4986_CR12","doi-asserted-by":"publisher","first-page":"693","DOI":"10.1016\/j.cell.2007.02.005","volume":"128","author":"T Kouzarides","year":"2007","unstructured":"Kouzarides T: Chromatin modifications and their function. Cell 2007, 128(4):693\u2013705. 10.1016\/j.cell.2007.02.005","journal-title":"Cell"},{"issue":"3","key":"4986_CR13","doi-asserted-by":"publisher","first-page":"161","DOI":"10.1038\/nrg2522","volume":"10","author":"C Jiang","year":"2009","unstructured":"Jiang C, Pugh BF: Nucleosome positioning and gene regulation: advances through genomics. Nat Rev Genet 2009, 10(3):161\u201372.","journal-title":"Nat Rev Genet"},{"issue":"6","key":"4986_CR14","doi-asserted-by":"publisher","first-page":"735","DOI":"10.1016\/j.molcel.2005.05.003","volume":"18","author":"EA Sekinger","year":"2005","unstructured":"Sekinger EA, Moqtaderi Z, Struhl K: Intrinsic histone-DNA interactions and low nucleosome density are important for preferential accessibility of promoter regions in yeast. Mol Cell 2005, 18(6):735\u201348. 10.1016\/j.molcel.2005.05.003","journal-title":"Mol Cell"},{"issue":"5734","key":"4986_CR15","doi-asserted-by":"publisher","first-page":"626","DOI":"10.1126\/science.1112178","volume":"309","author":"GC Yuan","year":"2005","unstructured":"Yuan GC, et al.: Genome-scale identification of nucleosome positions in S. cerevisiae. Science 2005, 309(5734):626\u201330. 10.1126\/science.1112178","journal-title":"Science"},{"issue":"8","key":"4986_CR16","doi-asserted-by":"publisher","first-page":"1170","DOI":"10.1101\/gr.6101007","volume":"17","author":"HE Peckham","year":"2007","unstructured":"Peckham HE, et al.: Nucleosome positioning signals in genomic DNA. Genome Res 2007, 17(8):1170\u20137. 10.1101\/gr.6101007","journal-title":"Genome Res"},{"key":"4986_CR17","doi-asserted-by":"publisher","first-page":"442","DOI":"10.1186\/1471-2105-10-442","volume":"10","author":"D Tillo","year":"2009","unstructured":"Tillo D, Hughes TR: G+C content dominates intrinsic nucleosome occupancy. BMC Bioinformatics 2009, 10: 442. 10.1186\/1471-2105-10-442","journal-title":"BMC Bioinformatics"},{"issue":"11","key":"4986_CR18","doi-asserted-by":"publisher","first-page":"e1000216","DOI":"10.1371\/journal.pcbi.1000216","volume":"4","author":"Y Field","year":"2008","unstructured":"Field Y, et al.: Distinct modes of regulation by chromatin encoded through nucleosome positioning signals. PLoS Comput Biol 2008, 4(11):e1000216. 10.1371\/journal.pcbi.1000216","journal-title":"PLoS Comput Biol"},{"issue":"1","key":"4986_CR19","doi-asserted-by":"publisher","first-page":"e13","DOI":"10.1371\/journal.pcbi.0040013","volume":"4","author":"GC Yuan","year":"2008","unstructured":"Yuan GC, Liu JS: Genomic sequence is highly predictive of local nucleosome depletion. PLoS Comput Biol 2008, 4(1):e13. 10.1371\/journal.pcbi.0040013","journal-title":"PLoS Comput Biol"},{"issue":"10","key":"4986_CR20","doi-asserted-by":"publisher","first-page":"e1000242","DOI":"10.1371\/journal.pgen.1000242","volume":"4","author":"M Ku","year":"2008","unstructured":"Ku M, et al.: Genomewide analysis of PRC1 and PRC2 occupancy identifies two classes of bivalent domains. PLoS Genet 2008, 4(10):e1000242. 10.1371\/journal.pgen.1000242","journal-title":"PLoS Genet"},{"issue":"2","key":"4986_CR21","doi-asserted-by":"publisher","first-page":"341","DOI":"10.1089\/cmb.2008.18TT","volume":"16","author":"GC Yuan","year":"2009","unstructured":"Yuan GC: Targeted recruitment of histone modifications in humans predicted by genomic sequences. J Comput Biol 2009, 16(2):341\u201355. 10.1089\/cmb.2008.18TT","journal-title":"J Comput Biol"},{"issue":"3","key":"4986_CR22","doi-asserted-by":"publisher","first-page":"e26","DOI":"10.1371\/journal.pgen.0020026","volume":"2","author":"C Bock","year":"2006","unstructured":"Bock C, et al.: CpG island methylation in human lymphocytes is highly correlated with DNA sequence, repeats, and predicted DNA structure. PLoS Genet 2006, 2(3):e26. 10.1371\/journal.pgen.0020026","journal-title":"PLoS Genet"},{"issue":"28","key":"4986_CR23","doi-asserted-by":"publisher","first-page":"10713","DOI":"10.1073\/pnas.0602949103","volume":"103","author":"R Das","year":"2006","unstructured":"Das R, et al.: Computational prediction of methylation status in human genomic sequences. Proc Natl Acad Sci USA 2006, 103(28):10713\u20136. 10.1073\/pnas.0602949103","journal-title":"Proc Natl Acad Sci USA"},{"issue":"4","key":"4986_CR24","first-page":"365","volume":"13","author":"SL Salzberg","year":"1997","unstructured":"Salzberg SL: A method for identifying splice sites and translational start sites in eukaryotic mRNA. Comput Appl Biosci 1997, 13(4):365\u201376.","journal-title":"Comput Appl Biosci"},{"issue":"9","key":"4986_CR25","doi-asserted-by":"publisher","first-page":"1389","DOI":"10.1101\/gr.6558107","volume":"17","author":"D DeCaprio","year":"2007","unstructured":"DeCaprio D, et al.: Conrad: gene prediction using conditional random fields. Genome Res 2007, 17(9):1389\u201398. 10.1101\/gr.6558107","journal-title":"Genome Res"},{"issue":"3","key":"4986_CR26","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1101\/gr.098657.109","volume":"20","author":"L Narlikar","year":"2010","unstructured":"Narlikar L, et al.: Genome-wide discovery of human heart enhancers. Genome Res 2010, 20(3):381\u201392. 10.1101\/gr.098657.109","journal-title":"Genome Res"},{"issue":"3","key":"4986_CR27","doi-asserted-by":"publisher","first-page":"645","DOI":"10.1111\/j.1541-0420.2006.00625.x","volume":"62","author":"H Ji","year":"2006","unstructured":"Ji H, Wong WH: Computational biology: toward deciphering gene regulatory information in mammalian genomes. Biometrics 2006, 62(3):645\u201363. 10.1111\/j.1541-0420.2006.00625.x","journal-title":"Biometrics"},{"issue":"1","key":"4986_CR28","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1214\/aoms\/1177729694","volume":"22","author":"S Kullback","year":"1951","unstructured":"Kullback S, Leibler RA: On Information and Sufficiency. The Annals of Mathematical Statistics 1951, 22(1):79\u201386. 10.1214\/aoms\/1177729694","journal-title":"The Annals of Mathematical Statistics"},{"key":"4986_CR29","doi-asserted-by":"publisher","first-page":"D91","DOI":"10.1093\/nar\/gkh012","volume":"32","author":"A Sandelin","year":"2004","unstructured":"Sandelin A, et al.: JASPAR: an open-access database for eukaryotic transcription factor binding profiles. Nucleic Acids Res 2004, (32 Database):D91\u20134.","journal-title":"Nucleic Acids Res"},{"issue":"18","key":"4986_CR30","doi-asserted-by":"publisher","first-page":"10096","DOI":"10.1073\/pnas.180265397","volume":"97","author":"HJ Bussemaker","year":"2000","unstructured":"Bussemaker HJ, Li H, Siggia ED: Building a dictionary for genomes: identification of presumptive regulatory sites by statistical analysis. Proc Natl Acad Sci USA 2000, 97(18):10096\u2013100.","journal-title":"Proc Natl Acad Sci USA"},{"issue":"1","key":"4986_CR31","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1038\/nbt.1518","volume":"27","author":"J Rozowsky","year":"2009","unstructured":"Rozowsky J, et al.: PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls. Nat Biotechnol 2009, 27(1):66\u201375. 10.1038\/nbt.1518","journal-title":"Nat Biotechnol"},{"issue":"7","key":"4986_CR32","doi-asserted-by":"publisher","first-page":"1017","DOI":"10.1093\/bioinformatics\/btr064","volume":"27","author":"CE Grant","year":"2011","unstructured":"Grant CE, Bailey TL, Noble WS: FIMO: scanning for occurrences of a given motif. Bioinformatics 2011, 27(7):1017\u20138. 10.1093\/bioinformatics\/btr064","journal-title":"Bioinformatics"},{"issue":"51","key":"4986_CR33","doi-asserted-by":"publisher","first-page":"30264","DOI":"10.1074\/jbc.270.51.30264","volume":"270","author":"DC Look","year":"1995","unstructured":"Look DC, et al.: Stat1 depends on transcriptional synergy with Sp1. J Biol Chem 1995, 270(51):30264\u20137. 10.1074\/jbc.270.51.30264","journal-title":"J Biol Chem"},{"issue":"5","key":"4986_CR34","doi-asserted-by":"publisher","first-page":"e10868","DOI":"10.1371\/journal.pone.0010868","volume":"5","author":"R Panchanathan","year":"2010","unstructured":"Panchanathan R, et al.: Mutually positive regulatory feedback loop between interferons and estrogen receptor-alpha in mice: implications for sex bias in autoimmunity. PLoS One 2010, 5(5):e10868. 10.1371\/journal.pone.0010868","journal-title":"PLoS One"},{"issue":"1","key":"4986_CR35","doi-asserted-by":"publisher","first-page":"80","DOI":"10.1016\/j.stem.2008.11.011","volume":"4","author":"K Cui","year":"2009","unstructured":"Cui K, et al.: Chromatin signatures in multipotent human hematopoietic stem cells indicate the fate of bivalent genes during differentiation. Cell Stem Cell 2009, 4(1):80\u201393. 10.1016\/j.stem.2008.11.011","journal-title":"Cell Stem Cell"},{"issue":"11","key":"4986_CR36","doi-asserted-by":"publisher","first-page":"1293","DOI":"10.1038\/nbt.1505","volume":"26","author":"H Ji","year":"2008","unstructured":"Ji H, et al.: An integrated software system for analyzing ChIP-chip and ChIP-seq data. Nat Biotechnol 2008, 26(11):1293\u2013300. 10.1038\/nbt.1505","journal-title":"Nat Biotechnol"},{"issue":"3","key":"4986_CR37","doi-asserted-by":"publisher","first-page":"610","DOI":"10.1016\/j.cell.2009.08.037","volume":"139","author":"S Hu","year":"2009","unstructured":"Hu S, et al.: Profiling the human protein-DNA interactome reveals ERK2 as a transcriptional repressor of interferon signaling. Cell 2009, 139(3):610\u201322. 10.1016\/j.cell.2009.08.037","journal-title":"Cell"},{"key":"4986_CR38","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1186\/1471-2105-10-48","volume":"10","author":"E Eden","year":"2009","unstructured":"Eden E, et al.: GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists. BMC Bioinformatics 2009, 10: 48. 10.1186\/1471-2105-10-48","journal-title":"BMC Bioinformatics"},{"key":"4986_CR39","volume-title":"Dictionary of distances","author":"E Deza","year":"2006","unstructured":"Deza E, Deza MM: Dictionary of distances. Elsevier; 2006."},{"key":"4986_CR40","volume-title":"Pattern Recognition","author":"S Theodoridis","year":"2009","unstructured":"Theodoridis S, Koutroumbas K: Pattern Recognition. Fourth edition. Academic Press; 2009.","edition":"Fourth"},{"issue":"1","key":"4986_CR41","doi-asserted-by":"publisher","first-page":"52","DOI":"10.1109\/TCOM.1967.1089532","volume":"15","author":"T Kailath","year":"1967","unstructured":"Kailath T: The Divergence and Bhattacharyya Distance Measures in Signal Selection. Communications, IEEE Transactions on [legacy, pre - 1988] 1967, 15(1):52\u201360.","journal-title":"Communications, IEEE Transactions on [legacy, pre - 1988]"},{"key":"4986_CR42","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198523963.001.0001","volume-title":"Applied Smoothing Techniques for Data Analysis","author":"AW Bowman","year":"1997","unstructured":"Bowman AW, Azzalini A: Applied Smoothing Techniques for Data Analysis. Oxford Univeristy Press; 1997."},{"issue":"10","key":"4986_CR43","doi-asserted-by":"publisher","first-page":"1235","DOI":"10.1038\/ng2117","volume":"39","author":"W Lee","year":"2007","unstructured":"Lee W, et al.: A high-resolution atlas of nucleosome occupancy in yeast. Nat Genet 2007, 39(10):1235\u201344. 10.1038\/ng2117","journal-title":"Nat Genet"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-12-408.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,17]],"date-time":"2024-04-17T05:25:10Z","timestamp":1713331510000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-12-408"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,10,21]]},"references-count":43,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,12]]}},"alternative-id":["4986"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-12-408","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,10,21]]},"assertion":[{"value":"26 July 2011","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 October 2011","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"21 October 2011","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"408"}}