{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,26]],"date-time":"2025-02-26T05:30:44Z","timestamp":1740547844369,"version":"3.38.0"},"reference-count":19,"publisher":"Oxford University Press (OUP)","issue":"18","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":2220,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Gene regulation commonly involves interaction among DNA, proteins and biochemical conditions. Using chromatin immunoprecipitation (ChIP) technologies, protein\u2013DNA interactions are routinely detected in the genome scale. Computational methods that detect weak protein-binding signals and simultaneously maintain a high specificity yet remain to be challenging. An attractive approach is to incorporate biologically relevant data, such as protein co-occupancy, to improve the power of protein-binding detection. We call the additional data related with the target protein binding as supporting tracks.<\/jats:p><jats:p>Results: We propose a novel but rigorous statistical method to identify protein occupancy in ChIP data using multiple supporting tracks (PASS2). We demonstrate that utilizing biologically related information can significantly increase the discovery of true protein-binding sites, while still maintaining a desired level of false positive calls. Applying the method to GATA1 restoration in mouse erythroid cell line, we detected many new GATA1-binding sites using GATA1 co-occupancy data.<\/jats:p><jats:p>Availability: \u00a0http:\/\/stat.psu.edu\/\u223cyuzhang\/pass2.tar<\/jats:p><jats:p>Contact: \u00a0yuzhang@stat.psu.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq379","type":"journal-article","created":{"date-parts":[[2010,9,7]],"date-time":"2010-09-07T17:41:46Z","timestamp":1283881306000},"page":"i504-i510","source":"Crossref","is-referenced-by-count":7,"title":["A varying threshold method for ChIP peak-calling using multiple sources of information"],"prefix":"10.1093","volume":"26","author":[{"given":"Kuan-Bei","family":"Chen","sequence":"first","affiliation":[{"name":"1 Department of Computer Science and Engineering, The Pennsylvania State University, University Park and 2Department of Statistics, The Pennsylvania State University, 422A Thomas, University Park, PA 16802, USA"}]},{"given":"Yu","family":"Zhang","sequence":"additional","affiliation":[{"name":"1 Department of Computer Science and Engineering, The Pennsylvania State University, University Park and 2Department of Statistics, The Pennsylvania State University, 422A Thomas, University Park, PA 16802, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,9,4]]},"reference":[{"key":"2023012508284432300_B1","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1007\/PL00011680","article-title":"Multivariate discretization for set mining","volume":"3","author":"Bay","year":"2001","journal-title":"Knowl. Inf. Syst."},{"key":"2023012508284432300_B2","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. Roy. Stat. Soc. Ser. B"},{"key":"2023012508284432300_B3","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1101\/gr.4887606","article-title":"Unbiased locationanalysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome","volume":"16","author":"Bieda","year":"2006","journal-title":"Genome Res."},{"key":"2023012508284432300_B4","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1016\/j.cell.2007.12.014","article-title":"High-resolution mapping and characterization of open chromatin across the genome","volume":"132","author":"Boyle","year":"2008","journal-title":"Cell"},{"key":"2023012508284432300_B5","doi-asserted-by":"crossref","first-page":"2537","DOI":"10.1093\/bioinformatics\/btn480","article-title":"F-Seq: a feature density estimator for high-throughput sequence tags","volume":"24","author":"Boyle","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508284432300_B6","doi-asserted-by":"crossref","first-page":"1896","DOI":"10.1101\/gr.083089.108","article-title":"Transcriptional enhancement by GATA1-occupied DNA segments is strongly associated with evolutionary constraint on the binding site motif","volume":"18","author":"Cheng","year":"2008","journal-title":"Genome Res."},{"key":"2023012508284432300_B7","doi-asserted-by":"crossref","first-page":"2172","DOI":"10.1101\/gr.098921.109","article-title":"Erythroid GATA1 function revealed by genome-wide analysis of transcription factor occupancy, histone modifications, and mRNA expression","volume":"19","author":"Cheng","year":"2009","journal-title":"Genome Res."},{"key":"2023012508284432300_B8","doi-asserted-by":"crossref","DOI":"10.1201\/b14832","volume-title":"Theoretical Statistics.","author":"Cox","year":"1979"},{"key":"2023012508284432300_B9","doi-asserted-by":"crossref","first-page":"545","DOI":"10.1093\/bioinformatics\/btm523","article-title":"Statistical methods to infer cooperative binding among transcription factors in Saccharomyces cerevisiae","volume":"24","author":"Datta","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508284432300_B10","doi-asserted-by":"crossref","first-page":"1424","DOI":"10.1093\/bioinformatics\/btm096","article-title":"Unsupervised segmentation of continuous genomic data","volume":"23","author":"Day","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012508284432300_B11","doi-asserted-by":"crossref","first-page":"3016","DOI":"10.1093\/bioinformatics\/btl515","article-title":"A supervised hidden markov model framework for efficiently segmenting tiling array data in transcriptional and chIP-chip experiments: systematically incorporating validated biological knowledge","volume":"22","author":"Du","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012508284432300_B12","first-page":"1022","article-title":"Multi-Interval discretization of continuous-valued attributes for classification learning","volume-title":"Proceedings of the 13th International Conference on Artificial Intelligence","author":"Fayyad","year":"1993"},{"key":"2023012508284432300_B13","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1038\/ng1966","article-title":"Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome","volume":"39","author":"Heintzman","year":"2007","journal-title":"Nat. Genet."},{"key":"2023012508284432300_B14","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1016\/S0092-8674(02)00976-5","article-title":"Histone methyltransferase activity of a Drosophila Polycomb group repressor complex","volume":"111","author":"Muller","year":"2002","journal-title":"Cell"},{"key":"2023012508284432300_B15","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1093\/nar\/gkl842","article-title":"NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins","volume":"35","author":"Pruitt","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012508284432300_B16","doi-asserted-by":"crossref","first-page":"3145","DOI":"10.1093\/emboj\/16.11.3145","article-title":"The LIM-only protein Lmo2 is a bridging molecule assembling an erythroid, DNA-binding complex which includes the TAL1, E47, GATA-1 and Ldb1\/NLI proteins","volume":"16","author":"Wadman","year":"1997","journal-title":"EMBO J."},{"key":"2023012508284432300_B17","doi-asserted-by":"crossref","first-page":"2825","DOI":"10.1093\/bioinformatics\/btn549","article-title":"Poisson approximation for significance in genome-wide ChIP-chip tiling arrays","volume":"24","author":"Zhang","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012508284432300_B18","doi-asserted-by":"crossref","first-page":"7024","DOI":"10.1093\/nar\/gkp747","article-title":"Primary sequence and epigenetic determinants of in vivo occupancy of genomic DNA by GATA1","volume":"37","author":"Zhang","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012508284432300_B19","doi-asserted-by":"crossref","first-page":"787","DOI":"10.1111\/j.1541-0420.2007.00768.x","article-title":"ChIP-chip: data, model, and analysis","volume":"63","author":"Zheng","year":"2007","journal-title":"Biometrics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/18\/i504\/48859264\/bioinformatics_26_18_i504.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/18\/i504\/48859264\/bioinformatics_26_18_i504.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,25]],"date-time":"2025-02-25T16:38:32Z","timestamp":1740501512000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/18\/i504\/205834"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,9,4]]},"references-count":19,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2010,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq379","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2010,9,15]]},"published":{"date-parts":[[2010,9,4]]}}}