{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:38:27Z","timestamp":1740184707353,"version":"3.37.3"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"17","funder":[{"DOI":"10.13039\/100000051","name":"NHGRI","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000051","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Epigenetic data are invaluable when determining the regulatory programs governing a cell. Based on use of next-generation sequencing data for characterizing epigenetic marks and transcription factor binding, numerous peak-calling approaches have been developed to determine sites of genomic significance in these data. Such analyses can produce a large number of false positive predictions, suggesting that sites supported by multiple algorithms provide a stronger foundation for inferring and characterizing regulatory programs associated with the epigenetic data. Few methodologies integrate epigenetic based predictions of multiple approaches when combining profiles generated by different tools.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>The SigSeeker peak-calling ensemble uses multiple tools to identify peaks, and with user-defined thresholds for peak overlap and signal strength it retains only those peaks that are concordant across multiple tools. Peaks predicted to be co-localized by only a very small number of tools, discovered to be only marginally overlapping, or found to represent significant outliers to the approximation model are removed from the results, providing concise and high quality epigenetic datasets. SigSeeker has been validated using established benchmarks for transcription factor binding and histone modification ChIP-Seq data. These comparisons indicate that the quality of our ensemble technique exceeds that of single tool approaches, enhances existing peak-calling ensembles, and results in epigenetic profiles of higher confidence.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>http:\/\/sigseeker.org<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btx276","type":"journal-article","created":{"date-parts":[[2017,4,21]],"date-time":"2017-04-21T03:11:13Z","timestamp":1492744273000},"page":"2615-2621","source":"Crossref","is-referenced-by-count":6,"title":["SigSeeker: a peak-calling ensemble approach for constructing epigenetic signatures"],"prefix":"10.1093","volume":"33","author":[{"given":"Jens","family":"Lichtenberg","sequence":"first","affiliation":[{"name":"Genetics and Molecular Biology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA"}]},{"given":"Laura","family":"Elnitski","sequence":"additional","affiliation":[{"name":"Translational and Functional Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA"}]},{"given":"David M","family":"Bodine","sequence":"additional","affiliation":[{"name":"Genetics and Molecular Biology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA"}]}],"member":"286","published-online":{"date-parts":[[2017,4,25]]},"reference":[{"key":"2023020206281129700_btx276-B1","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1002\/jcb.22077","article-title":"Genomic location analysis by ChIP-Seq","volume":"107","author":"Barski","year":"2009","journal-title":"J. Cell. Biochem"},{"key":"2023020206281129700_btx276-B2","doi-asserted-by":"crossref","first-page":"396","DOI":"10.1038\/nature05913","article-title":"Perceptions of epigenetics","volume":"447","author":"Bird","year":"2007","journal-title":"Nature"},{"key":"2023020206281129700_btx276-B3","doi-asserted-by":"crossref","first-page":"609","DOI":"10.1038\/nmeth.1985","article-title":"Systematic evaluation of factors influencing ChIP-seq fidelity","volume":"9","author":"Chen","year":"2012","journal-title":"Nat. Methods"},{"key":"2023020206281129700_btx276-B4","doi-asserted-by":"crossref","first-page":"882","DOI":"10.1093\/bioinformatics\/btn012","article-title":"OutlierD: an R package for outlier detection using quantile regression on mass spectrometry data","volume":"24","author":"Cho","year":"2008","journal-title":"Bioinformatics"},{"key":"2023020206281129700_btx276-B5","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1038\/nature11247","article-title":"An integrated encyclopedia of DNA elements in the human genome","volume":"489","author":"Dunham","year":"2012","journal-title":"Nature"},{"volume-title":"Measurement Error Models","year":"2009","author":"Fuller","key":"2023020206281129700_btx276-B6"},{"key":"2023020206281129700_btx276-B7","doi-asserted-by":"crossref","first-page":"318","DOI":"10.1137\/1015032","article-title":"Some modified matrix eigenvalue problems","volume":"15","author":"Golub","year":"1973","journal-title":"Siam Rev"},{"key":"2023020206281129700_btx276-B8","doi-asserted-by":"crossref","first-page":"883","DOI":"10.1137\/0717073","article-title":"An analysis of the total least-squares problem","volume":"17","author":"Golub","year":"1980","journal-title":"Siam J. Numer. Anal"},{"key":"2023020206281129700_btx276-B9","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1093\/bioinformatics\/btw672","article-title":"Optimizing ChIP-seq peak detectors using visual labels and supervised machine learning","volume":"33","author":"Hocking","year":"2017","journal-title":"Bioinformatics"},{"key":"2023020206281129700_btx276-B10","doi-asserted-by":"crossref","first-page":"1407","DOI":"10.1101\/gr.132878.111","article-title":"Genome-wide DNA methylation profiles in hematopoietic stem and progenitor cells reveal over-representation of ETS transcription factor binding sites","volume":"22","author":"Hogart","year":"2012","journal-title":"Genome Res"},{"key":"2023020206281129700_btx276-B11","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1186\/1471-2105-14-280","article-title":"Peak Finder Metaserver \u2013 a novel application for finding peaks in ChIP-seq data","volume":"14","author":"Kruczyk","year":"2013","journal-title":"BMC Bioinformatics"},{"key":"2023020206281129700_btx276-B12","doi-asserted-by":"crossref","first-page":"618","DOI":"10.1186\/1471-2164-10-618","article-title":"A practical comparison of methods for detecting transcription factor binding sites in ChIP-seq experiments","volume":"10","author":"Laajala","year":"2009","journal-title":"BMC Genomics"},{"key":"2023020206281129700_btx276-B13","doi-asserted-by":"crossref","first-page":"1813","DOI":"10.1101\/gr.136184.111","article-title":"ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia","volume":"22","author":"Landt","year":"2012","journal-title":"Genome Res"},{"key":"2023020206281129700_btx276-B14","doi-asserted-by":"crossref","first-page":"466","DOI":"10.1093\/bfgp\/elq022","article-title":"Processing and analyzing ChIP-seq data: from short reads to regulatory interactions","volume":"9","author":"Leleu","year":"2010","journal-title":"Brief. Funct. Genomics"},{"key":"2023020206281129700_btx276-B15","doi-asserted-by":"crossref","first-page":"1752","DOI":"10.1214\/11-AOAS466","article-title":"Measuring reproducibility of high-throughput experiments","volume":"5","author":"Li","year":"2011","journal-title":"Ann. Appl. Stat"},{"year":"2015","author":"Liu","key":"2023020206281129700_btx276-B16"},{"key":"2023020206281129700_btx276-B17","doi-asserted-by":"crossref","first-page":"e25260","DOI":"10.1371\/journal.pone.0025260","article-title":"Comparison of four ChIP-Seq analytical algorithms using rice endosperm H3K27 trimethylation profiling data","volume":"6","author":"Malone","year":"2011","journal-title":"PLoS One"},{"key":"2023020206281129700_btx276-B18","doi-asserted-by":"crossref","first-page":"2283","DOI":"10.1016\/j.sigpro.2007.04.004","article-title":"Overview of total least-squares methods","volume":"87","author":"Markovsky","year":"2007","journal-title":"Signal Process"},{"key":"2023020206281129700_btx276-B19","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1038\/nbt.1630","article-title":"GREAT improves functional interpretation of cis-regulatory regions","volume":"28","author":"McLean","year":"2010","journal-title":"Nat. Biotechnol"},{"key":"2023020206281129700_btx276-B20","doi-asserted-by":"crossref","first-page":"e70-e70","DOI":"10.1093\/nar\/gks048","article-title":"Picking ChIP-seq peak detectors for analyzing chromatin modification experiments","volume":"40","author":"Micsinai","year":"2012","journal-title":"Nucleic Acids Res"},{"key":"2023020206281129700_btx276-B21","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1038\/nature06008","article-title":"Genome-wide maps of chromatin state in pluripotent and lineage-committed cells","volume":"448","author":"Mikkelsen","year":"2007","journal-title":"Nature"},{"key":"2023020206281129700_btx276-B22","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1038\/nmeth.1226","article-title":"Mapping and quantifying mammalian transcriptomes by RNA-Seq","volume":"5","author":"Mortazavi","year":"2008","journal-title":"Nat. Methods"},{"key":"2023020206281129700_btx276-B23","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1186\/1756-0500-6-133","article-title":"The PinkThing for analysing ChIP profiling data in their genomic context","volume":"6","author":"Nielsen","year":"2013","journal-title":"BMC Res. Notes"},{"key":"2023020206281129700_btx276-B24","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1038\/nrg2641","article-title":"ChIP\u2013seq: advantages and challenges of a maturing technology","volume":"10","author":"Park","year":"2009","journal-title":"Nat. Rev. Genet"},{"key":"2023020206281129700_btx276-B25","doi-asserted-by":"crossref","first-page":"S22","DOI":"10.1038\/nmeth.1371","article-title":"Computation for ChIP-seq and RNA-seq studies","volume":"6","author":"Pepke","year":"2009","journal-title":"Nat. Methods"},{"key":"2023020206281129700_btx276-B26","first-page":"37","article-title":"Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation","volume-title":"J. Mach. Learn. Tech","author":"Powers","year":"2011"},{"key":"2023020206281129700_btx276-B27","doi-asserted-by":"crossref","first-page":"11.12.1","DOI":"10.1002\/0471250953.bi1112s47","article-title":"BEDTools: The Swiss-Army Tool for genome feature analysis","volume":"47","author":"Quinlan","year":"2014","journal-title":"Curr. Protoc. Bioinformatics"},{"key":"2023020206281129700_btx276-B28","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1093\/bioinformatics\/btq033","article-title":"BEDTools: a flexible suite of utilities for comparing genomic features","volume":"26","author":"Quinlan","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020206281129700_btx276-B29","doi-asserted-by":"crossref","first-page":"395","DOI":"10.1038\/nrg1834","article-title":"Inherited epigenetic variation\u2013revisiting soft inheritance","volume":"7","author":"Richards","year":"2006","journal-title":"Nat. Rev. Genet"},{"key":"2023020206281129700_btx276-B30","doi-asserted-by":"crossref","first-page":"e25","DOI":"10.1093\/nar\/gkq1187","article-title":"A manually curated ChIP-seq benchmark demonstrates room for improvement in current peak-finder programs","volume":"39","author":"Rye","year":"2011","journal-title":"Nucleic Acids Res"},{"key":"2023020206281129700_btx276-B31","doi-asserted-by":"crossref","first-page":"590","DOI":"10.1101\/gr.086983.108","article-title":"Genomic distribution of CHD7 on chromatin tracks H3K4 methylation patterns","volume":"19","author":"Schnetz","year":"2009","journal-title":"Genome Res"},{"key":"2023020206281129700_btx276-B32","doi-asserted-by":"crossref","first-page":"144.","DOI":"10.1186\/s12859-016-0991-z","article-title":"Detecting broad domains and narrow peaks in ChIP-seq data with hiddenDomains","volume":"17","author":"Starmer","year":"2016","journal-title":"BMC Bioinformatics"},{"key":"2023020206281129700_btx276-B33","first-page":"953","article-title":"A comprehensive comparison of tools for differential ChIP-seq analysis","volume":"17","author":"Steinhauser","year":"2016","journal-title":"Brief. Bioinf"},{"key":"2023020206281129700_btx276-B34","doi-asserted-by":"crossref","first-page":"e11471.","DOI":"10.1371\/journal.pone.0011471","article-title":"Evaluation of algorithm performance in ChIP-seq peak detection","volume":"5","author":"Wilbanks","year":"2010","journal-title":"PLoS One"},{"year":"2010","author":"Wilder","key":"2023020206281129700_btx276-B35"},{"key":"2023020206281129700_btx276-B36","doi-asserted-by":"crossref","first-page":"1199","DOI":"10.1093\/bioinformatics\/btq128","article-title":"A signal-noise model for significance analysis of ChIP-seq with negative control","volume":"26","author":"Xu","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020206281129700_btx276-B37","doi-asserted-by":"crossref","first-page":"2382","DOI":"10.1093\/bioinformatics\/btv145","article-title":"ChIPseeker: an R\/Bioconductor package for ChIP peak annotation, comparison and visualization","volume":"31","author":"Yu","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020206281129700_btx276-B38","doi-asserted-by":"crossref","first-page":"355","DOI":"10.1038\/nature13992","article-title":"A comparative encyclopedia of DNA elements in the mouse genome","volume":"515","author":"Yue","year":"2014","journal-title":"Nature"},{"key":"2023020206281129700_btx276-B39","doi-asserted-by":"crossref","first-page":"1952","DOI":"10.1093\/bioinformatics\/btp340","article-title":"A clustering approach for identification of enriched domains from histone modification ChIP-Seq data","volume":"25","author":"Zang","year":"2009","journal-title":"Bioinformatics"},{"key":"2023020206281129700_btx276-B40","doi-asserted-by":"crossref","first-page":"R137","DOI":"10.1186\/gb-2008-9-9-r137","article-title":"Model-based analysis of ChIP-Seq (MACS)","volume":"9","author":"Zhang","year":"2008","journal-title":"Genome Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/17\/2615\/49041155\/bioinformatics_33_17_2615.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/17\/2615\/49041155\/bioinformatics_33_17_2615.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T06:30:23Z","timestamp":1675319423000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/17\/2615\/3760101"}},"subtitle":[],"editor":[{"given":"John","family":"Hancock","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2017,4,25]]},"references-count":40,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2017,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx276","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2017,9,1]]},"published":{"date-parts":[[2017,4,25]]}}}