{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T10:26:43Z","timestamp":1776680803906,"version":"3.51.2"},"reference-count":38,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2016,4,22]],"date-time":"2016-04-22T00:00:00Z","timestamp":1461283200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2016,4,22]],"date-time":"2016-04-22T00:00:00Z","timestamp":1461283200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000288","name":"Royal Society","doi-asserted-by":"publisher","award":["NAF 164914"],"award-info":[{"award-number":["NAF 164914"]}],"id":[{"id":"10.13039\/501100000288","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004963","name":"Seventh Framework Programme","doi-asserted-by":"publisher","award":["305428"],"award-info":[{"award-number":["305428"]}],"id":[{"id":"10.13039\/501100004963","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>There is growing evidence that DNA methylation alterations may contribute to carcinogenesis. Recent data also suggest that DNA methylation field defects in normal pre-neoplastic tissue represent infrequent stochastic \u201coutlier\u201d events. This presents a statistical challenge for standard feature selection algorithms, which assume frequent alterations in a disease phenotype. Although differential variability has emerged as a novel feature selection paradigm for the discovery of outliers, a growing concern is that these could result from technical confounders, in principle thus favouring algorithms which are robust to outliers.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Here we evaluate five differential variability algorithms in over 700 DNA methylomes, including two of the largest cohorts profiling precursor cancer lesions, and demonstrate that most of the novel proposed algorithms lack the sensitivity to detect epigenetic field defects at genome-wide significance. In contrast, algorithms which recognise heterogeneous outlier DNA methylation patterns are able to identify many sites in pre-neoplastic lesions, which display progression in invasive cancer. Thus, we show that many DNA methylation outliers are not technical artefacts, but define epigenetic field defects which are selected for during cancer progression.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>Given that cancer studies aiming to find epigenetic field defects are likely to be limited by sample size, adopting the novel feature selection paradigm advocated here will be critical to increase assay sensitivity.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-016-1056-z","type":"journal-article","created":{"date-parts":[[2016,4,21]],"date-time":"2016-04-21T23:55:30Z","timestamp":1461282930000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":46,"title":["Stochastic epigenetic outliers can define field defects in cancer"],"prefix":"10.1186","volume":"17","author":[{"given":"Andrew E.","family":"Teschendorff","sequence":"first","affiliation":[]},{"given":"Allison","family":"Jones","sequence":"additional","affiliation":[]},{"given":"Martin","family":"Widschwendter","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2016,4,22]]},"reference":[{"key":"1056_CR1","doi-asserted-by":"publisher","first-page":"747","DOI":"10.1038\/35021093","volume":"406","author":"CM Perou","year":"2000","unstructured":"Perou CM, Sorlie T, Eisen MB, van de Rijn M, Jeffrey SS, Rees CA, Pollack JR, Ross DT, Johnsen H, Akslen LA, et al. Molecular portraits of human breast tumours. Nature. 2000;406:747\u201352.","journal-title":"Nature"},{"key":"1056_CR2","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1101\/sqb.1999.64.71","volume":"64","author":"A Alizadeh","year":"1999","unstructured":"Alizadeh A, Eisen M, Davis RE, Ma C, Sabet H, Tran T, Powell JI, Yang L, Marti GE, Moore DT, et al. The lymphochip: a specialized cDNA microarray for the genomic-scale analysis of gene expression in normal and malignant lymphocytes. Cold Spring Harb Symp Quant Biol. 1999;64:71\u20138.","journal-title":"Cold Spring Harb Symp Quant Biol"},{"key":"1056_CR3","doi-asserted-by":"publisher","first-page":"41","DOI":"10.1038\/14385","volume":"23","author":"JR Pollack","year":"1999","unstructured":"Pollack JR, Perou CM, Alizadeh AA, Eisen MB, Pergamenschikov A, Williams CF, Jeffrey SS, Botstein D, Brown PO. Genome-wide analysis of DNA copy-number changes using cDNA microarrays. Nat Genet. 1999;23:41\u20136.","journal-title":"Nat Genet"},{"key":"1056_CR4","doi-asserted-by":"publisher","first-page":"5116","DOI":"10.1073\/pnas.091062498","volume":"98","author":"VG Tusher","year":"2001","unstructured":"Tusher VG, Tibshirani R, Chu G. Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci U S A. 2001;98:5116\u201321.","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1056_CR5","doi-asserted-by":"crossref","first-page":"Article3","DOI":"10.2202\/1544-6115.1027","volume":"3","author":"GK Smyth","year":"2004","unstructured":"Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3:Article3.","journal-title":"Stat Appl Genet Mol Biol"},{"key":"1056_CR6","doi-asserted-by":"publisher","first-page":"3705","DOI":"10.1093\/bioinformatics\/bth449","volume":"20","author":"JM Wettenhall","year":"2004","unstructured":"Wettenhall JM, Smyth GK. limmaGUI: a graphical user interface for linear modeling of microarray data. Bioinformatics. 2004;20:3705\u20136.","journal-title":"Bioinformatics"},{"key":"1056_CR7","doi-asserted-by":"publisher","first-page":"80","DOI":"10.2307\/3001968","volume":"1","author":"F Wilcoxon","year":"1945","unstructured":"Wilcoxon F. Individual comparisons by ranking methods. Biom Bull. 1945;1:80\u20133.","journal-title":"Biom Bull"},{"key":"1056_CR8","doi-asserted-by":"crossref","unstructured":"Feinberg AP. Epigenetic stochasticity, nuclear structure and cancer: the implications for medicine. J Intern Med. 2014;276(1):5-11.","DOI":"10.1111\/joim.12224"},{"issue":"Suppl 1","key":"1056_CR9","doi-asserted-by":"publisher","first-page":"1757","DOI":"10.1073\/pnas.0906183107","volume":"107","author":"AP Feinberg","year":"2010","unstructured":"Feinberg AP, Irizarry RA. Evolution in health and medicine Sackler colloquium: Stochastic epigenetic variation as a driving force of development, evolutionary adaptation, and disease. Proc Natl Acad Sci U S A. 2010;107 Suppl 1:1757\u201364.","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1056_CR10","doi-asserted-by":"publisher","first-page":"768","DOI":"10.1038\/ng.865","volume":"43","author":"KD Hansen","year":"2011","unstructured":"Hansen KD, Timp W, Bravo HC, Sabunciyan S, Langmead B, McDonald OG, Wen B, Wu H, Liu Y, Diep D, et al. Increased methylation variation in epigenetic domains across cancer types. Nat Genet. 2011;43:768\u2013U777.","journal-title":"Nat Genet"},{"key":"1056_CR11","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1038\/nrg1748","volume":"7","author":"AP Feinberg","year":"2006","unstructured":"Feinberg AP, Ohlsson R, Henikoff S. The epigenetic progenitor origin of human cancer. Nat Rev Genet. 2006;7:21\u201333.","journal-title":"Nat Rev Genet"},{"key":"1056_CR12","doi-asserted-by":"publisher","first-page":"724","DOI":"10.1038\/ng.897","volume":"43","author":"JP Issa","year":"2011","unstructured":"Issa JP. Epigenetic variation and cellular Darwinism. Nat Genet. 2011;43:724\u20136.","journal-title":"Nat Genet"},{"key":"1056_CR13","doi-asserted-by":"publisher","first-page":"24","DOI":"10.1186\/gm323","volume":"4","author":"AE Teschendorff","year":"2012","unstructured":"Teschendorff AE, Jones A, Fiegl H, Sargent A, Zhuang JJ, Kitchener HC, Widschwendter M. Epigenetic variability in cells of normal cytology is associated with the risk of future morphological transformation. Genome Med. 2012;4:24.","journal-title":"Genome Med"},{"key":"1056_CR14","doi-asserted-by":"publisher","first-page":"1487","DOI":"10.1093\/bioinformatics\/bts170","volume":"28","author":"AE Teschendorff","year":"2012","unstructured":"Teschendorff AE, Widschwendter M. Differential variability improves the identification of cancer risk markers in DNA methylation studies profiling precursor cancer lesions. Bioinformatics. 2012;28:1487\u201394.","journal-title":"Bioinformatics"},{"key":"1056_CR15","doi-asserted-by":"publisher","first-page":"e1003709","DOI":"10.1371\/journal.pcbi.1003709","volume":"10","author":"AE Teschendorff","year":"2014","unstructured":"Teschendorff AE, Liu X, Caren H, Pollard SM, Beck S, Widschwendter M, Chen L. The dynamics of DNA methylation covariation patterns in carcinogenesis. PLoS Comput Biol. 2014;10:e1003709.","journal-title":"PLoS Comput Biol"},{"key":"1056_CR16","doi-asserted-by":"publisher","first-page":"402","DOI":"10.1038\/ng0406-402","volume":"38","author":"D Shibata","year":"2006","unstructured":"Shibata D. Clonal diversity in tumor progression. Nat Genet. 2006;38:402\u20133.","journal-title":"Nat Genet"},{"key":"1056_CR17","doi-asserted-by":"publisher","first-page":"43","DOI":"10.4251\/wjgo.v5.i3.43","volume":"5","author":"C Bernstein","year":"2013","unstructured":"Bernstein C, Nfonsam V, Prasad AR, Bernstein H. Epigenetic field defects in progression to cancer. World J Gastrointest Oncol. 2013;5:43\u20139.","journal-title":"World J Gastrointest Oncol"},{"key":"1056_CR18","doi-asserted-by":"publisher","first-page":"342","DOI":"10.1038\/onc.2011.241","volume":"31","author":"M Katsurano","year":"2012","unstructured":"Katsurano M, Niwa T, Yasui Y, Shigematsu Y, Yamashita S, Takeshima H, Lee MS, Kim YJ, Tanaka T, Ushijima T. Early-stage formation of an epigenetic field defect in a mouse colitis model, and non-essential roles of T- and B-cells in DNA methylation induction. Oncogene. 2012;31:342\u201351.","journal-title":"Oncogene"},{"key":"1056_CR19","doi-asserted-by":"publisher","first-page":"465","DOI":"10.1186\/s13059-014-0465-4","volume":"15","author":"B Phipson","year":"2014","unstructured":"Phipson B, Oshlack A. DiffVar: a new method for detecting differential variability with application to methylation in cancer and aging. Genome Biol. 2014;15:465.","journal-title":"Genome Biol"},{"key":"1056_CR20","doi-asserted-by":"crossref","unstructured":"Ahn S, Wang T. A powerful statistical method for identifying differentially methylated markers in complex diseases. Pac Symp Biocomput. 2013;69\u201379.","DOI":"10.1142\/9789814447973_0008"},{"key":"1056_CR21","doi-asserted-by":"publisher","first-page":"232","DOI":"10.1186\/1471-2105-15-232","volume":"15","author":"S Wahl","year":"2014","unstructured":"Wahl S, Fenske N, Zeilinger S, Suhre K, Gieger C, Waldenberger M, Grallert H, Schmid M. On the potential of models for location and scale for genome-wide DNA methylation data. BMC Bioinformatics. 2014;15:232.","journal-title":"BMC Bioinformatics"},{"key":"1056_CR22","doi-asserted-by":"publisher","first-page":"10478","DOI":"10.1038\/ncomms10478","volume":"7","author":"AE Teschendorff","year":"2016","unstructured":"Teschendorff AE, Gao Y, Jones A, Ruebner M, Beckmann MW, Wachter DL, Fasching PA, Widschwendter M. DNA methylation outliers in normal breast tissue identify field defects that are enriched in cancer. Nat Commun. 2016;7:10478.","journal-title":"Nat Commun"},{"key":"1056_CR23","doi-asserted-by":"publisher","first-page":"1363","DOI":"10.1093\/bioinformatics\/btu049","volume":"30","author":"MJ Aryee","year":"2014","unstructured":"Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, Irizarry RA. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics. 2014;30:1363\u20139.","journal-title":"Bioinformatics"},{"key":"1056_CR24","doi-asserted-by":"publisher","first-page":"189","DOI":"10.1093\/bioinformatics\/bts680","volume":"29","author":"AE Teschendorff","year":"2013","unstructured":"Teschendorff AE, Marabita F, Lechner M, Bartlett T, Tegner J, Gomez-Cabrero D, Beck S. A beta-mixture quantile normalization method for correcting probe design bias in Illumina Infinium 450\u00a0k DNA methylation data. Bioinformatics. 2013;29:189\u201396.","journal-title":"Bioinformatics"},{"key":"1056_CR25","volume-title":"Statistical methods","author":"GW Snedecor","year":"1989","unstructured":"Snedecor GW, Cochran WG. Statistical methods. 1989."},{"key":"1056_CR26","doi-asserted-by":"crossref","unstructured":"Xu X, Su S, Barnes VA, De Miguel C, Pollock J, Ownby D, Shi H, Zhu H, Snieder H, Wang X. A genome-wide methylation study on obesity: Differential variability and differential methylation. Epigenetics. 2013;8(5):522-33.","DOI":"10.4161\/epi.24506"},{"key":"1056_CR27","doi-asserted-by":"publisher","first-page":"9440","DOI":"10.1073\/pnas.1530509100","volume":"100","author":"JD Storey","year":"2003","unstructured":"Storey JD, Tibshirani R. Statistical significance for genomewide studies. Proc Natl Acad Sci U S A. 2003;100:9440\u20135.","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1056_CR28","doi-asserted-by":"publisher","first-page":"e8274","DOI":"10.1371\/journal.pone.0008274","volume":"4","author":"AE Teschendorff","year":"2009","unstructured":"Teschendorff AE, Menon U, Gentry-Maharaj A, Ramus SJ, Gayther SA, Apostolidou S, Jones A, Lechner M, Beck S, Jacobs IJ, Widschwendter M. An epigenetic signature in peripheral blood predicts active ovarian cancer. PLoS One. 2009;4:e8274.","journal-title":"PLoS One"},{"key":"1056_CR29","doi-asserted-by":"publisher","first-page":"726","DOI":"10.1002\/emmm.201100801","volume":"3","author":"S Dedeurwaerder","year":"2011","unstructured":"Dedeurwaerder S, Desmedt C, Calonne E, Singhal SK, Haibe-Kains B, Defrance M, Michiels S, Volkmar M, Deplus R, Luciani J, et al. DNA methylation profiling reveals a predominant immune component in breast cancers. EMBO Mol Med. 2011;3:726\u201341.","journal-title":"EMBO Mol Med"},{"key":"1056_CR30","doi-asserted-by":"publisher","first-page":"R157","DOI":"10.1186\/gb-2007-8-8-r157","volume":"8","author":"AE Teschendorff","year":"2007","unstructured":"Teschendorff AE, Miremadi A, Pinder SE, Ellis IO, Caldas C. An immune response gene expression module identifies a good prognosis subtype in estrogen receptor negative breast cancer. Genome Biol. 2007;8:R157.","journal-title":"Genome Biol"},{"key":"1056_CR31","doi-asserted-by":"publisher","first-page":"15545","DOI":"10.1073\/pnas.0506580102","volume":"102","author":"A Subramanian","year":"2005","unstructured":"Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102:15545\u201350.","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1056_CR32","doi-asserted-by":"publisher","first-page":"e1004996","DOI":"10.1371\/journal.pgen.1004996","volume":"11","author":"T Yuan","year":"2015","unstructured":"Yuan T, Jiao Y, de Jong S, Ophoff RA, Beck S, Teschendorff AE. An integrative multi-scale analysis of the dynamic DNA methylation landscape in aging. PLoS Genet. 2015;11:e1004996.","journal-title":"PLoS Genet"},{"key":"1056_CR33","doi-asserted-by":"publisher","first-page":"203","DOI":"10.4161\/epi.23470","volume":"8","author":"YA Chen","year":"2013","unstructured":"Chen YA, Lemire M, Choufani S, Butcher DT, Grafodatskaya D, Zanke BW, Gallinger S, Hudson TJ, Weksberg R. Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics. 2013;8:203\u20139.","journal-title":"Epigenetics"},{"key":"1056_CR34","doi-asserted-by":"publisher","first-page":"1496","DOI":"10.1093\/bioinformatics\/btr171","volume":"27","author":"AE Teschendorff","year":"2011","unstructured":"Teschendorff AE, Zhuang J, Widschwendter M. Independent surrogate variable analysis to deconvolve confounding factors in large-scale microarray profiling studies. Bioinformatics. 2011;27:1496\u2013505.","journal-title":"Bioinformatics"},{"key":"1056_CR35","doi-asserted-by":"publisher","first-page":"18909","DOI":"10.1038\/srep18909","volume":"6","author":"N Wang","year":"2016","unstructured":"Wang N, Hoffman EP, Chen L, Chen L, Zhang Z, Liu C, Yu G, Herrington DM, Clarke R, Wang Y. Mathematical modelling of transcriptional heterogeneity identifies novel markers and subpopulations in complex tissues. Sci Rep. 2016;6:18909.","journal-title":"Sci Rep"},{"key":"1056_CR36","doi-asserted-by":"publisher","first-page":"859","DOI":"10.1172\/JCI70941","volume":"124","author":"S Santagata","year":"2014","unstructured":"Santagata S, Thakkar A, Ergonul A, Wang B, Woo T, Hu R, Harrell JC, McNamara G, Schwede M, Culhane AC, et al. Taxonomy of breast cancer based on normal cell phenotype predicts outcome. J Clin Invest. 2014;124:859\u201370.","journal-title":"J Clin Invest"},{"key":"1056_CR37","doi-asserted-by":"publisher","first-page":"1385","DOI":"10.1586\/14737140.2014.956096","volume":"14","author":"S Santagata","year":"2014","unstructured":"Santagata S, Ince TA. Normal cell phenotypes of breast epithelial cells provide the foundation of a breast cancer taxonomy. Expert Rev Anticancer Ther. 2014;14:1385\u20139.","journal-title":"Expert Rev Anticancer Ther"},{"key":"1056_CR38","doi-asserted-by":"publisher","first-page":"529","DOI":"10.1038\/nrg3000","volume":"12","author":"VK Rakyan","year":"2011","unstructured":"Rakyan VK, Down TA, Balding DJ, Beck S. Epigenome-wide association studies for common human diseases. Nat Rev Genet. 2011;12:529\u201341.","journal-title":"Nat Rev Genet"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1056-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-016-1056-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1056-z","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-016-1056-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,2]],"date-time":"2025-06-02T21:06:09Z","timestamp":1748898369000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-016-1056-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,4,22]]},"references-count":38,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2016,12]]}},"alternative-id":["1056"],"URL":"https:\/\/doi.org\/10.1186\/s12859-016-1056-z","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2016,4,22]]},"assertion":[{"value":"19 December 2015","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 April 2016","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 April 2016","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"178"}}