{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,13]],"date-time":"2025-11-13T02:01:08Z","timestamp":1762999268726,"version":"3.37.3"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"17","license":[{"start":{"date-parts":[[2017,5,4]],"date-time":"2017-05-04T00:00:00Z","timestamp":1493856000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/about_us\/legal\/notices"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61572327"],"award-info":[{"award-number":["61572327"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2017,9,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Tumor sample classification has long been an important task in cancer research. Classifying tumors into different subtypes greatly benefits therapeutic development and facilitates application of precision medicine on patients. In practice, solid tumor tissue samples obtained from clinical settings are always mixtures of cancer and normal cells. Thus, the data obtained from these samples are mixed signals. The \u2018tumor purity\u2019, or the percentage of cancer cells in cancer tissue sample, will bias the clustering results if not properly accounted for.<\/jats:p>\n               <jats:p>Results: In this article, we developed a model-based clustering method and an R function which uses DNA methylation microarray data to infer tumor subtypes with the consideration of tumor purity. Simulation studies and the analyses of The Cancer Genome Atlas data demonstrate improved results compared with existing methods.<\/jats:p>\n               <jats:p>Availability and implementation: InfiniumClust is part of R package InfiniumPurify, which is freely available from CRAN (https:\/\/cran.r-project.org\/web\/packages\/InfiniumPurify\/index.html).<\/jats:p>\n               <jats:p>Contact: \u00a0hao.wu@emory.edu or xqzheng@shnu.edu.cn<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btx303","type":"journal-article","created":{"date-parts":[[2017,5,3]],"date-time":"2017-05-03T11:08:18Z","timestamp":1493809698000},"page":"2651-2657","source":"Crossref","is-referenced-by-count":28,"title":["Accounting for tumor purity improves cancer subtype classification from DNA methylation data"],"prefix":"10.1093","volume":"33","author":[{"given":"Weiwei","family":"Zhang","sequence":"first","affiliation":[{"name":"1Department of Mathematics, Shanghai Normal University, Shanghai 200234, China"},{"name":"2School of Science, East China University of Technology, Nanchang, Jiangxi 330013, China"}]},{"given":"Hao","family":"Feng","sequence":"additional","affiliation":[{"name":"3Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30322, USA"}]},{"given":"Hao","family":"Wu","sequence":"additional","affiliation":[{"name":"3Department of Biostatistics and Bioinformatics, Rollins School of Public Health, Emory University, Atlanta, GA 30322, USA"}]},{"given":"Xiaoqi","family":"Zheng","sequence":"additional","affiliation":[{"name":"1Department of Mathematics, Shanghai Normal University, Shanghai 200234, China"}]}],"member":"286","published-online":{"date-parts":[[2017,5,4]]},"reference":[{"key":"2023020206272948200_btx303-B1","doi-asserted-by":"crossref","first-page":"1865","DOI":"10.1093\/bioinformatics\/btt301","article-title":"DeMix: deconvolution for mixed cancer transcriptomes using raw measured data","volume":"29","author":"Ahn","year":"2013","journal-title":"Bioinformatics"},{"key":"2023020206272948200_btx303-B2","doi-asserted-by":"crossref","first-page":"8971.","DOI":"10.1038\/ncomms9971","article-title":"Systematic pan-cancer analysis of tumour purity","volume":"6","author":"Aran","year":"2015","journal-title":"Nat. Commun"},{"key":"2023020206272948200_btx303-B3","doi-asserted-by":"crossref","first-page":"1056","DOI":"10.1093\/bioinformatics\/btt759","article-title":"AbsCN-seq: a statistical method to estimate tumor purity, ploidy and absolute copy numbers from next-generation sequencing data","volume":"30","author":"Bao","year":"2014","journal-title":"Bioinformatics"},{"key":"2023020206272948200_btx303-B4","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1101\/gad.947102","article-title":"DNA methylation patterns and epigenetic memory","volume":"16","author":"Bird","year":"2002","journal-title":"Genes Dev"},{"key":"2023020206272948200_btx303-B5","doi-asserted-by":"crossref","first-page":"4164","DOI":"10.1073\/pnas.0308531101","article-title":"Metagenes and molecular pattern discovery using matrix factorization","volume":"101","author":"Brunet","year":"2004","journal-title":"Proc. Natl Acad. Sci. U. S. A"},{"key":"2023020206272948200_btx303-B6","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1038\/nbt.2203","article-title":"Absolute quantification of somatic DNA alterations in human cancer","volume":"30","author":"Carter","year":"2012","journal-title":"Nat. Biotechnol"},{"key":"2023020206272948200_btx303-B7","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1038\/ng1038","article-title":"Molecular portraits and the family tree of cancer","volume":"32","author":"Chung","year":"2002","journal-title":"Nat. Genet"},{"key":"2023020206272948200_btx303-B8","doi-asserted-by":"crossref","first-page":"4632","DOI":"10.1200\/JCO.2004.07.151","article-title":"DNA methylation and cancer","volume":"22","author":"Das","year":"2004","journal-title":"J. Clin. Oncol"},{"key":"2023020206272948200_btx303-B9","doi-asserted-by":"crossref","first-page":"20110328.","DOI":"10.1098\/rstb.2011.0328","article-title":"DNA methylation dynamics during the mammalian life cycle","volume":"368","author":"Hackett","year":"2013","journal-title":"Philos. Trans. R. Soc. Lond. B Biol. Sci"},{"key":"2023020206272948200_btx303-B10","doi-asserted-by":"crossref","first-page":"768","DOI":"10.1038\/ng.865","article-title":"Increased methylation variation in epigenetic domains across cancer types","volume":"43","author":"Hansen","year":"2011","journal-title":"Nat. Genet"},{"key":"2023020206272948200_btx303-B11","doi-asserted-by":"crossref","first-page":"929","DOI":"10.1016\/j.cell.2014.06.049","article-title":"Multiplatform analysis of 12 cancer types reveals molecular classification within and across tissues of origin","volume":"158","author":"Hoadley","year":"2014","journal-title":"Cell"},{"key":"2023020206272948200_btx303-B12","doi-asserted-by":"crossref","first-page":"365.","DOI":"10.1186\/1471-2105-9-365","article-title":"Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions","volume":"9","author":"Houseman","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023020206272948200_btx303-B14","doi-asserted-by":"crossref","first-page":"2849","DOI":"10.1093\/bioinformatics\/btq553","article-title":"A statistical framework for Illumina DNA methylation arrays","volume":"26","author":"Kuan","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020206272948200_btx303-B15","doi-asserted-by":"crossref","first-page":"633","DOI":"10.5306\/wjco.v5.i4.633","article-title":"Advances in adjuvant systemic therapy for non-small-cell lung cancer","volume":"5","author":"Leong","year":"2014","journal-title":"World J. Clin. Oncol"},{"key":"2023020206272948200_btx303-B16","doi-asserted-by":"crossref","first-page":"362","DOI":"10.1038\/366362a0","article-title":"Role for DNA methylation in genomic imprinting","volume":"366","author":"Li","year":"1993","journal-title":"Nature"},{"key":"2023020206272948200_btx303-B17","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1093\/biomet\/80.2.267","article-title":"Maximum likelihood estimation via the ECM algorithm: a general framework","volume":"80","author":"Meng","year":"1993","journal-title":"Biometrika"},{"key":"2023020206272948200_btx303-B18","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1016\/j.ccr.2006.10.008","article-title":"A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes","volume":"10","author":"Neve","year":"2006","journal-title":"Cancer Cell"},{"key":"2023020206272948200_btx303-B19","doi-asserted-by":"crossref","first-page":"11.","DOI":"10.1186\/s13148-014-0039-z","article-title":"DNA methylation-based subtype prediction for pediatric acute lymphoblastic leukemia","volume":"7","author":"Nordlund","year":"2015","journal-title":"Clin. Epigenetics"},{"key":"2023020206272948200_btx303-B20","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1586\/erm.12.46","article-title":"How many molecular subtypes? Implications of the unique tumor principle in personalized medicine","volume":"12","author":"Ogino","year":"2012","journal-title":"Expert. Rev. Mol. Diagn"},{"key":"2023020206272948200_btx303-B21","doi-asserted-by":"crossref","first-page":"1446","DOI":"10.1093\/bioinformatics\/btw026","article-title":"Differential methylation analysis for BS-seq data under general experimental design","volume":"32","author":"Park","year":"2016","journal-title":"Bioinformatics"},{"key":"2023020206272948200_btx303-B22","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1200\/JCO.2008.18.1370","article-title":"Supervised risk predictor of breast cancer based on intrinsic subtypes","volume":"27","author":"Parker","year":"2009","journal-title":"J. Clin. Oncol"},{"key":"2023020206272948200_btx303-B23","doi-asserted-by":"crossref","first-page":"555","DOI":"10.1016\/j.molonc.2014.10.012","article-title":"A DNA methylation-based definition of biologically distinct breast cancer subtypes","volume":"9","author":"Stefansson","year":"2015","journal-title":"Mol. Oncol"},{"key":"2023020206272948200_btx303-B24","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1016\/j.ccr.2009.12.020","article-title":"Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1","volume":"17","author":"Verhaak","year":"2010","journal-title":"Cancer Cell"},{"key":"2023020206272948200_btx303-B25","first-page":"291","article-title":"Hierarchical clustering of lung cancer cell lines using DNA methylation markers","volume":"11","author":"Virmani","year":"2002","journal-title":"Cancer Epidemiol. Biomarkers Prev"},{"key":"2023020206272948200_btx303-B32","first-page":"408","article-title":"Tumor purity and differential methylation in cancer epigenomics","volume":"15","author":"Wang","year":"2016","journal-title":"Brief Funct Genomics"},{"key":"2023020206272948200_btx303-B26","doi-asserted-by":"crossref","first-page":"1033","DOI":"10.1038\/nmeth.3583","article-title":"Comparing the performance of biomedical clustering methods","volume":"12","author":"Wiwie","year":"2015","journal-title":"Nat. Methods"},{"key":"2023020206272948200_btx303-B27","doi-asserted-by":"crossref","first-page":"2612.","DOI":"10.1038\/ncomms3612","article-title":"Inferring tumour purity and stromal and immune cell admixture from expression data","volume":"4","author":"Yoshihara","year":"2013","journal-title":"Nat. Commun"},{"key":"2023020206272948200_btx303-B28","doi-asserted-by":"crossref","first-page":"3401","DOI":"10.1093\/bioinformatics\/btv370","article-title":"Predicting tumor purity from methylation microarray data","volume":"31","author":"Zhang","year":"2015","journal-title":"Bioinformatics"},{"key":"2023020206272948200_btx303-B29","doi-asserted-by":"crossref","first-page":"17.","DOI":"10.1186\/s13059-016-1143-5","article-title":"Estimating and accounting for tumor purity in the analysis of DNA methylation data from cancer studies","volume":"18","author":"Zheng","year":"2017","journal-title":"Genome Biol"},{"key":"2023020206272948200_btx303-B30","doi-asserted-by":"crossref","first-page":"419.","DOI":"10.1186\/s13059-014-0419-x","article-title":"MethylPurify: tumor purity deconvolution and differential methylation detection from single tumor DNA methylomes","volume":"15","author":"Zheng","year":"2014","journal-title":"Genome Biol"},{"key":"2023020206272948200_btx303-B31","doi-asserted-by":"crossref","first-page":"e1002517.","DOI":"10.1371\/journal.pgen.1002517","article-title":"The dynamics and prognostic potential of DNA methylation changes at stem cell gene loci in women's cancer","volume":"8","author":"Zhuang","year":"2012","journal-title":"PLoS Genet"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/17\/2651\/49041032\/bioinformatics_33_17_2651.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/33\/17\/2651\/49041032\/bioinformatics_33_17_2651.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T06:30:06Z","timestamp":1675319406000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/33\/17\/2651\/3796398"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,5,4]]},"references-count":31,"journal-issue":{"issue":"17","published-print":{"date-parts":[[2017,9,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btx303","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2017,9,1]]},"published":{"date-parts":[[2017,5,4]]}}}