{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,15]],"date-time":"2026-05-15T02:31:00Z","timestamp":1778812260075,"version":"3.51.4"},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Genomic analyses of many solid cancers have demonstrated extensive genetic heterogeneity between as well as within individual tumors. However, statistical methods for classifying tumors by subtype based on genomic biomarkers generally entail an all-or-none decision, which may be misleading for clinical samples containing a mixture of subtypes and\/or normal cell contamination.<\/jats:p>\n               <jats:p>Results: We have developed a mixed-membership classification model, called glad , that simultaneously learns a sparse biomarker signature for each subtype as well as a distribution over subtypes for each sample. We demonstrate the accuracy of this model on simulated data, in-vitro mixture experiments, and clinical samples from the Cancer Genome Atlas (TCGA) project. We show that many TCGA samples are likely a mixture of multiple subtypes.<\/jats:p>\n               <jats:p>Availability: A python module implementing our algorithm is available from http:\/\/genomics.wpi.edu\/glad\/<\/jats:p>\n               <jats:p>Contact: \u00a0pjflaherty@wpi.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btu618","type":"journal-article","created":{"date-parts":[[2014,9,30]],"date-time":"2014-09-30T05:52:55Z","timestamp":1412056375000},"page":"225-232","source":"Crossref","is-referenced-by-count":11,"title":["GLAD: a mixed-membership model for heterogeneous tumor subtype classification"],"prefix":"10.1093","volume":"31","author":[{"given":"Hachem","family":"Saddiki","sequence":"first","affiliation":[{"name":"1 Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, USA, 2 School of Science and Engineering, Al Akhawayn University, Ifrane, 53000, Morocco, 3 Department of Statistics, University of California, Berkeley, CA 94720, USA, and 4 Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA"},{"name":"1 Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, USA, 2 School of Science and Engineering, Al Akhawayn University, Ifrane, 53000, Morocco, 3 Department of Statistics, University of California, Berkeley, CA 94720, USA, and 4 Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jon","family":"McAuliffe","sequence":"additional","affiliation":[{"name":"1 Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, USA, 2 School of Science and Engineering, Al Akhawayn University, Ifrane, 53000, Morocco, 3 Department of Statistics, University of California, Berkeley, CA 94720, USA, and 4 Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Patrick","family":"Flaherty","sequence":"additional","affiliation":[{"name":"1 Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, USA, 2 School of Science and Engineering, Al Akhawayn University, Ifrane, 53000, Morocco, 3 Department of Statistics, University of California, Berkeley, CA 94720, USA, and 4 Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA"},{"name":"1 Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, USA, 2 School of Science and Engineering, Al Akhawayn University, Ifrane, 53000, Morocco, 3 Department of Statistics, University of California, Berkeley, CA 94720, USA, and 4 Bioinformatics and Computational Biology Program, Worcester Polytechnic Institute, Worcester, MA 01609, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2014,9,29]]},"reference":[{"key":"2023020116144630100_btu618-B1","first-page":"1981","article-title":"Mixed membership stochastic blockmodels","volume":"9","author":"Airoldi","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"2023020116144630100_btu618-B2","volume-title":"Pattern Recognition and Machine Learning","author":"Bishop","year":"2006"},{"key":"2023020116144630100_btu618-B3","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"2023020116144630100_btu618-B4","doi-asserted-by":"crossref","first-page":"4055","DOI":"10.1158\/0008-5472.CAN-11-0153","article-title":"Heterogeneity maintenance in glioblastoma: a social network","volume":"71","author":"Bonavia","year":"2011","journal-title":"Cancer Res."},{"key":"2023020116144630100_btu618-B5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1002\/cne.10874","article-title":"Transient expression of doublecortin during adult neurogenesis","volume":"467","author":"Brown","year":"2003","journal-title":"J. Comp. Neurol."},{"key":"2023020116144630100_btu618-B6","doi-asserted-by":"crossref","first-page":"4164","DOI":"10.1073\/pnas.0308531101","article-title":"Metagenes and molecular pattern discovery using matrix factorization","volume":"101","author":"Brunet","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020116144630100_btu618-B7","doi-asserted-by":"crossref","first-page":"346","DOI":"10.1038\/nature10983","article-title":"The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups","volume":"486","author":"Curtis","year":"2012","journal-title":"Nature"},{"key":"2023020116144630100_btu618-B8","doi-asserted-by":"crossref","first-page":"614","DOI":"10.1038\/nm.3174","article-title":"Poor-prognosis colon cancer is defined by a molecularly distinct subtype and develops from serrated precursor lesions","volume":"19","author":"De Sousa E Melo","year":"2013","journal-title":"Nat. Med."},{"key":"2023020116144630100_btu618-B9","first-page":"3174","article-title":"Heterogeneity of tumor cells from a single mouse mammary tumor","volume":"38","author":"Dexter","year":"1978","journal-title":"Cancer Res."},{"key":"2023020116144630100_btu618-B10","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511761362","volume-title":"Large-scale Inference: Empirical Bayes Methods for Estimation, Testing, and Prediction","author":"Efron","year":"2010"},{"key":"2023020116144630100_btu618-B11","doi-asserted-by":"crossref","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","article-title":"Cluster analysis and display of genome-wide expression patterns","volume":"95","author":"Eisen","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020116144630100_btu618-B12","doi-asserted-by":"crossref","first-page":"5220","DOI":"10.1073\/pnas.0307760101","article-title":"Mixed-membership models of scientific publications","volume":"101","author":"Erosheva","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020116144630100_btu618-B13","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1111\/j.1471-8286.2007.01758.x","article-title":"Inference of population structure using multilocus genotype data: dominant markers and null alleles","volume":"7","author":"Falush","year":"2007","journal-title":"Mol. Ecol. Notes"},{"key":"2023020116144630100_btu618-B14","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-11-research0059","article-title":"Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering","volume":"3","author":"Gasch","year":"2002","journal-title":"Genome Biol."},{"key":"2023020116144630100_btu618-B15","doi-asserted-by":"crossref","first-page":"883","DOI":"10.1056\/NEJMoa1113205","article-title":"Intratumor heterogeneity and branched evolution revealed by multiregion sequencing","volume":"366","author":"Gerlinger","year":"2012","journal-title":"N. Engl. J. Med."},{"key":"2023020116144630100_btu618-B16","doi-asserted-by":"crossref","first-page":"746","DOI":"10.1198\/016214501753168398","article-title":"Model selection and the principle of minimum description length","volume":"96","author":"Hansen","year":"2001","journal-title":"J. Am. Stat. Assoc."},{"key":"2023020116144630100_btu618-B17","first-page":"2259","article-title":"Tumor heterogeneity","volume":"44","author":"Heppner","year":"1984","journal-title":"Cancer Res."},{"key":"2023020116144630100_btu618-B18","doi-asserted-by":"crossref","first-page":"1108","DOI":"10.1038\/nmeth.2651","article-title":"Network-based stratification of tumor mutations","volume":"10","author":"Hofree","year":"2013","journal-title":"Nat. Methods"},{"key":"2023020116144630100_btu618-B19","first-page":"1457","article-title":"Non-negative matrix factorization with sparseness constraints","volume":"5","author":"Hoyer","year":"2004","journal-title":"J. Mach. Learn. Res."},{"key":"2023020116144630100_btu618-B20","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1023\/A:1007665907178","article-title":"An introduction to variational methods for graphical models","volume":"37","author":"Jordan","year":"1999","journal-title":"Mach. Learn."},{"key":"2023020116144630100_btu618-B21","doi-asserted-by":"crossref","first-page":"1271","DOI":"10.1016\/j.patrec.2007.02.010","article-title":"On bayesian classification with laplace priors","volume":"28","author":"Kab\u00e1n","year":"2007","journal-title":"Pattern Recognit. Lett."},{"key":"2023020116144630100_btu618-B22","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1038\/nature11412","article-title":"Comprehensive molecular portraits of human breast tumours","volume":"490","author":"Koboldt","year":"2012","journal-title":"Nature"},{"key":"2023020116144630100_btu618-B23","first-page":"1167","article-title":"Periostin: novel diagnostic and therapeutic target for cancer","volume":"22","author":"Kudo","year":"2007","journal-title":"Histol. Histopathol."},{"key":"2023020116144630100_btu618-B24","article-title":"Efficient sparse coding algorithms","author":"Lee","year":"2006","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"2023020116144630100_btu618-B25","doi-asserted-by":"crossref","first-page":"415","DOI":"10.1162\/neco.1992.4.3.415","article-title":"Bayesian interpolation","volume":"4","author":"MacKay","year":"1992","journal-title":"Neural. Comput."},{"key":"2023020116144630100_btu618-B26","doi-asserted-by":"crossref","first-page":"1061","DOI":"10.1038\/nature07385","article-title":"Comprehensive genomic characterization defines human glioblastoma genes and core pathways","volume":"455","author":"McLendon","year":"2008","journal-title":"Nature"},{"key":"2023020116144630100_btu618-B27","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1007\/BF02294245","article-title":"An examination of procedures for determining the number of clusters in a data set","volume":"50","author":"Milligan","year":"1985","journal-title":"Psychometrika"},{"key":"2023020116144630100_btu618-B28","doi-asserted-by":"crossref","first-page":"1160","DOI":"10.1200\/JCO.2008.18.1370","article-title":"Supervised risk predictor of breast cancer based on intrinsic subtypes","volume":"27","author":"Parker","year":"2009","journal-title":"J. Clin. Oncol."},{"key":"2023020116144630100_btu618-B29","doi-asserted-by":"crossref","first-page":"1807","DOI":"10.1126\/science.1164382","article-title":"An integrated genomic analysis of human glioblastoma multiforme","volume":"321","author":"Parsons","year":"2008","journal-title":"Science"},{"key":"2023020116144630100_btu618-B30","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1109\/TCBB.2005.29","article-title":"The latent process decomposition of cDNA microarray data sets","volume":"2","author":"Rogers","year":"2005","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"2023020116144630100_btu618-B31","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: a graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"J. Comput. Appl. Math."},{"key":"2023020116144630100_btu618-B32","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1214\/aos\/1176344136","article-title":"Estimating the dimension of a model","volume":"6","author":"Schwarz","year":"1978","journal-title":"Ann. Stat."},{"key":"2023020116144630100_btu618-B33","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1038\/nmeth.1439","article-title":"Cell type-specific gene expression differences in complex tissues","volume":"7","author":"Shen-Orr","year":"2010","journal-title":"Nat. Methods"},{"key":"2023020116144630100_btu618-B34","volume-title":"Machine Learning and Knowledge Discovery in Databases","author":"Singh","year":"2008"},{"key":"2023020116144630100_btu618-B35","doi-asserted-by":"crossref","first-page":"750","DOI":"10.1198\/016214503000000666","article-title":"Finding the number of clusters in a dataset","volume":"98","author":"Sugar","year":"2003","journal-title":"J. Am. Stat. Assoc."},{"key":"2023020116144630100_btu618-B36","doi-asserted-by":"crossref","first-page":"755","DOI":"10.1080\/01621459.2012.734168","article-title":"Multinomial inverse regression for text analysis","volume":"108","author":"Taddy","year":"2012","journal-title":"J. Am. Stat. Assoc."},{"key":"2023020116144630100_btu618-B37","doi-asserted-by":"crossref","first-page":"1566","DOI":"10.1198\/016214506000000302","article-title":"Hierarchical dirichlet processes","volume":"101","author":"Teh","year":"2006","journal-title":"J. Am. Stat. Assoc."},{"key":"2023020116144630100_btu618-B38","doi-asserted-by":"crossref","first-page":"2483","DOI":"10.1056\/NEJMoa030847","article-title":"The role of the Wnt-signaling antagonist DKK1 in the development of osteolytic lesions in multiple myeloma","volume":"349","author":"Tian","year":"2003","journal-title":"N. Engl. J. Med."},{"key":"2023020116144630100_btu618-B39","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1016\/j.ccr.2009.12.020","article-title":"Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1","volume":"17","author":"Verhaak","year":"2010","journal-title":"Cancer Cell"},{"key":"2023020116144630100_btu618-B40","first-page":"1982","article-title":"Decoupling sparsity and smoothness in the discrete hierarchical dirichlet process","author":"Wang","year":"2009","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"2023020116144630100_btu618-B41","first-page":"1005","article-title":"Variational inference in nonconjugate models","volume":"14","author":"Wang","year":"2013","journal-title":"J. Mach. Learn. Res."},{"key":"2023020116144630100_btu618-B42","doi-asserted-by":"crossref","first-page":"893","DOI":"10.1093\/biomet\/asq061","article-title":"Consistent selection of the number of clusters via crossvalidation","volume":"97","author":"Wang","year":"2010","journal-title":"Biometrika"},{"key":"2023020116144630100_btu618-B43","doi-asserted-by":"crossref","DOI":"10.1145\/1150402.1150450","volume-title":"Topics Over Time: a Non-Markov Continuous-time Model of Topical Trends. A Non-Markov Continuous-time Model of Topical trends","author":"Wang","year":"2006"},{"key":"2023020116144630100_btu618-B44","doi-asserted-by":"crossref","first-page":"2612","DOI":"10.1038\/ncomms3612","article-title":"Inferring tumour purity and stromal and immune cell admixture from expression data","volume":"4","author":"Yoshihara","year":"2013","journal-title":"Nat. Commun."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/2\/225\/49010924\/bioinformatics_31_2_225.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/2\/225\/49010924\/bioinformatics_31_2_225.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T00:23:32Z","timestamp":1675297412000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/2\/225\/2365684"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,9,29]]},"references-count":44,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2015,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btu618","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,1,15]]},"published":{"date-parts":[[2014,9,29]]}}}