{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,1]],"date-time":"2025-02-01T05:20:47Z","timestamp":1738387247940,"version":"3.35.0"},"reference-count":27,"publisher":"Oxford University Press (OUP)","issue":"16","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,8,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Unsupervised class discovery in gene expression data relies on the statistical signals in the data to exclusively drive the results. It is often the case, however, that one is interested in constraining the search space to respect certain biological prior knowledge while still allowing a flexible search within these boundaries.<\/jats:p><jats:p>Results: We develop an approach to semi-supervised class discovery. One component of our approach uses clinical sample information to constrain the search space and guide the class discovery process to yield biologically relevant partitions. A second component consists of using known biological annotation of genes to drive the search, seeking partitions that manifest strong differential expression in specific sets of genes. We develop efficient algorithmics for these tasks, implementing both approaches and combinations thereof. We show that our method is robust enough to detect known clinical parameters in accordance with expected clinical values. We also use our method to elucidate cardiovascular disease (CVD) putative risk factors.<\/jats:p><jats:p>Availability: MonoClaD (Monotone Class Discovery). See http:\/\/bioinfo.cs.technion.ac.il\/people\/zohar\/MonoClad\/<\/jats:p><jats:p>Supplementary information: Supplementary data is available at http:\/\/bioinfo.cs.technion.ac.il\/people\/zohar\/MonoClad\/software.html<\/jats:p><jats:p>Contact: \u00a0zohar_yakhini@agilent.com<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn279","type":"journal-article","created":{"date-parts":[[2008,8,9]],"date-time":"2008-08-09T13:08:02Z","timestamp":1218287282000},"page":"i90-i97","source":"Crossref","is-referenced-by-count":16,"title":["Clinically driven semi-supervised class discovery in gene expression data"],"prefix":"10.1093","volume":"24","author":[{"given":"Israel","family":"Steinfeld","sequence":"first","affiliation":[{"name":"1 Agilent Laboratories, Tel Aviv, Israel and 2Departments of Internal Medicine and Biomedical Sciences, University of Parma, Parma, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Roy","family":"Navon","sequence":"additional","affiliation":[{"name":"1 Agilent Laboratories, Tel Aviv, Israel and 2Departments of Internal Medicine and Biomedical Sciences, University of Parma, Parma, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Diego","family":"Ardig\u00f2","sequence":"additional","affiliation":[{"name":"1 Agilent Laboratories, Tel Aviv, Israel and 2Departments of Internal Medicine and Biomedical Sciences, University of Parma, Parma, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ivana","family":"Zavaroni","sequence":"additional","affiliation":[{"name":"1 Agilent Laboratories, Tel Aviv, Israel and 2Departments of Internal Medicine and Biomedical Sciences, University of Parma, Parma, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zohar","family":"Yakhini","sequence":"additional","affiliation":[{"name":"1 Agilent Laboratories, Tel Aviv, Israel and 2Departments of Internal Medicine and Biomedical Sciences, University of Parma, Parma, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,8,9]]},"reference":[{"key":"2023020210495740300_B1","doi-asserted-by":"crossref","first-page":"503","DOI":"10.1038\/35000501","article-title":"Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling","volume":"403","author":"Alizadeh","year":"2000","journal-title":"Nature"},{"key":"2023020210495740300_B2","doi-asserted-by":"crossref","first-page":"1109","DOI":"10.1515\/CCLM.2007.261","article-title":"Application of leukocyte transcriptomes to assess systemic consequences of risk factors for cardiovascular disease","volume":"45","author":"Ardigo","year":"2007","journal-title":"Clin. Chem. Lab. Med"},{"key":"2023020210495740300_B3","doi-asserted-by":"crossref","first-page":"559","DOI":"10.1089\/106652700750050943","article-title":"Tissue classification with gene expression profiles","volume":"7","author":"Ben-Dor","year":"2001","journal-title":"J. Comput. Biol"},{"key":"2023020210495740300_B4","first-page":"31","article-title":"Class discovery in gene expression data. In","author":"Ben-Dor","year":"2001"},{"key":"2023020210495740300_B5","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. B"},{"key":"2023020210495740300_B6","doi-asserted-by":"crossref","first-page":"536","DOI":"10.1038\/35020115","article-title":"Molecular classification of cutaneous malignant melanoma by gene expression profiling","volume":"406","author":"Bittner","year":"2000","journal-title":"Nature"},{"key":"2023020210495740300_B7","doi-asserted-by":"crossref","first-page":"2181","DOI":"10.1185\/030079906X148472","article-title":"Carotid intima-media thickness as a surrogate marker for cardiovascular disease in intervention studies","volume":"22","author":"Bots","year":"2006","journal-title":"Curr. Med. Res. Opin"},{"key":"2023020210495740300_B8","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1186\/gb-2003-4-4-210","article-title":"Statistical tests for differential expression in cDNA microarray experiments","volume":"4","author":"Cui","year":"2003","journal-title":"Genome Biol"},{"key":"2023020210495740300_B9","doi-asserted-by":"crossref","first-page":"1010","DOI":"10.1093\/bioinformatics\/btl070","article-title":"Mayday - a microarray data analysis workbench","volume":"22","author":"Dietzsch","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020210495740300_B10","doi-asserted-by":"crossref","first-page":"e39","DOI":"10.1371\/journal.pcbi.0030039","article-title":"Discovering motifs in ranked lists of DNA Sequences","volume":"3","author":"Eden","year":"2007","journal-title":"PloS Comput. Biol"},{"key":"2023020210495740300_B11","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023020210495740300_B12","doi-asserted-by":"crossref","first-page":"D258","DOI":"10.1093\/nar\/gkh036","article-title":"The Gene Ontology (GO) database and informatics resource","volume":"32","author":"Harris","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023020210495740300_B13","doi-asserted-by":"crossref","first-page":"907","DOI":"10.1016\/j.echo.2007.02.028","article-title":"Clinical use of carotid intima-media thickness: review of the literature","volume":"20","author":"Hurst","year":"2007","journal-title":"J. Am. Soc. Echocardiogr"},{"key":"2023020210495740300_B14","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1002\/0470857897.ch8","article-title":"The KEGG database","volume":"247","author":"Kanehisa","year":"2002","journal-title":"Novartis Found Symp"},{"key":"2023020210495740300_B15","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1126\/science.220.4598.671","article-title":"Optimization by simulated annealing","volume":"220","author":"Kirkpatrick","year":"1983","journal-title":"Science"},{"key":"2023020210495740300_B16","doi-asserted-by":"crossref","first-page":"1135","DOI":"10.1161\/hc0902.104353","article-title":"Inflammation and atherosclerosis","volume":"105","author":"Libby","year":"2002","journal-title":"Circulation"},{"key":"2023020210495740300_B17","doi-asserted-by":"crossref","first-page":"459","DOI":"10.1161\/CIRCULATIONAHA.106.628875","article-title":"Prediction of clinical cardiovascular events with carotid intima-media thickness: a systematic review and meta-analysis","volume":"115","author":"Lorenz","year":"2007","journal-title":"Circulation"},{"key":"2023020210495740300_B18","doi-asserted-by":"crossref","first-page":"D54","DOI":"10.1093\/nar\/gki031","article-title":"Entrez gene: gene-centered information at NCBI","volume":"33","author":"Maglott","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023020210495740300_B19","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1373\/clinchem.2007.097360","article-title":"Inflammation in atherosclerosis: from vascular biology to biomarker discovery and risk prediction","volume":"54","author":"Packard","year":"2008","journal-title":"Clin. Chem"},{"key":"2023020210495740300_B20","doi-asserted-by":"crossref","first-page":"S6","DOI":"10.1186\/1471-2105-8-S8-S6","article-title":"Semi-supervised class discovery using quantitative phenotypes \u2013 CVD as a case study","volume":"8","author":"Steinfeld","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023020210495740300_B21","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210495740300_B22","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1159\/000097034","article-title":"Mannheim carotid intima-media thickness consensus (2004\u20132006). An update on behalf of the advisory board of the 3rd and 4th watching the risk symposium, 13th and 15th European stroke conferences, Mannheim, Germany, 2004, and Brussels, Belgium, 2006","volume":"23","author":"Touboul","year":"2007","journal-title":"Cerebrovasc Dis"},{"key":"2023020210495740300_B23","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1073\/pnas.091062498","article-title":"Significance analysis of microarrays applied to the ionizing radiation response","volume":"98","author":"Tusher","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020210495740300_B24","doi-asserted-by":"crossref","first-page":"S107","DOI":"10.1093\/bioinformatics\/17.suppl_1.S107","article-title":"Identifying splits with clear separation: a new class discovery method for gene expression data","volume":"17","author":"Heydebreck","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020210495740300_B25","doi-asserted-by":"crossref","first-page":"530","DOI":"10.1038\/415530a","article-title":"Gene expression profiling predicts clinical outcome of breast cancer","volume":"415","author":"Veer","year":"2002","journal-title":"Nature"},{"key":"2023020210495740300_B26","doi-asserted-by":"crossref","first-page":"702","DOI":"10.1056\/NEJM198903163201105","article-title":"Risk factors for coronary artery disease in healthy persons with hyperinsulinemia and normal glucose tolerance","volume":"320","author":"Zavaroni","year":"1989","journal-title":"N. Engl. J. Med"},{"key":"2023020210495740300_B27","doi-asserted-by":"crossref","first-page":"989","DOI":"10.1016\/S0026-0495(99)90195-6","article-title":"Hyperinsulinemia in a normal population as a predictor of non-insulin-dependent diabetes mellitus, hypertension, and coronary heart disease: the Barilla factory revisited","volume":"48","author":"Zavaroni","year":"1999","journal-title":"Metabolism"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/16\/i90\/49049918\/bioinformatics_24_16_i90.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/16\/i90\/49049918\/bioinformatics_24_16_i90.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,31]],"date-time":"2025-01-31T11:20:30Z","timestamp":1738322430000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/16\/i90\/200240"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,8,9]]},"references-count":27,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2008,8,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn279","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2008,8,15]]},"published":{"date-parts":[[2008,8,9]]}}}