{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,5]],"date-time":"2026-03-05T11:20:23Z","timestamp":1772709623913,"version":"3.50.1"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"16","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,8,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Methods for analyzing cancer microarray data often face two distinct challenges: the models they infer need to perform well when classifying new tissue samples while at the same time providing an insight into the patterns and gene interactions hidden in the data. State-of-the-art supervised data mining methods often cover well only one of these aspects, motivating the development of methods where predictive models with a solid classification performance would be easily communicated to the domain expert.<\/jats:p>\n               <jats:p>Results: Data visualization may provide for an excellent approach to knowledge discovery and analysis of class-labeled data. We have previously developed an approach called VizRank that can score and rank point-based visualizations according to degree of separation of data instances of different class. We here extend VizRank with techniques to uncover outliers, score features (genes) and perform classification, as well as to demonstrate that the proposed approach is well suited for cancer microarray analysis. Using VizRank and radviz visualization on a set of previously published cancer microarray data sets, we were able to find simple, interpretable data projections that include only a small subset of genes yet do clearly differentiate among different cancer types. We also report that our approach to classification through visualization achieves performance that is comparable to state-of-the-art supervised data mining techniques.<\/jats:p>\n               <jats:p>Availability: VizRank and radviz are implemented as part of the Orange data mining suite (http:\/\/www.ailab.si\/orange).<\/jats:p>\n               <jats:p>Contact: \u00a0blaz.zupan@fri.uni-lj.si<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available from http:\/\/www.ailab.si\/supp\/bi-cancer.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm312","type":"journal-article","created":{"date-parts":[[2007,6,23]],"date-time":"2007-06-23T00:39:53Z","timestamp":1182559193000},"page":"2147-2154","source":"Crossref","is-referenced-by-count":60,"title":["Visualization-based cancer microarray data classification analysis"],"prefix":"10.1093","volume":"23","author":[{"given":"Minca","family":"Mramor","sequence":"first","affiliation":[{"name":"1 Faculty of Computer and Information Science, University of Ljubljana, Tr\u017ea\u0161ka 25, 1000 Ljubljana, Slovenia and 2Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA"}]},{"given":"Gregor","family":"Leban","sequence":"additional","affiliation":[{"name":"1 Faculty of Computer and Information Science, University of Ljubljana, Tr\u017ea\u0161ka 25, 1000 Ljubljana, Slovenia and 2Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA"}]},{"given":"Janez","family":"Dem\u0161ar","sequence":"additional","affiliation":[{"name":"1 Faculty of Computer and Information Science, University of Ljubljana, Tr\u017ea\u0161ka 25, 1000 Ljubljana, Slovenia and 2Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA"}]},{"given":"Bla\u017e","family":"Zupan","sequence":"additional","affiliation":[{"name":"1 Faculty of Computer and Information Science, University of Ljubljana, Tr\u017ea\u0161ka 25, 1000 Ljubljana, Slovenia and 2Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA"},{"name":"1 Faculty of Computer and Information Science, University of Ljubljana, Tr\u017ea\u0161ka 25, 1000 Ljubljana, Slovenia and 2Department of Molecular and Human Genetics, Baylor College of Medicine, 1 Baylor Plaza, Houston, TX 77030, USA"}]}],"member":"286","published-online":{"date-parts":[[2007,6,22]]},"reference":[{"key":"2024121118001066900_B1","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nrg1749","article-title":"Microarray data analysis: from disarray to consolidation and consensus","author":"Allison","year":"2006","journal-title":"Nat. Rev. Genet"},{"key":"2024121118001066900_B2","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1038\/ng765","article-title":"MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia","volume":"30","author":"Armstrong","year":"2002","journal-title":"Nat. Genet"},{"key":"2024121118001066900_B3","doi-asserted-by":"crossref","first-page":"55","DOI":"10.2174\/157489306775330615","article-title":"Gene expression profile classification: a review","volume":"1","author":"Asyali","year":"2006","journal-title":"Curr. Bioinformatics"},{"key":"2024121118001066900_B4","doi-asserted-by":"crossref","first-page":"13790","DOI":"10.1073\/pnas.191502998","article-title":"Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses","volume":"98","author":"Bhattacharjee","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2024121118001066900_B5","doi-asserted-by":"crossref","first-page":"2584","DOI":"10.1182\/blood.V80.10.2584.2584","article-title":"Expression of the FMS\/KIT-like gene FLT3 in human acute leukemias of the myeloid and lymphoid lineages","volume":"80","author":"Birg","year":"1992","journal-title":"Blood"},{"key":"2024121118001066900_B6","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1075","article-title":"PLS dimension reduction for classification with microarray data","volume":"3","author":"Boulesteix","year":"2004","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2024121118001066900_B7","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1093\/bib\/bbl016","article-title":"Partial least squares: a versatile tool for the analysis of high-dimensional genomic data","volume":"8","author":"Boulesteix","year":"2007","journal-title":"Brief. Bioinformatics"},{"key":"2024121118001066900_B8","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1093\/bioinformatics\/btg419","article-title":"Is cross-validation valid for small-sample microarray classification?","volume":"20","author":"Braga-Neto","year":"2004","journal-title":"Bioinformatics"},{"key":"2024121118001066900_B9","first-page":"55","article-title":"An investigation of methods for visualising highly multivariate datasets","volume-title":"Case Studies of Visualization in the Social Sciences","author":"Brunsdon","year":"1998"},{"key":"2024121118001066900_B10","doi-asserted-by":"crossref","first-page":"1252","DOI":"10.1093\/bioinformatics\/btg150","article-title":"Graphical methods for class prediction using dimension reduction techniques on DNA microarray data","volume":"19","author":"Bura","year":"2003","journal-title":"Bioinformatics"},{"key":"2024121118001066900_B11","doi-asserted-by":"crossref","first-page":"396","DOI":"10.1093\/bioinformatics\/bth474","article-title":"Microarray data mining with visual programming","volume":"21","author":"Curk","year":"2005","journal-title":"Bioinformatics"},{"key":"2024121118001066900_B12","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1147","article-title":"Dimension reduction for classification with gene expression microarray data","volume":"5","author":"Dai","year":"2006","journal-title":"Stat. Appl. Genet. Mol. Biol"},{"key":"2024121118001066900_B13","volume-title":"Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques","author":"Dasarathy","year":"1991"},{"key":"2024121118001066900_B14","article-title":"Orange: from experimental machine learning to interactive data mining","author":"Demsar","year":"2004"},{"key":"2024121118001066900_B15","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1016\/S0169-5002(99)00043-4","article-title":"Molecular genetic characteristics of lung cancer\u2013useful as real\u2019 tumor markers?","volume":"25","author":"Fleischhacker","year":"1999","journal-title":"Lung Cancer"},{"key":"2024121118001066900_B16","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2024121118001066900_B17","doi-asserted-by":"crossref","first-page":"97","DOI":"10.1186\/1471-2105-6-97","article-title":"Many accurate small-discriminatory feature subsets exist in microarray transcript data: biomarker discovery","volume":"6","author":"Grate","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2024121118001066900_B18","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1146\/annurev.immunol.19.1.595","article-title":"B cell development pathways","volume":"19","author":"Hardy","year":"2001","journal-title":"Annu. Rev. Immunol"},{"key":"2024121118001066900_B19","first-page":"437","article-title":"DNA visual and analytic data mining","volume-title":"In the Proceedings of the IEEE Visualization","author":"Hoffman","year":"1997"},{"key":"2024121118001066900_B20","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1038\/89044","article-title":"Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks","volume":"7","author":"Khan","year":"2001","journal-title":"Nat. Med"},{"key":"2024121118001066900_B21","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1186\/1471-2105-7-235","article-title":"A comparison of univariate and multivariate gene selection techniques for classification of cancer datasets","volume":"7","author":"Lai","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2024121118001066900_B22","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1007\/s10618-005-0031-5","article-title":"VizRank: data visualization guided by machine learning","volume":"13","author":"Leban","year":"2006","journal-title":"Data Mining Knowl. Discov"},{"key":"2024121118001066900_B23","doi-asserted-by":"crossref","first-page":"239","DOI":"10.1196\/annals.1310.020","article-title":"Application of machine learning and high-dimensional visualization in cancer detection, diagnosis, and management","volume":"1020","author":"McCarthy","year":"2004","journal-title":"Ann. N.Y. Acad. Sci"},{"key":"2024121118001066900_B24","doi-asserted-by":"crossref","first-page":"37","DOI":"10.2174\/157489306775330642","article-title":"Analysis of microarray gene expression data","volume":"1","author":"Pham","year":"2006","journal-title":"Curr. bioinformatics"},{"key":"2024121118001066900_B25","article-title":"C4.5:","author":"Quinlan","year":"1993","journal-title":"Programs for Machine Learning"},{"key":"2024121118001066900_B26","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1038\/nm0102-68","article-title":"Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning","volume":"8","author":"Shipp","year":"2002","journal-title":"Nat. Med"},{"key":"2024121118001066900_B27","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1093\/jnci\/95.1.14","article-title":"Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification","volume":"95","author":"Simon","year":"2003","journal-title":"J. Natl Cancer Inst"},{"key":"2024121118001066900_B28","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/S1535-6108(02)00030-2","article-title":"Gene expression correlates of clinical prostate cancer behavior","volume":"1","author":"Singh","year":"2002","journal-title":"Cancer Cell"},{"key":"2024121118001066900_B29","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1093\/bioinformatics\/bti033","article-title":"A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis","volume":"21","author":"Statnikov","year":"2005","journal-title":"Bioinformatics"},{"key":"2024121118001066900_B30","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations","author":"Witten","year":"2005","edition":"2nd"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/16\/2147\/61051783\/bioinformatics_23_16_2147.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/16\/2147\/61051783\/bioinformatics_23_16_2147.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,11]],"date-time":"2024-12-11T22:23:25Z","timestamp":1733955805000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/16\/2147\/198827"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,6,22]]},"references-count":30,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2007,8,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm312","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,8,15]]},"published":{"date-parts":[[2007,6,22]]}}}