{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,8,3]],"date-time":"2024-08-03T21:41:34Z","timestamp":1722721294643},"reference-count":36,"publisher":"Oxford University Press (OUP)","issue":"9","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: High-throughput measurements of mRNA abundances from microarrays involve several stages of preprocessing. At each stage, a user has access to a large number of algorithms with no universally agreed guidance on which of these to use. We show that binary representations of gene expressions, retaining only information on whether a gene is expressed or not, reduces the variability in results caused by algorithmic choice, while also improving the quality of inference drawn from microarray studies.<\/jats:p>\n               <jats:p>Results: Binary representation of transcriptome data has the desirable property of reducing the variability introduced at the preprocessing stages due to algorithmic choice. We compare the effect of the choice of algorithms on different problems and suggest that using binary representation of microarray data with Tanimoto kernel for support vector machine reduces the effect of the choice of algorithm and simultaneously improves the performance of classification of phenotypes.<\/jats:p>\n               <jats:p>Contact: \u00a0mn@ecs.soton.ac.uk<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq104","type":"journal-article","created":{"date-parts":[[2010,3,9]],"date-time":"2010-03-09T04:53:55Z","timestamp":1268110435000},"page":"1185-1191","source":"Crossref","is-referenced-by-count":5,"title":["Reducing the algorithmic variability in transcriptome-based inference"],"prefix":"10.1093","volume":"26","author":[{"given":"Salih","family":"Tuna","sequence":"first","affiliation":[{"name":"School of Electronics and Computer Science, University of Southampton, Southampton, UK"}]},{"given":"Mahesan","family":"Niranjan","sequence":"additional","affiliation":[{"name":"School of Electronics and Computer Science, University of Southampton, Southampton, UK"}]}],"member":"286","published-online":{"date-parts":[[2010,3,8]]},"reference":[{"key":"2023012508164602500_B1","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nrg1749","article-title":"Microarray data analysis: from disarray to consolidation and consensus","volume":"7","author":"Allison","year":"2006","journal-title":"Nat. Rev. Genet."},{"key":"2023012508164602500_B2","doi-asserted-by":"crossref","first-page":"839","DOI":"10.1093\/bioinformatics\/btg487","article-title":"Comparative analysis of algorithms for signal quantitation from oligonucleotide microarrays","volume":"20","author":"Barash","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508164602500_B3","doi-asserted-by":"crossref","first-page":"711","DOI":"10.1182\/blood-2006-02-002824","article-title":"Biologic pathways associated with relapse in childhood acute lymphoblastic leukemia: a Children's Oncology Group study","volume":"108","author":"Bhojwani","year":"2006","journal-title":"Blood"},{"key":"2023012508164602500_B4","volume-title":"Pattern Recognition and Machine Learning.","author":"Bishop","year":"2006"},{"key":"2023012508164602500_B5","doi-asserted-by":"crossref","first-page":"1324","DOI":"10.1002\/ijc.23237","article-title":"A stromal gene signature associated with inflammatory breast cancer","volume":"122","author":"Boersma","year":"2007","journal-title":"Int. J. Cancer"},{"key":"2023012508164602500_B6","author":"Bolstad","year":"2004","journal-title":"affy: Built-in Processing Methods."},{"key":"2023012508164602500_B7","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1073\/pnas.97.1.262","article-title":"Knowledge-based analysis of microarray gene expression data by using support vector machines","volume":"97","author":"Brown","year":"2000","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508164602500_B8","doi-asserted-by":"crossref","first-page":"R16","DOI":"10.1186\/gb-2005-6-2-r16","article-title":"Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset","volume":"6","author":"Choe","year":"2005","journal-title":"Genome Biol."},{"key":"2023012508164602500_B9","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1093\/bioinformatics\/btg410","article-title":"A benchmark for Affymetrix GeneChip expression measures","volume":"20","author":"Cope","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508164602500_B10","doi-asserted-by":"crossref","first-page":"3583","DOI":"10.1093\/bioinformatics\/bth447","article-title":"BagBoosting for tumor classification with gene expression data","volume":"20","author":"Dettling","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508164602500_B11","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1016\/j.tig.2005.12.005","article-title":"Reliability and reproducibility issues in DNA microarray measurements","volume":"22","author":"Draghici","year":"2006","journal-title":"Trends Genet."},{"key":"2023012508164602500_B12","volume-title":"Pattern Classification.","author":"Duda","year":"2001"},{"key":"2023012508164602500_B13","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1093\/bioinformatics\/btg405","article-title":"affy\u2014analysis of Affymetrix GeneChip data at the probe level","volume":"20","author":"Gautier","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012508164602500_B14","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1126\/science.286.5439.531","article-title":"Molecular classification of cancer: class discovery and class prediction by gene expression monitoring","volume":"286","author":"Golub","year":"1999","journal-title":"Science"},{"key":"2023012508164602500_B15","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1007\/0-387-21679-0_4","article-title":"An R package for analyses of Affymetrix oligonucleotide arrays","volume-title":"The analysis of gene expression data: methods and software","author":"Irizarry","year":"2003"},{"key":"2023012508164602500_B16","doi-asserted-by":"crossref","first-page":"e15","DOI":"10.1093\/nar\/gng015","article-title":"Summaries of Affymetrix GeneChip probe level data","volume":"31","author":"Irizarry","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012508164602500_B17","doi-asserted-by":"crossref","first-page":"e1651","DOI":"10.1371\/journal.pone.0001651","article-title":"Gene expression signature of cigarette smoking and its role in lung adenocarcinoma development and survival","volume":"3","author":"Landi","year":"2008","journal-title":"PLoS ONE"},{"key":"2023012508164602500_B18","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1002\/path.1921","article-title":"Differential expression of a gene signature for scavenger\/lectin receptors by endothelial cells and macrophages in human lymph node sinuses, the primary sites of regional metastasis","volume":"208","author":"Martens","year":"2006","journal-title":"J. Pathol."},{"key":"2023012508164602500_B19","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1186\/1471-2105-7-137","article-title":"How to decide? different methods of calculating gene expression from short oligonucleotide array data will give different results","volume":"7","author":"Millenaar","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508164602500_B20","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1186\/1471-2105-9-164","article-title":"A comprehensive re-analysis of the Golden Spike data: towards a benchmark for differential expression methods","volume":"9","author":"Pearson","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012508164602500_B21","doi-asserted-by":"crossref","first-page":"80","DOI":"10.1186\/1471-2105-6-80","article-title":"Correlation test to assess low-level processing of high-density oligonucleotide microarray data","volume":"6","author":"Ploner","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012508164602500_B22","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1186\/1471-2105-7-23","article-title":"Evaluation of methods for oligonucleotide array data via quantitative real-time PCR","volume":"7","author":"Qin","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012508164602500_B23","doi-asserted-by":"crossref","first-page":"1093","DOI":"10.1016\/j.neunet.2005.07.009","article-title":"Graph kernels for chemical informatics","volume":"18","author":"Ralaivola","year":"2005","journal-title":"Neural Netw."},{"key":"2023012508164602500_B24","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1186\/1471-2105-6-26","article-title":"Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data","volume":"6","author":"Shedden","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012508164602500_B25","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/S1535-6108(02)00030-2","article-title":"Gene expression correlates of clinical prostate cancer behavior","volume":"1","author":"Singh","year":"2002","journal-title":"Cancer cell"},{"key":"2023012508164602500_B26","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1093\/bioinformatics\/bti033","article-title":"A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis","volume":"21","author":"Statnikov","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012508164602500_B27","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1186\/1471-2164-8-140","article-title":"Selection of DDX5 as a novel internal control for Q-RT-PCR from microarray data using a block bootstrap re-sampling scheme","volume":"8","author":"Su","year":"2007","journal-title":"BMC Genomics"},{"issue":"Suppl. 1","key":"2023012508164602500_B28","doi-asserted-by":"crossref","first-page":"i359","DOI":"10.1093\/bioinformatics\/bti1055","article-title":"Kernels for small molecules and the prediction of mutagenicity, toxicity and anti-cancer activity","volume":"21","author":"Swamidass","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012508164602500_B29","article-title":"Support vector machines for Drug Discovery","volume-title":"PhD Thesis","author":"Trotter","year":"2006"},{"key":"2023012508164602500_B30","doi-asserted-by":"crossref","first-page":"390","DOI":"10.4236\/jbise.2009.26056","article-title":"Classification with binary gene expressions","volume":"2","author":"Tuna","year":"2009","journal-title":"J. biomed. sci. eng."},{"key":"2023012508164602500_B31","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1007\/s11265-009-0363-2","article-title":"Inference from low precision transcriptome data representation","volume":"58","author":"Tuna","year":"2010","journal-title":"J. Sign. Process. syst."},{"key":"2023012508164602500_B32","doi-asserted-by":"crossref","first-page":"927","DOI":"10.1158\/0008-5472.CAN-07-2608","article-title":"Tumor immunobiological differences in prostate cancer between African-American and European-American men","volume":"68","author":"Wallace","year":"2008","journal-title":"Cancer Res."},{"key":"2023012508164602500_B33","doi-asserted-by":"crossref","first-page":"11462","DOI":"10.1073\/pnas.201162998","article-title":"Predicting the clinical status of human breast cancer by using gene expression profiles","volume":"98","author":"West","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012508164602500_B34","doi-asserted-by":"crossref","first-page":"1046","DOI":"10.1016\/j.drudis.2006.10.005","article-title":"Similarity-based virtual screening using 2D fingerprints","volume":"11","author":"Willett","year":"2006","journal-title":"Drug Discov. Today"},{"key":"2023012508164602500_B35","first-page":"679","article-title":"Binarization of microarray data on the basis of a mixture model","volume":"2","author":"Zhou","year":"2003","journal-title":"Mol. Cancer Ther."},{"key":"2023012508164602500_B36","doi-asserted-by":"crossref","first-page":"911","DOI":"10.1038\/nmeth1102","article-title":"A gene expression bar code for microarray data","volume":"4","author":"Zilliox","year":"2007","journal-title":"Nat. Methods"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/9\/1185\/48857031\/bioinformatics_26_9_1185.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/9\/1185\/48857031\/bioinformatics_26_9_1185.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T08:17:18Z","timestamp":1674634638000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/9\/1185\/199235"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,3,8]]},"references-count":36,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2010,5,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq104","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2010,5,1]]},"published":{"date-parts":[[2010,3,8]]}}}