{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,5,18]],"date-time":"2024-05-18T01:35:27Z","timestamp":1715996127808},"reference-count":78,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2007,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Independently derived expression profiles of the same biological condition often have few genes in common. In this study, we created populations of expression profiles from publicly available microarray datasets of cancer (breast, lymphoma and renal) samples linked to clinical information with an iterative machine learning algorithm. ROC curves were used to assess the prediction error of each profile for classification. We compared the prediction error of profiles correlated with molecular phenotype against profiles correlated with relapse-free status. Prediction error of profiles identified with supervised univariate feature selection algorithms were compared to profiles selected randomly from a) all genes on the microarray platform and b) a list of known disease-related genes (a priori selection). We also determined the relevance of expression profiles on test arrays from independent datasets, measured on either the same or different microarray platforms.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>Highly discriminative expression profiles were produced on both simulated gene expression data and expression data from breast cancer and lymphoma datasets on the basis of ER and BCL-6 expression, respectively. Use of relapse-free status to identify profiles for prognosis prediction resulted in poorly discriminative decision rules. Supervised feature selection resulted in more accurate classifications than random or a priori selection, however, the difference in prediction error decreased as the number of features increased. These results held when decision rules were applied across-datasets to samples profiled on the same microarray platform.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>Our results show that many gene sets predict molecular phenotypes accurately. Given this, expression profiles identified using different training datasets should be expected to show little agreement. In addition, we demonstrate the difficulty in predicting relapse directly from microarray data using supervised machine learning approaches. These findings are relevant to the use of molecular profiling for the identification of candidate biomarker panels.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-8-415","type":"journal-article","created":{"date-parts":[[2007,10,26]],"date-time":"2007-10-26T16:40:06Z","timestamp":1193416806000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":21,"title":["Prediction potential of candidate biomarker sets identified and validated on gene expression data from multiple datasets"],"prefix":"10.1186","volume":"8","author":[{"given":"Michael","family":"Gormley","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"William","family":"Dampier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Adam","family":"Ertel","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bilge","family":"Karacali","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Aydin","family":"Tozeren","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2007,10,26]]},"reference":[{"key":"1787_CR1","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1517\/14796694.1.1.37","volume":"1","author":"SK Chatterjee","year":"2005","unstructured":"Chatterjee SK, Zetter BR: Cancer biomarkers: knowing the present and predicting the future. Future Oncol 2005, 1: 37\u201350. 10.1517\/14796694.1.1.37","journal-title":"Future Oncol"},{"key":"1787_CR2","doi-asserted-by":"publisher","first-page":"499","DOI":"10.1097\/CAD.0b013e3280262427","volume":"18","author":"AM Parissenti","year":"2007","unstructured":"Parissenti AM, Hembruff SL, Villeneuve DJ, Veitch Z, Guo B, Eng J: Gene expression profiles as biomarkers for the prediction of chemotherapy drug response in human tumour cells. Anticancer Drugs 2007, 18: 499\u2013523. 10.1097\/CAD.0b013e3280262427","journal-title":"Anticancer Drugs"},{"key":"1787_CR3","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1097\/01.LAB.0000059936.28369.19","volume":"83","author":"F Bertucci","year":"2003","unstructured":"Bertucci F, Viens P, Tageet R, Nguyen C, Houlgatte R, Birnbaum D: DNA Arrays in Clinical Oncology: Promises and Challenges. Lab Invest 2003, 83: 305\u2013316.","journal-title":"Lab Invest"},{"key":"1787_CR4","doi-asserted-by":"publisher","first-page":"311","DOI":"10.1038\/ng1106","volume":"33","author":"SD Patterson","year":"2003","unstructured":"Patterson SD, Aebersold RH: Proteomics: the first decade and beyond. Nat Genet 2003, 33: 311\u2013323. 10.1038\/ng1106","journal-title":"Nat Genet"},{"key":"1787_CR5","doi-asserted-by":"publisher","first-page":"1321","DOI":"10.1083\/jcb.151.6.1321","volume":"151","author":"YW Chen","year":"2000","unstructured":"Chen YW, Zhao P, Borup R, Hoffman EP: Expression profiling in the muscular dystrophies: identification of novel aspects of molecular pathophysiology. J Cell Biol 2000, 151: 1321\u20131336. 10.1083\/jcb.151.6.1321","journal-title":"J Cell Biol"},{"key":"1787_CR6","doi-asserted-by":"publisher","first-page":"228","DOI":"10.1016\/j.nbd.2006.03.004","volume":"23","author":"E Sterrenburg","year":"2006","unstructured":"Sterrenburg E, van der Wees CG, White SJ, Turk R, de Menezes RX, van Ommen GJ, den Dunnen JT, t Hoen PA: Gene expression profiling highlights defective myogenesis in DMD patients and a possible role for bone morphogenetic protein 4. Neurobiol Dis 2006, 23: 228\u2013236. 10.1016\/j.nbd.2006.03.004","journal-title":"Neurobiol Dis"},{"key":"1787_CR7","doi-asserted-by":"publisher","first-page":"636","DOI":"10.1053\/j.ajkd.2003.12.028","volume":"43","author":"HJ Baelde","year":"2004","unstructured":"Baelde HJ, Eikmans M, Doran PP, Lappin DW, de Heer E, Bruijn JA: Gene expression profiling in glomeruli from human kidneys with diabetic nephropathy. Am J Kidney Dis 2004, 43: 636\u2013650. 10.1053\/j.ajkd.2003.12.028","journal-title":"Am J Kidney Dis"},{"key":"1787_CR8","doi-asserted-by":"publisher","first-page":"3507","DOI":"10.1210\/jc.2006-0274","volume":"91","author":"L Puricelli","year":"2006","unstructured":"Puricelli L, Iori E, Millioni R, Arrigoni G, James P, Vedovato M, Tessari P: Proteome analysis of cultured fibroblasts from type 1 diabetic patients and normal subjects. J Clin Endocrinol Metab 2006, 91: 3507\u20133514. 10.1210\/jc.2006-0274","journal-title":"J Clin Endocrinol Metab"},{"key":"1787_CR9","doi-asserted-by":"publisher","first-page":"973","DOI":"10.1093\/rheumatology\/keh224","volume":"43","author":"MG Barnes","year":"2004","unstructured":"Barnes MG, Aronow BJ, Luyrink LK, Moroldo MB, Pavlidis P, Passo MH, Grom AA, Hirsch R, Giannini EH, Colbert RA, Glass DN, Thompson SD: Gene expression in juvenile arthritis and spondyloarthropathy: pro-angiogenic ELR+ chemokine genes relate to course of arthritis. Rheumatology 2004, 43: 973\u2013979. 10.1093\/rheumatology\/keh224","journal-title":"Rheumatology"},{"key":"1787_CR10","doi-asserted-by":"publisher","first-page":"993","DOI":"10.1016\/S0022-2828(03)00179-2","volume":"35","author":"J Ma","year":"2003","unstructured":"Ma J, Liew CC: Gene profiling identifies secreted protein transcripts from peripheral blood cells in coronary artery disease. J Mol Cell Cardiol 2003, 35: 993\u2013998. 10.1016\/S0022-2828(03)00179-2","journal-title":"J Mol Cell Cardiol"},{"key":"1787_CR11","doi-asserted-by":"publisher","first-page":"530","DOI":"10.1038\/415530a","volume":"415","author":"LJ van't Veer","year":"2002","unstructured":"van't Veer LJ, Dai HY, van de Vijver MJ, He YDD, Hart AAM, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, Schreiber GJ, Kerkhoven RM, Roberts C, Linsely PS, Bernards R, Friend SH: Gene expression profiling predicts clinical outcome of breast cancer. Nature 2002, 415: 530\u2013536. 10.1038\/415530a","journal-title":"Nature"},{"key":"1787_CR12","doi-asserted-by":"publisher","first-page":"68","DOI":"10.1038\/nm0102-68","volume":"8","author":"MA Shipp","year":"2002","unstructured":"Shipp MA, Ross KN, Tamayo P, Weng AP, Kutok JL, Aguiar RCT, Gaasenbeek M, Angelo M, Reich M, Pinkus GS, Ray TS, Konal MA, Last KW, Norton A, Lister TA, Mesirov J, Neuberg DS, Lander ES, Aster JC, Golub TR: Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning. Nat Med 2002, 8: 68\u201374. 10.1038\/nm0102-68","journal-title":"Nat Med"},{"key":"1787_CR13","doi-asserted-by":"publisher","first-page":"1929","DOI":"10.1091\/mbc.02-02-0023.","volume":"13","author":"X Chen","year":"2002","unstructured":"Chen X, Cheung ST, So S, Fan ST, Barry C, Higgins J, Lai KM, Ji J, Dudoit S, Ng IO, van de Rijn M, Botstein D, Brown PO: Gene expression patterns in human liver cancers. Mol Biol Cell 2002, 13: 1929\u20131939. 10.1091\/mbc.02-02-0023.","journal-title":"Mol Biol Cell"},{"key":"1787_CR14","doi-asserted-by":"publisher","first-page":"13784","DOI":"10.1073\/pnas.241500798","volume":"98","author":"ME Garber","year":"2001","unstructured":"Garber ME, Troyanskaya OG, Schluens K, Petersen S, Thaesler Z, Pacyna-Gengelbach M, van de Rijn M, Rosen GD, Perou CM, Whyte RI, Altman RB, Brown PO, Botstein D, Petersen I: Diversity of gene expression in adenocarcinoma of the lung. Proc Natl Acad Sci USA 2001, 98: 13784\u201313789. 10.1073\/pnas.241500798","journal-title":"Proc Natl Acad Sci USA"},{"key":"1787_CR15","doi-asserted-by":"publisher","first-page":"4587","DOI":"10.1038\/sj.onc.1205570","volume":"21","author":"T Crnogorac-Jurcevic","year":"2002","unstructured":"Crnogorac-Jurcevic T, Efthimiou E, Nielsen T, Loader J, Terris B, Stamp G, Baron A, Scarpa A, Lemoine NR: Expression profiling of microdissected pancreatic adenocarcinomas. Oncogene 2002, 21: 4587\u20134594. 10.1038\/sj.onc.1205570","journal-title":"Oncogene"},{"key":"1787_CR16","doi-asserted-by":"publisher","first-page":"536","DOI":"10.1038\/35020115","volume":"406","author":"M Bittner","year":"2000","unstructured":"Bittner M, Meltzer P, Chen Y, Jiang Y, Seftor E, Hendrix M, Radmacher M, Simon R, Yakhini Z, Ben-Dor A, Sampas N, Dougherty E, Wang E, Marincola F, Gooden C, Lueders J, Glatfelter A, Pollock P, Carpten J, Gillanders E, Leja D, Dietrich K, Beaudry C, Berens M, Alberts D, Sondak V, Hayward N, Trent J: Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature 2000, 406: 536\u2013540. 10.1038\/35020115","journal-title":"Nature"},{"key":"1787_CR17","doi-asserted-by":"publisher","first-page":"1599","DOI":"10.1038\/sj.bjc.6601326","volume":"89","author":"R Simon","year":"2003","unstructured":"Simon R: Diagnostic and prognostic prediction using gene expression profiles in high-dimensional microarray data. Br J Cancer 2003, 89: 1599\u20131604. 10.1038\/sj.bjc.6601326","journal-title":"Br J Cancer"},{"key":"1787_CR18","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1198\/016214502753479248","volume":"97","author":"S Dudoit","year":"2002","unstructured":"Dudoit S, Fridlyand J, Speed TP: Comparison of discrimination methods for the classification of tumors using gene expression data. J Am Stat Assoc 2002, 97: 77\u201387. 10.1198\/016214502753479248","journal-title":"J Am Stat Assoc"},{"key":"1787_CR19","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1089\/106652700750050943","volume":"7","author":"A Ben-Dor","year":"2000","unstructured":"Ben-Dor A, Bruhn L, Friedman N, Nachman I, Schummer M, Yakhini Z: Tissue classification with gene expression profiles. J Comput Biol 2000, 7: 559\u2013583. 10.1089\/106652700750050943","journal-title":"J Comput Biol"},{"key":"1787_CR20","doi-asserted-by":"publisher","first-page":"1157","DOI":"10.1162\/153244303322753616","volume":"3","author":"I Guyon","year":"2003","unstructured":"Guyon I, Elisseeff A: An introduction to variable and feature selection. J Mach Learn Res 2003, 3: 1157\u20131182. 10.1162\/153244303322753616","journal-title":"J Mach Learn Res"},{"key":"1787_CR21","doi-asserted-by":"publisher","first-page":"5116","DOI":"10.1073\/pnas.091062498","volume":"98","author":"VG Tusher","year":"2001","unstructured":"Tusher VG, Tibshirani R, Chu G: Significance analysis of microarrays applied to the ionizing radiation response. Proc Natl Acad Sci USA 2001, 98: 5116\u20135121. 10.1073\/pnas.091062498","journal-title":"Proc Natl Acad Sci USA"},{"key":"1787_CR22","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1023\/A:1012487302797","volume":"46","author":"I Guyon","year":"2002","unstructured":"Guyon I, Weston J, Barnhill S: Gene selection for cancer classification using support vector machines. Machine Learning 2002, 46: 389\u2013422. 10.1023\/A:1012487302797","journal-title":"Machine Learning"},{"key":"1787_CR23","doi-asserted-by":"publisher","first-page":"727","DOI":"10.2174\/1386207013330733","volume":"4","author":"L Li","year":"2001","unstructured":"Li L, Darden TA, Weinberg CR, Levine AJ, Pedersen LG: Gene assessment and sample classification for gene expression data using a genetic algorithm\/k-nearest neighbor method. Comb Chem High Throughput Screen 2001, 4: 727\u2013739.","journal-title":"Comb Chem High Throughput Screen"},{"key":"1787_CR24","doi-asserted-by":"publisher","first-page":"2691","DOI":"10.1093\/bioinformatics\/bti419","volume":"21","author":"JJ Liu","year":"2005","unstructured":"Liu JJ, Cutler G, Li W, Pan Z, Peng S, Hoey T, Chen L, Ling X: Multiclass cancer classification and biomarker discovery using GA-based algorithms. Bioinformatics 2005, 21: 2691\u20132697. 10.1093\/bioinformatics\/bti419","journal-title":"Bioinformatics"},{"key":"1787_CR25","doi-asserted-by":"publisher","first-page":"93","DOI":"10.1016\/j.febslet.2004.03.081","volume":"565","author":"JK Choi","year":"2004","unstructured":"Choi JK, Choi JY, Kim DG, Choi DW, Kim BY, Lee KH, Yeom YI, Yoo HS, Yoo OJ, Kim S: Integrative analysis of multiple gene expression profiles applied to liver cancer study. FEBS Lett 2004, 565: 93\u2013100. 10.1016\/j.febslet.2004.05.087","journal-title":"FEBS Lett"},{"key":"1787_CR26","doi-asserted-by":"publisher","first-page":"15545","DOI":"10.1073\/pnas.0506580102","volume":"102","author":"A Subramanian","year":"2005","unstructured":"Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA 2005, 102: 15545\u201315550. 10.1073\/pnas.0506580102","journal-title":"Proc Natl Acad Sci USA"},{"key":"1787_CR27","doi-asserted-by":"publisher","first-page":"393","DOI":"10.1126\/science.1086384","volume":"302","author":"NO Fortunel","year":"2003","unstructured":"Fortunel NO, Otu HH, Ng HH, Chen J, Mu X, Chevassut T, Li X, Joseph M, Bailey C, Hatzfeld JA, Hatzfield A, Usta F, Vega VB, Long PM, Libermann TA, Lim B: Comment on \" 'Stemness': transcriptional profiling of embryonic and adult stem cells\" and \"a stem cell molecular signature\". Science 2003, 302: 393. 10.1126\/science.1086384","journal-title":"Science"},{"key":"1787_CR28","doi-asserted-by":"publisher","first-page":"214","DOI":"10.1186\/1471-2105-6-214","volume":"6","author":"SO Zakharkin","year":"2005","unstructured":"Zakharkin SO, Kim K, Mehta T, Chen L, Barnes S, Scheirer KE, Parrish RS, Allison DB, Page GP: Sources of variation in Affymetrix microarray experiments. BMC Bioinformatics 2005, 6: 214. 10.1186\/1471-2105-6-214","journal-title":"BMC Bioinformatics"},{"key":"1787_CR29","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1186\/1471-2164-6-71","volume":"6","author":"H Wang","year":"2005","unstructured":"Wang H, He X, Band M, Wilson C, Liu L: A study of inter-lab and inter-platform agreement of DNA microarray data. BMC Genomics 2005, 6: 71. 10.1186\/1471-2164-6-71","journal-title":"BMC Genomics"},{"key":"1787_CR30","doi-asserted-by":"publisher","first-page":"488","DOI":"10.1016\/S0140-6736(05)17866-0","volume":"365","author":"S Michiels","year":"2005","unstructured":"Michiels S, Koscielny S, Hill C: Prediction of cancer outcome with microarrays: a multiple random validation strategy. Lancet 2005, 365: 488\u2013492. 10.1016\/S0140-6736(05)17866-0","journal-title":"Lancet"},{"key":"1787_CR31","doi-asserted-by":"publisher","first-page":"407","DOI":"10.1186\/1471-2105-7-407","volume":"7","author":"SG Baker","year":"2006","unstructured":"Baker SG, Kramer BS: Identifying genes that contribute most to good classification in microarrays. BMC Bioinformatics 2006, 7: 407. 10.1186\/1471-2105-7-407","journal-title":"BMC Bioinformatics"},{"key":"1787_CR32","doi-asserted-by":"publisher","first-page":"171","DOI":"10.1093\/bioinformatics\/bth469","volume":"21","author":"L Ein-Dor","year":"2005","unstructured":"Ein-Dor L, Kela I, Getz G, Givol D, Domany E: Outcome signature genes in breast cancer: is there a unique set? Bioinformatics 2005, 21: 171\u2013178. 10.1093\/bioinformatics\/bth469","journal-title":"Bioinformatics"},{"key":"1787_CR33","doi-asserted-by":"publisher","first-page":"97","DOI":"10.1186\/1471-2105-6-97","volume":"6","author":"LR Grate","year":"2005","unstructured":"Grate LR: Many accurate small-discriminatory feature subsets exist in microarray transcript data: biomarker discovery. BMC Bioinformatics 2005, 6: 97. 10.1186\/1471-2105-6-97","journal-title":"BMC Bioinformatics"},{"key":"1787_CR34","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1177\/117693510600200011","volume":"2","author":"ER Dougherty","year":"2006","unstructured":"Dougherty ER, Brun M: On the Number of Close-to-Optimal Feature Sets. Cancer Informatics 2006, 2: 189\u2013196.","journal-title":"Cancer Informatics"},{"issue":"38","key":"1787_CR35","doi-asserted-by":"publisher","first-page":"13550","DOI":"10.1073\/pnas.0506230102","volume":"102","author":"LD Miller","year":"2005","unstructured":"Miller LD, Smeds J, George J, Vega VB, Vergara L, Ploner A, Pawitan Y, Hall P, Klaar S, Liu ET, Bergh J: An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival. Proc Natl Acad Sci U S A 2005, 102(38):13550\u201313555. 10.1073\/pnas.0506230102","journal-title":"Proc Natl Acad Sci U S A"},{"key":"1787_CR36","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1016\/S0140-6736(05)70933-8","volume":"365","author":"Y Wang","year":"2005","unstructured":"Wang Y, Klijn JGM, Zhang Y, Sieuwerts AM, Look MP, Yang F, Talantov D, Timmermans M, Meijer-van Gelder ME, Yu J, Jatkoe T, Berns EMJJ, Atkins D, Foekens JA: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet 2005, 365: 671\u201379.","journal-title":"Lancet"},{"key":"1787_CR37","doi-asserted-by":"publisher","first-page":"1999","DOI":"10.1056\/NEJMoa021967","volume":"347","author":"MJ Van de Vijver","year":"2002","unstructured":"Van de Vijver MJ, He YD, Van 't veer LJ, Dai H, Hart AAM, Voskuil DW, Schreiber GJ, Peterse JL, Roberts C, Marton MJ, Parrish M, Atsma D, Wittevenn A, Glas A, Delahaye L, Van der velde T, Bartelink H, Rodenhuis S, Rutgers ET, Friend SH, Bernards R: A gene-expression signature as a predictor of survival in breast cancer. New Engl J Med 2002, 347: 1999\u20132009. 10.1056\/NEJMoa021967","journal-title":"New Engl J Med"},{"key":"1787_CR38","doi-asserted-by":"publisher","first-page":"8418","DOI":"10.1073\/pnas.0932692100","volume":"100","author":"T Sorlie","year":"2003","unstructured":"Sorlie T, Tibshirani R, Parker J, Hastie T, Marron JS, Nobel A, Deng S, Johnsen H, Pesich R, Geisler S, Demeter J, Perou CM, Lonning PE, Brown PO, Borresen-Dale A, Botstein D: Repeated observation of breast tumor subtypes in independent gene expression datasets. Proc Natl Acad Sci USA 2003, 100: 8418\u20138423. 10.1073\/pnas.0932692100","journal-title":"Proc Natl Acad Sci USA"},{"key":"1787_CR39","doi-asserted-by":"publisher","first-page":"1851","DOI":"10.1182\/blood-2004-07-2947","volume":"105","author":"S Monti","year":"2005","unstructured":"Monti S, Savage KJ, Kutok JL, Feuerhake F, Kurtin P, Mihm M, Wu B, Pasqualucci L, Neuberg D, Aguiar RCT, Dal Cin P, Ladd C, Pinkus GS, Salles G, Harris NL, Dalla-Favera R, Habermann TM, Aster JC, Golub TR, Shipp MA: Molecular profiling of diffuse large B-cell lymphoma identifies robust subtypes including one characterized by host inflammatory response. Blood 2005, 105: 1851\u20131861. 10.1182\/blood-2004-07-2947","journal-title":"Blood"},{"key":"1787_CR40","doi-asserted-by":"publisher","first-page":"2419","DOI":"10.1056\/NEJMoa055351","volume":"354","author":"M Hummel","year":"2006","unstructured":"Hummel M, Bentink S, Berger H, Klapper W, Wessendorf S, Barth TFE, Bernd H, Cogliatti SB, Dierlamm J, Feller AC, Hansmann M, Haralambieva E, Harder L, HAsenclever D, Kuhn M, Lenze D, Lichter P, Martin-Subero JI, Moller P, Muller-Hermelink H, Ott G, Parwaresh RM, Pott C, Rosenwald A, Rosolowski M, Schwaenen C, Sturzenhofecker B, Szczepanowski M, Trautmann H, Wacker H, spang R, Loeffler M, Trumper L, Stein H, Siebert R: A biologic definition of Burkitt's lymphoma form transcriptional and genomic profiling. New Engl J Med 2006, 354: 2419\u20132430. 10.1056\/NEJMoa055351","journal-title":"New Engl J Med"},{"issue":"1","key":"1787_CR41","doi-asserted-by":"publisher","first-page":"e13","DOI":"10.1371\/journal.pmed.0030013","volume":"3","author":"H Zhao","year":"2005","unstructured":"Zhao H, Ljungberg B, Grankvist K, Rasmuson T, Tibshirani R, Brooks JD: Gene expression profiling predicts survival in conventional renal cell carcinoma. PLoS Med 2005, 3(1):e13-e13. 10.1371\/journal.pmed.0030013","journal-title":"PLoS Med"},{"key":"1787_CR42","doi-asserted-by":"publisher","first-page":"1252","DOI":"10.1093\/bioinformatics\/btg150","volume":"19","author":"E Bura","year":"2003","unstructured":"Bura E, Pfeiffer RM: Graphical methods for class prediction using dimension reduction techniques on DNA microarray data. Bioinformatics 2003, 19: 1252\u20131258. 10.1093\/bioinformatics\/btg150","journal-title":"Bioinformatics"},{"key":"1787_CR43","first-page":"5979","volume":"61","author":"S Gruvberger","year":"2001","unstructured":"Gruvberger S, Ringner M, Chen Y, Panavally S, Sall LH, Borg A, Ferno M, Peterson C, Meltzer PS: Estrogen receptor status in breast cancer is associated with remarkably distinct gene expression patterns. Cancer Res 2001, 61: 5979\u20135984.","journal-title":"Cancer Res"},{"key":"1787_CR44","doi-asserted-by":"publisher","first-page":"833","DOI":"10.1210\/me.2004-0486","volume":"19","author":"L Bjornstrom","year":"2005","unstructured":"Bjornstrom L, Sjoberg M: Mechanisms of estrogen receptor signaling: Convergene of genomic and nongenomic actions on target genes. Mol Endocrinol 2005, 19: 833\u2013842. 10.1210\/me.2004-0486","journal-title":"Mol Endocrinol"},{"key":"1787_CR45","doi-asserted-by":"publisher","first-page":"6947","DOI":"10.1073\/pnas.93.14.6947","volume":"93","author":"CC Chang","year":"1996","unstructured":"Chang CC, Ye BH, Chaganti RSK, Dalla-Favera R: BCL-6 a POZ\/zinc-finger protein, is a sequence-specific transcriptional repressor. Proc Natl Acad Sci USA 1996, 93: 6947\u20136952. 10.1073\/pnas.93.14.6947","journal-title":"Proc Natl Acad Sci USA"},{"key":"1787_CR46","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1182\/blood.V86.1.45.bloodjournal86145","volume":"86","author":"G Cattoretti","year":"1995","unstructured":"Cattoretti G, Chang CC, Cechova K, Zhang J, Ye BH, Falini B, Louie DC, Offit K, Chagnati RSK, Dalla-Favera R: BCL-6 protein is expressed in germinal-center B cells. Blood 1995, 86: 45\u201353.","journal-title":"Blood"},{"key":"1787_CR47","doi-asserted-by":"publisher","first-page":"275","DOI":"10.1182\/blood-2003-05-1545","volume":"103","author":"CP Hans","year":"2004","unstructured":"Hans CP, Weisenburger DD, Greiner TC, Gascoyne RD, Delabie J, Ott G, Muller-Hermelink HK, Campo E, Braziel RM, Jaffe ES, Pan Z, Farinha P, Smith LM, Falini B, Banham AH, Rosenwald A, Staudt LM, Connors JM, Armitage JO, Chan WC: Confirmation of the molecular classification of diffuse large B-cell lymphoma by immunohistochemistry using a tissue microarray. Blood 2004, 103: 275\u2013281. 10.1182\/blood-2003-05-1545","journal-title":"Blood"},{"key":"1787_CR48","doi-asserted-by":"publisher","first-page":"945","DOI":"10.1182\/blood.V98.4.945","volume":"98","author":"IS Lossos","year":"2001","unstructured":"Lossos IS, Jones CD, Warnke R, Natkunam Y, Kaizer H, Zehnder JL, Tibshirani R, Levy R: Expression of a single gene, BCL-6, strongly predicts survival in patients with diffuse large B-cell lymphoma. Blood 2001, 98: 945\u2013951. 10.1182\/blood.V98.4.945","journal-title":"Blood"},{"key":"1787_CR49","doi-asserted-by":"publisher","first-page":"1572","DOI":"10.1136\/bmj.317.7172.1572","volume":"317","author":"JM Bland","year":"1998","unstructured":"Bland JM, Altman DG: Survival probabilities (the Kaplan-Meier method). BMJ 1998, 317: 1572.","journal-title":"BMJ"},{"key":"1787_CR50","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1186\/1471-2164-6-71","volume":"6","author":"H Wang","year":"2005","unstructured":"Wang H, He X, Band M, Wilson C, Liu L: A study of inter-lab and inter-platform agreement of DNA microarray data. BMC Genomics 2005, 6: 71. 10.1186\/1471-2164-6-71","journal-title":"BMC Genomics"},{"key":"1787_CR51","doi-asserted-by":"publisher","first-page":"e74","DOI":"10.1093\/nar\/gnh071","volume":"32","author":"BH Mecham","year":"2004","unstructured":"Mecham BH, Klus GT, Strovel J, Augustus M, Byrne D, Bozso P, Wetmore DZ, Mariani TJ, Kohane IS, Szallasi Z: Sequence-matched probes produce increased cross-platform consistency and more reproducible biological results in microarray-based gene expression measurements. Nucleic Acids Res 2004, 32: e74. 10.1093\/nar\/gnh071","journal-title":"Nucleic Acids Res"},{"key":"1787_CR52","first-page":"4427","volume":"62","author":"DR Rhodes","year":"2002","unstructured":"Rhodes DR, Barrette TR, Rubin MA, Ghosh D, Chinnaiyan AM: Meta-analysis of microarrays: Interstudy validation of gene expression profiles reveals pathway deregulation in prostate cancer. Cancer Res 2002, 62: 4427\u20134433.","journal-title":"Cancer Res"},{"key":"1787_CR53","doi-asserted-by":"publisher","first-page":"i84","DOI":"10.1093\/bioinformatics\/btg1010","volume":"19","author":"JK Choi","year":"2003","unstructured":"Choi JK, Yu U, Kim S, Yoo OJ: Combining multiple microarray studies and modeling interstudy variation. Bioinformatics 2003, 19: i84-i90. 10.1093\/bioinformatics\/btg1010","journal-title":"Bioinformatics"},{"key":"1787_CR54","doi-asserted-by":"publisher","first-page":"2292","DOI":"10.1158\/1078-0432.CCR-03-0490","volume":"10","author":"G Parmigiani","year":"2004","unstructured":"Parmigiani G, Garrett-Mayer ES, Anbazhagan R, Gabrielson E: A cross-study comparison of gene expression studies for the molecular classification of lung cancer. Clin Cancer Res 2004, 10: 2292\u20132927. 10.1158\/1078-0432.CCR-03-0490","journal-title":"Clin Cancer Res"},{"key":"1787_CR55","doi-asserted-by":"publisher","first-page":"265","DOI":"10.1186\/1471-2105-6-265","volume":"6","author":"P Warnat","year":"2005","unstructured":"Warnat P, Eils R, Brors B: Cross-platform analysis of cancer microarray data improves gene expression based classification of phenotypes. BMC Bioinformatics 2005, 6: 265. 10.1186\/1471-2105-6-265","journal-title":"BMC Bioinformatics"},{"key":"1787_CR56","volume-title":"Proceedings of the 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference","author":"L Li","year":"2005","unstructured":"Li L, Chen L, Goldgof D, George F, Chen Z, Rao A, Cragun J, Sutphen R, Lancaster JM: Integration of clinical information and gene expression profiles for prediction of chemo-response for ovarian cancer. In Proceedings of the 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference. Shanghai, China; 2005."},{"key":"1787_CR57","doi-asserted-by":"crossref","unstructured":"Sun Y, Goodison S, Li J, Liu L, Farmerie W: Improved breast cancer prognosis through the combination of clinical and genetic markers. Bioinformatics 23: 30\u201337. 10.1093\/bioinformatics\/btl543","DOI":"10.1093\/bioinformatics\/btl543"},{"key":"1787_CR58","doi-asserted-by":"crossref","unstructured":"Pittman J, Huang E, Dressman H, Horng C, Cheng SH, Tsou M, Chen C, Bild A, Iversen ES, Huang AT, Nevins JR, West M: Integrated modeling of clinical and gene expression information for personalized prediction of disease outcomes. Proc Natl Acad Sci USA 101: 8431\u20138436. 10.1073\/pnas.0401736101","DOI":"10.1073\/pnas.0401736101"},{"key":"1787_CR59","doi-asserted-by":"publisher","first-page":"207","DOI":"10.1093\/nar\/30.1.207","volume":"30","author":"R Edgar","year":"2002","unstructured":"Edgar R, Domrachev M, Lash AE: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 2002, 30: 207\u2013210. 10.1093\/nar\/30.1.207","journal-title":"Nucleic Acids Res"},{"key":"1787_CR60","doi-asserted-by":"publisher","first-page":"152","DOI":"10.1093\/nar\/29.1.152","volume":"29","author":"G Sherlock","year":"2001","unstructured":"Sherlock G, Hernandez-Boussard T, Kasarskis A, Binkley G, Matese JC, Dwight SS, Kaloper M, Weng S, Jin H, Ball CA, Eisen MB, Spellman PT, Brown PO, Botstein D, Cherry JM: The Stanford Microarray Database. Nucleic Acids Res 2001, 29: 152\u2013155. 10.1093\/nar\/29.1.152","journal-title":"Nucleic Acids Res"},{"key":"1787_CR61","doi-asserted-by":"publisher","first-page":"299","DOI":"10.2307\/1390807","volume":"3","author":"R Ihaka","year":"1996","unstructured":"Ihaka R, Gentleman RC: R: A language for data analysis and graphics. J Comput Graph Stat 1996, 3: 299\u2013314. 10.2307\/1390807","journal-title":"J Comput Graph Stat"},{"key":"1787_CR62","doi-asserted-by":"publisher","first-page":"249","DOI":"10.1093\/biostatistics\/4.2.249","volume":"4","author":"RA Irizarry","year":"2003","unstructured":"Irizarry RA, Hobbs B, Collin F, Beazer-Barclay YD, Antonellis KJ, Scherf U, Speed TP: Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics 2003, 4: 249\u2013264. 10.1093\/biostatistics\/4.2.249","journal-title":"Biostatistics"},{"key":"1787_CR63","unstructured":"Statistical Algorithms Description Document[http:\/\/www.affymetrix.com\/support\/technical\/whitepapers\/sadd_whitepaper.pdf]"},{"key":"1787_CR64","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","volume":"19","author":"BM Bolstad","year":"2003","unstructured":"Bolstad BM, Irizarry RA, Astrand M, Speed TP: A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 2003, 19: 185\u2013193. 10.1093\/bioinformatics\/19.2.185","journal-title":"Bioinformatics"},{"key":"1787_CR65","doi-asserted-by":"publisher","first-page":"R80","DOI":"10.1186\/gb-2004-5-10-r80","volume":"5","author":"RC Gentleman","year":"2004","unstructured":"Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JY, Zhang J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 2004, 5: R80. 10.1186\/gb-2004-5-10-r80","journal-title":"Genome Biol"},{"key":"1787_CR66","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1093\/bioinformatics\/17.6.520","volume":"17","author":"O Troyanskaya","year":"2001","unstructured":"Troyanskaya O, Cantor M, Sherlock G, Brown P, Hastie T, Tibshirani R, Botstein D, Altman RB: Missing value estimation methods for DNA microarrays. Bioinformatics 2001, 17: 520\u2013525. 10.1093\/bioinformatics\/17.6.520","journal-title":"Bioinformatics"},{"key":"1787_CR67","unstructured":"Liu Ting-Yuan, Lin Chen, Falcon Seth, Zhang Jianhua, MacDonald JamesW: Hgu133a: Affymetrix Human Genome U133 Set Annotation Data (hgu133a). R package version 1.14.0"},{"key":"1787_CR68","doi-asserted-by":"publisher","first-page":"219","DOI":"10.1093\/nar\/gkg014","volume":"31","author":"M Diehn","year":"2003","unstructured":"Diehn M, Sherlock G, Binkley G, Jin H, Matese JC, Hernandez-Boussard T, Rees CA, Cherry JM, Botstein D, Brown PO, Alizadeh AA: SOURCE: a unified genomic resource of functional annotations, ontologies, and gene expression data. Nucleic Acids Res 2003, 31: 219\u2013223. 10.1093\/nar\/gkg014","journal-title":"Nucleic Acids Res"},{"key":"1787_CR69","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1093\/nar\/28.1.10","volume":"28","author":"DL Wheeler","year":"2000","unstructured":"Wheeler DL, Chappey C, Lash AE, Leipe DD, Madden TL, Schuler GD, Tatusova TA, Rapp BA: Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 2000, 28: 10\u201314. 10.1093\/nar\/28.1.10","journal-title":"Nucleic Acids Res"},{"key":"1787_CR70","unstructured":"GeneChip\u00aeExpression Analysis Data Analysis Fundamentals[http:\/\/www.affymetrix.com\/support\/downloads\/manuals\/data_analysis_fundamentals_manual.pdf]"},{"key":"1787_CR71","doi-asserted-by":"publisher","first-page":"3301","DOI":"10.1093\/bioinformatics\/bti499","volume":"21","author":"AM Molinaro","year":"2005","unstructured":"Molinaro AM, Simon R, Pfeiffer RM: Prediction error estimation: a comparison of resampling methods. Bioinformatics 2005, 21: 3301\u20133307. 10.1093\/bioinformatics\/bti499","journal-title":"Bioinformatics"},{"key":"1787_CR72","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1198\/016214502753479248","volume":"97","author":"S Dudoit","year":"2002","unstructured":"Dudoit S, Fridlyand J, Speed TP: Comparison of discrimination methods for the classification of tumors using gene expression data. J Am Stat Assoc 2002, 97: 77\u201387. 10.1198\/016214502753479248","journal-title":"J Am Stat Assoc"},{"key":"1787_CR73","unstructured":"Ingenuity Pathway Analysis [Ingenuity\u00aeSystems[http:\/\/www.ingenuity.com]"},{"key":"1787_CR74","doi-asserted-by":"publisher","first-page":"559","DOI":"10.1089\/106652700750050943","volume":"7","author":"A Ben-Dor","year":"2000","unstructured":"Ben-Dor A, Bruhn L, Friedman N, Nachman I, Schummer M, Yakhini Z: Tissue classification with gene expression profiles. J Comput Biol 2000, 7: 559\u2013583. 10.1089\/106652700750050943","journal-title":"J Comput Biol"},{"key":"1787_CR75","doi-asserted-by":"publisher","first-page":"404","DOI":"10.1016\/j.jbi.2005.02.008","volume":"38","author":"TA Lasko","year":"2005","unstructured":"Lasko TA, Bhagwat JG, Zou KH, Ohno-Machado L: The use of receiver operating characteristic curves in biomedical informatics. J Biomed Inform 2005, 38: 404\u2013415. 10.1016\/j.jbi.2005.02.008","journal-title":"J Biomed Inform"},{"key":"1787_CR76","unstructured":"Carey V, Redestig H: ROC: utilities for ROC, with uarray focus. R package version 1.8.0 [http:\/\/www.bioconductor.org]"},{"key":"1787_CR77","volume-title":"Proceedings of the 22nd International Conference on Machine Learning","author":"SA Macskassy","year":"2005","unstructured":"Macskassy SA, Provost F, Rosset S: Confidence Bands for ROC Curves: Methods and an Empirical Study. In Proceedings of the 22nd International Conference on Machine Learning. Bonn, Germany; 2005."},{"key":"1787_CR78","doi-asserted-by":"publisher","first-page":"368","DOI":"10.1093\/bioinformatics\/btf877","volume":"19","author":"A Reiner","year":"2003","unstructured":"Reiner A, Yekutieli D, Benjamini Y: Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 2003, 19: 368\u2013375. 10.1093\/bioinformatics\/btf877","journal-title":"Bioinformatics"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-8-415.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T10:25:46Z","timestamp":1630491946000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-8-415"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,10,26]]},"references-count":78,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2007,12]]}},"alternative-id":["1787"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-8-415","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2007,10,26]]},"assertion":[{"value":"19 June 2007","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 October 2007","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 October 2007","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"415"}}