{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,19]],"date-time":"2026-03-19T08:09:51Z","timestamp":1773907791056,"version":"3.50.1"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2013,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Microarray technology can acquire information about thousands of genes simultaneously. We analyzed published breast cancer microarray databases to predict five-year recurrence and compared the performance of three data mining algorithms of artificial neural networks (ANN), decision trees (DT) and logistic regression (LR) and two composite models of DT-ANN and DT-LR. The collection of microarray datasets from the Gene Expression Omnibus, four breast cancer datasets were pooled for predicting five-year breast cancer relapse. After data compilation, 757 subjects, 5 clinical variables and 13,452 genetic variables were aggregated. The bootstrap method, Mann-Whitney <jats:italic>U<\/jats:italic> test and 20-fold cross-validation were performed to investigate candidate genes with 100 most-significant p-values. The predictive powers of DT, LR and ANN models were assessed using accuracy and the area under ROC curve. The associated genes were evaluated using Cox regression.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>The DT models exhibited the lowest predictive power and the poorest extrapolation when applied to the test samples. The ANN models displayed the best predictive power and showed the best extrapolation. The 21 most-associated genes, as determined by integration of each model, were analyzed using Cox regression with a 3.53-fold (95% CI: 2.24-5.58) increased risk of breast cancer five-year recurrence\u2026<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>The 21 selected genes can predict breast cancer recurrence. Among these genes, CCNB1, PLK1 and TOP2A are in the cell cycle\u00a0G2\/M DNA damage checkpoint pathway. Oncologists can offer the genetic information for patients when understanding the gene expression profiles on breast cancer recurrence.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-14-100","type":"journal-article","created":{"date-parts":[[2013,3,19]],"date-time":"2013-03-19T07:07:00Z","timestamp":1363676820000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":31,"title":["Gene expression profiling of breast cancer survivability by pooled cDNA microarray analysis using logistic regression, artificial neural networks and decision trees"],"prefix":"10.1186","volume":"14","author":[{"given":"Hsiu-Ling","family":"Chou","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chung-Tay","family":"Yao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sui-Lun","family":"Su","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chia-Yi","family":"Lee","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kuang-Yu","family":"Hu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Harn-Jing","family":"Terng","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yun-Wen","family":"Shih","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu-Tien","family":"Chang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu-Fen","family":"Lu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chi-Wen","family":"Chang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mark L","family":"Wahlqvist","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas","family":"Wetter","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chi-Ming","family":"Chu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2013,3,19]]},"reference":[{"key":"5760_CR1","unstructured":"American Cancer Society. 2011. http:\/\/www.cancer.org\/docroot\/home\/index.asp"},{"key":"5760_CR2","doi-asserted-by":"publisher","first-page":"979","DOI":"10.1093\/jnci\/93.13.979","volume":"93","author":"P Eifel","year":"2001","unstructured":"Eifel P, Axelson JA, Costa J, Crowley J, Curran WJ Jr, Deshler A, Fulton S, Hendricks CB, Kemeny M, Kornblith AB: National Institutes of Health Consensus Development Conference Statement: adjuvant therapy for breast cancer: 1-3 November 2000. J Natl Cancer Inst 2001, 93: 979-989.","journal-title":"J Natl Cancer Inst"},{"key":"5760_CR3","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1093\/jnci\/83.3.154","volume":"83","author":"WL McGuire","year":"1991","unstructured":"McGuire WL: Breast cancer prognostic factors: evaluation guidelines. J Natl Cancer Inst 1991, 83: 154-155. 10.1093\/jnci\/83.3.154","journal-title":"J Natl Cancer Inst"},{"key":"5760_CR4","doi-asserted-by":"publisher","first-page":"2356","DOI":"10.1093\/bioinformatics\/btl400","volume":"22","author":"CA Davis","year":"2006","unstructured":"Davis CA, Gerick F, Hintermair V, Friedel CC, Fundel K, Kuffner R, Zimmer R: Reliable gene signatures for microarray classification: assessment of stability and performance. Bioinformatics 2006, 22: 2356-2363. 10.1093\/bioinformatics\/btl400","journal-title":"Bioinformatics"},{"key":"5760_CR5","doi-asserted-by":"crossref","first-page":"2051","DOI":"10.1093\/clinchem\/48.11.2051","volume":"48","author":"F Gemignani","year":"2002","unstructured":"Gemignani F, Perra C, Landi S, Canzian F, Kurg A, Tonisson N, Galanello R, Cao A, Metspalu A, Romeo G: Reliable detection of beta-thalassemia and G6PD mutations by a DNA microarray. Clin Chem 2002, 48: 2051-2054.","journal-title":"Clin Chem"},{"key":"5760_CR6","doi-asserted-by":"publisher","first-page":"675","DOI":"10.1039\/b418765b","volume":"5","author":"O Gutmann","year":"2005","unstructured":"Gutmann O, Kuehlewein R, Reinbold S, Niekrawietz R, Steinert CP, de Heij B, Zengerle R, Daub M: Fast and reliable protein microarray production by a new drop-in-drop technique. Lab Chip 2005, 5: 675-681. 10.1039\/b418765b","journal-title":"Lab Chip"},{"key":"5760_CR7","doi-asserted-by":"publisher","first-page":"211","DOI":"10.1007\/s00109-008-0419-y","volume":"87","author":"S Lassmann","year":"2009","unstructured":"Lassmann S, Kreutz C, Schoepflin A, Hopt U, Timmer J, Werner M: A novel approach for reliable microarray analysis of microdissected tumor cells from formalin-fixed and paraffin-embedded colorectal cancer resection specimens. J Mol Med 2009, 87: 211-224. 10.1007\/s00109-008-0419-y","journal-title":"J Mol Med"},{"key":"5760_CR8","doi-asserted-by":"publisher","first-page":"10","DOI":"10.1016\/j.copbio.2007.11.003","volume":"19","author":"L Shi","year":"2008","unstructured":"Shi L, Perkins RG, Fang H, Tong W: Reproducible and reliable microarray results through quality control: good laboratory proficiency and appropriate data analysis practices are essential. Curr Opin Biotechnol 2008, 19: 10-18. 10.1016\/j.copbio.2007.11.003","journal-title":"Curr Opin Biotechnol"},{"key":"5760_CR9","doi-asserted-by":"publisher","first-page":"321","DOI":"10.1016\/j.ygeno.2003.08.008","volume":"83","author":"DL Stirewalt","year":"2004","unstructured":"Stirewalt DL, Pogosova-Agadjanyan EL, Khalid N, Hare DR, Ladne PA, Sala-Torra O, Zhao LP, Radich JP: Single-stranded linear amplification protocol results in reproducible and reliable microarray data from nanogram amounts of starting RNA. Genomics 2004, 83: 321-331. 10.1016\/j.ygeno.2003.08.008","journal-title":"Genomics"},{"key":"5760_CR10","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1016\/S1672-0229(03)01003-9","volume":"1","author":"PJ van der Spek","year":"2003","unstructured":"van der Spek PJ, Kremer A, Murry L, Walker MG: Are gene expression microarray analyses reliable? A review of studies of retinoic acid responsive genes. Genomics Proteomics Bioinformatics 2003, 1: 9-14.","journal-title":"Genomics Proteomics Bioinformatics"},{"key":"5760_CR11","doi-asserted-by":"publisher","first-page":"3789","DOI":"10.1021\/jf048368t","volume":"53","author":"X Xu","year":"2005","unstructured":"Xu X, Li Y, Zhao H, Wen SY, Wang SQ, Huang J, Huang KL, Luo YB: Rapid and reliable detection and identification of GM events using multiplex PCR coupled with oligonucleotide microarray. J Agric Food Chem 2005, 53: 3789-3794. 10.1021\/jf048368t","journal-title":"J Agric Food Chem"},{"key":"5760_CR12","doi-asserted-by":"publisher","first-page":"3207","DOI":"10.1158\/1078-0432.CCR-06-2765","volume":"13","author":"C Desmedt","year":"2007","unstructured":"Desmedt C, Piette F, Loi S, Wang Y, Lallemand F, Haibe-Kains B, Viale G, Delorenzi M, Zhang Y, d'Assignies MS: Strong time dependence of the 76-gene prognostic signature for node-negative breast cancer patients in the TRANSBIG multicenter independent validation series. Clin Cancer Res 2007, 13: 3207-3214. 10.1158\/1078-0432.CCR-06-2765","journal-title":"Clin Cancer Res"},{"key":"5760_CR13","doi-asserted-by":"publisher","first-page":"10292","DOI":"10.1158\/0008-5472.CAN-05-4414","volume":"66","author":"AV Ivshina","year":"2006","unstructured":"Ivshina AV, George J, Senko O, Mow B, Putti TC, Smeds J, Lindahl T, Pawitan Y, Hall P, Nordgren H: Genetic reclassification of histologic grade delineates new clinical subtypes of breast cancer. Cancer Res 2006, 66: 10292-10301. 10.1158\/0008-5472.CAN-05-4414","journal-title":"Cancer Res"},{"key":"5760_CR14","doi-asserted-by":"publisher","first-page":"262","DOI":"10.1093\/jnci\/djj052","volume":"98","author":"C Sotiriou","year":"2006","unstructured":"Sotiriou C, Wirapati P, Loi S, Harris A, Fox S, Smeds J, Nordgren H, Farmer P, Praz V, Haibe-Kains B: Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis. J Natl Cancer Inst 2006, 98: 262-272. 10.1093\/jnci\/djj052","journal-title":"J Natl Cancer Inst"},{"key":"5760_CR15","doi-asserted-by":"publisher","first-page":"671","DOI":"10.1016\/S0140-6736(05)70933-8","volume":"365","author":"Y Wang","year":"2005","unstructured":"Wang Y, Klijn JG, Zhang Y, Sieuwerts AM, Look MP, Yang F, Talantov D, Timmermans M, Meijer-van Gelder ME, Yu J: Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer. Lancet 2005, 365: 671-679.","journal-title":"Lancet"},{"key":"5760_CR16","doi-asserted-by":"publisher","first-page":"e29860","DOI":"10.1371\/journal.pone.0029860","volume":"7","author":"Y Wang","year":"2012","unstructured":"Wang Y, Sun G, Ji Z, Xing C, Liang Y: Weighted change-point method for detecting differential gene expression in breast cancer microarray data. PLoS One 2012, 7: e29860. 10.1371\/journal.pone.0029860","journal-title":"PLoS One"},{"key":"5760_CR17","doi-asserted-by":"publisher","first-page":"15545","DOI":"10.1073\/pnas.0506580102","volume":"102","author":"A Subramanian","year":"2005","unstructured":"Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005, 102: 15545-15550. 10.1073\/pnas.0506580102","journal-title":"Proc Natl Acad Sci U S A"},{"key":"5760_CR18","volume-title":"Net Reclassification Improvement (NRI) has been proposed as an alternative to the area under the curve of the the ROC","author":"A Padoan","year":"2010","unstructured":"Padoan A: Net Reclassification Improvement (NRI) has been proposed as an alternative to the area under the curve of the the ROC. The MathWorks, Inc; 2010. . Accessed 22. November 2012 http:\/\/www.mathworks.com\/matlabcentral\/fileexchange\/28579-net-reclassification-improvement&watching=28579 . Accessed 22. November 2012"},{"key":"5760_CR19","doi-asserted-by":"publisher","first-page":"157","DOI":"10.1002\/sim.2929","volume":"27","author":"MJ Pencina","year":"2008","unstructured":"Pencina MJ, D'Agostino RB Sr, D'Agostino RB Jr, Vasan RS: Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat Med 2008, 27: 157-172. discussion 207-212 discussion 207-212 10.1002\/sim.2929","journal-title":"Stat Med"},{"key":"5760_CR20","doi-asserted-by":"publisher","first-page":"101","DOI":"10.1002\/sim.4348","volume":"31","author":"MJ Pencina","year":"2012","unstructured":"Pencina MJ, D'Agostino RB Sr, Demler OV: Novel metrics for evaluating improvement in discrimination: net reclassification and integrated discrimination improvement for normal variables and nested models. Stat Med 2012, 31: 101-113. 10.1002\/sim.4348","journal-title":"Stat Med"},{"key":"5760_CR21","doi-asserted-by":"publisher","first-page":"11","DOI":"10.1002\/sim.4085","volume":"30","author":"MJ Pencina","year":"2011","unstructured":"Pencina MJ, D'Agostino RB Sr, Steyerberg EW: Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers. Stat Med 2011, 30: 11-21. 10.1002\/sim.4085","journal-title":"Stat Med"},{"key":"5760_CR22","doi-asserted-by":"publisher","first-page":"397","DOI":"10.1186\/1756-0500-4-397","volume":"4","author":"SJ Beyer","year":"2011","unstructured":"Beyer SJ, Zhang X, Jimenez RE, Lee ML, Richardson AL, Huang K, Jhiang SM: Microarray analysis of genes associated with cell surface NIS protein levels in breast cancer. BMC Res Notes 2011, 4: 397. 10.1186\/1756-0500-4-397","journal-title":"BMC Res Notes"},{"key":"5760_CR23","doi-asserted-by":"publisher","first-page":"113","DOI":"10.1016\/j.artmed.2004.07.002","volume":"34","author":"D Delen","year":"2005","unstructured":"Delen D, Walker G, Kadam A: Predicting breast cancer survivability: a comparison of three data mining methods. Artif Intell Med 2005, 34: 113-127. 10.1016\/j.artmed.2004.07.002","journal-title":"Artif Intell Med"},{"key":"5760_CR24","doi-asserted-by":"publisher","first-page":"21","DOI":"10.4103\/0975-7406.92726","volume":"4","author":"R Kumar","year":"2012","unstructured":"Kumar R, Sharma A, Tiwari RK: Application of microarray in breast cancer: An overview. J Pharm Bioallied Sci 2012, 4: 21-26.","journal-title":"J Pharm Bioallied Sci"},{"key":"5760_CR25","doi-asserted-by":"publisher","first-page":"1673","DOI":"10.1002\/1097-0142(20010415)91:8+<1673::AID-CNCR1182>3.0.CO;2-T","volume":"91","author":"PB Snow","year":"2001","unstructured":"Snow PB, Kerr DJ, Brandt JM, Rodvold DM: Neural network and regression predictions of 5-year survival after colon carcinoma treatment. Cancer 2001, 91: 1673-1678. 10.1002\/1097-0142(20010415)91:8+<1673::AID-CNCR1182>3.0.CO;2-T","journal-title":"Cancer"},{"key":"5760_CR26","volume-title":"Investigating the Models of Logistic Regression, Decision Tree, Artificial Neural Network and Hybrid Analysis for Predicting Coronary Artery Disease","author":"YH Hsu","year":"2007","unstructured":"Hsu YH: Investigating the Models of Logistic Regression, Decision Tree, Artificial Neural Network and Hybrid Analysis for Predicting Coronary Artery Disease. Taipei, Taiwan: Master Thesis of National Defense Medical Center; 2007."},{"key":"5760_CR27","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1186\/1471-2105-9-125","volume":"9","author":"L Xu","year":"2008","unstructured":"Xu L, Tan AC, Winslow RL, Geman D: Merging microarray data from separate breast cancer studies provides a robust prognostic test. BMC Bioinformatics 2008, 9: 125. 10.1186\/1471-2105-9-125","journal-title":"BMC Bioinformatics"},{"key":"5760_CR28","doi-asserted-by":"publisher","first-page":"R953","DOI":"10.1186\/bcr1325","volume":"7","author":"Y Pawitan","year":"2005","unstructured":"Pawitan Y, Bjohle J, Amler L, Borg AL, Egyhazi S, Hall P, Han X, Holmberg L, Huang F, Klaar S: Gene expression profiling spares early breast cancer patients from adjuvant therapy: derived and validated in two population-based cohorts. Breast Cancer Res 2005, 7: R953-R964. 10.1186\/bcr1325","journal-title":"Breast Cancer Res"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-14-100.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T22:48:42Z","timestamp":1630536522000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-14-100"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,3,19]]},"references-count":28,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2013,12]]}},"alternative-id":["5760"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-14-100","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2013,3,19]]},"assertion":[{"value":"12 July 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 February 2013","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 March 2013","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"100"}}