{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,20]],"date-time":"2026-02-20T22:07:59Z","timestamp":1771625279822,"version":"3.50.1"},"reference-count":19,"publisher":"Springer Science and Business Media LLC","issue":"S2","license":[{"start":{"date-parts":[[2005,7,1]],"date-time":"2005-07-01T00:00:00Z","timestamp":1120176000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/2.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2005,7]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Systematic evaluation and study of single nucleotide polymorphisms (SNPs) made possible by high throughput genotyping technologies and bioinformatics promises to provide breakthroughs in the understanding of complex diseases. Understanding how the millions of SNPs in the human genome are involved in conferring susceptibility or resistance to disease, or in rendering a drug efficacious or toxic in the individual is a major goal of the relatively new fields of pharmacogenomics. Esophageal squamous cell carcinoma is a high-mortality cancer with complex etiology and progression involving both genetic and environmental factors. We examined the association between esophageal cancer risk and patterns of 61 SNPs in a case-control study for a population from Shanxi Province in North Central China that has among the highest rates of esophageal squamous cell carcinoma in the world.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Methods<\/jats:title>\n            <jats:p>High-throughput Masscode mass spectrometry genotyping was done on genomic DNA from 574 individuals (394 cases and 180 age-frequency matched controls). SNPs were chosen from among genes involving DNA repair enzymes, and Phase I and Phase II enzymes.<\/jats:p>\n            <jats:p>We developed a novel adaptation of the Decision Forest pattern recognition method named Decision Forest for SNPs (DF-SNPs). The method was designated to analyze the SNP data.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>The classifier in separating the cases from the controls developed with DF-SNPs gave concordance, sensitivity and specificity, of 94.7%, 99.0% and 85.1%, respectively; suggesting its usefulness for hypothesizing what SNPs or combinations of SNPs could be involved in susceptibility to esophageal cancer. Importantly, the DF-SNPs algorithm incorporated a randomization test for assessing the relevance (or importance) of individual SNPs, SNP types (Homozygous common, heterozygous and homozygous variant) and patterns of SNP types (SNP patterns) that differentiate cases from controls. For example, we found that the different genotypes of SNP GADD45B E1122 are all associated with cancer risk.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>The DF-SNPs method can be used to differentiate esophageal squamous cell carcinoma cases from controls based on individual SNPs, SNP types and SNP patterns. The method could be useful to identify potential biomarkers from the SNP data and complement existing methods for genotype analyses.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-6-s2-s4","type":"journal-article","created":{"date-parts":[[2005,7,16]],"date-time":"2005-07-16T09:00:41Z","timestamp":1121504441000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":48,"title":["Decision Forest Analysis of 61 Single Nucleotide Polymorphisms in a Case-Control Study of Esophageal Cancer; a novel method"],"prefix":"10.1186","volume":"6","author":[{"given":"Qian","family":"Xie","sequence":"first","affiliation":[]},{"given":"Luke D","family":"Ratnasinghe","sequence":"additional","affiliation":[]},{"given":"Huixiao","family":"Hong","sequence":"additional","affiliation":[]},{"given":"Roger","family":"Perkins","sequence":"additional","affiliation":[]},{"given":"Ze-Zhong","family":"Tang","sequence":"additional","affiliation":[]},{"given":"Nan","family":"Hu","sequence":"additional","affiliation":[]},{"given":"Philip R","family":"Taylor","sequence":"additional","affiliation":[]},{"given":"Weida","family":"Tong","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2005,7,15]]},"reference":[{"issue":"1","key":"662_CR1","doi-asserted-by":"publisher","first-page":"308","DOI":"10.1093\/nar\/29.1.308","volume":"29","author":"ST Sherry","year":"2001","unstructured":"Sherry ST, Ward MH, Kholodov M, Baker J, Phan L, Smigielski EM, Sirotkin K: dbSNP: the NCBI database of genetic variation. Nucleic Acids Res 2001, 29(1):308\u2013311. 10.1093\/nar\/29.1.308","journal-title":"Nucleic Acids Res"},{"issue":"1","key":"662_CR2","doi-asserted-by":"publisher","first-page":"124","DOI":"10.1086\/338007","volume":"70","author":"HJ Cordell","year":"2002","unstructured":"Cordell HJ, Clayton DG: A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case\/control or family data: application to HLA in type 1 diabetes. Am J Hum Genet 2002, 70(1):124\u2013141. 10.1086\/338007","journal-title":"Am J Hum Genet"},{"issue":"5","key":"662_CR3","doi-asserted-by":"publisher","first-page":"1231","DOI":"10.1086\/375140","volume":"72","author":"LP Zhao","year":"2003","unstructured":"Zhao LP, Li SS, Khalid N: A method for the assessment of disease associations with single-nucleotide polymorphism haplotypes and environmental variables in case-control studies. Am J Hum Genet 2003, 72(5):1231\u20131250. 10.1086\/375140","journal-title":"Am J Hum Genet"},{"issue":"1","key":"662_CR4","doi-asserted-by":"publisher","first-page":"30","DOI":"10.1186\/1471-2105-4-30","volume":"4","author":"RM Hubley","year":"2003","unstructured":"Hubley RM, Zitzler E, Roach JC: Evolutionary algorithms for the selection of single nucleotide polymorphisms. BMC Bioinformatics 2003, 4(1):30. 10.1186\/1471-2105-4-30","journal-title":"BMC Bioinformatics"},{"issue":"6909","key":"662_CR5","doi-asserted-by":"publisher","first-page":"832","DOI":"10.1038\/nature01140","volume":"419","author":"PC Sabeti","year":"2002","unstructured":"Sabeti PC, Reich DE, Higgins JM, Levine HZ, Richter DJ, Schaffner SF, Gabriel SB, Platko JV, Patterson NJ, McDonald GJ, et al.: Detecting recent positive selection in the human genome from haplotype structure. Nature 2002, 419(6909):832\u2013837. 10.1038\/nature01140","journal-title":"Nature"},{"issue":"4","key":"662_CR6","doi-asserted-by":"publisher","first-page":"513","DOI":"10.1093\/biostatistics\/4.4.513","volume":"4","author":"SS Li","year":"2003","unstructured":"Li SS, Khalid N, Carlson C, Zhao LP: Estimating haplotype frequencies and standard errors for multiple single nucleotide polymorphisms. Biostatistics 2003, 4(4):513\u2013522. 10.1093\/biostatistics\/4.4.513","journal-title":"Biostatistics"},{"issue":"2","key":"662_CR7","doi-asserted-by":"publisher","first-page":"425","DOI":"10.1086\/338688","volume":"70","author":"DJ Schaid","year":"2002","unstructured":"Schaid DJ, Rowland CM, Tines DE, Jacobson RM, Poland GA: Score tests for association between traits and haplotypes when linkage phase is ambiguous. Am J Hum Genet 2002, 70(2):425\u2013434. 10.1086\/338688","journal-title":"Am J Hum Genet"},{"issue":"1","key":"662_CR8","doi-asserted-by":"publisher","first-page":"143","DOI":"10.1101\/gr.148401","volume":"11","author":"D Fallin","year":"2001","unstructured":"Fallin D, Cohen A, Essioux L, Chumakov I, Blumenfeld M, Cohen D, Schork NJ: Genetic analysis of case\/control data using estimated haplotype frequencies: application to APOE locus variation and Alzheimer's disease. Genome Res 2001, 11(1):143\u2013151. 10.1101\/gr.148401","journal-title":"Genome Res"},{"issue":"2","key":"662_CR9","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1002\/jso.20070","volume":"87","author":"A Kleespies","year":"2004","unstructured":"Kleespies A, Guba M, Jauch KW, CJ B: Vascular endothelial growth factor in esophageal cancer. J Surg Oncol 2004, 87(2):95\u2013104. 10.1002\/jso.20070","journal-title":"J Surg Oncol"},{"key":"662_CR10","doi-asserted-by":"publisher","first-page":"2241","DOI":"10.1056\/NEJMra035010","volume":"349","author":"PC Enzinger","year":"2003","unstructured":"Enzinger PC, Mayer RJ: Esophageal Cancer. N Engl J Med 2003, 349: 2241\u20132252. 10.1056\/NEJMra035010","journal-title":"N Engl J Med"},{"issue":"3","key":"662_CR11","doi-asserted-by":"publisher","first-page":"475","DOI":"10.1002\/ijc.20619","volume":"113","author":"CH Lee","year":"2005","unstructured":"Lee CH, Lee JM, Wu DC, Hsu HK, Kao EL, Huang HL, Wang TN, Huang MC, Wu MT: Independent and combined effects of alcohol intake, tobacco smoking and betel quid chewing on the risk of esophageal cancer in Taiwan. Int J Cancer 2005, 113(3):475\u2013482. 10.1002\/ijc.20619","journal-title":"Int J Cancer"},{"key":"662_CR12","doi-asserted-by":"publisher","first-page":"721","DOI":"10.1023\/A:1011290704728","volume":"12","author":"AH Wu","year":"2001","unstructured":"Wu AH, Wan P, Bernstein L: A multiethnic population-based study of smoking, alcohol and body size and risk of adenocarcinoma of the stomach and esophagus. Cancer Causes Control 2001, 12: 721\u2013732. 10.1023\/A:1011290704728","journal-title":"Cancer Causes Control"},{"key":"662_CR13","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1093\/aje\/153.2.114","volume":"153","author":"LM Brown","year":"2001","unstructured":"Brown LM, Hoover R, Silverman D: Excess incidence of squamous cell esophageal cancer among US Black men: role of social class and other risk factors. Am J Epidemiol 2001, 153: 114\u2013122. 10.1093\/aje\/153.8.815","journal-title":"Am J Epidemiol"},{"key":"662_CR14","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-2719-7","volume-title":"Modern Applied Statistics with S-PLUS","author":"WN Venables","year":"1997","unstructured":"Venables WN, Ripley BD: Modern Applied Statistics with S-PLUS. Volume 14. 2nd edition. New York: Springer; 1997.","edition":"2"},{"issue":"2","key":"662_CR15","doi-asserted-by":"publisher","first-page":"525","DOI":"10.1021\/ci020058s","volume":"43","author":"W Tong","year":"2003","unstructured":"Tong W, Hong H, Fang H, Xie Q, Perkins R: Decision Forest: Combining the Predictions of Multiple Independent Decision Tree Model. Journal of Chemical Information and Computer Science 2003, 43(2):525\u2013531. 10.1021\/ci020058s","journal-title":"Journal of Chemical Information and Computer Science"},{"issue":"1","key":"662_CR16","doi-asserted-by":"publisher","first-page":"29","DOI":"10.1289\/ehp.0211029","volume":"110","author":"H Hong","year":"2002","unstructured":"Hong H, Tong W, Fang H, Shi LM, Xie Q, Wu J, Perkins R, Walker J, Branham W, Sheehan D: Prediction of Estrogen Receptor Binding for 58,000 chemicals Using an Integrated system of a tree-based model with structural alerts. Environ Health Perspect 2002, 110(1):29\u201336.","journal-title":"Environ Health Perspect"},{"issue":"5","key":"662_CR17","doi-asserted-by":"publisher","first-page":"365","DOI":"10.1093\/mutage\/geh043","volume":"19","author":"J R Votano","year":"2004","unstructured":"R Votano J, Parham M, H Hall L, B Kier L, Oloff S, Tropsha A, Xie Q, Tong W: Three new consensus QSAR models for the prediction of Ames genotoxicity. Mutagenesis 2004, 19(5):365\u2013377. 10.1093\/mutage\/geh043","journal-title":"Mutagenesis"},{"issue":"10","key":"662_CR18","doi-asserted-by":"publisher","first-page":"685","DOI":"10.1089\/dna.2004.23.685","volume":"23","author":"H Hong","year":"2004","unstructured":"Hong H, Tong W, Perkins R, Fang H, Xie Q, Shi L: Multiclass decision forest \u2013 a novel pattern recognition method for multiclass classification in microarray data analysis. DNA AND CELL BIOLOGY 2004, 23(10):685\u2013694.","journal-title":"DNA AND CELL BIOLOGY"},{"issue":"12","key":"662_CR19","doi-asserted-by":"publisher","first-page":"1249","DOI":"10.1289\/ehp.7125","volume":"112","author":"W Tong","year":"2004","unstructured":"Tong W, Xie Q, Hong H, Shi L, Fang H, Perkins R: Assessment of prediction confidence and domain extrapolation of two structure activity relationship models for predicting estrogen receptor binding activity. Environmental Health Perspectives 2004, 112(12):1249\u20131254.","journal-title":"Environmental Health Perspectives"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-S2-S4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1471-2105-6-S2-S4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-6-S2-S4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T21:24:10Z","timestamp":1630445050000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-6-S2-S4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2005,7]]},"references-count":19,"journal-issue":{"issue":"S2","published-print":{"date-parts":[[2005,7]]}},"alternative-id":["662"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-6-s2-s4","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2005,7]]},"assertion":[{"value":"15 July 2005","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S4"}}