{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,7,2]],"date-time":"2023-07-02T15:40:51Z","timestamp":1688312451568},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"S3","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Syst Biol"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Bacterial 16S Ribosomal RNAs profiling have been widely used in the classification of microbiota associated diseases. Dimensionality reduction is among the keys in mining high-dimensional 16S rRNAs' expression data. High levels of sparsity and redundancy are common in 16S rRNA gene microbial surveys. Traditional feature selection methods are generally restricted to measuring correlated abundances, and are limited in discrimination when so few microbes are actually shared across communities.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Here we present a Feature Merging and Selection algorithm (FMS) to deal with 16S rRNAs' expression data. By integrating Linear Discriminant Analysis method, FMS can reduce the feature dimension with higher accuracy and preserve the relationship between different features as well. Two 16S rRNAs' expression datasets of pneumonia and dental decay patients were used to test the validity of the algorithm. Combined with SVM, FMS discriminated different classes of both pneumonia and dental caries better than other popular feature selection methods.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusions<\/jats:title><jats:p>FMS projects data into lower dimension with preservation of enough features, and thus improve the intelligibility of the result. The results showed that FMS is a more valid and reliable methods in feature reduction.<\/jats:p><\/jats:sec>","DOI":"10.1186\/1752-0509-6-s3-s12","type":"journal-article","created":{"date-parts":[[2013,6,24]],"date-time":"2013-06-24T14:19:20Z","timestamp":1372083560000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["An improved dimensionality reduction method for meta-transcriptome indexing based diseases classification"],"prefix":"10.1186","volume":"6","author":[{"given":"Yin","family":"Wang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuhua","family":"Zhou","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yixue","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zongxin","family":"Ling","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yan","family":"Zhu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiaokui","family":"Guo","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hong","family":"Sun","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2012,12,17]]},"reference":[{"key":"997_CR1","doi-asserted-by":"publisher","first-page":"228","DOI":"10.1126\/science.1179721","volume":"328","author":"M Vijay-Kumar","year":"2010","unstructured":"Vijay-Kumar M, Aitken JD, Carvalho FA, Cullender TC, Mwangi S, Srinivasan S, Sitaraman SV, Knight R, Ley RE, Gewirtz AT: Metabolic syndrome and altered gut microbiota in mice lacking Toll-like receptor 5. Science. 2010, 328: 228-231. 10.1126\/science.1179721.","journal-title":"Science"},{"key":"997_CR2","doi-asserted-by":"publisher","first-page":"16731","DOI":"10.1073\/pnas.0804812105","volume":"105","author":"H Sokol","year":"2008","unstructured":"Sokol H, Pigneur B, Watterlot L, Lakhdari O, Bermudez-Humaran LG, Gratadoux JJ, Blugeon S, Bridonneau C, Furet JP, Corthier G, et al: Faecalibacterium prausnitzii is an anti-inflammatory commensal bacterium identified by gut microbiota analysis of Crohn disease patients. Proc Natl Acad Sci USA. 2008, 105: 16731-16736. 10.1073\/pnas.0804812105.","journal-title":"Proc Natl Acad Sci USA"},{"key":"997_CR3","doi-asserted-by":"publisher","first-page":"754","DOI":"10.1093\/abbs\/gmq081","volume":"42","author":"Y Zhou","year":"2010","unstructured":"Zhou Y, Lin P, Li Q, Han L, Zheng H, Wei Y, Cui Z, Ni Y, Guo X: Analysis of the microbiota of sputum samples from patients with lower respiratory tract infections. Acta Biochim Biophys Sin (Shanghai). 2010, 42: 754-761. 10.1093\/abbs\/gmq081.","journal-title":"Acta Biochim Biophys Sin (Shanghai)"},{"key":"997_CR4","doi-asserted-by":"publisher","first-page":"677","DOI":"10.1007\/s00248-010-9712-8","volume":"60","author":"Z Ling","year":"2010","unstructured":"Ling Z, Kong J, Jia P, Wei C, Wang Y, Pan Z, Huang W, Li L, Chen H, Xiang C: Analysis of oral microbiota in children with dental caries by PCR-DGGE and barcoded pyrosequencing. Microb Ecol. 2010, 60: 677-690. 10.1007\/s00248-010-9712-8.","journal-title":"Microb Ecol"},{"key":"997_CR5","doi-asserted-by":"publisher","first-page":"3575","DOI":"10.1128\/JCM.00597-10","volume":"48","author":"Z Gao","year":"2010","unstructured":"Gao Z, Perez-Perez GI, Chen Y, Blaser MJ: Quantitation of major human cutaneous bacterial and fungal populations. J Clin Microbiol. 2010, 48: 3575-3581. 10.1128\/JCM.00597-10.","journal-title":"J Clin Microbiol"},{"key":"997_CR6","doi-asserted-by":"publisher","first-page":"732","DOI":"10.1073\/pnas.0506655103","volume":"103","author":"EM Bik","year":"2006","unstructured":"Bik EM, Eckburg PB, Gill SR, Nelson KE, Purdom EA, Francois F, Perez-Perez G, Blaser MJ, Relman DA: Molecular analysis of the bacterial microbiota in the human stomach. Proc Natl Acad Sci USA. 2006, 103: 732-737. 10.1073\/pnas.0506655103.","journal-title":"Proc Natl Acad Sci USA"},{"key":"997_CR7","doi-asserted-by":"publisher","first-page":"355","DOI":"10.1093\/bfgp\/elq011","volume":"9","author":"Y Liu","year":"2010","unstructured":"Liu Y, Zhang C, Zhao L, Nardini C: Adapting functional genomic tools to metagenomic analyses: investigating the role of gut bacteria in relation to obesity. Brief Funct Genomics. 2010, 9: 355-361. 10.1093\/bfgp\/elq011.","journal-title":"Brief Funct Genomics"},{"key":"997_CR8","doi-asserted-by":"publisher","first-page":"343","DOI":"10.1111\/j.1574-6976.2010.00251.x","volume":"35","author":"D Knights","year":"2011","unstructured":"Knights D, Costello EK, Knight R: Supervised classification of human microbiota. FEMS Microbiol Rev. 2011, 35: 343-359. 10.1111\/j.1574-6976.2010.00251.x.","journal-title":"FEMS Microbiol Rev"},{"key":"997_CR9","doi-asserted-by":"publisher","first-page":"96","DOI":"10.1186\/1471-2180-9-96","volume":"9","author":"A Rani","year":"2009","unstructured":"Rani A, Sharma A, Rajagopal R, Adak T, Bhatnagar RK: Bacterial diversity analysis of larvae and adult midgut microflora using culture-dependent and culture-independent methods in lab-reared and field-collected Anopheles stephensi-an Asian malarial vector. BMC Microbiol. 2009, 9: 96-10.1186\/1471-2180-9-96.","journal-title":"BMC Microbiol"},{"key":"997_CR10","doi-asserted-by":"publisher","first-page":"869","DOI":"10.1016\/j.csda.2004.03.017","volume":"48","author":"JW Lee","year":"2005","unstructured":"Lee JW, Lee JB, Park M, Song SH: An extensive comparison of recent classification tools applied to microarray data. Computational Statistics & Data Analysis. 2005, 48: 869-885. 10.1016\/j.csda.2004.03.017.","journal-title":"Computational Statistics & Data Analysis"},{"key":"997_CR11","doi-asserted-by":"publisher","first-page":"77","DOI":"10.1198\/016214502753479248","volume":"97","author":"S Dudoit","year":"2002","unstructured":"Dudoit S, Fridlyand J, Speed TP: Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data. Journal of the American Statistical Association. 2002, 97: 77-87. 10.1198\/016214502753479248.","journal-title":"Journal of the American Statistical Association"},{"key":"997_CR12","doi-asserted-by":"publisher","first-page":"109","DOI":"10.1186\/1471-2105-11-109","volume":"11","author":"KL Tang","year":"2010","unstructured":"Tang KL, Li TH, Xiong WW, Chen K: Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data. BMC Bioinformatics. 2010, 11: 109-10.1186\/1471-2105-11-109.","journal-title":"BMC Bioinformatics"},{"key":"997_CR13","first-page":"1157","volume":"3","author":"AE Isabelle Guyon","year":"2003","unstructured":"Isabelle Guyon AE: An Introduction to Variable and Feature Selection. J Mach Learn Res. 2003, 3: 1157-1182.","journal-title":"J Mach Learn Res"},{"key":"997_CR14","doi-asserted-by":"publisher","first-page":"1942","DOI":"10.1109\/TSMCB.2004.831770","volume":"34","author":"XY Jing","year":"2004","unstructured":"Jing XY, Zhang D, Tang YY: An improved LDA approach. IEEE Trans Syst Man Cybern B Cybern. 2004, 34: 1942-1951. 10.1109\/TSMCB.2004.831770.","journal-title":"IEEE Trans Syst Man Cybern B Cybern"},{"key":"997_CR15","doi-asserted-by":"publisher","first-page":"2117","DOI":"10.1073\/pnas.0712038105","volume":"105","author":"M Li","year":"2008","unstructured":"Li M, Wang B, Zhang M, Rantalainen M, Wang S, Zhou H, Zhang Y, Shen J, Pang X, Wei H, et al: Symbiotic gut microbes modulate human metabolic phenotypes. Proc Natl Acad Sci USA. 2008, 105: 2117-2122. 10.1073\/pnas.0712038105.","journal-title":"Proc Natl Acad Sci USA"},{"key":"997_CR16","doi-asserted-by":"publisher","first-page":"1109","DOI":"10.1038\/nature07336","volume":"455","author":"L Wen","year":"2008","unstructured":"Wen L, Ley RE, Volchkov PY, Stranges PB, Avanesyan L, Stonebraker AC, Hu C, Wong FS, Szot GL, Bluestone JA, et al: Innate immunity and intestinal microbiota in the development of Type 1 diabetes. Nature. 2008, 455: 1109-1113. 10.1038\/nature07336.","journal-title":"Nature"},{"key":"997_CR17","doi-asserted-by":"publisher","first-page":"1190","DOI":"10.1126\/science.1171700","volume":"324","author":"EA Grice","year":"2009","unstructured":"Grice EA, Kong HH, Conlan S, Deming CB, Davis J, Young AC, Bouffard GG, Blakesley RW, Murray PR, Green ED, et al: Topographical and temporal diversity of the human skin microbiome. Science. 2009, 324: 1190-1192. 10.1126\/science.1171700.","journal-title":"Science"},{"key":"997_CR18","first-page":"223","volume-title":"The Elements of Statistical Learning Data Mining, Inference, and Prediction","author":"RT Trevor Hastie","year":"2009","unstructured":"Trevor Hastie RT, Jerome Firedman: The Elements of Statistical Learning Data Mining, Inference, and Prediction. 2009, 223-249."},{"key":"997_CR19","first-page":"65","volume-title":"McGraw-Hill","author":"M-H Tom Mitchell","year":"1997","unstructured":"Tom Mitchell M-H: Machine Learning. McGraw-Hill. 1997, 65-66."},{"key":"997_CR20","doi-asserted-by":"publisher","first-page":"185","DOI":"10.1142\/S0219720005001004","volume":"3","author":"C Ding","year":"2005","unstructured":"Ding C, Peng H: Minimum redundancy feature selection from microarray gene expression data. J Bioinform Comput Biol. 2005, 3: 185-205. 10.1142\/S0219720005001004.","journal-title":"J Bioinform Comput Biol"},{"key":"997_CR21","volume-title":"C4. 5: Programs for Machine Learning","author":"JR Quinlan","year":"1993","unstructured":"Quinlan JR: C4. 5: Programs for Machine Learning. 1993"},{"key":"997_CR22","volume-title":"Chi2: Feature Selection and Discretization of Numeric Attributes","author":"RS Huan Liu","year":"1995","unstructured":"Huan Liu RS: Chi2: Feature Selection and Discretization of Numeric Attributes. 1995"},{"key":"997_CR23","first-page":"1006","volume-title":"Journal of the American Statistical Association","author":"LJ Wei","year":"1981","unstructured":"Wei LJ: Asymptotic Conservativeness and Efficiency of Kruskal-Wallis Test for K Dependent Samples. Journal of the American Statistical Association. 1981, 1006-1009."},{"key":"997_CR24","first-page":"164","volume-title":"AAAI Press","author":"K Philip","year":"1998","unstructured":"Philip K, Chan SJS: Toward scalable learning with non-uniform class and cost distributions: A case study in credit card fraud detection. AAAI Press. 1998, 164-168."},{"key":"997_CR25","first-page":"217","volume-title":"Proceedings of the 11th International Conference of Machine Learning, New Brunswick Morgan Kaufmann","author":"M Pazzani","year":"1994","unstructured":"Pazzani M, Merz C, Murphy P, Ali K, Hume T, Brunk C: Reducing Misclassification Costs. Proceedings of the 11th International Conference of Machine Learning, New Brunswick Morgan Kaufmann. 1994, 217-225."},{"key":"997_CR26","doi-asserted-by":"publisher","first-page":"282","DOI":"10.1186\/1471-2105-9-282","volume":"9","author":"G Zheng","year":"2008","unstructured":"Zheng G, Qian Z, Yang Q, Wei C, Xie L, Zhu Y, Li Y: The combination approach of SVM and ECOC for powerful identification and classification of transcription factor. BMC Bioinformatics. 2008, 9: 282-10.1186\/1471-2105-9-282.","journal-title":"BMC Bioinformatics"},{"key":"997_CR27","volume-title":"Elements of Information Theory","author":"M Thomas","year":"1991","unstructured":"Thomas M, Cover JAT: Elements of Information Theory. 1991"},{"key":"997_CR28","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1128\/JCM.01429-08","volume":"47","author":"T Kawanami","year":"2009","unstructured":"Kawanami T, Fukuda K, Yatera K, Kido T, Yoshii C, Taniguchi H, Kido M: Severe pneumonia with Leptotrichia sp. detected predominantly in bronchoalveolar lavage fluid by use of 16S rRNA gene sequencing analysis. J Clin Microbiol. 2009, 47: 496-498. 10.1128\/JCM.01429-08.","journal-title":"J Clin Microbiol"},{"key":"997_CR29","doi-asserted-by":"crossref","first-page":"487","DOI":"10.7883\/yoken.JJID.2008.487","volume":"61","author":"M Koide","year":"2008","unstructured":"Koide M, Furugen M, Haranaga S, Higa F, Tateyama M, Yamane N, Fujita J: Characteristics of Legionella pneumophila serogroup 2 strains by colony morphology. Jpn J Infect Dis. 2008, 61: 487-489.","journal-title":"Jpn J Infect Dis"},{"key":"997_CR30","doi-asserted-by":"crossref","first-page":"3336","DOI":"10.1128\/AEM.64.9.3336-3345.1998","volume":"64","author":"AH Franks","year":"1998","unstructured":"Franks AH, Harmsen HJ, Raangs GC, Jansen GJ, Schut F, Welling GW: Variations of bacterial populations in human feces measured by fluorescent in situ hybridization with group-specific 16S rRNA-targeted oligonucleotide probes. Appl Environ Microbiol. 1998, 64: 3336-3345.","journal-title":"Appl Environ Microbiol"},{"key":"997_CR31","first-page":"44","volume-title":"2000","author":"O Richard","year":"2000","unstructured":"Richard O, Duda PEH, Stork David: Pattern Classification. . 2000, Published by Wiley-Interscience, 44-51. Chapter 4, 2","edition":"2"},{"key":"997_CR32","doi-asserted-by":"publisher","first-page":"906","DOI":"10.1093\/bioinformatics\/16.10.906","volume":"16","author":"TS Furey","year":"2000","unstructured":"Furey TS, Cristianini N, Duffy N, Bednarski DW, Schummer M, Haussler D: Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics. 2000, 16: 906-914. 10.1093\/bioinformatics\/16.10.906.","journal-title":"Bioinformatics"},{"key":"997_CR33","first-page":"459","volume-title":"The Elements of Statistical Learning Data Mining, Inference, and Prediction","author":"RT Trevor Hastie","year":"2009","unstructured":"Trevor Hastie RT, Firedman Jerome: The Elements of Statistical Learning Data Mining, Inference, and Prediction. 2009, 459-475."},{"key":"997_CR34","first-page":"233","volume-title":"Machine Learning","author":"M-H Tom Mitchell","year":"1997","unstructured":"Tom Mitchell M-H: Machine Learning. 1997, 233-234."},{"key":"997_CR35","doi-asserted-by":"publisher","first-page":"1226","DOI":"10.1109\/TPAMI.2005.159","volume":"27","author":"H Peng","year":"2005","unstructured":"Peng H, Long F, Ding C: Feature selection based on mutual information: criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell. 2005, 27: 1226-1238.","journal-title":"IEEE Trans Pattern Anal Mach Intell"}],"container-title":["BMC Systems Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1752-0509-6-S3-S12.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,7,2]],"date-time":"2023-07-02T15:29:29Z","timestamp":1688311769000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcsystbiol.biomedcentral.com\/articles\/10.1186\/1752-0509-6-S3-S12"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,12]]},"references-count":35,"journal-issue":{"issue":"S3","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["997"],"URL":"https:\/\/doi.org\/10.1186\/1752-0509-6-s3-s12","relation":{},"ISSN":["1752-0509"],"issn-type":[{"value":"1752-0509","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,12]]},"assertion":[{"value":"17 December 2012","order":1,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"S12"}}