{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,15]],"date-time":"2026-01-15T09:40:04Z","timestamp":1768470004044,"version":"3.49.0"},"reference-count":36,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,2,23]],"date-time":"2022-02-23T00:00:00Z","timestamp":1645574400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["BDCC"],"abstract":"<jats:p>In high-dimensional data analysis, Feature Selection (FS) is one of the most fundamental issues in machine learning and requires the attention of researchers. These datasets are characterized by huge space due to a high number of features, out of which only a few are significant for analysis. Thus, significant feature extraction is crucial. There are various techniques available for feature selection; among them, the filter techniques are significant in this community, as they can be used with any type of learning algorithm and drastically lower the running time of optimization algorithms and improve the performance of the model. Furthermore, the application of a filter approach depends on the characteristics of the dataset as well as on the machine learning model. Thus, to avoid these issues in this research, a combination of feature reduction (CFR) is considered designing a pipeline of filter approaches for high-dimensional microarray data classification. Considering four filter approaches, sixteen combinations of pipelines are generated. The feature subset is reduced in different levels, and ultimately, the significant feature set is evaluated. The pipelined filter techniques are Correlation-Based Feature Selection (CBFS), Chi-Square Test (CST), Information Gain (InG), and Relief Feature Selection (RFS), and the classification techniques are Decision Tree (DT), Logistic Regression (LR), Random Forest (RF), and k-Nearest Neighbor (k-NN). The performance of CFR depends highly on the datasets as well as on the classifiers. Thereafter, the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) method is used for ranking all reduction combinations and evaluating the superior filter combination among all.<\/jats:p>","DOI":"10.3390\/bdcc6010024","type":"journal-article","created":{"date-parts":[[2022,2,23]],"date-time":"2022-02-23T09:34:38Z","timestamp":1645608878000},"page":"24","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":26,"title":["Combination of Reduction Detection Using TOPSIS for Gene Expression Data Analysis"],"prefix":"10.3390","volume":"6","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6441-147X","authenticated-orcid":false,"given":"Jogeswar","family":"Tripathy","sequence":"first","affiliation":[{"name":"ITER, Siksha \u2018O\u2019 Anusandhan Deemed to be University, Bhubaneswar 751030, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2235-7516","authenticated-orcid":false,"given":"Rasmita","family":"Dash","sequence":"additional","affiliation":[{"name":"ITER, Siksha \u2018O\u2019 Anusandhan Deemed to be University, Bhubaneswar 751030, India"}]},{"given":"Binod Kumar","family":"Pattanayak","sequence":"additional","affiliation":[{"name":"ITER, Siksha \u2018O\u2019 Anusandhan Deemed to be University, Bhubaneswar 751030, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3737-5223","authenticated-orcid":false,"given":"Sambit Kumar","family":"Mishra","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, SRM University-AP, Amaravati 522502, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6363-5017","authenticated-orcid":false,"given":"Tapas Kumar","family":"Mishra","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering, SRM University-AP, Amaravati 522502, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8332-278X","authenticated-orcid":false,"given":"Deepak","family":"Puthal","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science, Khalifa University, Abu Dhabi 127788, United Arab Emirates"}]}],"member":"1968","published-online":{"date-parts":[[2022,2,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"485","DOI":"10.1093\/nar\/gkh421","article-title":"New challenges in gene expression data analysis and the extended GEPAS","volume":"32","author":"Herrero","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"78533","DOI":"10.1109\/ACCESS.2019.2922987","article-title":"A survey on hybrid feature selection methods in microarray gene expression data for cancer classification","volume":"7","author":"Almugren","year":"2019","journal-title":"IEEE Access"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1016\/j.procs.2015.04.060","article-title":"Feature selection of gene expression data for cancer classification: A review","volume":"50","author":"Singh","year":"2015","journal-title":"Procedia Comput. Sci."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1016\/j.asoc.2016.07.006","article-title":"Pipelining the ranking techniques for microarray data classification: A case study","volume":"48","author":"Dash","year":"2016","journal-title":"Appl. Soft Comput."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Glaab, E., Bacardit, J., Garibaldi, J.M., and Krasnogor, N. (2012). Using rule-based machine learning for candidate disease gene prioritization and sample classification of cancer gene expression data. PLoS ONE, 7.","DOI":"10.1371\/journal.pone.0039932"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"114485","DOI":"10.1016\/j.eswa.2020.114485","article-title":"Theoretical and empirical analysis of filter ranking methods: Experimental study on benchmark DNA microarray data","volume":"169","author":"Ghosh","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.imu.2017.07.004","article-title":"Feature selection model based on clustering and ranking in pipeline for microarray data","volume":"9","author":"Sahu","year":"2017","journal-title":"Inform. Med. Unlocked"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"104396","DOI":"10.1016\/j.chemolab.2021.104396","article-title":"A hybrid ensemble-filter wrapper feature selection approach for medical data classification","volume":"217","author":"Singh","year":"2021","journal-title":"Chemom. Intell. Lab. Syst."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"106839","DOI":"10.1016\/j.csda.2019.106839","article-title":"Benchmark for filter methods for feature selection in high-dimensional classification data","volume":"143","author":"Bommert","year":"2020","journal-title":"Comput. Stat. Data Anal."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1106","DOI":"10.1109\/TCBB.2012.33","article-title":"A survey on filter techniques for feature selection in gene expression microarray analysis","volume":"9","author":"Lazar","year":"2012","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1504\/IJDMB.2018.093683","article-title":"A multi-objective feature selection and classifier ensemble technique for microarray data analysis","volume":"20","author":"Dash","year":"2018","journal-title":"Int. J. Data Min. Bioinform."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"2429","DOI":"10.1093\/bioinformatics\/bth267","article-title":"A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression","volume":"20","author":"Li","year":"2004","journal-title":"Bioinformatics"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2164-9-S1-S13","article-title":"A comparative study of different machine learning methods on microarray gene expression data","volume":"9","author":"Pirooznia","year":"2008","journal-title":"BMC Genom."},{"key":"ref_14","first-page":"754","article-title":"A top-r feature selection algorithm for microarray gene expression data","volume":"9","author":"Sharma","year":"2011","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform."},{"key":"ref_15","first-page":"68","article-title":"Aiding classification of gene expression data with feature selection: A comparative study","volume":"1","author":"Shen","year":"2006","journal-title":"J. Comput. Intell. Res. (IJCIR)"},{"key":"ref_16","first-page":"6837498","article-title":"A multiple-classifier framework for Parkinson\u2019s disease detection based on various vocal tests","volume":"2016","author":"Behroozi","year":"2016","journal-title":"Int. J. Telemed. Appl."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1016\/j.neucom.2016.07.080","article-title":"A hybrid feature selection algorithm for gene expression data classification","volume":"256","author":"Lu","year":"2017","journal-title":"Neurocomputing"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Jirapech-Umpai, T., and Aitken, S. (2005). Feature selection and classification for microarray data analysis: Evolutionary methods for identifying predictive genes. BMC Bioinform., 6.","DOI":"10.1186\/1471-2105-6-148"},{"key":"ref_19","first-page":"232","article-title":"A two stage grading approach for feature selection and classification of microarray data using Pareto based feature ranking techniques: A case study","volume":"32","author":"Dash","year":"2020","journal-title":"J. King Saud-Univ.-Comput. Inf. Sci."},{"key":"ref_20","first-page":"4598","article-title":"TOPSIS based multi-criteria decision making of feature selection techniques for network traffic dataset","volume":"5","author":"Singh","year":"2014","journal-title":"Int. J. Eng. Technol."},{"key":"ref_21","unstructured":"GeeksforGeeks (2018, March 06). Data Normalization in Data Mining. Available online: https:\/\/www.geeksforgeeks.org\/data-normalization-in-data-mining\/."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1016\/j.procs.2013.10.003","article-title":"A comparative study of feature selection and classification methods for gene expression data of glioma","volume":"23","author":"Abusamra","year":"2013","journal-title":"Procedia Comput. Sci."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Hemphill, E., Lindsay, J., Lee, C.M., Oiu, I.I., and Nelson, C.E. (2014). Feature selection and classifier performance on diverse bio-logical datasets. BMC Bioinform., 15.","DOI":"10.1186\/1471-2105-15-S13-S4"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Spencer, R., Thabtah, F., Abdelhamid, N., and Thompson, M. (2020). Exploring feature selection and classification methods for predicting heart disease. Digit. Health, 6.","DOI":"10.1177\/2055207620914777"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Liu, S., Xu, C., Zhang, Y., Liu, J., Yu, B., Liu, X., and Dehmer, M. (2018). Feature selection of gene expression data for cancer classification using double RBF-kernels. BMC Bioinform., 19.","DOI":"10.1186\/s12859-018-2400-2"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"102452","DOI":"10.1016\/j.bspc.2021.102452","article-title":"An efficient dimensionality reduction method using filter-based feature selection and variational autoencoders on Parkinson\u2019s disease classification","volume":"66","author":"Gunduz","year":"2021","journal-title":"Biomed. Signal Process. Control."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Mohapatra, D., Tripathy, J., and Patra, T.K. (2021). Rice Disease Detection and Monitoring Using CNN and Naive Bayes Classification. Soft Computing Techniques and Applications, Springer.","DOI":"10.1007\/978-981-15-7394-1_2"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"660","DOI":"10.1016\/j.compeleceng.2018.01.037","article-title":"Evaluating a bag-of-visual features approach using spatio-temporal features for action recognition","volume":"72","author":"Nazir","year":"2018","journal-title":"Comput. Electr. Eng."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Assiri, A.S., Nazir, S., and Velastin, S.A. (2020). Breast tumor classification using an ensemble machine learning method. J. Imaging, 6.","DOI":"10.3390\/jimaging6060039"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/j.media.2016.06.002","article-title":"Machine learning for medical images analysis","volume":"33","author":"Criminisi","year":"2016","journal-title":"Med Image Anal."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1141","DOI":"10.1007\/s10278-011-9380-3","article-title":"X-ray image classification using random forests with local wavelet-based CS-local binary patterns","volume":"24","author":"Ko","year":"2011","journal-title":"J. Digit. Imaging"},{"key":"ref_32","unstructured":"Tripathy, J., Dash, R., Pattanayak, B.K., and Mohanty, B. Agutomated Phrase Mining Using POST: The Best Approach. Proceedings of the IEEE International Conference on Electrical Power Engineering, Communication and Computing Technology (ODICON), Odisha, India."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"105784","DOI":"10.1016\/j.asoc.2019.105784","article-title":"An integrated TOPSIS crow search based classifier ensemble: In application to stock index price movement prediction","volume":"85","author":"Dash","year":"2019","journal-title":"Appl. Soft Comput."},{"key":"ref_34","unstructured":"(2018, March 06). Microarray Datasets. Available online: http:\/\/csse.szu.edu.cn\/staff\/zhuzx\/Datasets.html."},{"key":"ref_35","unstructured":"(2018, March 06). Brain Tumor Dataset. Available online: https:\/\/www.kaggle.com\/ahmedhamada0\/brain-tumor-detection\/metadata."},{"key":"ref_36","unstructured":"(2018, March 06). Adenoma Datasets. Available online: http:\/\/biogps.org\/dataset\/tag\/adenoma\/."}],"container-title":["Big Data and Cognitive Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2504-2289\/6\/1\/24\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:26:19Z","timestamp":1760135179000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2504-2289\/6\/1\/24"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,2,23]]},"references-count":36,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,3]]}},"alternative-id":["bdcc6010024"],"URL":"https:\/\/doi.org\/10.3390\/bdcc6010024","relation":{},"ISSN":["2504-2289"],"issn-type":[{"value":"2504-2289","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,2,23]]}}}