{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,16]],"date-time":"2026-03-16T20:50:07Z","timestamp":1773694207710,"version":"3.50.1"},"reference-count":35,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2022,10,19]],"date-time":"2022-10-19T00:00:00Z","timestamp":1666137600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>The features of a dataset play an important role in the construction of a machine learning model. Because big datasets often have a large number of features, they may contain features that are less relevant to the machine learning task, which makes the process more time-consuming and complex. In order to facilitate learning, it is always recommended to remove the less significant features. The process of eliminating the irrelevant features and finding an optimal feature set involves comprehensively searching the dataset and considering every subset in the data. In this research, we present a distributed fuzzy cognitive map based learning-based wrapper method for feature selection that is able to extract those features from a dataset that play the most significant role in decision making. Fuzzy cognitive maps (FCMs) represent a hybrid computing technique combining elements of both fuzzy logic and cognitive maps. Using Spark\u2019s resilient distributed datasets (RDDs), the proposed model can work effectively in a distributed manner for quick, in-memory processing along with effective iterative computations. According to the experimental results, when the proposed model is applied to a classification task, the features selected by the model help to expedite the classification process. The selection of relevant features using the proposed algorithm is on par with existing feature selection algorithms. In conjunction with a random forest classifier, the proposed model produced an average accuracy above 90%, as opposed to 85.6% accuracy when no feature selection strategy was adopted.<\/jats:p>","DOI":"10.3390\/a15100383","type":"journal-article","created":{"date-parts":[[2022,10,19]],"date-time":"2022-10-19T20:32:23Z","timestamp":1666211543000},"page":"383","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["Distributed Fuzzy Cognitive Maps for Feature Selection in Big Data Classification"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1725-4416","authenticated-orcid":false,"given":"K.","family":"Haritha","sequence":"first","affiliation":[{"name":"Department of Computer Applications, Cochin University of Science and Technology, Kochi 682022, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1843-2330","authenticated-orcid":false,"given":"M. V.","family":"Judy","sequence":"additional","affiliation":[{"name":"Department of Computer Applications, Cochin University of Science and Technology, Kochi 682022, India"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6117-1220","authenticated-orcid":false,"given":"Konstantinos","family":"Papageorgiou","sequence":"additional","affiliation":[{"name":"Institute of Educational Policy, Tsocha 36, 11521 Athens, Greece"},{"name":"Energy Systems Department, Gaiopolis Campus, University of Thessaly, 41500 Larisa, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9895-7606","authenticated-orcid":false,"given":"Vassilis C.","family":"Georgiannis","sequence":"additional","affiliation":[{"name":"Digital Systems Department, Gaiopolis Campus, University of Thessaly, 41500 Larisa, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2498-9661","authenticated-orcid":false,"given":"Elpiniki","family":"Papageorgiou","sequence":"additional","affiliation":[{"name":"Energy Systems Department, Gaiopolis Campus, University of Thessaly, 41500 Larisa, Greece"}]}],"member":"1968","published-online":{"date-parts":[[2022,10,19]]},"reference":[{"key":"ref_1","first-page":"1157","article-title":"An Introduction to Variable and Feature Selection","volume":"3","author":"Guyon","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/j.knosys.2015.05.014","article-title":"Recent advances and emerging challenges of feature selection in the context of big data","volume":"86","year":"2015","journal-title":"Knowl.-Based Syst."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/S0020-7373(86)80040-2","article-title":"Cognitive fuzzy maps","volume":"24","author":"Kosko","year":"1986","journal-title":"Int. J. Man-Mach. Stud."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1016\/S0004-3702(97)00043-X","article-title":"Wrapper for Feature Subset Selection","volume":"97","author":"Kohavi","year":"1997","journal-title":"Artif. Intell."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1007\/s10115-012-0487-8","article-title":"A review of feature selection methods on synthetic data","volume":"34","year":"2013","journal-title":"Knowl. Inf. Syst."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"531","DOI":"10.1016\/j.patcog.2011.06.006","article-title":"An ensemble of filters and classifiers for microarray data classification","volume":"45","year":"2012","journal-title":"Pattern Recognit."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Saeys, Y., Abeel, T., and van de Peer, Y. (2008). Robust Feature Selection Using Ensemble Feature Selection Techniques. Lecture Notes in Computer Science Book Series (LNAI), Springer Science.","DOI":"10.1007\/978-3-540-87481-2_21"},{"key":"ref_8","first-page":"1341","article-title":"Feature selection with ensembles, artificial variables, and redundancy elimination","volume":"10","author":"Tuv","year":"2009","journal-title":"J. Mach. Learn. Res."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1007\/s10115-010-0348-2","article-title":"Obtaining scalable and accurate classification in large-scale spatio-temporal domains","volume":"29","author":"Vainer","year":"2011","journal-title":"Knowl. Inf. Syst."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Ding, C., and Li, T. (2008). Gene selection algorithm by combining reliefF and mRMR. BMC Genom., 9.","DOI":"10.1186\/1471-2164-9-S2-S27"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1007\/s10115-010-0288-x","article-title":"A two-stage gene selection scheme utilizing MRMR filter and GA wrapper","volume":"26","author":"Amine","year":"2011","journal-title":"Knowl. Inf. Syst."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"12868","DOI":"10.1109\/JSEN.2020.3033153","article-title":"A Review on Soft Sensors for Monitoring, Control, and Optimization of Industrial Processes","volume":"21","author":"Jiang","year":"2021","journal-title":"IEEE Sens. J."},{"key":"ref_13","first-page":"243","article-title":"Prognostic Kalman Filter Based Bayesian Learning Model for Data Accuracy Prediction","volume":"72","author":"Karthik","year":"2022","journal-title":"Comput. Mater. Contin."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"32093","DOI":"10.1007\/s11042-022-12907-y","article-title":"Bunch graph based dimensionality reduction using auto-encoder for character recognition","volume":"81","author":"Bhadoria","year":"2022","journal-title":"Multimed. Tools Appl."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1007\/s13042-021-01347-z","article-title":"Ensemble of feature selection algorithms: A multi-criteria decision-making approach","volume":"13","author":"Hashemi","year":"2022","journal-title":"Int. J. Mach. Learn. Cybern."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"685","DOI":"10.34768\/amcs-2021-0047","article-title":"A weighted wrapper approach to feature selection","volume":"31","author":"Kusy","year":"2021","journal-title":"Int. J. Appl. Math. Comput. Sci."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Chellappan, S., and Ganesan, D. (2018). Practical Apache Spark, Apress.","DOI":"10.1007\/978-1-4842-3652-9"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"5947","DOI":"10.1016\/j.eswa.2010.11.028","article-title":"Feature selection and classification in multiple class datasets: An application to KDD Cup 99 dataset","volume":"38","year":"2011","journal-title":"Expert Syst. Appl."},{"key":"ref_19","first-page":"1289","article-title":"An Extensive Empirical Study of Feature Selection Metrics for Text Classification","volume":"1","author":"Forman","year":"2000","journal-title":"J. Mach. Learn. Res."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1007\/s10115-011-0403-7","article-title":"Highly discriminative statistical features for email classification","volume":"31","author":"Gomez","year":"2012","journal-title":"Knowl. Inf. Syst."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Yu, L., and Liu, H. (2004, January 22\u201325). Redundancy based feature selection for microarray data. Proceedings of the KDD-2004\u2014Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, WA, USA.","DOI":"10.1145\/1014052.1014149"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1802","DOI":"10.1109\/TASL.2010.2101596","article-title":"Generalizability and Simplicity as Criteria in Feature Selection: Application to Mood Classification in Music","volume":"19","author":"Saari","year":"2011","journal-title":"IEEE Trans. Audio Speech Lang. Process."},{"key":"ref_23","unstructured":"Axelrod, R. (1976). Structure of Decisions: The Cognitive Maps of Political Elites, Princeton University Press."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"562","DOI":"10.1016\/j.socscimed.2006.09.007","article-title":"Integrating conventional science and aboriginal perspectives on diabetes using fuzzy cognitive maps","volume":"64","author":"Giles","year":"2007","journal-title":"Soc. Sci. Med."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"3711","DOI":"10.1016\/j.asoc.2012.02.006","article-title":"A fuzzy cognitive map of the psychosocial determinants of obesity","volume":"12","author":"Giabbanelli","year":"2012","journal-title":"Appl. Soft Comput. J."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"123","DOI":"10.1016\/j.cmpb.2015.07.003","article-title":"A risk management model for familial breast cancer: A new application using Fuzzy Cognitive Map method","volume":"122","author":"Papageorgiou","year":"2015","journal-title":"Comput. Methods Programs Biomed."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1007\/s00500-004-0344-0","article-title":"Soft computing for crisis management and political decision making: The use of genetically evolved fuzzy cognitive maps","volume":"9","author":"Andreou","year":"2005","journal-title":"Soft Comput."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhai, D.S., Chang, Y.N., and Zhang, J. (2009, January 7\u20138). An application of fuzzy cognitive map based on active hebbian learning algorithm in credit risk evaluation of listed companies. Proceedings of the 2009 International Conference on Artificial Intelligence and Computational Intelligence, AICI 2009, Washington, DC, USA.","DOI":"10.1109\/AICI.2009.214"},{"key":"ref_29","unstructured":"Carvalho, J.P., and Tome, J.A.B. (2001, January 2\u20135). Rule based fuzzy cognitive maps expressing time in qualitative system dynamics. Proceedings of the 10th IEEE International Conference on Fuzzy Systems (Cat. No.01CH37297), Melbourne, VIC, Australia."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"7581","DOI":"10.1016\/j.eswa.2010.04.085","article-title":"Modelling grey uncertainty with fuzzy grey cognitive maps","volume":"37","author":"Salmeron","year":"2010","journal-title":"Expert Syst. Appl."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1109\/TITB.2010.2093603","article-title":"Intuitionistic fuzzy cognitive maps for medical decision making","volume":"15","author":"Iakovidis","year":"2011","journal-title":"IEEE Trans. Inf. Technol. Biomed."},{"key":"ref_32","first-page":"260","article-title":"Dynamic Random Fuzzy Cognitive Maps","volume":"7","author":"Aguilar","year":"2004","journal-title":"Comput. Sist."},{"key":"ref_33","first-page":"183","article-title":"Fuzzy cognitive network: A general framework","volume":"1","author":"Kottas","year":"2007","journal-title":"Intell. Decis. Technol."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1016\/j.knosys.2015.10.015","article-title":"Rough Cognitive Networks","volume":"91","author":"Grau","year":"2016","journal-title":"Knowl.-Based Syst."},{"key":"ref_35","unstructured":"Dua, D., and Graff, C. (2017). UCI Machine Learning Repository, University of California, School of Information and Computer Science. Available online: http:\/\/archive.ics.uci.edu\/ml."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/15\/10\/383\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:57:04Z","timestamp":1760144224000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/15\/10\/383"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,10,19]]},"references-count":35,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2022,10]]}},"alternative-id":["a15100383"],"URL":"https:\/\/doi.org\/10.3390\/a15100383","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,10,19]]}}}