{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,2]],"date-time":"2026-05-02T07:06:04Z","timestamp":1777705564866,"version":"3.51.4"},"reference-count":40,"publisher":"SAGE Publications","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IFS"],"published-print":{"date-parts":[[2023,4,3]]},"abstract":"<jats:p>Data pre-processing is one of the crucial phases of data mining that enhances the efficiency of data mining techniques. One of the most important operations performed on data pre-processing is missing values imputation in incomplete datasets. This research presents a new imputation technique using K-means and samples weighting mechanism based on Grey relation (KWGI). The Grey-based K-means algorithm applicable to all samples of incomplete datasets clusters the similar samples, then an appropriate kernel function generates appropriate weights based on the Grey relation. The missing values estimation of the incomplete samples is done based on the weighted mean to reduce the impact of outlier and vague samples. In both clustering and imputation steps, a penalty mechanism has been considered to reduce the similarity of ambiguous samples with a high number of missing values, and consequently, increase the accuracy of clustering and imputation. The KWGI method has been applied on nine natural datasets with eight state-of-the-art and commonly used methods, namely CMIWD, KNNI, HotDeck, MeanI, KmeanI, RKmeanI, ICKmeanI, and FKMI. The imputation results are evaluated by the Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE) criteria. In this study, the missing values are generated at two levels, namely sample and value, and the results are discussed in a wide range of missingness from low rate to high rate. Experimental results of the t-test show that the proposed method performs significantly better than all the other compared methods.<\/jats:p>","DOI":"10.3233\/jifs-200774","type":"journal-article","created":{"date-parts":[[2022,12,23]],"date-time":"2022-12-23T11:48:57Z","timestamp":1671796137000},"page":"5675-5697","source":"Crossref","is-referenced-by-count":4,"title":["A novel missing value imputation relying on K-means clustering and kernel-based weighting using grey relation (KWGI)"],"prefix":"10.1177","volume":"44","author":[{"given":"Alireza","family":"Dehghani","sequence":"first","affiliation":[{"name":"Department of Computer Engineering, Yasooj Branch, Islamic Azad University, Yasooj, Iran"}]},{"given":"Karamolah","family":"Bagherifard","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Yasooj Branch, Islamic Azad University, Yasooj, Iran"}]},{"given":"Samad","family":"Nejatian","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering, Yasooj Branch, Islamic Azad University, Yasooj, Iran"}]},{"given":"Hamid","family":"Parvin","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Nourabad Mamasani Branch, Islamic Azad University, Nourabad Mamasani, Iran"}]}],"member":"179","reference":[{"issue":"11","key":"10.3233\/JIFS-200774_ref1","doi-asserted-by":"publisher","first-page":"2541","DOI":"10.1016\/j.jss.2012.05.073","article-title":"Nearest neighbor selection for iteratively k NNimputation","volume":"85","author":"Zhang","year":"2012","journal-title":"J. Syst. Softw."},{"key":"10.3233\/JIFS-200774_ref3","unstructured":"Allison P.D. , Missing Data. Sage Publications, Thousand Oaks (2001)."},{"key":"10.3233\/JIFS-200774_ref4","doi-asserted-by":"publisher","first-page":"52","DOI":"10.1016\/j.eswa.2017.07.026","article-title":"An extensive analysis of the interaction between missing data types, imputation methods, and supervised classifiers","volume":"89","author":"Garciarena","year":"2017","journal-title":"Expert Syst. Appl."},{"key":"10.3233\/JIFS-200774_ref5","doi-asserted-by":"crossref","unstructured":"Miao X. and Gao Y. , Incomplete data management: A survey, 12(1) (2018), 4\u201325.","DOI":"10.1007\/s11704-016-6195-x"},{"key":"10.3233\/JIFS-200774_ref9","doi-asserted-by":"publisher","DOI":"10.1016\/j.bioeng.2007.04.003"},{"key":"10.3233\/JIFS-200774_ref10","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1016\/j.neucom.2013.02.016","article-title":"Locally linear reconstruction based missing value imputation for supervised learning","volume":"118","author":"Kang","year":"2013","journal-title":"Neurocomputing"},{"issue":"9","key":"10.3233\/JIFS-200774_ref11","doi-asserted-by":"crossref","first-page":"3463","DOI":"10.1016\/j.patcog.2012.03.009","article-title":"A new fuzzy c-means method with total variation regularization for segmentation of images with noisy and incomplete data","volume":"45","author":"He","year":"2012","journal-title":"Pattern Recognit."},{"issue":"2","key":"10.3233\/JIFS-200774_ref12","doi-asserted-by":"crossref","first-page":"206","DOI":"10.7763\/IJFCC.2012.V1.54","article-title":"Missing value imputation method based on clustering and nearest neighbours","volume":"1","author":"Gajawada","year":"2012","journal-title":"in International Journal of Future Computer and Communication"},{"key":"10.3233\/JIFS-200774_ref13","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-017-1118-1"},{"key":"10.3233\/JIFS-200774_ref15","doi-asserted-by":"publisher","DOI":"10.1007\/s10115-015-0822-y"},{"issue":"8","key":"10.3233\/JIFS-200774_ref16","doi-asserted-by":"publisher","first-page":"2993","DOI":"10.1016\/j.patcog.2010.02.006","article-title":"The theoreticframework of local weighted approximation for microarray missingvalue estimation","volume":"43","author":"Liu","year":"2010","journal-title":"Pattern Recognit."},{"key":"10.3233\/JIFS-200774_ref17","doi-asserted-by":"crossref","unstructured":"Liu X. and Yin J. , K-Means Clustering With Incomplete Data, IEEE Access 7 (2019).","DOI":"10.1109\/ACCESS.2019.2960531"},{"key":"10.3233\/JIFS-200774_ref18","doi-asserted-by":"publisher","first-page":"596","DOI":"10.1016\/j.ins.2010.12.017","article-title":"Incomplete-case nearest neighbor imputation in software measurement data q","volume":"259","author":"Van Hulse","year":"2014","journal-title":"Inf. Sci. (Ny)."},{"issue":"3","key":"10.3233\/JIFS-200774_ref21","doi-asserted-by":"publisher","first-page":"709","DOI":"10.1007\/s10115-017-1025-5","article-title":"Missing value estimation for microarray data through cluster analysis","volume":"52","author":"Kumar","year":"2017","journal-title":"Knowl. Inf. Syst."},{"key":"10.3233\/JIFS-200774_ref25","doi-asserted-by":"crossref","first-page":"2977","DOI":"10.1007\/s11227-015-1433-9","article-title":"Incomplete high-dimensional data imputation algorithm using feature selection and clustering analysis on cloud","volume":"72","author":"Bu","year":"2016","journal-title":"J. Supercomput."},{"key":"10.3233\/JIFS-200774_ref27","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1016\/j.neucom.2014.12.073","article-title":"Data imputation via evolutionary computation, clustering and a neural network","volume":"156","author":"Gautam","year":"2015","journal-title":"Neurocomputing"},{"key":"10.3233\/JIFS-200774_ref28","unstructured":"Little D.R.R.J.A. , Statistical analysis with missing data. Wiley and Sons, New York, NY, USA (1987)."},{"key":"10.3233\/JIFS-200774_ref29","doi-asserted-by":"publisher","first-page":"65","DOI":"10.1016\/j.asoc.2014.09.052","article-title":"Single imputation with multilayer perceptron and multiple imputation combining multilayer perceptron and k-nearest neighbours for monotone patterns","volume":"29","author":"Silva-ram\u00edrez","year":"2015","journal-title":"Appl. Soft Comput."},{"issue":"5\u20136","key":"10.3233\/JIFS-200774_ref30","doi-asserted-by":"crossref","first-page":"519","DOI":"10.1080\/713827181","article-title":"An analysis of four missing data treatment methods for supervised learning","volume":"17","author":"Batista","year":"2003","journal-title":"Appl. Artif. Intell."},{"issue":"3","key":"10.3233\/JIFS-200774_ref31","doi-asserted-by":"publisher","first-page":"614","DOI":"10.1007\/s10489-015-0666-x","article-title":"Missing data imputation by K nearest neighbours based on grey relational structure and mutual information","volume":"43","author":"Pan","year":"2015","journal-title":"Appl. Intell."},{"key":"10.3233\/JIFS-200774_ref32","doi-asserted-by":"crossref","unstructured":"Xian Wang Z.J. H.F. and Ao Li , Missing value estimation for DNA microarray gene expression data by Support Vector Regression imputation and orthogonal coding scheme, BMC Bioinformatics 7 (2006).","DOI":"10.1186\/1471-2105-7-32"},{"issue":"3","key":"10.3233\/JIFS-200774_ref33","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1023\/B:STCO.0000035301.49549.88","article-title":"A tutorial on support vector regression","volume":"14","author":"Smola","year":"2004","journal-title":"Stat. Comput."},{"issue":"2","key":"10.3233\/JIFS-200774_ref35","doi-asserted-by":"crossref","first-page":"216","DOI":"10.3844\/jcssp.2011.216.224","article-title":"Predicting Missing Attribute Values Using k-Means Clustering","volume":"7","author":"Suguna","year":"2011","journal-title":"J. Comput. Sci."},{"key":"10.3233\/JIFS-200774_ref36","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2018.07.057"},{"key":"10.3233\/JIFS-200774_ref37","doi-asserted-by":"crossref","unstructured":"Sim J. , Lee J.S. and Kwon O. , Missing values and optimal selection of an imputation method and classification algorithm to improve the accuracy of ubiquitous computing applications, Math. Probl. Eng. 2015 (2015).","DOI":"10.1155\/2015\/538613"},{"key":"10.3233\/JIFS-200774_ref39","first-page":"1","article-title":"Introduction to grey system theory","volume":"1","author":"Julong","year":"1989","journal-title":"J. Grey Syst."},{"issue":"6","key":"10.3233\/JIFS-200774_ref40","doi-asserted-by":"crossref","first-page":"843","DOI":"10.1109\/76.785721","article-title":"The gray prediction search algorithm for block motion estimation","volume":"9","author":"Sun","year":"1999","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"10.3233\/JIFS-200774_ref41","doi-asserted-by":"publisher","DOI":"10.1109\/METRICS.2005.51"},{"issue":"6","key":"10.3233\/JIFS-200774_ref42","doi-asserted-by":"publisher","first-page":"2081","DOI":"10.1109\/25.901877","article-title":"Grey-based power control for DS-CDMA cellular mobile systems","volume":"49","author":"Su","year":"2000","journal-title":"IEEE Trans. Veh. Technol."},{"issue":"10","key":"10.3233\/JIFS-200774_ref44","doi-asserted-by":"publisher","first-page":"617","DOI":"10.1109\/TSMC.1979.4310090","article-title":"Pattern recognition with partly missing data","volume":"9","author":"Dixon","year":"1979","journal-title":"IEEE Trans. Syst. Man. Cybern."},{"key":"10.3233\/JIFS-200774_ref45","doi-asserted-by":"publisher","DOI":"10.1016\/j.csda.2015.04.009"},{"key":"10.3233\/JIFS-200774_ref46","doi-asserted-by":"publisher","DOI":"10.1007\/s00521-017-3036-2"},{"issue":"4","key":"10.3233\/JIFS-200774_ref47","doi-asserted-by":"publisher","first-page":"453","DOI":"10.1016\/j.ijar.2010.01.004","article-title":"International Journal of Approximate Reasoning Gaussian kernel based fuzzy roughsets: Model, uncertainty measures and applications","volume":"51","author":"Hu","year":"2010","journal-title":"Int. J.Approx. Reason."},{"issue":"2","key":"10.3233\/JIFS-200774_ref48","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1109\/72.914517","article-title":"An Introduction to Kernel-Based Learning Algorithms","volume":"12","author":"M\u00fcller","year":"2001","journal-title":"IEEE Trans. NEURAL NETWORKS"},{"key":"10.3233\/JIFS-200774_ref50","unstructured":"Han J. , Pei J. and Kamber M. , Data Mining: Concepts and Techniques, Third edit. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc, 2011."},{"key":"10.3233\/JIFS-200774_ref51","doi-asserted-by":"publisher","first-page":"637","DOI":"10.1109\/ICDAR.1997.620583","article-title":"Combining multiple representations and classifiers for pen-based handwritten digit recognition","volume":"2","author":"Alimoglu","year":"1997","journal-title":"in Proceedings of the Fourth International Conference on Document Analysis and Recognition"},{"issue":"6","key":"10.3233\/JIFS-200774_ref52","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1093\/bioinformatics\/17.6.520","article-title":"Missing value estimation methods for DNA microarrays","volume":"17","author":"Troyanskaya","year":"2001","journal-title":"Bioinformatics"},{"issue":"1","key":"10.3233\/JIFS-200774_ref53","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10044-014-0411-9","article-title":"Techniques for dealing with incomplete data: a tutorial and survey","volume":"18","author":"Aste","year":"2015","journal-title":"Pattern Anal. Appl."},{"key":"10.3233\/JIFS-200774_ref54","doi-asserted-by":"publisher","first-page":"311","DOI":"10.1016\/j.knosys.2013.12.005","article-title":"Knowledge-Based Systems FIMUS: A framework for imputing missing values using co-appearance, correlation and similarity analysis","volume":"56","author":"Rahman","year":"2014","journal-title":"Knowledge-Based Syst."},{"key":"10.3233\/JIFS-200774_ref55","doi-asserted-by":"crossref","first-page":"274","DOI":"10.1016\/j.ins.2016.01.018","article-title":"Missing value imputation for the analysis of incomplete traffic accident data","volume":"339","author":"Deb","year":"2016","journal-title":"Inf. Sci. (Ny)."}],"container-title":["Journal of Intelligent &amp; Fuzzy Systems"],"original-title":[],"link":[{"URL":"https:\/\/content.iospress.com\/download?id=10.3233\/JIFS-200774","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,4,29]],"date-time":"2026-04-29T09:44:15Z","timestamp":1777455855000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/full\/10.3233\/JIFS-200774"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,3]]},"references-count":40,"journal-issue":{"issue":"4"},"URL":"https:\/\/doi.org\/10.3233\/jifs-200774","relation":{},"ISSN":["1064-1246","1875-8967"],"issn-type":[{"value":"1064-1246","type":"print"},{"value":"1875-8967","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,4,3]]}}}