{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,3]],"date-time":"2026-06-03T06:40:44Z","timestamp":1780468844909,"version":"3.54.1"},"reference-count":70,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T00:00:00Z","timestamp":1740009600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"project \u201cSAIL: SustAInable Lifecycle of Intelligent SocioTechnical Systems\u201d","award":["NW21-059B"],"award-info":[{"award-number":["NW21-059B"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Data are crucial components of machine learning and deep learning in real-world applications. However, when collecting data from actual systems, we often encounter issues with missing information, which can harm accuracy and lead to biased results. In the context of video surveillance, missing data may arise due to obstructions, varying camera angles, or technical issues, resulting in incomplete information about the observed scene. This paper introduces a method for handling missing data in tabular formats, specifically focusing on video surveillance. The core idea is to fill in the missing values for a specific feature using values from other related features rather than relying on all available features, which we refer to as the imputation approach based on informative features. The paper presents three sets of experiments. The first set uses synthetic datasets to compare four optimization algorithms\u2014Particle Swarm Optimization (PSO), Grey Wolf Optimization (GWO), Whale Optimization Algorithm (WOA), and the Sine\u2013Cosine Algorithm (SCA)\u2014to determine which one best identifies features related to the target feature. The second set works with real-world datasets, while the third focuses on video-surveillance datasets. Each experiment compares the proposed method, utilizing the best optimizer from the first set, against leading imputation methods. The experiments evaluate different types of data and various missing-data rates, ensuring that randomness does not introduce bias. In the first experiment, using only synthetic data, the results indicate that the WOA-based approach outperforms PSO, GWO, and SCA optimization algorithms. The second experiment used real datasets, while the third used tabular data extracted from a video-surveillance system. Both experiments show that our WOA-based imputation method produces promising results, outperforming other state-of-the-art imputation methods.<\/jats:p>","DOI":"10.3390\/a18030119","type":"journal-article","created":{"date-parts":[[2025,2,20]],"date-time":"2025-02-20T04:03:17Z","timestamp":1740024197000},"page":"119","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Using Optimization Algorithms for Effective Missing-Data Imputation: A Case Study of Tabular Data Derived from Video Surveillance"],"prefix":"10.3390","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0009-0006-2876-2666","authenticated-orcid":false,"given":"Mahmoud M.","family":"Eid","sequence":"first","affiliation":[{"name":"Department of Computer Science, El-Shorouk Academy, Cairo 11837, Egypt"},{"name":"Faculty of Science, Al-Azhar University, Cairo 11651, Egypt"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kamal","family":"ElDahshan","sequence":"additional","affiliation":[{"name":"Faculty of Science, Al-Azhar University, Cairo 11651, Egypt"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Abdelatif H.","family":"Abouali","sequence":"additional","affiliation":[{"name":"Murray State University, Murray, KY 42071, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Alaa","family":"Tharwat","sequence":"additional","affiliation":[{"name":"Bielefeld University of Applied Sciences and Arts, 33619 Bielefeld, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2025,2,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1016\/j.jsp.2009.10.001","article-title":"An introduction to modern missing data analyses","volume":"48","author":"Baraldi","year":"2010","journal-title":"J. Sch. Psychol."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Allison, P.D. (2009). Missing data. The SAGE Handbook of Quantitative Methods in Psychology, Sage Publications.","DOI":"10.4135\/9780857020994.n4"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1093\/jpepsy\/jst048","article-title":"On the joys of missing data","volume":"39","author":"Little","year":"2014","journal-title":"J. Pediatr. Psychol."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Little, R.J., and Rubin, D.B. (2019). Statistical Analysis with Missing Data, John Wiley & Sons.","DOI":"10.1002\/9781119482260"},{"key":"ref_5","unstructured":"Enders, C.K. (2022). Applied Missing Data Analysis, Guilford Publications."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"913","DOI":"10.1080\/08839514.2019.1637138","article-title":"Comparison of performance of data imputation methods for numeric dataset","volume":"33","author":"Jadhav","year":"2019","journal-title":"Appl. Artif. Intell."},{"key":"ref_7","first-page":"261","article-title":"Missing data imputation techniques","volume":"2","author":"Song","year":"2007","journal-title":"Int. J. Bus. Intell. Data Min."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1016\/j.artmed.2010.05.002","article-title":"Missing data imputation using statistical and machine learning methods in a real breast cancer problem","volume":"50","author":"Jerez","year":"2010","journal-title":"Artif. Intell. Med."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"J\u00e4ger, S., Allhorn, A., and Bie\u00dfmann, F. (2021). A benchmark for data imputation methods. Front. Big Data, 4.","DOI":"10.3389\/fdata.2021.693674"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Pavithrakannan, R., Fenn, N.B., Raman, S., Kalyanaraman, V., Murugananthan, V.K., and Janarthanan, J. (2021, January 21\u201324). Imputation Analysis of Central Tendencies for Classification. Proceedings of the 2021 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS), Toronto, ON, Canada.","DOI":"10.1109\/IEMTRONICS52119.2021.9422507"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1111\/j.1751-5823.2010.00103.x","article-title":"A review of hot deck imputation for survey non-response","volume":"78","author":"Andridge","year":"2010","journal-title":"Int. Stat. Rev."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1016\/j.csda.2014.09.008","article-title":"A hot deck imputation procedure for multiply imputing nonignorable missing data: The proxy pattern-mixture hot deck","volume":"82","author":"Sullivan","year":"2015","journal-title":"Comput. Stat. Data Anal."},{"key":"ref_13","first-page":"141","article-title":"Analysis of Missing Value Imputation Application with K-Nearest Neighbor (K-NN) Algorithm in Dataset","volume":"5","author":"Sallaby","year":"2021","journal-title":"Int. J. Inform. Comput. Sci."},{"key":"ref_14","unstructured":"Yoon, J., Jordon, J., and Schaar, M. (2018, January 10\u201315). Gain: Missing data imputation using generative adversarial nets. Proceedings of the International Conference on Machine Learning, PMLR, Stockholm, Sweden."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1177\/0962280206074463","article-title":"Multiple imputation of discrete and continuous data by fully conditional specification","volume":"16","year":"2007","journal-title":"Stat. Methods Med. Res."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Jovi\u0107, A., Brki\u0107, K., and Bogunovi\u0107, N. (2015, January 25\u201329). A review of feature selection methods with applications. Proceedings of the 2015 38th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO), Opatija, Croatia.","DOI":"10.1109\/MIPRO.2015.7160458"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1016\/j.jclinepi.2009.12.008","article-title":"Missing covariate data in medical research: To impute is better than to ignore","volume":"63","author":"Janssen","year":"2010","journal-title":"J. Clin. Epidemiol."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"15844","DOI":"10.1109\/ACCESS.2018.2810849","article-title":"Improvement of generalization ability of deep CNN via implicit regularization in two-stage training process","volume":"6","author":"Zheng","year":"2018","journal-title":"IEEE Access"},{"key":"ref_19","unstructured":"Le, L., Patterson, A., and White, M. (2018, January 2\u20138). Supervised autoencoders: Improving generalization performance with unsupervised regularizers. Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montr\u00e9al, QC, Canada."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Little, R.J., and Rubin, D.B. (2002). Single imputation methods. Statistical Analysis with Missing Data, Wiley.","DOI":"10.1002\/9781119013563"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1016\/j.knosys.2018.06.012","article-title":"Handling missing values: A study of popular imputation packages in R","volume":"160","author":"Yadav","year":"2018","journal-title":"Knowl.-Based Syst."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1093\/biomet\/63.3.581","article-title":"Inference and missing data","volume":"63","author":"Rubin","year":"1976","journal-title":"Biometrika"},{"key":"ref_23","first-page":"79","article-title":"Determination of the best single imputation algorithm for missing rainfall data treatment","volume":"12","author":"Zakaria","year":"2016","journal-title":"J. Qual. Meas. Anal. JQMA"},{"key":"ref_24","first-page":"79","article-title":"Missing data imputation: Focusing on single imputation","volume":"4","author":"Zhang","year":"2016","journal-title":"Ann. Transl. Med."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1093\/bioinformatics\/17.6.520","article-title":"Missing value estimation methods for DNA microarrays","volume":"17","author":"Troyanskaya","year":"2001","journal-title":"Bioinformatics"},{"key":"ref_26","unstructured":"Pujianto, U., Wibawa, A.P., Akbar, M.I., and Murti, D.M.P. (2019, January 23\u201324). K-nearest neighbor (k-NN) based missing data imputation. Proceedings of the 2019 5th International Conference on Science in Information Technology (ICSITech), Jogjakarta, Indonesia."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1016\/j.neucom.2021.04.010","article-title":"Imputation of missing data with class imbalance using conditional generative adversarial networks","volume":"453","author":"Awan","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"19685","DOI":"10.1007\/s00521-023-08840-2","article-title":"A systematic review of generative adversarial imputation network in missing data imputation","volume":"35","author":"Zhang","year":"2023","journal-title":"Neural Comput. Appl."},{"key":"ref_29","unstructured":"Rubin, D.B. (2004). Multiple Imputation for Nonresponse in Surveys, John Wiley & Sons."},{"key":"ref_30","first-page":"1","article-title":"The rise of multiple imputation: A review of the reporting and implementation of the method in medical research","volume":"15","author":"Lee","year":"2015","journal-title":"BMC Med. Res. Methodol."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"107079","DOI":"10.1016\/j.knosys.2021.107079","article-title":"Active learning with missing values considering imputation uncertainty","volume":"224","author":"Han","year":"2021","journal-title":"Knowl.-Based Syst."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"142","DOI":"10.1214\/18-STS644","article-title":"Multiple Imputation: A Review of Practical and Theoretical Findings","volume":"33","author":"Murray","year":"2018","journal-title":"Stat. Sci."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Van Buuren, S. (2018). Flexible Imputation of Missing Data, CRC Press.","DOI":"10.1201\/9780429492259"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"76","DOI":"10.1002\/nur.10015","article-title":"Multiple imputation for missing data","volume":"25","author":"Patrician","year":"2002","journal-title":"Res. Nurs. Health"},{"key":"ref_35","first-page":"30","article-title":"Multiple imputation with multivariate imputation by chained equation (MICE) package","volume":"4","author":"Zhang","year":"2016","journal-title":"Ann. Transl. Med."},{"key":"ref_36","first-page":"979","article-title":"Comparison of Single and MICE Imputation Methods for Missing Values: A Simulation Study","volume":"29","author":"Pauzi","year":"2021","journal-title":"Pertanika J. Sci. Technol."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Ding, Y., Chen, L., Hao, K., Ding, Y., Chen, L., and Hao, K. (2018). Bio-inspired optimization algorithms. Bio-Inspired Collaborative Intelligent Control and Optimization, Springer.","DOI":"10.1007\/978-981-10-6689-4"},{"key":"ref_38","unstructured":"Simon, D. (2013). Evolutionary Optimization Algorithms, John Wiley & Sons."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"114430","DOI":"10.1016\/j.eswa.2020.114430","article-title":"A conceptual and practical comparison of PSO-style optimization algorithms","volume":"167","author":"Tharwat","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_40","unstructured":"Kennedy, J., and Eberhart, R. (December, January 27). Particle swarm optimization. Proceedings of the ICNN\u201995-International Conference on Neural Networks, Perth, WA, Australia."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1109\/4235.985692","article-title":"The particle swarm-explosion, stability, and convergence in a multidimensional complex space","volume":"6","author":"Clerc","year":"2002","journal-title":"IEEE Trans. Evol. Comput."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"387","DOI":"10.1007\/s00500-016-2474-6","article-title":"Particle swarm optimization algorithm: An overview","volume":"22","author":"Wang","year":"2018","journal-title":"Soft Comput."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"001","DOI":"10.30574\/gjeta.2020.3.3.0033","article-title":"The particle swarm optimization (PSO) algorithm application\u2013A review","volume":"3","author":"Aje","year":"2020","journal-title":"Glob. J. Eng. Technol. Adv."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1016\/j.chemolab.2015.08.020","article-title":"Particle swarm optimization (PSO). A tutorial","volume":"149","author":"Marini","year":"2015","journal-title":"Chemom. Intell. Lab. Syst."},{"key":"ref_45","first-page":"231","article-title":"An improved particle swarm optimization algorithm","volume":"193","author":"Jiang","year":"2007","journal-title":"Appl. Math. Comput."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/j.advengsoft.2016.01.008","article-title":"The whale optimization algorithm","volume":"95","author":"Mirjalili","year":"2016","journal-title":"Adv. Eng. Softw."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.swevo.2019.03.004","article-title":"A comprehensive survey: Whale Optimization Algorithm and its applications","volume":"48","author":"Gharehchopogh","year":"2019","journal-title":"Swarm Evol. Comput."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"16245","DOI":"10.1007\/s00521-020-04849-z","article-title":"Whale optimization algorithm: A systematic review of contemporary applications, modifications and developments","volume":"32","author":"Rana","year":"2020","journal-title":"Neural Comput. Appl."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s00500-016-2442-1","article-title":"Optimizing connection weights in neural networks using the whale optimization algorithm","volume":"22","author":"Aljarah","year":"2018","journal-title":"Soft Comput."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"4113","DOI":"10.1007\/s11831-023-09928-7","article-title":"A systematic review of the whale optimization algorithm: Theoretical foundation, improvements, and hybridizations","volume":"30","author":"Zamani","year":"2023","journal-title":"Arch. Comput. Methods Eng."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"104558","DOI":"10.1016\/j.engappai.2021.104558","article-title":"A multi-strategy Whale optimization algorithm and its application","volume":"108","author":"Yang","year":"2022","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"46","DOI":"10.1016\/j.advengsoft.2013.12.007","article-title":"Grey wolf optimizer","volume":"69","author":"Mirjalili","year":"2014","journal-title":"Adv. Eng. Softw."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1007\/s10489-014-0645-7","article-title":"How effective is the Grey Wolf optimizer in training multi-layer perceptrons","volume":"43","author":"Mirjalili","year":"2015","journal-title":"Appl. Intell."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"6617","DOI":"10.1007\/s00500-018-3310-y","article-title":"An improved hybrid grey wolf optimization algorithm","volume":"23","author":"Teng","year":"2019","journal-title":"Soft Comput."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1007\/s00521-017-3272-5","article-title":"Grey wolf optimizer: A review of recent variants and applications","volume":"30","author":"Faris","year":"2018","journal-title":"Neural Comput. Appl."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1016\/j.eswa.2015.10.039","article-title":"Multi-objective grey wolf optimizer: A novel algorithm for multi-criterion optimization","volume":"47","author":"Mirjalili","year":"2016","journal-title":"Expert Syst. Appl."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"7711","DOI":"10.1007\/s00521-021-06885-9","article-title":"Adaptive grey wolf optimizer","volume":"34","author":"Meidani","year":"2022","journal-title":"Neural Comput. Appl."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"120","DOI":"10.1016\/j.knosys.2015.12.022","article-title":"SCA: A sine cosine algorithm for solving optimization problems","volume":"96","author":"Mirjalili","year":"2016","journal-title":"Knowl.-Based Syst."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Hafez, A.I., Zawbaa, H.M., Emary, E., and Hassanien, A.E. (2016, January 2\u20135). Sine cosine optimization algorithm for feature selection. Proceedings of the 2016 International Symposium on Innovations in Intelligent Systems and Applications (INISTA), Sinaia, Romania.","DOI":"10.1109\/INISTA.2016.7571853"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"27434","DOI":"10.1109\/ACCESS.2021.3058128","article-title":"A modified sine cosine algorithm for solving optimization problems","volume":"9","author":"Wang","year":"2021","journal-title":"IEEE Access"},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1007\/s42979-019-0050-8","article-title":"Mitigating metaphors: A comprehensible guide to recent nature-inspired algorithms","volume":"1","author":"Lones","year":"2020","journal-title":"SN Comput. Sci."},{"key":"ref_62","unstructured":"Segal, M.R. (2025, February 11). Machine Learning Benchmarks and Random Forest Regression. Available online: https:\/\/escholarship.org\/uc\/item\/35x3v9t4."},{"key":"ref_63","first-page":"18","article-title":"Classification and regression by randomForest","volume":"2","author":"Liaw","year":"2002","journal-title":"R News"},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1016\/j.apenergy.2018.09.182","article-title":"Random forest regression for online capacity estimation of lithium-ion batteries","volume":"232","author":"Li","year":"2018","journal-title":"Appl. Energy"},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"804","DOI":"10.1016\/j.oregeorev.2015.01.001","article-title":"Machine learning predictive models for mineral prospectivity: An evaluation of neural networks, random forest, regression trees and support vector machines","volume":"71","year":"2015","journal-title":"Ore Geol. Rev."},{"key":"ref_66","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1016\/j.rse.2018.12.008","article-title":"A practical method for reducing terrain effect on land surface temperature using random forest regression","volume":"221","author":"Zhao","year":"2019","journal-title":"Remote Sens. Environ."},{"key":"ref_67","doi-asserted-by":"crossref","first-page":"125861","DOI":"10.1016\/j.jhydrol.2020.125861","article-title":"Regional hydrological frequency analysis at ungauged sites with random forest regression","volume":"594","author":"Desai","year":"2021","journal-title":"J. Hydrol."},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Couronn\u00e9, R., Probst, P., and Boulesteix, A.L. (2018). Random forest versus logistic regression: A large-scale benchmark experiment. BMC Bioinform., 19.","DOI":"10.1186\/s12859-018-2264-5"},{"key":"ref_69","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1504\/IJAPR.2016.079733","article-title":"Principal component analysis\u2014A tutorial","volume":"3","author":"Tharwat","year":"2016","journal-title":"Int. J. Appl. Pattern Recognit."},{"key":"ref_70","unstructured":"Shlens, J. (2014). A tutorial on principal component analysis. arXiv."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/3\/119\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T16:38:27Z","timestamp":1760027907000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/18\/3\/119"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,20]]},"references-count":70,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2025,3]]}},"alternative-id":["a18030119"],"URL":"https:\/\/doi.org\/10.3390\/a18030119","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,2,20]]}}}