{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,10]],"date-time":"2026-06-10T21:55:20Z","timestamp":1781128520403,"version":"3.54.1"},"reference-count":51,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2024,4,10]],"date-time":"2024-04-10T00:00:00Z","timestamp":1712707200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Department of Economy, Science, and Innovation (Flanders)"},{"name":"Flemish Institute for Technological Research (VITO)"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Over the past few years, the scale of sensor networks has greatly expanded. This generates extended spatiotemporal datasets, which form a crucial information resource in numerous fields, ranging from sports and healthcare to environmental science and surveillance. Unfortunately, these datasets often contain missing values due to systematic or inadvertent sensor misoperation. This incompleteness hampers the subsequent data analysis, yet addressing these missing observations forms a challenging problem. This is especially the case when both the temporal correlation of timestamps within a single sensor and the spatial correlation between sensors are important. Here, we apply and evaluate 12 imputation methods to complete the missing values in a dataset originating from large-scale environmental monitoring. As part of a large citizen science project, IoT-based microclimate sensors were deployed for six months in 4400 gardens across the region of Flanders, generating 15-min recordings of temperature and soil moisture. Methods based on spatial recovery as well as time-based imputation were evaluated, including Spline Interpolation, MissForest, MICE, MCMC, M-RNN, BRITS, and others. The performance of these imputation methods was evaluated for different proportions of missing data (ranging from 10% to 50%), as well as a realistic missing value scenario. Techniques leveraging the spatial features of the data tend to outperform the time-based methods, with matrix completion techniques providing the best performance. Our results therefore provide a tool to maximize the benefit from costly, large-scale environmental monitoring efforts.<\/jats:p>","DOI":"10.3390\/s24082416","type":"journal-article","created":{"date-parts":[[2024,4,10]],"date-time":"2024-04-10T06:07:46Z","timestamp":1712729266000},"page":"2416","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":39,"title":["Missing Value Imputation of Wireless Sensor Data for Environmental Monitoring"],"prefix":"10.3390","volume":"24","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1763-3210","authenticated-orcid":false,"given":"Thomas","family":"Decorte","sequence":"first","affiliation":[{"name":"Department of Mathematics, University of Antwerp-imec, Middelheimlaan 1, 2000 Antwerp, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8241-6656","authenticated-orcid":false,"given":"Steven","family":"Mortier","sequence":"additional","affiliation":[{"name":"IDLab, Department of Computer Science, University of Antwerp-imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1933-0750","authenticated-orcid":false,"given":"Jonas J.","family":"Lembrechts","sequence":"additional","affiliation":[{"name":"Plants and Ecosystems, Department of Biology, University of Antwerp, Universiteitsplein 1, 2610 Antwerp, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5334-7655","authenticated-orcid":false,"given":"Filip J. R.","family":"Meysman","sequence":"additional","affiliation":[{"name":"Geobiology, Department of Biology, University of Antwerp, Universiteitsplein 1, 2610 Antwerp, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0351-1714","authenticated-orcid":false,"given":"Steven","family":"Latr\u00e9","sequence":"additional","affiliation":[{"name":"IDLab, Department of Computer Science, University of Antwerp-imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7946-4884","authenticated-orcid":false,"given":"Erik","family":"Mannens","sequence":"additional","affiliation":[{"name":"IDLab, Department of Computer Science, University of Antwerp-imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1105-2028","authenticated-orcid":false,"given":"Tim","family":"Verdonck","sequence":"additional","affiliation":[{"name":"Department of Mathematics, University of Antwerp-imec, Middelheimlaan 1, 2000 Antwerp, Belgium"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2024,4,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"5482","DOI":"10.1109\/JSEN.2022.3148128","article-title":"Challenges, Applications, and Future of Wireless Sensors in Internet of Things: A Review","volume":"22","author":"Jamshed","year":"2022","journal-title":"IEEE Sens. J."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"102038","DOI":"10.1016\/j.phycom.2023.102038","article-title":"Delay optimization and energy balancing algorithm for improving network lifetime in fixed wireless sensor networks","volume":"58","author":"Chaitra","year":"2023","journal-title":"Phys. Commun."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Jino Ramson, S.R., and Moni, D.J. (2017, January 3\u20134). Applications of wireless sensor networks\u2014A survey. Proceedings of the International Conference on Innovations in Electrical, Electronics, Instrumentation and Media Technology (ICEEIMT 2017), Coimbatore, India.","DOI":"10.1109\/ICIEEIMT.2017.8116858"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Kandris, D., Nakas, C., Vomvas, D., and Koulouras, G. (2020). Applications of Wireless Sensor Networks: An Up-to-Date Survey. Appl. Syst. Innov., 3.","DOI":"10.3390\/asi3010014"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2818","DOI":"10.1109\/TPDS.2013.269","article-title":"Data loss and reconstruction in wireless sensor networks","volume":"25","author":"Kong","year":"2014","journal-title":"IEEE Trans. Parallel Distrib. Syst."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"115","DOI":"10.4236\/wsn.2010.22016","article-title":"K-Nearest Neighbor Based Missing Data Estimation Algorithm in Wireless Sensor Networks","volume":"2","author":"Pan","year":"2010","journal-title":"Wirel. Sens. Netw."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"6616","DOI":"10.1111\/gcb.15123","article-title":"SoilTemp: A global database of near-surface temperature","volume":"26","author":"Lembrechts","year":"2020","journal-title":"Glob. Chang. Biol."},{"key":"ref_8","unstructured":"Kreindler, D.M., and Lumsden, C.J. (2016). Nonlinear Dynamical Systems Analysis for the Behavioral Sciences Using Real Data, CRC Press."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1093\/bioinformatics\/17.6.520","article-title":"Missing value estimation methods for DNA microarrays","volume":"17","author":"Troyanskaya","year":"2001","journal-title":"Bioinformatics"},{"key":"ref_10","unstructured":"Rubin, D.B. (2018). Flexible Imputation of Missing Data, Chapman and Hall\/CRC. [2nd ed.]."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"377","DOI":"10.1002\/sim.4067","article-title":"Multiple imputation using chained equations: Issues and guidance for practice","volume":"30","author":"White","year":"2011","journal-title":"Stat. Med."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1007\/s10182-008-0053-6","article-title":"A Markov chain Monte Carlo algorithm for multiple imputation in large surveys","volume":"92","author":"Schunk","year":"2008","journal-title":"AStA Adv. Stat. Anal."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"112","DOI":"10.1093\/bioinformatics\/btr597","article-title":"MissForest\u2014non-parametric missing value imputation for mixed-type data","volume":"28","author":"Stekhoven","year":"2012","journal-title":"Bioinformatics"},{"key":"ref_14","first-page":"2287","article-title":"Spectral Regularization Algorithms for Learning Large Incomplete Matrices","volume":"11","author":"Mazumder","year":"2010","journal-title":"J. Mach. Learn. Res."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1477","DOI":"10.1109\/TBME.2018.2874712","article-title":"Estimating Missing Data in Temporal Data Streams Using Multi-Directional Recurrent Neural Networks","volume":"66","author":"Yoon","year":"2019","journal-title":"IEEE Trans. Biomed. Eng."},{"key":"ref_16","first-page":"260","article-title":"MIDA: Multiple imputation using denoising autoencoders","volume":"Volume 10939 LNAI","author":"Gondara","year":"2018","journal-title":"Proceedings of the PAKDD 2018: Advances in Knowledge Discovery and Data Mining"},{"key":"ref_17","unstructured":"Cao, W., Wang, D., Li, J., Bytedance, H.Z., Lab, A.I., Li, Y., Lab, B.A., and Li, L. (2018). Advances in Neural Information Processing Systems, Proceedings of the NeurIPS, Montreal, ON, Canada, 3\u20138 December 2018, Neural Information Processing Systems Foundation, Inc. (NeurIPS)."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"913","DOI":"10.1080\/08839514.2019.1637138","article-title":"Comparison of performance of data imputation methods for numeric dataset","volume":"33","author":"Jadhav","year":"2019","journal-title":"Appl. Artif. Intell."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"693674","DOI":"10.3389\/fdata.2021.693674","article-title":"A Benchmark for Data Imputation Methods","volume":"4","author":"Allhorn","year":"2021","journal-title":"Front. Big Data"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"768","DOI":"10.14778\/3377369.3377383","article-title":"Mind the gap","volume":"13","author":"Khayati","year":"2020","journal-title":"Proc. VLDB Endow."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1007\/s00704-012-0723-x","article-title":"Comparison of missing value imputation methods in time series: The case of Turkish meteorological data","volume":"112","author":"Yozgatligil","year":"2013","journal-title":"Theor. Appl. Climatol."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Li, Y., Ai, C., Deshmukh, W.P., and Wu, Y. (2008, January 17\u201320). Data Estimation in Sensor Networks Using Physical and Statistical Methodologies. Proceedings of the 2008 The 28th International Conference on Distributed Computing Systems, Beijing, China.","DOI":"10.1109\/ICDCS.2008.22"},{"key":"ref_23","unstructured":"Lembrechts, J., Van de Vondel, S., Allonsius, C., Nijs, I., and Meysman, F. (2022). CurieuzeNeuzen in de tuin: Eerste ervaringen met de microklimaatnetwerken van de toekomst. Bodem: Tijdschrift voor informatie-uitwisseling en discussie over duurzaam bodembeheer.-Alphen aan den Rijn, 1991, currens, 33, Available online: https:\/\/edepot.wur.nl\/651758."},{"key":"ref_24","unstructured":"(2024, February 07). CurieuzeNeuzen in de Tuin (Curious Noses In The Garden). Available online: https:\/\/curieuzeneuzen.be\/home-en\/."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"3110","DOI":"10.1111\/gcb.16060","article-title":"Global maps of soil temperature","volume":"28","author":"Lembrechts","year":"2022","journal-title":"Glob. Chang. Biol."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"1168","DOI":"10.1111\/geb.13290","article-title":"Designing countrywide and regional microclimate networks","volume":"30","author":"Lembrechts","year":"2021","journal-title":"Glob. Ecol. Biogeogr."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1016\/j.agrformet.2018.12.018","article-title":"Climate at ecologically relevant scales: A new temperature and soil moisture logger for long-term microclimate measurement","volume":"268","author":"Wild","year":"2019","journal-title":"Agric. For. Meteorol."},{"key":"ref_28","first-page":"298","article-title":"Landmark based shortest path detection by using A* and Haversine formula","volume":"1","author":"Chopde","year":"2013","journal-title":"Int. J. Innov. Res. Comput. Commun. Eng."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Little, R.J., and Rubin, D.B. (2019). Statistical Analysis with Missing Data, John Wiley & Sons.","DOI":"10.1002\/9781119482260"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"581","DOI":"10.1093\/biomet\/63.3.581","article-title":"Inference and missing data","volume":"63","author":"Rubin","year":"1976","journal-title":"Biometrika"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Schafer, J.L. (1997). Analysis of Incomplete Multivariate Data, CRC Press.","DOI":"10.1201\/9781439821862"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Van Buuren, S. (2018). Flexible Imputation of Missing Data, CRC Press.","DOI":"10.1201\/9780429492259"},{"key":"ref_33","first-page":"339","article-title":"Comparison of missing value imputation methods for crop yield data","volume":"17","author":"Lokupitiya","year":"2006","journal-title":"Environ. Off. J. Int. Environ. Soc."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1038\/s41586-020-2649-2","article-title":"Array programming with NumPy","volume":"585","author":"Harris","year":"2020","journal-title":"Nature"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1002\/mpr.329","article-title":"Multiple imputation by chained equations: What is it and how does it work?","volume":"20","author":"Azur","year":"2011","journal-title":"Int. J. Methods Psychiatr. Res."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1177\/0962280206074463","article-title":"Multiple imputation of discrete and continuous data by fully conditional specification","volume":"16","year":"2007","journal-title":"Stat. Methods Med. Res."},{"key":"ref_37","first-page":"1","article-title":"mice: Multivariate Imputation by Chained Equations in R","volume":"45","year":"2011","journal-title":"J. Stat. Softw."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1080\/01621459.1994.10476469","article-title":"Sequential imputations and Bayesian missing data problems","volume":"89","author":"Kong","year":"1994","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1109\/TPAMI.1984.4767596","article-title":"Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images","volume":"6","author":"Geman","year":"1984","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_40","unstructured":"Le Gruenwald, M.H. (2005, January 5\u20138). Estimating missing values in related sensor data streams. Proceedings of the COMAD, Goa, India."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Deng, Y., Han, C., Guo, J., and Sun, L. (2021). Temporal and Spatial Nearest Neighbor Values Based Missing Data Imputation in Wireless Sensor Networks. Sensors, 21.","DOI":"10.3390\/s21051782"},{"key":"ref_42","unstructured":"Van Rossum, G., and Drake, F.L. (2009). Python 3 Reference Manual, CreateSpace."},{"key":"ref_43","first-page":"2825","article-title":"Scikit-learn: Machine Learning in Python","volume":"12","author":"Pedregosa","year":"2011","journal-title":"J. Mach. Learn. Res."},{"key":"ref_44","unstructured":"Rubinsteyn, A., and Feldman, S. (2024, February 05). fancyimpute: An Imputation Library for Python. Available online: https:\/\/github.com\/iskandr\/fancyimpute."},{"key":"ref_45","unstructured":"Gonzalez-Nieto, P.L., Flechoso, M.G., Mocoroa, M.A., Martin, A.M., Lorenzo, M.G., Gomez, G.C., Gomez, J.A., Fraile, A.C., Dagan, J.O., and Palomares, R.M. (2020, January 2\u20134). Design and Development of a Virtual Laboratory in Python for the Teaching of Data Analysis and Mathematics in Geology: Geopy. Proceedings of the INTED2020 Proceedings, Valencia, Spain."},{"key":"ref_46","unstructured":"Chollet, F. (2024, February 07). Keras. Available online: https:\/\/keras.io."},{"key":"ref_47","unstructured":"van der Walt, S., and Millman, J. (2010, January 28\u201330). Data Structures for Statistical Computing in Python. Proceedings of the 9th Python in Science Conference, Austin, TX, USA."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1038\/s41592-019-0686-2","article-title":"SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python","volume":"17","author":"Virtanen","year":"2020","journal-title":"Nat. Methods"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"3021","DOI":"10.21105\/joss.03021","article-title":"Seaborn: Statistical data visualization","volume":"6","author":"Waskom","year":"2021","journal-title":"J. Open Source Softw."},{"key":"ref_50","unstructured":"Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., and Devin, M. (2024, February 12). TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Available online: https:\/\/www.tensorflow.org\/."},{"key":"ref_51","unstructured":"Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., and Antiga, L. (2019). Advances in Neural Information Processing Systems 32, Curran Associates, Inc."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/8\/2416\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:25:35Z","timestamp":1760106335000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/24\/8\/2416"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,10]]},"references-count":51,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2024,4]]}},"alternative-id":["s24082416"],"URL":"https:\/\/doi.org\/10.3390\/s24082416","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,10]]}}}