{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,18]],"date-time":"2026-06-18T13:45:26Z","timestamp":1781790326832,"version":"3.54.5"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"9","license":[{"start":{"date-parts":[[2025,8,30]],"date-time":"2025-08-30T00:00:00Z","timestamp":1756512000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,8,30]],"date-time":"2025-08-30T00:00:00Z","timestamp":1756512000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100002352","name":"Ain Shams University","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100002352","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Cluster Comput"],"published-print":{"date-parts":[[2025,10]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Water quality plays a pivotal role in ensuring the safety and sustainability of water resources, with significant implications for environmental protection, public health, and various industrial applications. This paper presents both a review of related state-of-the-art works and an implementation and application of adapted versions of these related works for predicting water quality parameters on a new water dataset from Galicia, Spain. The reviewed studies encompass a range of predictive models applied to diverse water quality parameters, including dissolved oxygen levels, pH levels, and other complex water parameters. These models include various machine learning and deep learning methods such as Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM) networks, and Bidirectional LSTMs. This research contributes by implementing various models on the dataset and experimentally demonstrating the impact of key factors on model performance. These factors include model sophistication, imputation techniques, recurrent architectures, and customized approaches for water quality prediction using deep learning. Notably, K-Nearest Neighbors (KNN) imputation enhances performance by preserving local data relationships, while noise filtering further improves predictive accuracy. Additionally, we observe that smaller batch sizes and learning rates lead to better generalization in sparse datasets, outperforming traditional approaches. The conclusions are guided by comparing the performance of all models on the Galician dataset using the Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and Coefficient of Determination (<jats:inline-formula>\n              <jats:alternatives>\n                <jats:tex-math>$$R^{2}$$<\/jats:tex-math>\n                <mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\">\n                  <mml:msup>\n                    <mml:mi>R<\/mml:mi>\n                    <mml:mn>2<\/mml:mn>\n                  <\/mml:msup>\n                <\/mml:math>\n              <\/jats:alternatives>\n            <\/jats:inline-formula>). This paper provides the first DL-based water quality analysis for Galicia, emphasizing the need for regional model adaptation. Our results guide future research directions, including the exploration of Transformer-based architectures for time-series data, more sophisticated feature selection techniques, and neural-network-based imputation strategies to enhance data completeness.<\/jats:p>","DOI":"10.1007\/s10586-025-05221-3","type":"journal-article","created":{"date-parts":[[2025,8,30]],"date-time":"2025-08-30T10:58:56Z","timestamp":1756551536000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Advancements in water quality prediction: a practical review of machine learning and deep learning approaches"],"prefix":"10.1007","volume":"28","author":[{"given":"Marwah","family":"A. Helaly","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sherine","family":"Rady","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mohamed","family":"Mabrouk","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mostafa","family":"M. Aref","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Sebastian","family":"Villarroya","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jose M.","family":"Cotos","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"David","family":"Mera","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2025,8,30]]},"reference":[{"issue":"1","key":"5221_CR1","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1016\/j.ejar.2021.10.001","volume":"48","author":"AM Abd El-Mageed","year":"2022","unstructured":"Abd El-Mageed, A.M., Enany, T.A., Goher, M.E., Hassouna, M.E.: Forecasting water quality parameters in Wadi El Rayan Upper Lake, Fayoum, Egypt using adaptive neuro-fuzzy inference system. Egypt. J. Aquat. Res. 48(1), 13\u201319 (2022)","journal-title":"Egypt. J. Aquat. Res."},{"issue":"9","key":"5221_CR2","doi-asserted-by":"publisher","first-page":"1657","DOI":"10.3390\/w15091657","volume":"15","author":"I Al-Shourbaji","year":"2023","unstructured":"Al-Shourbaji, I., Duraibi, S.: IWQP4Net: an efficient convolution neural network for irrigation water quality prediction. Water 15(9), 1657 (2023)","journal-title":"Water"},{"key":"5221_CR3","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/2484\/1\/012051","volume":"2484","author":"MV Anand","year":"2023","unstructured":"Anand, M.V., Sohitha, C., Saraswathi, G.N., Lavanya, G.: Water quality prediction using CNN. J. Phys. Conf. Ser. 2484, 012051 (2023)","journal-title":"J. Phys. Conf. Ser."},{"key":"5221_CR4","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1016\/j.ins.2021.04.057","volume":"571","author":"J Bi","year":"2021","unstructured":"Bi, J., Lin, Y., Dong, Q., Yuan, H., Zhou, M.: Large-scale water quality prediction with integrated deep neural network. Inf. Sci. 571, 191\u2013205 (2021)","journal-title":"Inf. Sci."},{"issue":"3","key":"5221_CR5","doi-asserted-by":"publisher","first-page":"67","DOI":"10.3390\/hydrology7030067","volume":"7","author":"SH Ewaid","year":"2020","unstructured":"Ewaid, S.H., Abed, S.A., Al-Ansari, N., Salih, R.M.: Development and evaluation of a water quality index for the Iraqi rivers. Hydrology 7(3), 67 (2020)","journal-title":"Hydrology"},{"key":"5221_CR6","first-page":"2129","volume":"67","author":"M Galal","year":"2020","unstructured":"Galal, M., Soliman, A., Kamel, G., Zaher, K., El-Fakharany, Z.: Prediction and assessment of surface water quality effect on groundwater in El-Galuybia, Egypt. J. Eng. Appl. Sci. 67, 2129\u20132148 (2020)","journal-title":"J. Eng. Appl. Sci."},{"key":"5221_CR7","doi-asserted-by":"publisher","DOI":"10.1016\/j.envpol.2020.116332","volume":"271","author":"S Giri","year":"2021","unstructured":"Giri, S.: Water quality prospective in twenty first century: status of water quality in major river basins, contemporary strategies and impediments: a review. Environ. Pollut. 271, 116332 (2021)","journal-title":"Environ. Pollut."},{"key":"5221_CR8","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2022.117972","volume":"208","author":"MA Helaly","year":"2022","unstructured":"Helaly, M.A., Rady, S., Aref, M.M.: BERT contextual embeddings for taxonomic classification of bacterial DNA sequences. Expert Syst. Appl. 208, 117972 (2022)","journal-title":"Expert Syst. Appl."},{"key":"5221_CR9","first-page":"1","volume":"2022","author":"TO Hodson","year":"2022","unstructured":"Hodson, T.O.: Root mean square error (RMSE) or mean absolute error (MAE): when to use them or not. Geosci. Model Dev. Discuss. 2022, 1\u201310 (2022)","journal-title":"Geosci. Model Dev. Discuss."},{"issue":"23","key":"5221_CR10","doi-asserted-by":"publisher","first-page":"3364","DOI":"10.3390\/w13233364","volume":"13","author":"L Karamoutsou","year":"2021","unstructured":"Karamoutsou, L., Psilovikos, A.: Deep learning in water resources management: the case study of Kastoria Lake in Greece. Water 13(23), 3364 (2021)","journal-title":"Water"},{"issue":"9","key":"5221_CR11","doi-asserted-by":"publisher","first-page":"12875","DOI":"10.1007\/s11356-021-13875-w","volume":"29","author":"S Khullar","year":"2022","unstructured":"Khullar, S., Singh, N.: Water quality assessment of a river using deep learning Bi-LSTM methodology: forecasting and validation. Environ. Sci. Pollut. Res. 29(9), 12875\u201312889 (2022)","journal-title":"Environ. Sci. Pollut. Res."},{"issue":"1","key":"5221_CR12","first-page":"1","volume":"10","author":"H Larochelle","year":"2009","unstructured":"Larochelle, H., Bengio, Y., Louradour, J., Lamblin, P.: Exploring strategies for training deep neural networks. J. Mach. Learn. Res. 10(1), 1\u201340 (2009)","journal-title":"J. Mach. Learn. Res."},{"issue":"4","key":"5221_CR13","doi-asserted-by":"publisher","first-page":"1268","DOI":"10.1093\/ije\/dyad008","volume":"52","author":"KJ Lee","year":"2023","unstructured":"Lee, K.J., Carlin, J.B., Simpson, J.A., Moreno-Betancur, M.: Assumptions and analysis planning in studies with missing data in multiple variables: moving beyond the MCAR\/MAR\/MNAR classification. Int. J. Epidemiol. 52(4), 1268\u20131275 (2023)","journal-title":"Int. J. Epidemiol."},{"key":"5221_CR14","doi-asserted-by":"publisher","first-page":"166","DOI":"10.1016\/j.cageo.2016.12.013","volume":"100","author":"D Mera","year":"2017","unstructured":"Mera, D., Bolon-Canedo, V., Alonso-Betanzos, A.: On the use of feature selection to improve the detection of sea oil spills in SAR images. Comput. Geosci. 100, 166\u2013178 (2017)","journal-title":"Comput. Geosci."},{"key":"5221_CR15","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1016\/j.cageo.2014.07.015","volume":"72","author":"D Mera","year":"2014","unstructured":"Mera, D., Cotos, J.M., Varela-Pet, J., Rodr\u00edguez, P.G., Caro, A.: Automatic decision support system based on SAR data for oil spill detection. Comput. Geosci. 72, 184\u2013191 (2014)","journal-title":"Comput. Geosci."},{"issue":"4","key":"5221_CR16","doi-asserted-by":"publisher","first-page":"76","DOI":"10.1007\/s13201-022-01590-x","volume":"12","author":"A Mokhtar","year":"2022","unstructured":"Mokhtar, A., Elbeltagi, A., Gyasi-Agyei, Y., Al-Ansari, N., Abdel-Fattah, M.K.: Prediction of irrigation water quality indices based on machine learning and regression models. Appl. Water Sci. 12(4), 76 (2022)","journal-title":"Appl. Water Sci."},{"issue":"12","key":"5221_CR17","doi-asserted-by":"publisher","first-page":"35307","DOI":"10.1007\/s11042-023-16737-4","volume":"83","author":"MY Shams","year":"2024","unstructured":"Shams, M.Y., Elshewey, A.M., El-Kenawy, E.S.M., Ibrahim, A., Talaat, F.M., Tarek, Z.: Water quality prediction using machine learning models based on grid search method. Multimed. Tools Appl. 83(12), 35307\u201335334 (2024)","journal-title":"Multimed. Tools Appl."},{"key":"5221_CR18","doi-asserted-by":"crossref","unstructured":"Siami-Namini, S., Tavakoli, N., Namin, A.S.: The performance of LSTM and BiLSTM in forecasting time series. In: 2019 IEEE International Conference on Big Data (Big Data), 2019, pp. 3285\u20133292. IEEE (2019)","DOI":"10.1109\/BigData47090.2019.9005997"},{"issue":"12","key":"5221_CR19","doi-asserted-by":"publisher","first-page":"2295","DOI":"10.1109\/JPROC.2017.2761740","volume":"105","author":"V Sze","year":"2017","unstructured":"Sze, V., Chen, Y.H., Yang, T.J., Emer, J.S.: Efficient processing of deep neural networks: a tutorial and survey. Proc. IEEE 105(12), 2295\u20132329 (2017)","journal-title":"Proc. IEEE"},{"issue":"6","key":"5221_CR20","doi-asserted-by":"publisher","first-page":"4235","DOI":"10.1007\/s11053-021-09922-5","volume":"30","author":"T Tiyasha","year":"2021","unstructured":"Tiyasha, T., Tung, T.M., Yaseen, Z.M.: Deep learning for prediction of water quality index classification: tropical catchment environmental assessment. Nat. Resour. Res. 30(6), 4235\u20134254 (2021)","journal-title":"Nat. Resour. Res."},{"key":"5221_CR21","doi-asserted-by":"crossref","unstructured":"United Nations: United Nations Sustainable Development Goals. United Nations (2023). https:\/\/sdgs.un.org\/goals. Accessed 1 Nov 2023","DOI":"10.1007\/978-3-030-91260-4_95-1"},{"key":"5221_CR22","doi-asserted-by":"publisher","first-page":"418","DOI":"10.1016\/j.future.2020.02.022","volume":"107","author":"D Wu","year":"2020","unstructured":"Wu, D., Wang, H., Seidu, R.: Smart data driven quality prediction for urban water source management. Future Gener. Comput. Syst. 107, 418\u2013432 (2020)","journal-title":"Future Gener. Comput. Syst."},{"issue":"21","key":"5221_CR23","doi-asserted-by":"publisher","first-page":"3408","DOI":"10.3390\/w14213408","volume":"14","author":"X Wu","year":"2022","unstructured":"Wu, X., Zhang, Q., Wen, F., Qi, Y.: A water quality prediction model based on multi-task deep learning: a case study of the Yellow River, China. Water 14(21), 3408 (2022)","journal-title":"Water"},{"key":"5221_CR24","doi-asserted-by":"crossref","unstructured":"Yang, R.: Analyses of approaches to deal with missing data in water quality data set. In: 2022 7th International Conference on Social Sciences and Economic Development (ICSSED 2022), 2022, pp. 1102\u20131108. Atlantis Press (2022)","DOI":"10.2991\/aebmr.k.220405.184"},{"issue":"39","key":"5221_CR25","doi-asserted-by":"publisher","first-page":"55129","DOI":"10.1007\/s11356-021-14687-8","volume":"28","author":"Y Yang","year":"2021","unstructured":"Yang, Y., Xiong, Q., Wu, C., Zou, Q., Yu, Y., Yi, H., Gao, M.: A study on water quality prediction by a hybrid CNN\u2013LSTM model with attention mechanism. Environ. Sci. Pollut. Res. 28(39), 55129\u201355139 (2021)","journal-title":"Environ. Sci. Pollut. Res."},{"key":"5221_CR26","doi-asserted-by":"publisher","first-page":"63","DOI":"10.1016\/j.future.2021.09.033","volume":"128","author":"Y Zhang","year":"2022","unstructured":"Zhang, Y., Thorburn, P.J.: Handling missing data in near real-time environmental monitoring: a system and a review of selected methods. Future Gener. Comput. Syst. 128, 63\u201372 (2022)","journal-title":"Future Gener. Comput. Syst."}],"container-title":["Cluster Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10586-025-05221-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10586-025-05221-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10586-025-05221-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,17]],"date-time":"2025-09-17T21:24:01Z","timestamp":1758144241000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10586-025-05221-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,30]]},"references-count":26,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2025,10]]}},"alternative-id":["5221"],"URL":"https:\/\/doi.org\/10.1007\/s10586-025-05221-3","relation":{},"ISSN":["1386-7857","1573-7543"],"issn-type":[{"value":"1386-7857","type":"print"},{"value":"1573-7543","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,8,30]]},"assertion":[{"value":"28 November 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 February 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 February 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 August 2025","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}},{"value":"Our research does not involve humans, animals, or any biological material. Therefore, ethical approval and informed consent are not required.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical approval and Informed consent"}}],"article-number":"598"}}