{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,4]],"date-time":"2026-05-04T22:27:29Z","timestamp":1777933649432,"version":"3.51.4"},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"12","license":[{"start":{"date-parts":[[2023,9,29]],"date-time":"2023-09-29T00:00:00Z","timestamp":1695945600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,9,29]],"date-time":"2023-09-29T00:00:00Z","timestamp":1695945600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Kafr El Shiekh University"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Multimed Tools Appl"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Water quality is very dominant for humans, animals, plants, industries, and the environment. In the last decades, the quality of water has been impacted by contamination and pollution. In this paper, the challenge is to anticipate Water Quality Index (WQI) and Water Quality Classification (WQC), such that WQI is a vital indicator for water validity. In this study, parameters optimization and tuning are utilized to improve the accuracy of several machine learning models, where the machine learning techniques are utilized for the process of predicting WQI and WQC. Grid search is a vital method used for optimizing and tuning the parameters for four classification models and also, for optimizing and tuning the parameters for four regression models. Random forest (RF) model, Extreme Gradient Boosting (Xgboost) model, Gradient Boosting (GB) model, and Adaptive Boosting (AdaBoost) model are used as classification models for predicting WQC. K-nearest neighbor (KNN) regressor model, decision tree (DT) regressor model, support vector regressor (SVR) model, and multi-layer perceptron (MLP) regressor model are used as regression models for predicting WQI. In addition, preprocessing step including, data imputation (mean imputation) and data normalization were performed to fit the data and make it convenient for any further processing. The dataset used in this study includes 7 features and 1991 instances. To examine the efficacy of the classification approaches, five assessment metrics were computed: accuracy, recall, precision, Matthews's Correlation Coefficient (MCC), and F1 score. To assess the effectiveness of the regression models, four assessment metrics were computed: Mean Absolute Error (MAE), Median Absolute Error (MedAE), Mean Square Error (MSE), and coefficient of determination (R<jats:sup>2<\/jats:sup>). In terms of classification, the testing findings showed that the GB model produced the best results, with an accuracy of 99.50% when predicting WQC values. According to the experimental results, the MLP regressor model outperformed other models in regression and achieved an R<jats:sup>2<\/jats:sup> value of 99.8% while predicting WQI values.<\/jats:p>","DOI":"10.1007\/s11042-023-16737-4","type":"journal-article","created":{"date-parts":[[2023,9,29]],"date-time":"2023-09-29T17:01:51Z","timestamp":1696006911000},"page":"35307-35334","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":283,"title":["Water quality prediction using machine learning models based on grid search method"],"prefix":"10.1007","volume":"83","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-3021-5902","authenticated-orcid":false,"given":"Mahmoud Y.","family":"Shams","sequence":"first","affiliation":[]},{"given":"Ahmed M.","family":"Elshewey","sequence":"additional","affiliation":[]},{"given":"El-Sayed M.","family":"El-kenawy","sequence":"additional","affiliation":[]},{"given":"Abdelhameed","family":"Ibrahim","sequence":"additional","affiliation":[]},{"given":"Fatma M.","family":"Talaat","sequence":"additional","affiliation":[]},{"given":"Zahraa","family":"Tarek","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,9,29]]},"reference":[{"key":"16737_CR1","doi-asserted-by":"crossref","unstructured":"Jain D, Shah S, Mehta H et al (2021) A Machine Learning Approach to Analyze Marine Life Sustainability. In: Proceedings of International Conference on Intelligent Computing, Information and Control Systems. Springer, pp 619\u2013632","DOI":"10.1007\/978-981-15-8443-5_53"},{"key":"16737_CR2","doi-asserted-by":"publisher","unstructured":"Clark RM, Hakim S, Ostfeld A (2011) Handbook of water and wastewater systems protection. In: Protecting Critical Infrastructure. Springer, pp 1\u201329. https:\/\/doi.org\/10.1007\/978-1-4614-0189-6","DOI":"10.1007\/978-1-4614-0189-6"},{"key":"16737_CR3","doi-asserted-by":"publisher","first-page":"1420","DOI":"10.3390\/s19061420","volume":"19","author":"Z Hu","year":"2019","unstructured":"Hu Z, Zhang Y, Zhao Y et al (2019) A water quality prediction method based on the deep LSTM network considering correlation in smart mariculture. Sensors 19:1420","journal-title":"Sensors"},{"key":"16737_CR4","doi-asserted-by":"publisher","first-page":"1148","DOI":"10.3390\/w10091148","volume":"10","author":"J Zhou","year":"2018","unstructured":"Zhou J, Wang Y, Xiao F et al (2018) Water quality prediction method based on IGRA and LSTM. Water 10:1148","journal-title":"Water"},{"key":"16737_CR5","doi-asserted-by":"publisher","first-page":"5215","DOI":"10.1007\/s10462-022-10143-2","volume":"55","author":"M Waqas","year":"2022","unstructured":"Waqas M, Tu S, Halim Z et al (2022) The role of artificial intelligence and machine learning in wireless networks security: principle, practice and challenges. Artif Intell Rev 55:5215\u20135261. https:\/\/doi.org\/10.1007\/s10462-022-10143-2","journal-title":"Artif Intell Rev"},{"key":"16737_CR6","doi-asserted-by":"publisher","first-page":"106443","DOI":"10.1016\/j.knosys.2020.106443","volume":"208","author":"Z Halim","year":"2020","unstructured":"Halim Z, Waqar M, Tahir M (2020) A machine learning-based investigation utilizing the in-text features for the identification of dominant emotion in an email. Knowl Based Syst 208:106443. https:\/\/doi.org\/10.1016\/j.knosys.2020.106443","journal-title":"Knowl Based Syst"},{"key":"16737_CR7","doi-asserted-by":"publisher","first-page":"610","DOI":"10.3390\/w14040610","volume":"14","author":"J Wu","year":"2022","unstructured":"Wu J, Wang Z (2022) A Hybrid Model for Water Quality Prediction Based on an Artificial Neural Network, Wavelet Transform, and Long Short-Term Memory. Water 14:610","journal-title":"Water"},{"key":"16737_CR8","doi-asserted-by":"publisher","first-page":"1322","DOI":"10.3390\/ijerph15071322","volume":"15","author":"S Lee","year":"2018","unstructured":"Lee S, Lee D (2018) Improved prediction of harmful algal blooms in four Major South Korea\u2019s Rivers using deep learning models. Int J Environ Res Public Health 15:1322","journal-title":"Int J Environ Res Public Health"},{"key":"16737_CR9","doi-asserted-by":"publisher","first-page":"2058","DOI":"10.3390\/su11072058","volume":"11","author":"P Liu","year":"2019","unstructured":"Liu P, Wang J, Sangaiah AK et al (2019) Analysis and prediction of water quality using LSTM deep neural networks in IoT environment. Sustainability 11:2058","journal-title":"Sustainability"},{"key":"16737_CR10","doi-asserted-by":"publisher","first-page":"4259","DOI":"10.3390\/su13084259","volume":"13","author":"M Hmoud Al-Adhaileh","year":"2021","unstructured":"Hmoud Al-Adhaileh M, Waselallah Alsaade F (2021) Modelling and prediction of water quality by using artificial intelligence. Sustainability 13:4259","journal-title":"Sustainability"},{"issue":"5","key":"16737_CR11","first-page":"2496","volume":"8","author":"D Bhardwaj","year":"2017","unstructured":"Bhardwaj D, Verma N (2017) Research paper on analysing impact of various parameters on water quality index. Int J Adv Res Comput Sci 8(5):2496\u2013498","journal-title":"Int J Adv Res Comput Sci"},{"key":"16737_CR12","doi-asserted-by":"publisher","first-page":"1067","DOI":"10.3390\/w14071067","volume":"14","author":"NHA Malek","year":"2022","unstructured":"Malek NHA, Wan Yaacob WF, Md Nasir SA, Shaadan N (2022) Prediction of Water Quality Classification of the Kelantan River Basin, Malaysia, Using Machine Learning Techniques. Water 14:1067","journal-title":"Water"},{"key":"16737_CR13","unstructured":"Slatnia A, Ladjal M, Ouali MA, Imed M (2022) Improving prediction and classification of water quality indices using hybrid machine learning algorithms with features selection analysis. In: Online International Symposium on Applied Mathematics and Engineering (ISAME22), vol 1.\u00a0ISAME22, Istanbul-Turkey, pp 16\u201317"},{"key":"16737_CR14","doi-asserted-by":"publisher","first-page":"112051","DOI":"10.1016\/j.jenvman.2021.112051","volume":"284","author":"T Deng","year":"2021","unstructured":"Deng T, Chau K-W, Duan H-F (2021) Machine learning based marine water quality prediction for coastal hydro-environment management. J Environ Manage 284:112051","journal-title":"J Environ Manage"},{"key":"16737_CR15","doi-asserted-by":"publisher","first-page":"12875","DOI":"10.1007\/s11356-021-13875-w","volume":"29","author":"S Khullar","year":"2022","unstructured":"Khullar S, Singh N (2022) Water quality assessment of a river using deep learning Bi-LSTM methodology: forecasting and validation. Environ Sci Pollut Res 29:12875\u201312889","journal-title":"Environ Sci Pollut Res"},{"key":"16737_CR16","doi-asserted-by":"publisher","first-page":"41524","DOI":"10.1007\/s11356-020-09689-x","volume":"27","author":"SI Abba","year":"2020","unstructured":"Abba SI, Pham QB, Saini G et al (2020) Implementation of data intelligence models coupled with ensemble machine learning for prediction of water quality index. Environ Sci Pollut Res 27:41524\u201341539","journal-title":"Environ Sci Pollut Res"},{"key":"16737_CR17","doi-asserted-by":"publisher","first-page":"17591","DOI":"10.1007\/s11356-021-17064-7","volume":"29","author":"A Elbeltagi","year":"2022","unstructured":"Elbeltagi A, Pande CB, Kouadri S, Islam ARM (2022) Applications of various data-driven models for the prediction of groundwater quality index in the Akot basin, Maharashtra, India. Environ Sci Pollut Res 29:17591\u201317605","journal-title":"Environ Sci Pollut Res"},{"key":"16737_CR18","doi-asserted-by":"publisher","first-page":"104599","DOI":"10.1016\/j.jece.2020.104599","volume":"9","author":"SBHS Asadollah","year":"2021","unstructured":"Asadollah SBHS, Sharafati A, Motta D, Yaseen ZM (2021) River water quality index prediction and uncertainty analysis: A comparative study of machine learning models. J Environ Chem Eng 9:104599","journal-title":"J Environ Chem Eng"},{"key":"16737_CR19","doi-asserted-by":"publisher","first-page":"9318","DOI":"10.1007\/s11356-021-16289-w","volume":"29","author":"AM Nosair","year":"2022","unstructured":"Nosair AM, Shams MY, AbouElmagd LM et al (2022) Predictive model for progressive salinization in a coastal aquifer using artificial intelligence and hydrogeochemical techniques: A case study of the Nile Delta aquifer, Egypt. Environ Sci Pollut Res 29:9318\u20139340","journal-title":"Environ Sci Pollut Res"},{"key":"16737_CR20","doi-asserted-by":"publisher","unstructured":"Garabaghi FH, Benzer S, Benzer R (2021) Performance evaluation of machine learning models with ensemble learning approach in classification of water quality indices based on different subset of features. Res Square 1:1\u201335. https:\/\/doi.org\/10.21203\/rs.3.rs-876980\/v2","DOI":"10.21203\/rs.3.rs-876980\/v2"},{"key":"16737_CR21","doi-asserted-by":"publisher","first-page":"86","DOI":"10.2991\/hcis.k.211203.001","volume":"1","author":"MM Hassan","year":"2021","unstructured":"Hassan MM, Hassan MM, Akter L et al (2021) Efficient Prediction of Water Quality Index (WQI) Using Machine Learning Algorithms. Hum Centric Intell Syst 1:86\u201397","journal-title":"Hum Centric Intell Syst"},{"key":"16737_CR22","doi-asserted-by":"crossref","unstructured":"Radhakrishnan N, Pillai AS (2020) Comparison of Water Quality Classification Models using Machine Learning. In: 2020 5th International Conference on Communication and Electronics Systems (ICCES). IEEE, pp 1183\u20131188","DOI":"10.1109\/ICCES48766.2020.9137903"},{"key":"16737_CR23","doi-asserted-by":"publisher","unstructured":"Khan MSI, Islam N, Uddin J et al (2021) Water quality prediction and classification based on principal component regression and gradient boosting classifier approach. J King Saud Univ \u2013 Comput Inform Sci 34(8):4773\u20134781. https:\/\/doi.org\/10.1016\/j.jksuci.2021.06.003","DOI":"10.1016\/j.jksuci.2021.06.003"},{"key":"16737_CR24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1155\/2020\/6659314","volume":"2020","author":"THH Aldhyani","year":"2020","unstructured":"Aldhyani THH, Al-Yaari M, Alkahtani H, Maashi M (2020) Water quality prediction using artificial intelligence algorithms. Appl Bionics Biomech 2020:1\u201312. https:\/\/doi.org\/10.1155\/2020\/6659314","journal-title":"Appl Bionics Biomech"},{"key":"16737_CR25","doi-asserted-by":"publisher","first-page":"1552","DOI":"10.3390\/w14101552","volume":"14","author":"DN Khoi","year":"2022","unstructured":"Khoi DN, Quan NT, Linh DQ et al (2022) Using Machine Learning Models for Predicting the Water Quality Index in the La Buong River, Vietnam. Water 14:1552","journal-title":"Water"},{"key":"16737_CR26","unstructured":"Forests R, Breiman L (1999) Statistics Department University of California Berkeley. pp 1-29"},{"key":"16737_CR27","first-page":"1063","volume":"13","author":"G Biau","year":"2012","unstructured":"Biau G (2012) Analysis of a random forests model. J Mach Learn Res 13:1063\u20131095","journal-title":"J Mach Learn Res"},{"key":"16737_CR28","doi-asserted-by":"publisher","first-page":"127320","DOI":"10.1016\/j.jhydrol.2021.127320","volume":"605","author":"S Wang","year":"2022","unstructured":"Wang S, Peng H, Liang S (2022) Prediction of estuarine water quality using interpretable machine learning approach. J Hydrol 605:127320","journal-title":"J Hydrol"},{"key":"16737_CR29","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin C (2016) Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd ACM sigkdd international conference on knowledge discovery and data mining. pp 785\u2013794","DOI":"10.1145\/2939672.2939785"},{"key":"16737_CR30","doi-asserted-by":"crossref","unstructured":"Prakash R, Tharun VP, Devi SR (2018) A comparative study of various classification techniques to determine water quality. In: 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT). IEEE, pp 1501\u20131506","DOI":"10.1109\/ICICCT.2018.8473168"},{"key":"16737_CR31","doi-asserted-by":"publisher","first-page":"367","DOI":"10.1016\/S0167-9473(01)00065-2","volume":"38","author":"JH Friedman","year":"2002","unstructured":"Friedman JH (2002) Stochastic gradient boosting. Comput Stat Data Anal 38:367\u2013378","journal-title":"Comput Stat Data Anal"},{"key":"16737_CR32","doi-asserted-by":"publisher","first-page":"113864","DOI":"10.1016\/j.eswa.2020.113864","volume":"162","author":"Y Zhou","year":"2020","unstructured":"Zhou Y, Mazzuchi TA, Sarkani S (2020) M-adaboost-a based ensemble system for network intrusion detection. Expert Syst Appl 162:113864","journal-title":"Expert Syst Appl"},{"key":"16737_CR33","doi-asserted-by":"crossref","unstructured":"Beyer K, Goldstein J, Ramakrishnan R, Shaft U (1999) When is \u201cnearest neighbor\u201d meaningful? In: International conference on database theory. Springer, pp 217\u2013235","DOI":"10.1007\/3-540-49257-7_15"},{"key":"16737_CR34","doi-asserted-by":"publisher","first-page":"126169","DOI":"10.1016\/j.chemosphere.2020.126169","volume":"249","author":"H Lu","year":"2020","unstructured":"Lu H, Ma X (2020) Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere 249:126169","journal-title":"Chemosphere"},{"key":"16737_CR35","doi-asserted-by":"publisher","first-page":"66","DOI":"10.1016\/j.inffus.2019.06.006","volume":"53","author":"Z Halim","year":"2020","unstructured":"Halim Z, Rehan M (2020) On identification of driving-induced stress using electroencephalogram signals: A framework based on wearable safety-critical scheme and machine learning. Inf Fusion 53:66\u201379. https:\/\/doi.org\/10.1016\/j.inffus.2019.06.006","journal-title":"Inf Fusion"},{"key":"16737_CR36","doi-asserted-by":"publisher","first-page":"105923","DOI":"10.1016\/j.agwat.2019.105923","volume":"228","author":"H Chen","year":"2020","unstructured":"Chen H, Huang JJ, McBean E (2020) Partitioning of daily evapotranspiration using a modified shuttleworth-wallace model, random Forest and support vector regression, for a cabbage farmland. Agric Water Manag 228:105923","journal-title":"Agric Water Manag"},{"key":"16737_CR37","doi-asserted-by":"publisher","first-page":"105834","DOI":"10.1016\/j.cie.2019.04.047","volume":"139","author":"Y Cheng","year":"2020","unstructured":"Cheng Y, Peng J, Gu X et al (2020) An intelligent supplier evaluation model based on data-driven support vector regression in global supply chain. Comput Ind Eng 139:105834","journal-title":"Comput Ind Eng"},{"key":"16737_CR38","doi-asserted-by":"publisher","first-page":"5814","DOI":"10.3390\/su12145814","volume":"12","author":"Z Liao","year":"2020","unstructured":"Liao Z, Li Y, Xiong W et al (2020) An In-Depth Assessment of Water Resource Responses to Regional Development Policies Using Hydrological Variation Analysis and System Dynamics Modeling. Sustainability 12:5814","journal-title":"Sustainability"},{"key":"16737_CR39","doi-asserted-by":"publisher","first-page":"34","DOI":"10.12691\/ajwr-1-3-3","volume":"1","author":"S Tyagi","year":"2013","unstructured":"Tyagi S, Sharma B, Singh P, Dobhal R (2013) Water quality assessment in terms of water quality index. Am J Water Resour 1:34\u201338","journal-title":"Am J Water Resour"},{"key":"16737_CR40","first-page":"61","volume-title":"The Power of Data: Driving Climate Change with Data Science and Artificial Intelligence Innovations","author":"MY Shams","year":"2023","unstructured":"Shams MY, Tarek Z, Elshewey AM et al (2023) A Machine Learning-Based Model for Predicting Temperature Under the Effects of Climate Change. In: Hassanien AE, Darwish A (eds) The Power of Data: Driving Climate Change with Data Science and Artificial Intelligence Innovations. Springer Nature Switzerland, Cham, pp 61\u201381"},{"key":"16737_CR41","doi-asserted-by":"publisher","first-page":"757","DOI":"10.3390\/su15010757","volume":"15","author":"AM Elshewey","year":"2023","unstructured":"Elshewey AM, Shams MY, Elhady AM et al (2023) A Novel WD-SARIMAX Model for Temperature Forecasting Using Daily Delhi Climate Dataset. Sustainability 15:757. https:\/\/doi.org\/10.3390\/su15010757","journal-title":"Sustainability"},{"key":"16737_CR42","doi-asserted-by":"publisher","first-page":"715","DOI":"10.32604\/cmc.2023.032533","volume":"74","author":"Z Tarek","year":"2023","unstructured":"Tarek Z, Shams MY, Elshewey AM et al (2023) Wind Power Prediction Based on Machine Learning and Deep Learning Models. Comput Mater Contin 74:715\u2013732. https:\/\/doi.org\/10.32604\/cmc.2023.032533","journal-title":"Comput Mater Contin"},{"key":"16737_CR43","doi-asserted-by":"publisher","first-page":"765","DOI":"10.32604\/csse.2023.034324","volume":"46","author":"AM Elshewey","year":"2023","unstructured":"Elshewey AM, Shams MY, Tarek Z et al (2023) Weight Prediction Using the Hybrid Stacked-LSTM Food Selection Model. Comput Syst Sci Eng 46:765\u2013781. https:\/\/doi.org\/10.32604\/csse.2023.034324","journal-title":"Comput Syst Sci Eng"}],"container-title":["Multimedia Tools and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11042-023-16737-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11042-023-16737-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11042-023-16737-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,2]],"date-time":"2024-04-02T13:43:01Z","timestamp":1712065381000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11042-023-16737-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,29]]},"references-count":43,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2024,4]]}},"alternative-id":["16737"],"URL":"https:\/\/doi.org\/10.1007\/s11042-023-16737-4","relation":{},"ISSN":["1573-7721"],"issn-type":[{"value":"1573-7721","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,29]]},"assertion":[{"value":"17 June 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 August 2023","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"31 August 2023","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 September 2023","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no conflicts of interest to report regarding the present study.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflicts of interest"}}]}}