{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T01:52:04Z","timestamp":1760233924330,"version":"build-2065373602"},"reference-count":22,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2021,2,22]],"date-time":"2021-02-22T00:00:00Z","timestamp":1613952000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>The ocean occupies more than two-thirds of the earth\u2019s area, providing a lot of oxygen and materials for human survival and development. However, with human activities, a large number of sewage, plastic bags, and other wastes are discharged into the ocean, and the problem of marine water pollution has become a hot topic in the world. In order to extract the characteristics of marine water pollution, this study proposed K-means clustering technology based on cosine distance and discrimination to study the polluted water. In this study, the polygonal area method combined with six parameters of water quality is used to analyze the marine water body anomalies, so as to realize the rapid and real-time monitoring of marine water body anomalies. At the same time, the cosine distance method and discrimination are used to classify marine water pollutants, so as to improve the classification accuracy. The results show that the detection rate of water quality anomalies is more than 88.2%, and the overall classification accuracy of water pollution is 96.3%, which proves the effectiveness of the method. It is hoped that this study can provide timely and effective data support for the detection of marine water bodies.<\/jats:p>","DOI":"10.3390\/sym13020355","type":"journal-article","created":{"date-parts":[[2021,2,22]],"date-time":"2021-02-22T20:42:51Z","timestamp":1614026571000},"page":"355","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Feature Extraction of Marine Water Pollution Based on Data Mining"],"prefix":"10.3390","volume":"13","author":[{"given":"Haixia","family":"Lin","sequence":"first","affiliation":[{"name":"School of Artificial Intelligence and Big Data, Hebei Polytechnic Institute, Shijiazhuang 050091, China"}]},{"given":"Jianhong","family":"Cui","sequence":"additional","affiliation":[{"name":"School of Artificial Intelligence and Big Data, Hebei Polytechnic Institute, Shijiazhuang 050091, China"}]},{"given":"Xiangwei","family":"Bai","sequence":"additional","affiliation":[{"name":"School of Network and Communication, Hebei Polytechnic Institute, Shijiazhuang 050091, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,2,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"e7305","DOI":"10.7717\/peerj.7305","article-title":"Monitoring cyanobacterial toxins in a large reservoir: Relationships with water quality parameters","volume":"7","author":"Subbiah","year":"2019","journal-title":"PeerJ"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1016\/j.scitotenv.2016.11.116","article-title":"Citizen science-based water quality monitoring: Constructing a large database to characterize the impacts of combined sewer overflow in New York City","volume":"580","author":"Farnham","year":"2017","journal-title":"Sci. Total Environ."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1016\/j.watres.2016.02.036","article-title":"Epidemiologic evaluation of multiple alternate microbial water quality monitoring indicators at three California beaches","volume":"94","author":"Griffith","year":"2016","journal-title":"Water Res."},{"key":"ref_4","first-page":"305","article-title":"Evaluation of Ordinary Least Square (OLS) and Geographically Weighted Regression (GWR) for Water Quality Monitoring: A Case Study for the Estimation of Salinity","volume":"2","author":"Majid","year":"2018","journal-title":"J. Ocean Univ. China"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"579","DOI":"10.1016\/j.jclepro.2017.08.060","article-title":"A Multi-Objective Artificial Bee Colony-based optimization approach to design water quality monitoring networks in river basins","volume":"166","author":"Reder","year":"2017","journal-title":"J. Clean. Prod."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1007\/s12665-015-5139-3","article-title":"Environmetric techniques in water quality assessment and monitoring: A case study","volume":"75","author":"Hamid","year":"2016","journal-title":"Environ. Earth Sci."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1007\/s11269-018-2092-4","article-title":"Drinking Water Source Monitoring Using Early Warning Systems Based on Data Mining Techniques","volume":"33","author":"Delpla","year":"2019","journal-title":"Water Resour. Manag."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"136","DOI":"10.2478\/pomr-2018-0122","article-title":"Research and Application of Rule Updating Mining Algorithm for Marine Water Quality Monitoring Data","volume":"25","author":"Sun","year":"2018","journal-title":"Pol. Marit. Res."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"9315","DOI":"10.1029\/2019WR024897","article-title":"Data Mining to Uncover Heterogeneous Water Use Behaviors from Smart Meter Data","volume":"55","author":"Cominola","year":"2019","journal-title":"Water Resour. Res."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Lee, S., Hyun, Y., and Lee, M.J. (2019). Groundwater Potential Mapping Using Data Mining Models of Big Data Analysis in Goyang-si, South Korea. Sustainability, 11.","DOI":"10.3390\/su11061678"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1016\/j.apr.2019.09.009","article-title":"Application of k-means and hierarchical clustering techniques for analysis of air pollution: A review (1980\u20132019)\u2014ScienceDirect","volume":"11","author":"Govender","year":"2020","journal-title":"Atmos. Pollut. Res."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"65","DOI":"10.4018\/IJEHMC.2021030104","article-title":"Prediction of Environmental Pollution Using Hybrid PSO-K-Means Approach","volume":"12","author":"Mahajan","year":"2021","journal-title":"Int. J. E-Health Med. Commun. (IJEHMC)"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"43","DOI":"10.5890\/JEAM.2021.03.005","article-title":"Assessment of the Water Quality of Karun River Catchment Using Artificial Neural Networks-self-Organizing Maps and K-Means Algorithm","volume":"9","author":"Ahmadmoazzam","year":"2020","journal-title":"J. Environ. Account. Manag."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1446","DOI":"10.1016\/j.scitotenv.2018.02.163","article-title":"Using self-organizing map for coastal water quality classification: Towards a better understanding of patterns and processes","volume":"628\u2013629","author":"Li","year":"2018","journal-title":"Sci. Total Environ."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/j.scitotenv.2013.04.005","article-title":"Spatial and temporal characteristics of particulate matter in Beijing, China using the Empirical Mode Decomposition method","volume":"458\u2013460","author":"Hu","year":"2013","journal-title":"Sci. Total Environ."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Samendra, S., Syreeta, M., Luisa, I., Yu, H.W., Snyder, S.A., and Pepper, I.L. (2018). Near Real-Time Detection of E. coli in Reclaimed Water. Sensors, 18.","DOI":"10.3390\/s18072303"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Wang, K., Wen, X., Hou, D., Tu, D., Zhu, N., Pingjie, H., Guangxin, Z., and Zhang, H. (2018). Application of Least-Squares Support Vector Machines for Quantitative Evaluation of Known Contaminant in Water Distribution System Using Online Water Quality Parameters. Sensors, 18.","DOI":"10.3390\/s18040938"},{"key":"ref_18","first-page":"530","article-title":"Analysis of Seawater Pollution Using Neural Networks","volume":"56","author":"Vasilescu","year":"2011","journal-title":"Rom. J. Phys."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1016\/j.jenvman.2019.02.110","article-title":"Real-time detection of potable-reclaimed water pipe cross-connection events by conventional water quality sensors using machine learning methods","volume":"238","author":"Xu","year":"2019","journal-title":"J. Environ. Manag."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1207","DOI":"10.1007\/s10040-018-01921-w","article-title":"Influence of available data on the geostatistical-based design of optimal spatiotemporal groundwater-level-monitoring networks","volume":"27","author":"Herrera","year":"2019","journal-title":"Hydrogeol. J."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2478\/amns.2020.1.00001","article-title":"Improvement of the fast clustering algorithm improved by k-means in the big data","volume":"5","author":"Xie","year":"2020","journal-title":"Appl. Math. Nonlinear Sci."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"191","DOI":"10.2478\/AMNS.2019.1.00025","article-title":"Analysis of fractional factor system for data transmission in SDN","volume":"4","author":"Wu","year":"2019","journal-title":"Appl. Math. Nonlinear Sci."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/13\/2\/355\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:27:50Z","timestamp":1760160470000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/13\/2\/355"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,22]]},"references-count":22,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2021,2]]}},"alternative-id":["sym13020355"],"URL":"https:\/\/doi.org\/10.3390\/sym13020355","relation":{},"ISSN":["2073-8994"],"issn-type":[{"type":"electronic","value":"2073-8994"}],"subject":[],"published":{"date-parts":[[2021,2,22]]}}}