Show simple item record

Title

Groundwater quality assessment: an improved approach to K-means clustering, principal component analysis and spatial analysis: a case study

dc.contributor.authorMarín Celestino, Ana Elizabeth
dc.contributor.authorMartínez Cruz, Diego Armando
dc.contributor.authorOtazo Sánchez, Elena María
dc.contributor.authorGavi Reyes, Francisco
dc.contributor.authorVásquez Soto, David
dc.date.accessioned2018-12-13T21:09:50Z
dc.date.available2018-12-13T21:09:50Z
dc.date.issued2018
dc.identifier.citationMarín Celestino, A.E.; Martínez Cruz, D.A.; Otazo Sánchez, E.M.; Gavi Reyes, F.; Vásquez Soto, D. Groundwater Quality Assessment: An Improved Approach to K-Means Clustering, Principal Component Analysis and Spatial Analysis: A Case Study. Water 2018, 10, 437.
dc.identifier.urihttp://hdl.handle.net/11627/4848
dc.description.abstract"K-means clustering and principal component analysis (PCA) are widely used in water quality analysis and management. Nevertheless, numerous studies have pointed out that K-means with the squared Euclidean distance is not suitable for high-dimensional datasets. We evaluate a methodology (K-means based on PCA) for water quality evaluation. It is based on the PCA method to reduce the dataset from high dimensional to low for the improvement of K-means clustering. For this, a large dataset of 28 hydrogeochemical variables and 582 wells in the coastal aquifer are classified with K-means clustering for high dimensional and K-means clustering based on PCA. The proposed method achieved increased quality cluster cohesion according to the average Silhouette index. It ranged from 0.13 for high dimensional k-means clustering to 5.94 for K-means based on PCA and the practical spatial geographic information systems (GIS) evaluation of clustering indicates more quality results for K-means clustering based on PCA. K-means based on PCA identified three hydrogeochemical classes and their sources. High salinity was attributed to seawater intrusion and the mineralization process, high levels of heavy metals related to domestic-industrial wastewater discharge and low heavy metals concentrations were associated with industrial wastewater punctual discharges. This approach allowed the demarcation of natural and anthropogenic variation sources in the aquifer and provided greater certainty and accuracy to the data classification."
dc.publisherMDPI
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internacional
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectK-means clustering
dc.subjectPCA
dc.subjectSpatial analysis
dc.subjectWater quality
dc.subjectHydrogeochemical
dc.subjectCoastal aquifer
dc.subject.classificationCIENCIAS FÍSICO MATEMÁTICAS Y CIENCIAS DE LA TIERRA
dc.titleGroundwater quality assessment: an improved approach to K-means clustering, principal component analysis and spatial analysis: a case study
dc.typearticle
dc.identifier.doihttps://doi.org/10.3390/w10040437
dc.rights.accessAcceso Abierto


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 Internacional