A comparative study of performance of K-nearest neighbors and support vector machines for classification of groundwater

Sakizadeh, M.; Mirzaei, R.

doi:10.22044/jme.2016.480

Document Type : Case Study

Authors

M. Sakizadeh ¹
R. Mirzaei ²

¹ Department of Environmental Sciences, Faculty of Sciences, Shahid Rajaee Teacher Training University, Tehran, Iran

² Department of Environmental Sciences,University of Kashan, Kashan, Iran

https://doi.org/10.22044/jme.2016.480

Abstract

The aim of this work is to examine the feasibilities of the support vector machines (SVMs) and K-nearest neighbor (K-NN) classifier methods for the classification of an aquifer in the Khuzestan Province, Iran. For this purpose, 17 groundwater quality variables including EC, TDS, turbidity, pH, total hardness, Ca, Mg, total alkalinity, sulfate, nitrate, nitrite, fluoride, phosphate, Fe, Mn, Cu, and Cr(VI) from 41 wells and springs were used during an eight-year time period (2006 to 2013). The cluster analysis was used, leading to a dendrogram that differentiated two distinct groups. The factor analysis extracted eight factors accumulatively, accounting for 90.97% of the total variance. Thus the variations in 17 variables could be covered by just eight factors. K-NN and SVMs were applied for the classification of the aquifer under study. The results of SVMs indicated that the best performed model was related to an exponent of degree one with an accuracy of 94% for the test data set, in which the sensitivity and specificity were 1.00 and 0.87, respectively. In addition, there was no significant difference among the results of different kernels, indicating that an acceptable result can be achieved by selecting the optimum parameters for a kernel. The results of K-NN showed roughly a lower efficiency compared with those of SVMs, where the sensitivity and specificity was reduced to 0.90 and 0.88, respectively, although the accuracy of the model was 93%. A sensitivity analysis was performed on the groundwater quality variables, suggesting that calcium next to nitrate were the most influential parameters in the classification of this aquifer.

Keywords

Journal of Mining and Environment

A comparative study of performance of K-nearest neighbors and support vector machines for classification of groundwater

Volume 7, Issue 2 - Serial Number 2
July and August 2016
Pages 149-164

A comparative study of performance of K-nearest neighbors and support vector machines for classification of groundwater

Volume 7, Issue 2 - Serial Number 2July and August 2016Pages 149-164

Volume 7, Issue 2 - Serial Number 2
July and August 2016
Pages 149-164