Document Type : Original Research Paper

Authors

Department of Mining Engineering, Faculty of Mine, AmirKabir University, Tehran, Iran

Abstract

Various methods have been used for clustering big data. Pattern recognition methods are suitable methods for clustering these data. Due to the large volume of samples taken in the drilling of mines and their analysis for various elements, this category of geochemical data can be considered big data. Examining and evaluating drilling data in the Lar copper mine in Sistan and Baluchistan province located in the southeast of Iran requires the use of these methods. Therefore, the main goal of the article is the clustering of the drilling data in the mentioned mine and its zoning of the geochemical data. To achieve this goal, 3500 samples taken from drilling cores have been used. Elemental analysis for six elements has been done using the ICP-Ms method. Pattern recognition methods including SOM and K-MEANS have been used to evaluate the relation between these elements. The silhouette method has been used to determine and evaluate the number of clusters. Using this method, 4 clusters have been considered for the mentioned data. According to this method, it was found that the accuracy of clustering is higher in the SOM method. By considering the 4 clusters, 4 zones were identified using clustering methods. By comparing the results of the two methods and using the graphical method, it was determined that the SOM method has a better performance for clustering geochemical data in the studied area. Based on that, zones 2 and 4 were recognized as high-grade zones in this area.

Keywords

Main Subjects

[1]. Mostafaei K, Ramazi K (2018) 3D model construction of induced polarization and resistivity data with quantifying uncertainties using geostatistical methods and drilling (Case study: Madan Bozorg, Iran). Journal of Mining & Environment 4: 857-872.
[2]. Mostafaei K, Ramazi H (2019) Mineral resource estimation using a combination of drilling and IP-Rs data using statistical and cokriging methods. Bulletin of the Mineral Research and Exploration 160:177–195.
[3]. Sinclair, W (2007) Porphyry deposits. Mineral deposits of Canada: A synthesis of major deposit types, district metallogeny, the evolution of geological provinces, and exploration methods. Geological Association of Canada, Mineral Deposits Division 5: 223-243.
[4]. Takaew P, Cecilia Xia J, Doucet LS (2024) Machine learning and tectonic setting determination: Bridging the gap between Earth scientists and data scientists. Geoscience Frontiers 15:101726.
[5]. Farhadi, S Tatullo, MB Konari, P Afzal (2024) Evaluating StackingC and ensemble models for enhanced lithological classification in geological mapping‏. Journal of Geochemical Exploration 260, 107441
[6]. Farhadi, P Afzal, M Boveiri Konari, L Daneshvar Saein, B Sadeghi (2022) Combination of machine learning algorithms with a concentration-area fractal method for soil geochemical anomaly detection in sediment-hosted Irankuh Pb-Zn deposit, Central Iran. Minerals 12 (6), 689
[7]. Madhulatha TS (2012) AN OVERVIEW ON CLUSTERING METHODS. IOSR Journal of Engineering 02:719–725.
[8]. Templ M, Filzmoser P, Reimann C (2008) Cluster analysis applied to regional geochemical data: Problems and possibilities. Applied Geochemistry 23:2198–2213.
[9]. Zhang Z, Lin S, Ye Y,Xu z, Zhao H, Zhao Y and Sun J (2022) A Hybrid MCDM Model for Evaluating the Market-Oriented Business Regulatory Risk of Power Grid Enterprises Based on the Bayesian Best-Worst Method and MARCOS Approach. Energies (Basel) 15:2978.
[10]. Bigdeli A, Maghsoudi A, Ghezelbash R (2022) Application of self-organizing map (SOM) and K-means clustering algorithms for portraying geochemical anomaly patterns in Moalleman district, NE Iran. Journal of Geochemical Exploration 233:106923.
[11]. Žalik KR (2008) An efficient k′-means clustering algorithm. Pattern Recognition Letters | 29:1385–1391.
[12]. Sarparandeh M, Hezarkhani A (2017) Application of unsupervised pattern recognition approaches for exploration of rare earth elements in Se-Chahun iron ore, central Iran. Compu Computers & Geosciences ters & Geosciences 6:537–546.
[13]. Sarparandeh M, Hezarkhani A (2016) Application of Self-Organizing Map for Exploration of REEs’ Deposition. Open Journal of Geology 06:571–582.
[14]. Ghezelbash R, Maghsoudi A, Carranza EJM (2020) Optimization of geochemical anomaly detection using a novel genetic K-means clustering (GKMC) algorithm. Computers & Geosciences 134:104335.
[15]. Fu Q, Li Y, Albathan M (2023) A supervised method to enhance distance-based neural network clustering performance by discovering perfect representative neurons. Granular Computing 8:1051–1065.
[16]. Boomeri M, Moradi R, Stein H, Bagheri S (2019) Geology, Re-Os age, S and O isotopic composition of the Lar porphyry Cu-Mo deposit, southeast Iran. Ore Geology Reviews 104:477–494.
[17]. Nakisa M (2002) Results of Exploration Studies and Reserves Estimation of the Lar Cu Ore Deposit-Zahedan. Tehran, Iran: Ministry of Industries and Mines, National Iranian Copper Industries Co.
[18]. MORADI R, BOOMERI M, BAGHERI S, NAKASHIMA K (2016) Mineral chemistry of igneous rocks in the Lar Cu-Mo prospect, southeastern part of Iran: implications for P, T, and ƒO2. turkish journal of earth sciences 25:418–433.
[19]. Chen K, Abtahi F, Carrero J-J, Fernandez- Llatas C and Seoane F (2023) Process mining and data mining applications in the domain of chronic diseases: A systematic review. Artificial Intelligence in Medicine 144:102645.
[20]. Shu X, Ye Y (2023) Knowledge Discovery: Methods from data mining and machine learning. Soc Sci Res 110:102817.
[21]. Chaudhry M, Shafi I, Mahnoor M, Ramirez D. Bautista E and Ashraf I (2023) A Systematic Literature Review on Identifying Patterns Using Unsupervised Clustering Algorithms: A Data Mining Perspective. Symmetry (Basel) 15:1679.
[22]. Abdulnassar AA, Nair LR (2023) Performance analysis of Kmeans with modified initial centroid selection algorithms and developed Kmeans 9+ model. Measurement: Sensors 25:100666.
[23]. Kohonen T (1990) The self-organizing map. Proceedings of the IEEE 78:1464–1480.
[24]. Jensen CA, Reed RD, Marks RJ, El-Sharkawi MA, Jung JB, Miyamoto RT, Anderson G and Eggen C,  (1999) Inversion of feedforward neural networks: algorithms and applications. Proceedings of the IEEE 87:1536–1549
[25]. Li L, Wang Y (2014) What drives the aerosol distribution in Guangdong - the most developed province in Southern China? Scientific Reports 4:5972.
[26]. Vesanto J (1999) SOM-based data visualization methods. Intelligent Data Analysis 3:111–126.