ADASYN-CatBoost Method for Intelligent Identification of Logging Lithology Considering Unbalanced Data:A Case Study of Zhaoxian Gold Deposit in Northwestern Jiaodong Peninsula
Received date: 2023-04-24
Revised date: 2023-06-30
Online published: 2023-11-21
Logging lithology identification is helpful to quickly and accurately identify the underlying strata and rock mass in the overburden area,which is of great significance to the geological prospecting exploration of metal mines. Based on the actual logging data of the Zhaoxian gold deposit in the northwest of Jiaodong Peninsula,this paper combined machine learning methods to research on intelligent identification of lithology. In view of the diversity and non-equilibrium of lithology distribution of complex rock formations in the deposit,considering the strong non-linear relationship between logging response and lithology,this paper proposed an intelligent identification method for logging lithology based on ADASYN imbalanced data processing and CatBoost machine learning.Firstly,the ADASYN algorithm was used to process the unbalanced logging sample data and generate synthetic samples according to the weighted distribution of small class samples. Then,the CatBoost algorithm was used to construct a machine learning model between logging characteristic and lithology. The validation curve was used to determine the hyperparametric grid search range of the model. Parameters were optimized by combining grid search with grid search and 10-fold cross validation to establish the optimal lithology classification model.Finally,the performance of the model was evaluated by indices such as accuracy,recall and F1 score on the test set,while the results of the lithology classification were interpreted by the model output of the feature importance and the partial dependence map.An example was given on the logging data from the Zhaoxian gold deposit in northwest Jiaodong peninsula,the lithology identification and interpretation analysis were conducted on 10 types of lithologies based on sample data equalisation. The model evaluation results show that the accuracy,recall and F1 score on the test set reached 98.21%,98.20% and 98.20%,respectively.CatBoost lithology classification was compared with GBDT and LightGBM algorithms,and the results show that CatBoost classifier has the best performance and is superior to the lithology recognition effect of sample data without equalization processing.The comparison with the lithology of example logging section cores verifies the validity of the model classification results.The results of the feature importance of the model output indicate that the logging features contribute to lithology classification are resistivity,natural potential and natural gamma.The strong correlation between these logging features and the identification of the lithology is a good indication of further mineralization.
Fangying XU , Yanhong ZOU , Zhuowei YI , Fuqiang YANG , Xiancheng MAO . ADASYN-CatBoost Method for Intelligent Identification of Logging Lithology Considering Unbalanced Data:A Case Study of Zhaoxian Gold Deposit in Northwestern Jiaodong Peninsula[J]. Gold Science and Technology, 2023 , 31(5) : 721 -735 . DOI: 10.11872/j.issn.1005-2518.2023.05.063
http://www.goldsci.ac.cn/article/2023/1005-2518/1005-2518-2023-31-5-721.shtml
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
陈钢花,梁莎莎,王军,等,2019.卷积神经网络在岩性识别中的应用[J].测井技术,43(2):129-134.
|
付光明,严加永,张昆,等,2017.岩性识别技术现状与进展[J].地球物理学进展,32(1):26-40.
|
葛云峰,钟鹏,唐辉明,等,2019.基于钻孔图像的岩体结构面几何信息智能测量[J].岩土力学,40(11):4467-4476.
|
桂州,陈建国,王成彬,2017.基于PCA-SMOTE-随机森林的地质不平衡数据分类方法——以东天山地球化学数据为例[J]. 桂林理工大学学报,37(4):587-593.
|
韩启迪,张小桐,申维,2019.基于决策树特征提取的支持向量机在岩性分类中的应用[J].吉林大学学报(地球科学版),49(2):611-620.
|
康乾坤,路来君,2020.随机森林算法在测井岩性分类中的应用[J].世界地质,39(2):398-405.
|
刘子云,王向公,1989.利用概率统计方法判断岩性[J].石油天然气学报,(2):35-40.
|
吕庆田,张晓培,汤井田,等,2019.金属矿地球物理勘探技术与设备:回顾与进展[J].地球物理学报,62(10):3629-3664.
|
牟丹,王祝文,黄玉龙,等,2015.基于SVM测井数据的火山岩岩性识别——以辽河盆地东部坳陷为例[J]. 地球物理学报,58(5):1785-1793.
|
孙健,周魁,冉小丰,等,2009.Bayes判别分析方法在岩性识别中的应用[J].石油天然气学报,(2):74-77.
|
王川婴,钟声,孙卫春,2009.基于数字钻孔图像的结构面连通性研究[J].岩石力学与工程学报,28(12):2405-2410.
|
王恒,姜亚楠,张欣,等,2021.基于梯度提升算法的岩性识别方法[J].吉林大学学报(地球科学版),51(3):940-950.
|
王英鹏,祝培刚,张文,等,2022.胶东地区招贤深部金矿床金和载金矿物化学成分及其地质意义[J].矿床地质,41(2):255-272.
|
徐德龙,李涛,黄宝华,等,2012.利用交会图法识别国外M油田岩性与流体类型的研究[J].地球物理学进展,27(3):1123-1132.
|
寻知锋,余继峰,2008.聚类和判别分析在测井岩性识别中的应用[J].山东科技大学学报(自然科学版),27(5):10-13.
|
姚金铸,符耀庆,王正勇,等,2014.基于颜色特征和纹理特征的岩屑岩性识别[J].四川大学学报(自然科学版),51(2):313-318.
|
张涛,李艳萍,刘晓宇,等,2023.基于自适应粒子群优化最小二乘支持向量机的深层变质岩测井岩性识别[J].地球物理学进展,38(1):382-392.
|
张旭春,2021.基于CatBoost模型实现对污水处理厂排污情况的监测预警[D].兰州:兰州大学.
|
赵建,高福红,2003.测井资料交会图法在火山岩岩性识别中的应用[J].世界地质,(2):136-140.
|
赵显令,王贵文,周正龙,等,2015.地球物理测井岩性解释方法综述[J].地球物理学进展,30(3):1278-1287.
|
/
〈 |
|
〉 |