IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v13y2020i15p3903-d392375.html
   My bibliography  Save this article

A Data-Driven Approach for Lithology Identification Based on Parameter-Optimized Ensemble Learning

Author

Listed:
  • Zhixue Sun

    (School of Petroleum Engineering, China University of Petroleum (East China), Qingdao 266580, China)

  • Baosheng Jiang

    (School of Petroleum Engineering, China University of Petroleum (East China), Qingdao 266580, China)

  • Xiangling Li

    (PetroChina Research Institute of Petroleum Exploration & Development, Beijing 100083, China)

  • Jikang Li

    (School of Petroleum Engineering, China University of Petroleum (East China), Qingdao 266580, China)

  • Kang Xiao

    (PetroChina Research Institute of Petroleum Exploration & Development, Beijing 100083, China)

Abstract

The identification of underground formation lithology can serve as a basis for petroleum exploration and development. This study integrates Extreme Gradient Boosting (XGBoost) with Bayesian Optimization (BO) for formation lithology identification and comprehensively evaluated the performance of the proposed classifier based on the metrics of the confusion matrix, precision, recall, F1-score and the area under the receiver operating characteristic curve (AUC). The data of this study are derived from Daniudui gas field and the Hangjinqi gas field, which includes 2153 samples with known lithology facies class with each sample having seven measured properties (well log curves), and corresponding depth. The results show that BO significantly improves parameter optimization efficiency. The AUC values of the test sets of the two gas fields are 0.968 and 0.987, respectively, indicating that the proposed method has very high generalization performance. Additionally, we compare the proposed algorithm with Gradient Tree Boosting-Differential Evolution (GTB-DE) using the same dataset. The results demonstrated that the average of precision, recall and F1 score of the proposed method are respectively 4.85%, 5.7%, 3.25% greater than GTB-ED. The proposed XGBoost-BO ensemble model can automate the procedure of lithology identification, and it may also be used in the prediction of other reservoir properties.

Suggested Citation

  • Zhixue Sun & Baosheng Jiang & Xiangling Li & Jikang Li & Kang Xiao, 2020. "A Data-Driven Approach for Lithology Identification Based on Parameter-Optimized Ensemble Learning," Energies, MDPI, vol. 13(15), pages 1-15, July.
  • Handle: RePEc:gam:jeners:v:13:y:2020:i:15:p:3903-:d:392375
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/13/15/3903/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/13/15/3903/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Gang Chen & Mian Chen & Guobin Hong & Yunhu Lu & Bo Zhou & Yanfang Gao, 2020. "A New Method of Lithology Classification Based on Convolutional Neural Network Algorithm by Utilizing Drilling String Vibration Data," Energies, MDPI, vol. 13(4), pages 1-24, February.
    2. Zoubin Ghahramani, 2015. "Probabilistic machine learning and artificial intelligence," Nature, Nature, vol. 521(7553), pages 452-459, May.
    3. Chuanbo Shen & Solomon Asante-Okyere & Yao Yevenyo Ziggah & Liang Wang & Xiangfeng Zhu, 2019. "Group Method of Data Handling (GMDH) Lithology Identification Based on Wavelet Analysis and Dimensionality Reduction as Well Log Data Pre-Processing Techniques," Energies, MDPI, vol. 12(8), pages 1-16, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Junlong Zhang & Youbin He & Yuan Zhang & Weifeng Li & Junjie Zhang, 2022. "Well-Logging-Based Lithology Classification Using Machine Learning Methods for High-Quality Reservoir Identification: A Case Study of Baikouquan Formation in Mahu Area of Junggar Basin, NW China," Energies, MDPI, vol. 15(10), pages 1-15, May.
    2. Cenk Temizel & Uchenna Odi & Karthik Balaji & Hakki Aydin & Javier E. Santos, 2022. "Classifying Facies in 3D Digital Rock Images Using Supervised and Unsupervised Approaches," Energies, MDPI, vol. 15(20), pages 1-15, October.
    3. Timur Merembayev & Darkhan Kurmangaliyev & Bakhbergen Bekbauov & Yerlan Amanbek, 2021. "A Comparison of Machine Learning Algorithms in Predicting Lithofacies: Case Studies from Norway and Kazakhstan," Energies, MDPI, vol. 14(7), pages 1-16, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Masafumi Nakano & Akihiko Takahashi & Soichiro Takahashi, 2018. "State Space Approach to Adaptive Artificial Intelligence Modeling: Application to Financial Portfolio with Fuzzy System," CARF F-Series CARF-F-422, Center for Advanced Research in Finance, Faculty of Economics, The University of Tokyo.
    2. Li, Yi & Liu, Kailong & Foley, Aoife M. & Zülke, Alana & Berecibar, Maitane & Nanini-Maury, Elise & Van Mierlo, Joeri & Hoster, Harry E., 2019. "Data-driven health estimation and lifetime prediction of lithium-ion batteries: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 113(C), pages 1-1.
    3. Amit Kumar Kushwaha & Arpan Kumar Kar, 2024. "MarkBot – A Language Model-Driven Chatbot for Interactive Marketing in Post-Modern World," Information Systems Frontiers, Springer, vol. 26(3), pages 857-874, June.
    4. Zhao, Jingyuan & Wang, Zhenghong & Wu, Yuyan & Burke, Andrew F., 2025. "Predictive pretrained transformer (PPT) for real-time battery health diagnostics," Applied Energy, Elsevier, vol. 377(PD).
    5. Mkono, Christopher N. & Chuanbo, Shen & Mulashani, Alvin K. & Mwakipunda, Grant Charles, 2023. "Deep learning integrated approach for hydrocarbon source rock evaluation and geochemical indicators prediction in the Jurassic - Paleogene of the Mandawa basin, SE Tanzania," Energy, Elsevier, vol. 284(C).
    6. Qiqin Zhou, 2024. "Explainable AI in Request-for-Quote," Papers 2407.15038, arXiv.org.
    7. Ding, Song & Hu, Lunhu & Pan, Xing & Zuo, Dujun & Sun, Liuwang, 2025. "Assessing human situation awareness reliability considering fatigue and mood using EEG data: A Bayesian neural network-Bayesian network approach," Reliability Engineering and System Safety, Elsevier, vol. 260(C).
    8. Manis, K.T. & Madhavaram, Sreedhar, 2023. "AI-Enabled marketing capabilities and the hierarchy of capabilities: Conceptualization, proposition development, and research avenues," Journal of Business Research, Elsevier, vol. 157(C).
    9. Efrat Taig & Ohad Ben-Shahar, 2019. "Gradient Surfing: A New Deterministic Approach for Low-Dimensional Global Optimization," Journal of Optimization Theory and Applications, Springer, vol. 180(3), pages 855-878, March.
    10. O’Dwyer, Edward & Pan, Indranil & Acha, Salvador & Shah, Nilay, 2019. "Smart energy systems for sustainable smart cities: Current developments, trends and future directions," Applied Energy, Elsevier, vol. 237(C), pages 581-597.
    11. Masafumi Nakano & Akihiko Takahashi & Soichiro Takahashi, 2017. "State Space Approach to Adaptive Fuzzy Modeling: Application to Financial Investment," CIRJE F-Series CIRJE-F-1067, CIRJE, Faculty of Economics, University of Tokyo.
    12. Xu, Wen & Wu, Xianguo & Xiong, Shishu & Li, Tiejun & Liu, Yang, 2025. "Optimizing the sustainable performance of public buildings: A hybrid machine learning algorithm," Energy, Elsevier, vol. 320(C).
    13. Chen-Ying Hung & Ching-Heng Lin & Tsuo-Hung Lan & Giia-Sheun Peng & Chi-Chun Lee, 2019. "Development of an intelligent decision support system for ischemic stroke risk assessment in a population-based electronic health record database," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-16, March.
    14. Paola Perchinunno & Massimo Bilancia & Domenico Vitale, 2021. "A Statistical Analysis of Factors Affecting Higher Education Dropouts," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 156(2), pages 341-362, August.
    15. Cui, Mianshan, 2022. "District heating load prediction algorithm based on bidirectional long short-term memory network model," Energy, Elsevier, vol. 254(PA).
    16. Golsefidi, Atefeh Hemmati & Hüttel, Frederik Boe & Peled, Inon & Samaranayake, Samitha & Pereira, Francisco Câmara, 2023. "A joint machine learning and optimization approach for incremental expansion of electric vehicle charging infrastructure," Transportation Research Part A: Policy and Practice, Elsevier, vol. 178(C).
    17. Xiaojie Geng & Shunchuan Wu & Yanjie Zhang & Junlong Sun & Haiyong Cheng & Zhongxin Zhang & Shijiang Pu, 2023. "Developing hybrid XGBoost model integrated with entropy weight and Bayesian optimization for predicting tunnel squeezing intensity," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 119(1), pages 751-771, October.
    18. Mardia, Kanti V., 2024. "Fisher’s pioneering work on discriminant analysis and its impact on Artificial Intelligence," Journal of Multivariate Analysis, Elsevier, vol. 203(C).
    19. Ya Li & Zheng Guangwen, 2025. "Balancing innovation and accountability: AI’s transformative influence on logistics in G20 nations," Palgrave Communications, Palgrave Macmillan, vol. 12(1), pages 1-12, December.
    20. Pantano, Eleonora & Pizzi, Gabriele, 2020. "Forecasting artificial intelligence on online customer assistance: Evidence from chatbot patents analysis," Journal of Retailing and Consumer Services, Elsevier, vol. 55(C).

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:13:y:2020:i:15:p:3903-:d:392375. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.