IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v16y2023i10p4159-d1149629.html
   My bibliography  Save this article

Prediction of TOC Content in Organic-Rich Shale Using Machine Learning Algorithms: Comparative Study of Random Forest, Support Vector Machine, and XGBoost

Author

Listed:
  • Jiangtao Sun

    (School of Earth Sciences and Engineering, Xi’an Shiyou University, Xi’an 710065, China)

  • Wei Dang

    (School of Earth Sciences and Engineering, Xi’an Shiyou University, Xi’an 710065, China
    Shaanxi Key Laboratory of Petroleum Accumulation Geology, Xi’an Shiyou University, Xi’an 710065, China)

  • Fengqin Wang

    (School of Earth Sciences and Engineering, Xi’an Shiyou University, Xi’an 710065, China
    Shaanxi Key Laboratory of Petroleum Accumulation Geology, Xi’an Shiyou University, Xi’an 710065, China)

  • Haikuan Nie

    (Petroleum Exploration and Production Research Institute, SINOPEC, Beijing 100083, China)

  • Xiaoliang Wei

    (Exploration and Development Institute of Shengli Oilfield Company, SINOPEC, Dongying 257000, China
    Key Laboratory of Strategy Evaluation for Shale Gas, Ministry of Land and Resources, China University of Geosciences, Beijing 100083, China)

  • Pei Li

    (Petroleum Exploration and Production Research Institute, SINOPEC, Beijing 100083, China)

  • Shaohua Zhang

    (School of Earth Sciences and Engineering, Xi’an Shiyou University, Xi’an 710065, China
    Shaanxi Key Laboratory of Petroleum Accumulation Geology, Xi’an Shiyou University, Xi’an 710065, China)

  • Yubo Feng

    (School of Earth Sciences and Engineering, Xi’an Shiyou University, Xi’an 710065, China)

  • Fei Li

    (School of Earth Sciences and Engineering, Xi’an Shiyou University, Xi’an 710065, China)

Abstract

The total organic carbon (TOC) content of organic-rich shale is a key parameter in screening for potential source rocks and sweet spots of shale oil/gas. Traditional methods of determining the TOC content, such as the geochemical experiments and the empirical mathematical regression method, are either high cost and low-efficiency, or universally non-applicable and low-accuracy. In this study, we propose three machine learning models of random forest (RF), support vector regression (SVR), and XGBoost to predict the TOC content using well logs, and the performance of each model are compared with the traditional empirical methods. First, the decision tree algorithm is used to identify the optimal set of well logs from a total of 15. Then, 816 data points of well logs and the TOC content data collected from five different shale formations are used to train and test these three models. Finally, the accuracy of three models is validated by predicting the unknown TOC content data from a shale oil well. The results show that the RF model provides the best prediction for the TOC content, with R 2 = 0.915, MSE = 0.108, and MAE = 0.252, followed by the XGBoost, while the SVR gives the lowest predictive accuracy. Nevertheless, all three machine learning models outperform the traditional empirical methods such as Schmoker gamma-ray log method, multiple linear regression method and ΔlgR method. Overall, the proposed machine learning models are powerful tools for predicting the TOC content of shale and improving the oil/gas exploration efficiency in a different formation or a different basin.

Suggested Citation

  • Jiangtao Sun & Wei Dang & Fengqin Wang & Haikuan Nie & Xiaoliang Wei & Pei Li & Shaohua Zhang & Yubo Feng & Fei Li, 2023. "Prediction of TOC Content in Organic-Rich Shale Using Machine Learning Algorithms: Comparative Study of Random Forest, Support Vector Machine, and XGBoost," Energies, MDPI, vol. 16(10), pages 1-26, May.
  • Handle: RePEc:gam:jeners:v:16:y:2023:i:10:p:4159-:d:1149629
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/16/10/4159/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/16/10/4159/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Haitao Guo & Yongsheng Wang & Zhongmin Wang, 2016. "Shale Development and China," Natural Resource Management and Policy, in: Yongsheng Wang & William E. Hefley (ed.), The Global Impact of Unconventional Shale Gas Development, pages 131-147, Springer.
    2. Partha Pratim Mandal & Reza Rezaee & Irina Emelyanova, 2021. "Ensemble Learning for Predicting TOC from Well-Logs of the Unconventional Goldwyer Shale," Energies, MDPI, vol. 15(1), pages 1-30, December.
    3. Ahmed Abdulhamid Mahmoud & Salaheldin Elkatatny & Abdulwahab Z. Ali & Mohamed Abouelresh & Abdulazeez Abdulraheem, 2019. "Evaluation of the Total Organic Carbon (TOC) Using Different Artificial Intelligence Techniques," Sustainability, MDPI, vol. 11(20), pages 1-15, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Banglong Pan & Hanming Yu & Hongwei Cheng & Shuhua Du & Shutong Cai & Minle Zhao & Juan Du & Fazhi Xie, 2023. "A CNN–LSTM Machine-Learning Method for Estimating Particulate Organic Carbon from Remote Sensing in Lakes," Sustainability, MDPI, vol. 15(17), pages 1-15, August.
    2. Magdalena Rykała & Małgorzata Grzelak & Łukasz Rykała & Daniela Voicu & Ramona-Monica Stoica, 2023. "Modeling Vehicle Fuel Consumption Using a Low-Cost OBD-II Interface," Energies, MDPI, vol. 16(21), pages 1-23, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tadeusz Kwilosz & Bogdan Filar & Mariusz Miziołek, 2022. "Use of Cluster Analysis to Group Organic Shale Gas Rocks by Hydrocarbon Generation Zones," Energies, MDPI, vol. 15(4), pages 1-14, February.
    2. Miltiadis D. Lytras & Anna Visvizi, 2021. "Artificial Intelligence and Cognitive Computing: Methods, Technologies, Systems, Applications and Policy Making," Sustainability, MDPI, vol. 13(7), pages 1-3, March.
    3. Zhou, Junping & Tian, Shifeng & Zhou, Lei & Xian, Xuefu & Yang, Kang & Jiang, Yongdong & Zhang, Chengpeng & Guo, Yaowen, 2020. "Experimental investigation on the influence of sub- and super-critical CO2 saturation time on the permeability of fractured shale," Energy, Elsevier, vol. 191(C).
    4. Xiaoqian Guo & Qiang Yan & Anjian Wang, 2017. "Assessment of Methods for Forecasting Shale Gas Supply in China Based on Economic Considerations," Energies, MDPI, vol. 10(11), pages 1-14, October.
    5. Zhiyao Zhang & Shang Xu & Qiyang Gou & Qiqi Li, 2022. "Reservoir Characteristics and Resource Potential of Marine Shale in South China: A Review," Energies, MDPI, vol. 15(22), pages 1-21, November.
    6. Mkono, Christopher N. & Chuanbo, Shen & Mulashani, Alvin K. & Mwakipunda, Grant Charles, 2023. "Deep learning integrated approach for hydrocarbon source rock evaluation and geochemical indicators prediction in the Jurassic - Paleogene of the Mandawa basin, SE Tanzania," Energy, Elsevier, vol. 284(C).
    7. Zhou, Zhongbing & Qin, Quande, 2020. "Epistemological dominance and ignorance of the comparative advantages of China's shale gas: Evidence from international scientific journals," Energy Policy, Elsevier, vol. 138(C).
    8. Marcin Kremieniewski, 2022. "Improving the Efficiency of Oil Recovery in Research and Development," Energies, MDPI, vol. 15(12), pages 1-7, June.
    9. Leifu Zhang & Qun Zhao & Sizhong Peng & Zhen Qiu & Congjun Feng & Qin Zhang & Yuman Wang & Dazhong Dong & Shangwen Zhou, 2021. "Paleoenvironment and Organic Matter Accumulation Mechanism of Marine–Continental Transitional Shales: Outcrop Characterizations of the Carboniferous–Permian Strata, Ordos Basin, North China," Energies, MDPI, vol. 14(21), pages 1-19, November.
    10. Wang, Xu & Fang, Hong & Fang, Siran, 2020. "An integrated approach for exploitation block selection of shale gas—based on cloud model and grey relational analysis," Resources Policy, Elsevier, vol. 68(C).
    11. Liuyang Yao & Dangchen Sui & Xiaotong Liu & Hui Fan, 2020. "The Psychological Process of Residents’ Acceptance of Local Shale Gas Exploitation in China," IJERPH, MDPI, vol. 17(18), pages 1-20, September.
    12. Ahmed Abdulhamid Mahmoud & Salaheldin Elkatatny & Dhafer Al Shehri, 2020. "Application of Machine Learning in Evaluation of the Static Young’s Modulus for Sandstone Formations," Sustainability, MDPI, vol. 12(5), pages 1-16, March.
    13. Wang, Yan & Zhong, Dong-Liang & Li, Zheng & Li, Jian-Bo, 2020. "Application of tetra-n-butyl ammonium bromide semi-clathrate hydrate for CO2 capture from unconventional natural gases," Energy, Elsevier, vol. 197(C).
    14. Feng, Qianqian & Qiu, Nansheng & Borjigin, Tenger & Wu, Hang & Zhang, Jiatang & Shen, Baojian & Wang, Jiangshan, 2022. "Tectonic evolution revealed by thermo-kinematic and its effect on shale gas preservation," Energy, Elsevier, vol. 240(C).
    15. Jingqiang Tan & Guolai Li & Ruining Hu & Lei Li & Qiao Lyu & Jeffrey Dick, 2020. "Experimental Investigation of the Impacts of Fracturing Fluid on the Evolution of Fluid Composition and Shale Characteristics: A Case Study of the Niutitang Shale in Hunan Province, South China," Energies, MDPI, vol. 13(13), pages 1-20, June.
    16. Wang, Qiang & Zhan, Lina, 2019. "Assessing the sustainability of the shale gas industry by combining DPSIRM model and RAGA-PP techniques: An empirical analysis of Sichuan and Chongqing, China," Energy, Elsevier, vol. 176(C), pages 353-364.
    17. Zhou, Na & Wu, Qiaosheng & Hu, Xiangping & Xu, Deyi & Wang, Xiaolin, 2020. "Evaluation of Chinese natural gas investment along the Belt and Road Initiative using super slacks-based measurement of efficiency method," Resources Policy, Elsevier, vol. 67(C).
    18. Zhengwei Ma & Yuanjun Pang & Dan Zhang & Yuqi Zhang, 2020. "Measuring the air pollution cost of shale gas development in China," Energy & Environment, , vol. 31(6), pages 1098-1111, September.
    19. Dong, Kangyin & Sun, Renjin & Wu, Jin & Hochman, Gal, 2018. "The growth and development of natural gas supply chains: The case of China and the US," Energy Policy, Elsevier, vol. 123(C), pages 64-71.
    20. Piotr Słomski & Maria Mastalerz & Jacek Szczepański & Arkadiusz Derkowski & Tomasz Topór & Marcin Lutyński, 2020. "Experimental and numerical investigation of CO2–brine–rock interactions in the early Palaeozoic mudstones from the Polish part of the Baltic Basin at simulatedin situ conditions," Greenhouse Gases: Science and Technology, Blackwell Publishing, vol. 10(3), pages 567-590, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:16:y:2023:i:10:p:4159-:d:1149629. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.