IDEAS home Printed from https://ideas.repec.org/a/gam/jjrfmx/v18y2025i7p351-d1686223.html
   My bibliography  Save this article

Sustainable Factor Augmented Machine Learning Models for Crude Oil Return Forecasting

Author

Listed:
  • Lianxu Wang

    (E.T.S. DE Ingenieros Informáticos, Universidad Politécnica de Madrid, Boadilla del Monte, 28660 Madrid, Spain)

  • Xu Chen

    (School of Mathematics and Statistics, University College Dublin, D04 V1W8 Dublin, Ireland)

Abstract

The global crude oil market, known for its pronounced volatility and nonlinear dynamics, plays a pivotal role in shaping economic stability and informing investment strategies. Contrary to traditional research focused on price forecasting, this study emphasizes the more investor-centric task of predicting returns for West Texas Intermediate (WTI) crude oil. By spotlighting returns, it directly addresses critical investor concerns such as asset allocation and risk management. This study applies advanced machine learning models, including XGBoost, random forest, and neural networks to predict crude oil return, and for the first time, incorporates sustainability and external risk variables, which are shown to enhance predictive performance in capturing the non-stationarity and complexity of financial time-series data. To enhance predictive accuracy, we integrate 55 variables across five dimensions: macroeconomic indicators, financial and futures markets, energy markets, momentum factors, and sustainability and external risk. Among these, the rate of change stands out as the most influential predictor. Notably, XGBoost demonstrates a superior performance, surpassing competing models with an impressive 76% accuracy in direction forecasting. The analysis highlights how the significance of various predictors shifted during the COVID-19 pandemic. This underscores the dynamic and adaptive character of crude oil markets under substantial external disruptions. In addition, by incorporating sustainability factors, the study provides deeper insights into the drivers of market behavior, supporting more informed portfolio adjustments, risk management strategies, and policy development aimed at fostering resilience and advancing sustainable energy transitions.

Suggested Citation

  • Lianxu Wang & Xu Chen, 2025. "Sustainable Factor Augmented Machine Learning Models for Crude Oil Return Forecasting," JRFM, MDPI, vol. 18(7), pages 1-27, June.
  • Handle: RePEc:gam:jjrfmx:v:18:y:2025:i:7:p:351-:d:1686223
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1911-8074/18/7/351/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1911-8074/18/7/351/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jjrfmx:v:18:y:2025:i:7:p:351-:d:1686223. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.