IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v382y2025ics0306261925000315.html
   My bibliography  Save this article

Incorporating key features from structured and unstructured data for enhanced carbon trading price forecasting with interpretability analysis

Author

Listed:
  • Jiang, Meiqin
  • Che, Jinxing
  • Li, Shuying
  • Hu, Kun
  • Xu, Yifan

Abstract

Accurate prediction of carbon trading prices is crucial for societal progression, prompting a shift toward harnessing unstructured data sources to improve forecasting accuracy and interpretability. Despite this trend, the utilization of unstructured data like the Baidu search index remains inadequate. Additionally, conventional multivariate forecasting methods face challenges in balancing accuracy with interpretability. To address these limitations, we propose an innovative methodology that integrates both structured and unstructured data streams to enhance carbon trading price forecasting, where an interpretability analysis framework is used to explore the influencing factors of predictive models. Firstly, we conduct feature analysis focusing on structured data including historical carbon price data, key supply-demand factors, and unstructured data sourced from Baidu Index keywords. Besides, we simplify the data pattern through statistical analysis to reduce dimensionality, and refine it further via CEEMDAN decomposition coupled with wavelet threshold denoising for a cleaner dataset. Subsequently, we establish a support vector regression model leveraging the multi-dimensional and small-sample modeling, suitable for this multi-source and mixed structures data. Finally, we introduce a random interference mechanism based on model features, then construct a mathematical framework for the differences in model accuracy loss, providing insights into the model's decision-making mechanisms. This work demonstrates that the data cleaning stage can provide more predictive data for subsequent prediction models, and the proposed model is validated through rigorous experimental design for its superiority in model comparison analysis, multi-step prediction, and interpretability analysis of selected features.

Suggested Citation

  • Jiang, Meiqin & Che, Jinxing & Li, Shuying & Hu, Kun & Xu, Yifan, 2025. "Incorporating key features from structured and unstructured data for enhanced carbon trading price forecasting with interpretability analysis," Applied Energy, Elsevier, vol. 382(C).
  • Handle: RePEc:eee:appene:v:382:y:2025:i:c:s0306261925000315
    DOI: 10.1016/j.apenergy.2025.125301
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261925000315
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2025.125301?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wang, Jujie & Zhuang, Zhenzhen & Gao, Dongming, 2023. "An enhanced hybrid model based on multiple influencing factors and divide-conquer strategy for carbon price prediction," Omega, Elsevier, vol. 120(C).
    2. Long, Huidian & Cao, Yafei, 2024. "A wavelet analysis of the relationship between carbon emissions rights and crude oil prices in China," Resources Policy, Elsevier, vol. 91(C).
    3. Hou, Guolian & Wang, Junjie & Fan, Yuzhen, 2024. "Multistep short-term wind power forecasting model based on secondary decomposition, the kernel principal component analysis, an enhanced arithmetic optimization algorithm, and error correction," Energy, Elsevier, vol. 286(C).
    4. Sun, Wei & Zhang, Junjian, 2022. "A novel carbon price prediction model based on optimized least square support vector machine combining characteristic-scale decomposition and phase space reconstruction," Energy, Elsevier, vol. 253(C).
    5. Wu, Huijuan & Meng, Keqilao & Fan, Daoerji & Zhang, Zhanqiang & Liu, Qing, 2022. "Multistep short-term wind speed forecasting using transformer," Energy, Elsevier, vol. 261(PA).
    6. Xu, Yingying & Dai, Yifan & Guo, Lingling & Chen, Jingjing, 2024. "Leveraging machine learning to forecast carbon returns: Factors from energy markets," Applied Energy, Elsevier, vol. 357(C).
    7. Ding, Lili & Zhang, Rui & Zhao, Xin, 2024. "Forecasting carbon price in China unified carbon market using a novel hybrid method with three-stage algorithm and long short-term memory neural networks," Energy, Elsevier, vol. 288(C).
    8. Liu, Jincheng & Li, Teng, 2024. "Multi-step power forecasting for regional photovoltaic plants based on ITDE-GAT model," Energy, Elsevier, vol. 293(C).
    9. Karijadi, Irene & Chou, Shuo-Yan & Dewabharata, Anindhita, 2023. "Wind power forecasting based on hybrid CEEMDAN-EWT deep learning method," Renewable Energy, Elsevier, vol. 218(C).
    10. Li, Jingmiao & Liu, Dehong, 2023. "Carbon price forecasting based on secondary decomposition and feature screening," Energy, Elsevier, vol. 278(PA).
    11. Zhou, Zhongbao & Lin, Ling & Li, Shuxian, 2018. "International stock market contagion: A CEEMDAN wavelet analysis," Economic Modelling, Elsevier, vol. 72(C), pages 333-352.
    12. Zhang, Yue-Jun & Wei, Yi-Ming, 2010. "An overview of current research on EU ETS: Evidence from its operating mechanism and economic effect," Applied Energy, Elsevier, vol. 87(6), pages 1804-1814, June.
    13. Pang, Junheng & Dong, Sheng, 2023. "A novel multivariable hybrid model to improve short and long-term significant wave height prediction," Applied Energy, Elsevier, vol. 351(C).
    14. Binrong Wu & Zhongrui Wang & Lin Wang, 2024. "Interpretable corn future price forecasting with multivariate time series," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(5), pages 1575-1594, August.
    15. Hongtao Li & Xiaoxuan Li & Shaolong Sun & Zhipeng Huang & Xiaoyan Jia, 2024. "Multivariable forecasting approach of high‐speed railway passenger demand based on residual term of Baidu search index and error correction," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(7), pages 2401-2433, November.
    16. Zhao, Lu-Tao & Zheng, Zhi-Yi & Wei, Yi-Ming, 2023. "Forecasting oil inventory changes with Google trends: A hybrid wavelet decomposer and ARDL-SVR ensemble model," Energy Economics, Elsevier, vol. 120(C).
    17. Zhou, Feite & Huang, Zhehao & Zhang, Changhong, 2022. "Carbon price forecasting based on CEEMDAN and LSTM," Applied Energy, Elsevier, vol. 311(C).
    18. Luo, Jian & Hong, Tao & Gao, Zheming & Fang, Shu-Cherng, 2023. "A robust support vector regression model for electric load forecasting," International Journal of Forecasting, Elsevier, vol. 39(2), pages 1005-1020.
    19. Zhang, Wen & Wu, Zhibin & Zeng, Xiaojun & Zhu, Changhui, 2023. "An ensemble dynamic self-learning model for multiscale carbon price forecasting," Energy, Elsevier, vol. 263(PC).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Jujie & Xu, Shulian & Shu, Shuqin, 2024. "An optimal weight heterogeneous integrated carbon price prediction model based on temporal information extraction and specific comprehensive feature selection," Energy, Elsevier, vol. 312(C).
    2. Yingjie Zhu & Yongfa Chen & Qiuling Hua & Jie Wang & Yinghui Guo & Zhijuan Li & Jiageng Ma & Qi Wei, 2024. "A Hybrid Model for Carbon Price Forecasting Based on Improved Feature Extraction and Non-Linear Integration," Mathematics, MDPI, vol. 12(10), pages 1-26, May.
    3. Bai, Yun & Deng, Shuyun & Pu, Ziqiang & Li, Chuan, 2024. "Carbon price forecasting using leaky integrator echo state networks with the framework of decomposition-reconstruction-integration," Energy, Elsevier, vol. 305(C).
    4. Wu, Han & Du, Pei, 2024. "Dual-stream transformer-attention fusion network for short-term carbon price prediction," Energy, Elsevier, vol. 311(C).
    5. Liu, Shuihan & Li, Mingchen & Yang, Kun & Wei, Yunjie & Wang, Shouyang, 2025. "From forecasting to trading: A multimodal-data-driven approach to reversing carbon market losses," Energy Economics, Elsevier, vol. 144(C).
    6. Tian, Zhirui & Sun, Wenpu & Wu, Chenye, 2025. "MLP-Carbon: A new paradigm integrating multi-frequency and multi-scale techniques for accurate carbon price forecasting," Applied Energy, Elsevier, vol. 383(C).
    7. Beibei Hu & Yunhe Cheng, 2023. "Prediction of Regional Carbon Price in China Based on Secondary Decomposition and Nonlinear Error Correction," Energies, MDPI, vol. 16(11), pages 1-22, May.
    8. Zhao, Yang & Wang, Jianzhou & Wang, Shuai & Zheng, Jingwei & Lv, Mengzheng, 2025. "Using explainable deep learning to improve decision quality: Evidence from carbon trading market," Omega, Elsevier, vol. 133(C).
    9. Zeng, Liling & Hu, Huanling & Song, Qingkui & Zhang, Boting & Lin, Ruibin & Zhang, Dabin, 2024. "A drift-aware dynamic ensemble model with two-stage member selection for carbon price forecasting," Energy, Elsevier, vol. 313(C).
    10. Wang, Yue & Wang, Zhong & Luo, Yuyan, 2024. "A hybrid carbon price forecasting model combining time series clustering and data augmentation," Energy, Elsevier, vol. 308(C).
    11. Geng, Donghan & Zhang, Yongkang & Zhang, Yunlong & Qu, Xingchuang & Li, Longfei, 2025. "A hybrid model based on CapSA-VMD-ResNet-GRU-attention mechanism for ultra-short-term and short-term wind speed prediction," Renewable Energy, Elsevier, vol. 240(C).
    12. Na Fu & Liyan Geng & Junhai Ma & Xue Ding, 2023. "Price, Complexity, and Mathematical Model," Mathematics, MDPI, vol. 11(13), pages 1-30, June.
    13. Wang, Ning & Guo, Ziyu & Shang, Dawei & Li, Keyuyang, 2024. "Carbon trading price forecasting in digitalization social change era using an explainable machine learning approach: The case of China as emerging country evidence," Technological Forecasting and Social Change, Elsevier, vol. 200(C).
    14. Li, Jingmiao & Liu, Dehong, 2023. "Carbon price forecasting based on secondary decomposition and feature screening," Energy, Elsevier, vol. 278(PA).
    15. Xian, Sidong & Feng, Miaomiao & Cheng, Yue, 2023. "Incremental nonlinear trend fuzzy granulation for carbon trading time series forecast," Applied Energy, Elsevier, vol. 352(C).
    16. Yin, Hao & Yin, Yiding & Li, Hanhong & Zhu, Jianbin & Xian, Zikang & Tang, Yanshu & Xiao, Liexi & Rong, Jiayu & Li, Chen & Zhang, Haitao & Xie, Zhifeng & Meng, Anbo, 2025. "Carbon emissions trading price forecasting based on temporal-spatial multidimensional collaborative attention network and segment imbalance regression," Applied Energy, Elsevier, vol. 377(PA).
    17. Xiaolu Wei & Hongbing Ouyang, 2024. "Carbon price prediction based on a scaled PCA approach," PLOS ONE, Public Library of Science, vol. 19(1), pages 1-16, January.
    18. Cao, Jin-Hui & Xie, Chi & Zhou, Yang & Wang, Gang-Jin & Zhu, You, 2025. "Forecasting carbon price: A novel multi-factor spatial-temporal GNN framework integrating Graph WaveNet and self-attention mechanism," Energy Economics, Elsevier, vol. 144(C).
    19. Cheng, Runkun & Yang, Di & Liu, Da & Zhang, Guowei, 2024. "A reconstruction-based secondary decomposition-ensemble framework for wind power forecasting," Energy, Elsevier, vol. 308(C).
    20. Li, Jianfang & Jia, Li & Zhou, Chengyu, 2024. "Probability density function based adaptive ensemble learning with global convergence for wind power prediction," Energy, Elsevier, vol. 312(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:382:y:2025:i:c:s0306261925000315. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.