IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v42y2023i7p1865-1888.html
   My bibliography  Save this article

Comprehensive commodity price forecasting framework using text mining methods

Author

Listed:
  • Wuyue An
  • Lin Wang
  • Dongfeng Zhang

Abstract

Exploiting advanced and appropriate methods to construct high‐quality features from different types of data becomes crucial in agricultural futures price forecasting. Thus, this study develops a comprehensive commodity price forecasting framework using text mining methods. First, the modal features of the price series are extracted using the proposed Integrated‐EEMD‐VMD‐SE method, and dynamic topic sentiment features are constructed from Weibo texts using the proposed dynamic topic model joint sentiment analysis method. Second, combined with statistical variables, lag order selection and feature selection are performed on these comprehensive factors. Finally, 12 comparative prediction models are designed based on random forest (RF), long short‐term memory (LSTM), and multilayer perceptron (MLP), and empirical analysis is carried out on two cases of pork prices and soybean futures prices. The experimental results show that the proposed prediction framework has high prediction accuracy, and the mean absolute percentage error (MAPE) values are 1.00 and 0.92, respectively. The constructed time series modal features and dynamic topic sentiment features can significantly improve the performance of the prediction model.

Suggested Citation

  • Wuyue An & Lin Wang & Dongfeng Zhang, 2023. "Comprehensive commodity price forecasting framework using text mining methods," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(7), pages 1865-1888, November.
  • Handle: RePEc:wly:jforec:v:42:y:2023:i:7:p:1865-1888
    DOI: 10.1002/for.2985
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.2985
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.2985?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lv, Sheng-Xiang & Wang, Lin, 2023. "Multivariate wind speed forecasting based on multi-objective feature selection approach and hybrid deep learning model," Energy, Elsevier, vol. 263(PE).
    2. Wu, Binrong & Wang, Lin & Wang, Sirui & Zeng, Yu-Rong, 2021. "Forecasting the U.S. oil markets based on social media information during the COVID-19 pandemic," Energy, Elsevier, vol. 226(C).
    3. Xingrui Jiao & Yuping Song & Yang Kong & Xiaolong Tang, 2022. "Volatility forecasting for crude oil based on text information and deep learning PSO‐LSTM model," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(5), pages 933-944, August.
    4. Wang, Jue & Wang, Zhen & Li, Xiang & Zhou, Hao, 2022. "Artificial bee colony-based combination approach to forecasting agricultural commodity prices," International Journal of Forecasting, Elsevier, vol. 38(1), pages 21-34.
    5. Li, Jianping & Li, Guowen & Liu, Mingxi & Zhu, Xiaoqian & Wei, Lu, 2022. "A novel text-based framework for forecasting agricultural futures using massive online news headlines," International Journal of Forecasting, Elsevier, vol. 38(1), pages 35-50.
    6. Qiang Gao & Xiao Huang & Ke Dong & Zhentao Liang & Jiang Wu, 2022. "Semantic-enhanced topic evolution analysis: a combination of the dynamic topic model and word2vec," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(3), pages 1543-1563, March.
    7. Ga-Ae Ryu & Aziz Nasridinov & HyungChul Rah & Kwan-Hee Yoo, 2020. "Forecasts of the Amount Purchase Pork Meat by Using Structured and Unstructured Big Data," Agriculture, MDPI, vol. 10(1), pages 1-14, January.
    8. Lago, Jesus & De Ridder, Fjo & De Schutter, Bart, 2018. "Forecasting spot electricity prices: Deep learning approaches and empirical comparison of traditional algorithms," Applied Energy, Elsevier, vol. 221(C), pages 386-405.
    9. Massaoudi, Mohamed & Refaat, Shady S. & Chihi, Ines & Trabelsi, Mohamed & Oueslati, Fakhreddine S. & Abu-Rub, Haitham, 2021. "A novel stacked generalization ensemble-based hybrid LGBM-XGB-MLP model for Short-Term Load Forecasting," Energy, Elsevier, vol. 214(C).
    10. Szoplik, Jolanta, 2015. "Forecasting of natural gas consumption with artificial neural networks," Energy, Elsevier, vol. 85(C), pages 208-220.
    11. Liwen Ling & Dabin Zhang & Shanying Chen & Amin W. Mugera, 2020. "Can online search data improve the forecast accuracy of pork price in China?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(4), pages 671-686, July.
    12. Sun, Na & Zhou, Jianzhong & Chen, Lu & Jia, Benjun & Tayyab, Muhammad & Peng, Tian, 2018. "An adaptive dynamic short-term wind speed forecasting model using secondary decomposition and an improved regularized extreme learning machine," Energy, Elsevier, vol. 165(PB), pages 939-957.
    13. Peng, Lu & Wang, Lin & Xia, De & Gao, Qinglu, 2022. "Effective energy consumption forecasting using empirical wavelet transform and long short-term memory," Energy, Elsevier, vol. 238(PB).
    14. Bai, Yun & Li, Xixi & Yu, Hao & Jia, Suling, 2022. "Crude oil price forecasting incorporating news text," International Journal of Forecasting, Elsevier, vol. 38(1), pages 367-383.
    15. Jingfei Zhang & Zhicheng Zheng & Lijun Zhang & Yaochen Qin & Jieran Duan & Anyi Zhang, 2021. "Influencing Factors of Environmental Risk Perception during the COVID-19 Epidemic in China," IJERPH, MDPI, vol. 18(17), pages 1-21, September.
    16. Wu, Binrong & Wang, Lin & Zeng, Yu-Rong, 2022. "Interpretable wind speed prediction with multivariate time series and temporal fusion transformers," Energy, Elsevier, vol. 252(C).
    17. Wang, Cong & Zhang, Hongli & Fan, Wenhui & Ma, Ping, 2017. "A new chaotic time series hybrid prediction method of wind power based on EEMD-SE and full-parameters continued fraction," Energy, Elsevier, vol. 138(C), pages 977-990.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Carlos J. Delgado & Estefanía Alfaro-Mejía & Vidya Manian & Efrain O’Neill-Carrillo & Fabio Andrade, 2024. "Photovoltaic Power Generation Forecasting with Hidden Markov Model and Long Short-Term Memory in MISO and SISO Configurations," Energies, MDPI, vol. 17(3), pages 1-18, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wuyue An & Lin Wang & Yu‐Rong Zeng, 2023. "Text‐based soybean futures price forecasting: A two‐stage deep learning approach," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(2), pages 312-330, March.
    2. Rui Luo & Jinpei Liu & Piao Wang & Zhifu Tao & Huayou Chen, 2024. "A multisource data‐driven combined forecasting model based on internet search keyword screening method for interval soybean futures price," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 43(2), pages 366-390, March.
    3. Katarzyna Rudnik & Anna Hnydiuk-Stefan & Aneta Kucińska-Landwójtowicz & Łukasz Mach, 2022. "Forecasting Day-Ahead Carbon Price by Modelling Its Determinants Using the PCA-Based Approach," Energies, MDPI, vol. 15(21), pages 1-23, October.
    4. Wang, Delu & Gan, Jun & Mao, Jinqi & Chen, Fan & Yu, Lan, 2023. "Forecasting power demand in China with a CNN-LSTM model including multimodal information," Energy, Elsevier, vol. 263(PE).
    5. Shengxiang Lv & Lin Wang & Sirui Wang, 2023. "A Hybrid Neural Network Model for Short-Term Wind Speed Forecasting," Energies, MDPI, vol. 16(4), pages 1-18, February.
    6. Stefenon, Stefano Frizzo & Seman, Laio Oriel & Aquino, Luiza Scapinello & Coelho, Leandro dos Santos, 2023. "Wavelet-Seq2Seq-LSTM with attention for time series forecasting of level of dams in hydroelectric power plants," Energy, Elsevier, vol. 274(C).
    7. Hao, Jun & Feng, Qianqian & Yuan, Jiaxin & Sun, Xiaolei & Li, Jianping, 2022. "A dynamic ensemble learning with multi-objective optimization for oil prices prediction," Resources Policy, Elsevier, vol. 79(C).
    8. Lv, Sheng-Xiang & Wang, Lin, 2022. "Deep learning combined wind speed forecasting with hybrid time series decomposition and multi-objective parameter optimization," Applied Energy, Elsevier, vol. 311(C).
    9. Marcus Vinicius Santos & Fernando Morgado-Dias & Thiago C. Silva, 2023. "Oil Sector and Sentiment Analysis—A Review," Energies, MDPI, vol. 16(12), pages 1-29, June.
    10. Liyang Tang, 2020. "Application of Nonlinear Autoregressive with Exogenous Input (NARX) neural network in macroeconomic forecasting, national goal setting and global competitiveness assessment," Papers 2005.08735, arXiv.org.
    11. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    12. Tang, Ling & Wang, Haohan & Li, Ling & Yang, Kaitong & Mi, Zhifu, 2020. "Quantitative models in emission trading system research: A literature review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 132(C).
    13. Miguel A. Jaramillo-Morán & Daniel Fernández-Martínez & Agustín García-García & Diego Carmona-Fernández, 2021. "Improving Artificial Intelligence Forecasting Models Performance with Data Preprocessing: European Union Allowance Prices Case Study," Energies, MDPI, vol. 14(23), pages 1-23, November.
    14. Konstantinos Papageorgiou & Elpiniki I. Papageorgiou & Katarzyna Poczeta & Dionysis Bochtis & George Stamoulis, 2020. "Forecasting of Day-Ahead Natural Gas Consumption Demand in Greece Using Adaptive Neuro-Fuzzy Inference System," Energies, MDPI, vol. 13(9), pages 1-32, May.
    15. Chen, Ying & Xu, Xiuqin & Koch, Thorsten, 2020. "Day-ahead high-resolution forecasting of natural gas demand and supply in Germany with a hybrid model," Applied Energy, Elsevier, vol. 262(C).
    16. Shao, Zhen & Zheng, Qingru & Yang, Shanlin & Gao, Fei & Cheng, Manli & Zhang, Qiang & Liu, Chen, 2020. "Modeling and forecasting the electricity clearing price: A novel BELM based pattern classification framework and a comparative analytic study on multi-layer BELM and LSTM," Energy Economics, Elsevier, vol. 86(C).
    17. Yuo-Hsien Shiau & Su-Fen Yang & Rishan Adha & Syamsiyatul Muzayyanah, 2022. "Modeling Industrial Energy Demand in Relation to Subsector Manufacturing Output and Climate Change: Artificial Neural Network Insights," Sustainability, MDPI, vol. 14(5), pages 1-18, March.
    18. Magazzino, Cosimo & Mele, Marco & Schneider, Nicolas, 2021. "A D2C algorithm on the natural gas consumption and economic growth: Challenges faced by Germany and Japan," Energy, Elsevier, vol. 219(C).
    19. Li, Guanzheng & Li, Bin & Li, Chao & Wang, Shuai, 2023. "State-of-health rapid estimation for lithium-ion battery based on an interpretable stacking ensemble model with short-term voltage profiles," Energy, Elsevier, vol. 263(PE).
    20. Zhou, Jianguo & Xu, Zhongtian, 2023. "Optimal sizing design and integrated cost-benefit assessment of stand-alone microgrid system with different energy storage employing chameleon swarm algorithm: A rural case in Northeast China," Renewable Energy, Elsevier, vol. 202(C), pages 1110-1137.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:42:y:2023:i:7:p:1865-1888. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.