IDEAS home Printed from https://ideas.repec.org/a/spr/elcore/v25y2025i1d10.1007_s10660-023-09701-9.html
   My bibliography  Save this article

Forecasting product sales using text mining: a case study in new energy vehicle

Author

Listed:
  • Yi Ding

    (Anhui University)

  • Peng Wu

    (Anhui University
    Anhui University)

  • Jie Zhao

    (Anhui University)

  • Ligang Zhou

    (Anhui University
    Anhui University)

Abstract

This study aims to improve the prediction accuracy of product sales by developing an online review-driven combination forecasting model. The proposed model includes two parts: online reviews and the forecasting model. For online reviews, the sentiment value concerning different product attributes is defined from the sentiment score and the sentiment tendency based on prospect theory. Furthermore, the Mallat pyramid algorithm is used to mitigate the impact of reviews with nonstandard expressions, malicious reviews and spam on the sentiment value of online reviews. A combination forecasting model composed of a backpropagation neural network (BPNN), recurrent neural network (RNN) and long short-term memory (LSTM) neural network is constructed. Taking the sales of BYD-Tang as a case study, some statistical evaluation indicators and the Diebold-Mariano (DM) test indicate the superior performance of our proposed online review-driven combination forecasting model in prediction accuracy.

Suggested Citation

  • Yi Ding & Peng Wu & Jie Zhao & Ligang Zhou, 2025. "Forecasting product sales using text mining: a case study in new energy vehicle," Electronic Commerce Research, Springer, vol. 25(1), pages 495-527, February.
  • Handle: RePEc:spr:elcore:v:25:y:2025:i:1:d:10.1007_s10660-023-09701-9
    DOI: 10.1007/s10660-023-09701-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10660-023-09701-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10660-023-09701-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wang, Jue & Zhou, Hao & Hong, Tao & Li, Xiang & Wang, Shouyang, 2020. "A multi-granularity heterogeneous combination approach to crude oil price forecasting," Energy Economics, Elsevier, vol. 91(C).
    2. Jitendra Kumar Rout & Kim-Kwang Raymond Choo & Amiya Kumar Dash & Sambit Bakshi & Sanjay Kumar Jena & Karen L. Williams, 2018. "A model for sentiment and emotion analysis of unstructured social media text," Electronic Commerce Research, Springer, vol. 18(1), pages 181-199, March.
    3. Ling-Ling Pei & Qin Li, 2019. "Forecasting Quarterly Sales Volume of the New Energy Vehicles Industry in China Using a Data Grouping Approach-Based Nonlinear Grey Bernoulli Model," Sustainability, MDPI, vol. 11(5), pages 1-15, February.
    4. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    5. Hong Pan & Hanxun Zhou, 2020. "Study on convolutional neural network and its application in data mining and sales forecasting for E-commerce," Electronic Commerce Research, Springer, vol. 20(2), pages 297-320, June.
    6. Kolassa, Stephan, 2011. "Combining exponential smoothing forecasts using Akaike weights," International Journal of Forecasting, Elsevier, vol. 27(2), pages 238-251, April.
    7. Kourentzes, Nikolaos & Barrow, Devon & Petropoulos, Fotios, 2019. "Another look at forecast selection and combination: Evidence from forecast pooling," International Journal of Production Economics, Elsevier, vol. 209(C), pages 226-235.
    8. Daniel Kahneman & Amos Tversky, 2013. "Prospect Theory: An Analysis of Decision Under Risk," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part I, chapter 6, pages 99-127, World Scientific Publishing Co. Pte. Ltd..
    9. Li, Hongmin & Wang, Jianzhou & Lu, Haiyan & Guo, Zhenhai, 2018. "Research and application of a combined model based on variable weight for short term wind speed forecasting," Renewable Energy, Elsevier, vol. 116(PA), pages 669-684.
    10. Liu, Bingchun & Song, Chengyuan & Wang, Qingshan & Zhang, Xinming & Chen, Jiali, 2022. "Research on regional differences of China's new energy vehicles promotion policies: A perspective of sales volume forecasting," Energy, Elsevier, vol. 248(C).
    11. Francis X. Diebold & Peter Pauly, 1987. "Structural change and the combination of forecasts," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 6(1), pages 21-40.
    12. Hui Shi & Zhongming Ma & Dazhi Chong & Wu He, 2021. "The impact of Facebook on real estate sales," Journal of Management Analytics, Taylor & Francis Journals, vol. 8(1), pages 101-112, January.
    13. Schneider, Matthew J. & Gupta, Sachin, 2016. "Forecasting sales of new and existing products using consumer reviews: A random projections approach," International Journal of Forecasting, Elsevier, vol. 32(2), pages 243-256.
    14. Zhu, Bangzhu & Wei, Yiming, 2013. "Carbon price forecasting with a novel hybrid ARIMA and least squares support vector machines methodology," Omega, Elsevier, vol. 41(3), pages 517-524.
    15. Barrow, Devon K. & Crone, Sven F., 2016. "Cross-validation aggregation for combining autoregressive neural network forecasts," International Journal of Forecasting, Elsevier, vol. 32(4), pages 1120-1137.
    16. Rodrigues, Bruno Dore & Stevenson, Maxwell J., 2013. "Takeover prediction using forecast combinations," International Journal of Forecasting, Elsevier, vol. 29(4), pages 628-641.
    17. Li, Yiman & Peng, Tian & Zhang, Chu & Sun, Wei & Hua, Lei & Ji, Chunlei & Muhammad Shahzad, Nazir, 2022. "Multi-step ahead wind speed forecasting approach coupling maximal overlap discrete wavelet transform, improved grey wolf optimization algorithm and long short-term memory," Renewable Energy, Elsevier, vol. 196(C), pages 1115-1126.
    18. Zhang, Chuan & Tian, Yu-Xin & Fan, Zhi-Ping, 2022. "Forecasting sales using online review and search engine data: A method based on PCA–DSFOA–BPNN," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1005-1024.
    19. Yu, Lean & Liang, Shaodong & Chen, Rongda & Lai, Kin Keung, 2022. "Predicting monthly biofuel production using a hybrid ensemble forecasting methodology," International Journal of Forecasting, Elsevier, vol. 38(1), pages 3-20.
    20. Fan, Zhi-Ping & Che, Yu-Jie & Chen, Zhen-Yu, 2017. "Product sales forecasting using online reviews and historical sales data: A method combining the Bass model and sentiment analysis," Journal of Business Research, Elsevier, vol. 74(C), pages 90-100.
    21. Wang, Jianzhou & Zhu, Suling & Zhang, Wenyu & Lu, Haiyan, 2010. "Combined modeling for electric load forecasting with adaptive particle swarm optimization," Energy, Elsevier, vol. 35(4), pages 1671-1678.
    22. Kolassa, Stephan, 2011. "Combining exponential smoothing forecasts using Akaike weights," International Journal of Forecasting, Elsevier, vol. 27(2), pages 238-251.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    2. Evangelos Spiliotis & Fotios Petropoulos & Vassilios Assimakopoulos, 2023. "On the Disagreement of Forecasting Model Selection Criteria," Forecasting, MDPI, vol. 5(2), pages 1-12, June.
    3. Wang, Xiaoqian & Hyndman, Rob J. & Li, Feng & Kang, Yanfei, 2023. "Forecast combinations: An over 50-year review," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1518-1547.
    4. Makridakis, Spyros & Hyndman, Rob J. & Petropoulos, Fotios, 2020. "Forecasting in social settings: The state of the art," International Journal of Forecasting, Elsevier, vol. 36(1), pages 15-28.
    5. Kourentzes, Nikolaos & Barrow, Devon & Petropoulos, Fotios, 2019. "Another look at forecast selection and combination: Evidence from forecast pooling," International Journal of Production Economics, Elsevier, vol. 209(C), pages 226-235.
    6. Fotios Petropoulos & Enno Siemsen, 2023. "Forecast Selection and Representativeness," Management Science, INFORMS, vol. 69(5), pages 2672-2690, May.
    7. Pietro Giorgio Lovaglio, 2025. "Cross‐Learning With Panel Data Modeling for Stacking and Forecast Time Series Employment in Europe," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 44(2), pages 753-780, March.
    8. WenJie Wang & Qi Xu & Dandan Fan, 2018. "Stein-Rule Combination Forecasting on RFID Based Supply Chain," Asia-Pacific Journal of Operational Research (APJOR), World Scientific Publishing Co. Pte. Ltd., vol. 35(02), pages 1-13, April.
    9. Meira, Erick & Cyrino Oliveira, Fernando Luiz & Jeon, Jooyoung, 2021. "Treating and Pruning: New approaches to forecasting model selection and combination using prediction intervals," International Journal of Forecasting, Elsevier, vol. 37(2), pages 547-568.
    10. Hollyman, Ross & Petropoulos, Fotios & Tipping, Michael E., 2021. "Understanding forecast reconciliation," European Journal of Operational Research, Elsevier, vol. 294(1), pages 149-160.
    11. Carlos Medel, 2017. "Forecasting Chilean inflation with the hybrid new keynesian Phillips curve: globalisation, combination, and accuracy," Journal Economía Chilena (The Chilean Economy), Central Bank of Chile, vol. 20(3), pages 004-050, December.
    12. Kourentzes, Nikolaos & Petropoulos, Fotios & Trapero, Juan R., 2014. "Improving forecasting by estimating time series structural components across multiple frequencies," International Journal of Forecasting, Elsevier, vol. 30(2), pages 291-302.
    13. Pablo Pincheira-Brown & Andrea Bentancor & Nicolás Hardy, 2023. "An Inconvenient Truth about Forecast Combinations," Mathematics, MDPI, vol. 11(18), pages 1-24, September.
    14. Carlos A. Medel, 2018. "Forecasting Inflation with the Hybrid New Keynesian Phillips Curve: A Compact-Scale Global VAR Approach," International Economic Journal, Taylor & Francis Journals, vol. 32(3), pages 331-371, July.
    15. Hao, Jun & Feng, Qianqian & Yuan, Jiaxin & Sun, Xiaolei & Li, Jianping, 2022. "A dynamic ensemble learning with multi-objective optimization for oil prices prediction," Resources Policy, Elsevier, vol. 79(C).
    16. Mingyang Zhang & Heyan Xu & Ning Ma & Xinglin Pan, 2022. "Intelligent Vehicle Sales Prediction Based on Online Public Opinion and Online Search Index," Sustainability, MDPI, vol. 14(16), pages 1-17, August.
    17. Villegas, Marco A. & Pedregal, Diego J., 2019. "Automatic selection of unobserved components models for supply chain forecasting," International Journal of Forecasting, Elsevier, vol. 35(1), pages 157-169.
    18. Athanasopoulos, George & Hyndman, Rob J. & Kourentzes, Nikolaos & Petropoulos, Fotios, 2017. "Forecasting with temporal hierarchies," European Journal of Operational Research, Elsevier, vol. 262(1), pages 60-74.
    19. Markopoulou, Chrysi E. & Skintzi, Vasiliki D. & Refenes, Apostolos-Paul N., 2016. "Realized hedge ratio: Predictability and hedging performance," International Review of Financial Analysis, Elsevier, vol. 45(C), pages 121-133.
    20. Bernard Moeketsi Hlalele, 2022. "A Comparative Analysis Of Holts-Winters’ And Neural Network Prediction Models On Annual Bloemfontein’S Precipitation: Risk Aversion," Big Data In Agriculture (BDA), Zibeline International Publishing, vol. 4(1), pages 17-21, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:elcore:v:25:y:2025:i:1:d:10.1007_s10660-023-09701-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.