IDEAS home Printed from https://ideas.repec.org/a/eee/intfor/v32y2016i2p243-256.html
   My bibliography  Save this article

Forecasting sales of new and existing products using consumer reviews: A random projections approach

Author

Listed:
  • Schneider, Matthew J.
  • Gupta, Sachin

Abstract

We consider the problem of predicting sales of new and existing products using both the numeric and textual data contained in consumer reviews. Many of the extant approaches require considerable manual pre-processing of the textual data, making the methods prohibitively expensive to implement and difficult to scale. In contrast, our approach uses a bag-of-words method that requires minimal pre-processing and parsing, making it efficient and scalable. However, a key implementation challenge with the bag-of-words approach is that the number of predictors can quickly outstrip the number of degrees of freedom available. Furthermore, the method can require impracticably large computational resources. We propose a random projections approach for dealing with the curse-of-dimensionality issue that afflicts bag-of-words models. The random projections approach is computationally simple, flexible and fast, and has desirable statistical properties. We apply the proposed approach to the forecasting of sales at Amazon.com using consumer reviews with an attributes-based regression model. The model is applied to produce of one-week-ahead rolling horizon sales forecasts for existing and newly-introduced tablet computers. The results show that the predictive performance of the proposed approach for both tasks is strong and significantly better than those of either models that ignore the textual content of consumer reviews, or a support vector regression machine with the textual content. Furthermore, the approach is easy to repeat across product categories, and readily scalable to much larger datasets.

Suggested Citation

  • Schneider, Matthew J. & Gupta, Sachin, 2016. "Forecasting sales of new and existing products using consumer reviews: A random projections approach," International Journal of Forecasting, Elsevier, vol. 32(2), pages 243-256.
  • Handle: RePEc:eee:intfor:v:32:y:2016:i:2:p:243-256
    DOI: 10.1016/j.ijforecast.2015.08.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0169207015001235
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ijforecast.2015.08.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Judith Chevalier & Austan Goolsbee, 2003. "Measuring Prices and Price Competition Online: Amazon.com and BarnesandNoble.com," Quantitative Marketing and Economics (QME), Springer, vol. 1(2), pages 203-222, June.
    2. De Gooijer, Jan G. & Hyndman, Rob J., 2006. "25 years of time series forecasting," International Journal of Forecasting, Elsevier, vol. 22(3), pages 443-473.
    3. Monic Sun, 2012. "How Does the Variance of Product Ratings Matter?," Management Science, INFORMS, vol. 58(4), pages 696-707, April.
    4. Nikolay Archak & Anindya Ghose & Panagiotis G. Ipeirotis, 2011. "Deriving the Pricing Power of Product Features by Mining Consumer Reviews," Management Science, INFORMS, vol. 57(8), pages 1485-1509, August.
    5. Yubo Chen & Jinhong Xie, 2008. "Online Consumer Review: Word-of-Mouth as a New Element of Marketing Communication Mix," Management Science, INFORMS, vol. 54(3), pages 477-491, March.
    6. Hyndman, Rob J. & Koehler, Anne B., 2006. "Another look at measures of forecast accuracy," International Journal of Forecasting, Elsevier, vol. 22(4), pages 679-688.
    7. Xinxin Li & Lorin M. Hitt, 2008. "Self-Selection and Information Role of Online Product Reviews," Information Systems Research, INFORMS, vol. 19(4), pages 456-474, December.
    8. Erik Brynjolfsson & Yu (Jeffrey) Hu & Michael D. Smith, 2003. "Consumer Surplus in the Digital Economy: Estimating the Value of Increased Product Variety at Online Booksellers," Management Science, INFORMS, vol. 49(11), pages 1580-1596, November.
    9. Yi Zhao & Sha Yang & Vishal Narayan & Ying Zhao, 2013. "Modeling Consumer Learning from Online Product Reviews," Marketing Science, INFORMS, vol. 32(1), pages 153-169, May.
    10. Oded Netzer & Ronen Feldman & Jacob Goldenberg & Moshe Fresko, 2012. "Mine Your Own Business: Market-Structure Surveillance Through Text Mining," Marketing Science, INFORMS, vol. 31(3), pages 521-543, May.
    11. Armstrong, J. Scott & Collopy, Fred, 1992. "Error measures for generalizing about forecasting methods: Empirical comparisons," International Journal of Forecasting, Elsevier, vol. 8(1), pages 69-80, June.
    12. Stephan Kolassa & Wolfgang Schütz, 2007. "Advantages of the MAD/Mean Ratio over the MAPE," Foresight: The International Journal of Applied Forecasting, International Institute of Forecasters, issue 6, pages 40-43, Spring.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Young Kwark & Jianqing Chen & Srinivasan Raghunathan, 2018. "User-Generated Content and Competing Firms’ Product Design," Management Science, INFORMS, vol. 64(10), pages 4608-4628, October.
    2. Kun Chen & Peng Luo & Huaiqing Wang, 2017. "Investigating transitive influences on WOM: from the product network perspective," Electronic Commerce Research, Springer, vol. 17(1), pages 149-167, March.
    3. Bin Gu & Jaehong Park & Prabhudev Konana, 2012. "Research Note ---The Impact of External Word-of-Mouth Sources on Retailer Sales of High-Involvement Products," Information Systems Research, INFORMS, vol. 23(1), pages 182-196, March.
    4. Dominik Gutt & Jürgen Neumann & Steffen Zimmermann & Dennis Kundisch & Jianqing Chen, 2018. "Design of Review Systems - A Strategic Instrument to shape Online Review Behavior and Economic Outcomes," Working Papers Dissertations 42, Paderborn University, Faculty of Business Administration and Economics.
    5. Dominik Gutt, 2018. "In the Eye of the Beholder? Empirically Decomposing Different Economic Implications of the Online Rating Variance," Working Papers Dissertations 40, Paderborn University, Faculty of Business Administration and Economics.
    6. Floyd, Kristopher & Freling, Ryan & Alhoqail, Saad & Cho, Hyun Young & Freling, Traci, 2014. "How Online Product Reviews Affect Retail Sales: A Meta-analysis," Journal of Retailing, Elsevier, vol. 90(2), pages 217-232.
    7. Juan Feng & Xin Li & Xiaoquan (Michael) Zhang, 2019. "Online Product Reviews-Triggered Dynamic Pricing: Theory and Evidence," Information Systems Research, INFORMS, vol. 30(4), pages 1107-1123, December.
    8. Linyi Li & Shyam Gopinath & Stephen J. Carson, 2022. "History Matters: The Impact of Online Customer Reviews Across Product Generations," Management Science, INFORMS, vol. 68(5), pages 3878-3903, May.
    9. Li, Yiming & Li, Gang & Tayi, Giri Kumar & Cheng, T.C.E., 2019. "Omni-channel retailing: Do offline retailers benefit from online reviews?," International Journal of Production Economics, Elsevier, vol. 218(C), pages 43-61.
    10. Young Kwark & Jianqing Chen & Srinivasan Raghunathan, 2014. "Online Product Reviews: Implications for Retailers and Competing Manufacturers," Information Systems Research, INFORMS, vol. 25(1), pages 93-110, March.
    11. Fildes, Robert & Ma, Shaohui & Kolassa, Stephan, 2022. "Retail forecasting: Research and practice," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1283-1318.
    12. Nikolay Archak & Anindya Ghose & Panagiotis G. Ipeirotis, 2011. "Deriving the Pricing Power of Product Features by Mining Consumer Reviews," Management Science, INFORMS, vol. 57(8), pages 1485-1509, August.
    13. Anning Wang & Qiang Zhang & Shuangyao Zhao & Xiaonong Lu & Zhanglin Peng, 2020. "A review-driven customer preference measurement model for product improvement: sentiment-based importance–performance analysis," Information Systems and e-Business Management, Springer, vol. 18(1), pages 61-88, March.
    14. Peiyu Chen & Lorin M. Hitt & Yili Hong & Shinyi Wu, 2021. "Measuring Product Type and Purchase Uncertainty with Online Product Ratings: A Theoretical Model and Empirical Application," Information Systems Research, INFORMS, vol. 32(4), pages 1470-1489, December.
    15. Steffen Zimmermann & Philipp Herrmann & Dennis Kundisch & Barrie R. Nault, 2018. "Decomposing the Variance of Consumer Ratings and the Impact on Price and Demand," Information Systems Research, INFORMS, vol. 29(4), pages 984-1002, December.
    16. Sun, Miao & Chen, Jing & Tian, Ye & Yan, Yufei, 2021. "The impact of online reviews in the presence of customer returns," International Journal of Production Economics, Elsevier, vol. 232(C).
    17. Sungsik Park & Woochoel Shin & Jinhong Xie, 2021. "The Fateful First Consumer Review," Marketing Science, INFORMS, vol. 40(3), pages 481-507, May.
    18. Kim, Sungil & Kim, Heeyoung, 2016. "A new metric of absolute percentage error for intermittent demand forecasts," International Journal of Forecasting, Elsevier, vol. 32(3), pages 669-679.
    19. Qiao, Haike & Su, Qin, 2021. "Distribution channel and licensing strategy choice considering consumer online reviews in a closed-loop supply chain," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 151(C).
    20. Yusong Wang & David Bell, 2015. "Consumer store choice in Asian markets," Marketing Letters, Springer, vol. 26(3), pages 293-308, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:intfor:v:32:y:2016:i:2:p:243-256. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/ijforecast .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.