IDEAS home Printed from https://ideas.repec.org/a/pal/jorapm/v19y2020i5d10.1057_s41272-020-00236-4.html
   My bibliography  Save this article

Interpretable machine learning for demand modeling with high-dimensional data using Gradient Boosting Machines and Shapley values

Author

Listed:
  • Evgeny A. Antipov

    (National Research University Higher School of Economics)

  • Elena B. Pokryshevskaya

    (National Research University Higher School of Economics)

Abstract

Forecasting demand and understanding sales drivers are one of the most important tasks in retail analytics. However, traditionally, linear models and/or models with a small number of predictors have been predominantly used in sales modeling. Taking into account that real-world demand is naturally determined by complex substitution and complementation patterns among a large number of interrelated SKUs, nonlinear effects of prices, promotions, seasonality, as well as many other factors, their lagged values, and interactions, a realistic model has to be able to account for all that. We propose a conceptual model for sales modeling based on standard POS data available to any retailer and generate almost 500 potentially useful predictors of a focal SKU’s sales accordingly. In our comparison of three classes of models, Gradient Boosting Machines outperformed Random Forests and Elastic nets. By using interpretable machine learning methods, we came up with actionable insights related to the importance of various groups of predictors from the conceptual model, as well as demonstrated how helpful it can be for marketing managers to decompose predictions into the effects of individual regressors by using an approximation of Shapley values for feature attribution.

Suggested Citation

  • Evgeny A. Antipov & Elena B. Pokryshevskaya, 2020. "Interpretable machine learning for demand modeling with high-dimensional data using Gradient Boosting Machines and Shapley values," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 19(5), pages 355-364, October.
  • Handle: RePEc:pal:jorapm:v:19:y:2020:i:5:d:10.1057_s41272-020-00236-4
    DOI: 10.1057/s41272-020-00236-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/s41272-020-00236-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1057/s41272-020-00236-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Harald J. van Heerde & Peter S. H. Leeflang & Dick R. Wittink, 2002. "How Promotions Work: Scan Pro-Based Evolutionary Model Building," Schmalenbach Business Review (sbr), LMU Munich School of Management, vol. 54(3), pages 198-220, July.
    2. Harry Haupt & Kathrin Kagerer & Winfried J. Steiner, 2014. "Smooth Quantile‐Based Modeling Of Brand Sales, Price And Promotional Effects From Retail Scanner Panels," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(6), pages 1007-1028, September.
    3. Bradlow, Eric T. & Gangwar, Manish & Kopalle, Praveen & Voleti, Sudhir, 2017. "The Role of Big Data and Predictive Analytics in Retailing," Journal of Retailing, Elsevier, vol. 93(1), pages 79-95.
    4. Evgeniy M. Ozhegov & Daria Teterina, 2018. "The Ensemble Method For Censored Demand Prediction," HSE Working papers WP BRP 200/EC/2018, National Research University Higher School of Economics.
    5. Kusum L. Ailawadi & Bari A. Harlam & Jacques César & David Trounce, 2007. "Practice Prize Report—Quantifying and Improving Promotion Effectiveness at CVS," Marketing Science, INFORMS, vol. 26(4), pages 566-575, 07-08.
    6. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    7. Peter S.H. Leeflang & Harald J. van Heerde & Dick Wittink, 2002. "How Promotions Work: SCAN*PRO-Based Evolutionary Model Building," Yale School of Management Working Papers ysm292, Yale School of Management.
    8. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    9. Harald J. van Heerde & Peter S. H. Leeflang & Dick R. Wittink, 2004. "Decomposing the Sales Promotion Bump with Store Data," Marketing Science, INFORMS, vol. 23(3), pages 317-334, December.
    10. Patrick Bajari & Denis Nekipelov & Stephen P. Ryan & Miaoyu Yang, 2015. "Machine Learning Methods for Demand Estimation," American Economic Review, American Economic Association, vol. 105(5), pages 481-485, May.
    11. Ma, Shaohui & Fildes, Robert, 2017. "A retail store SKU promotions optimization model for category multi-period profit maximization," European Journal of Operational Research, Elsevier, vol. 260(2), pages 680-692.
    12. Ma, Shaohui & Fildes, Robert & Huang, Tao, 2016. "Demand forecasting with high dimensional data: The case of SKU retail sales forecasting with intra- and inter-category promotional information," European Journal of Operational Research, Elsevier, vol. 249(1), pages 245-257.
    13. Andrews, Rick L. & Currim, Imran S. & Leeflang, Peter & Lim, Jooseop, 2008. "Estimating the SCAN⁎PRO model of store sales: HB, FM or just OLS?," International Journal of Research in Marketing, Elsevier, vol. 25(1), pages 22-33.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fildes, Robert & Kolassa, Stephan & Ma, Shaohui, 2022. "Post-script—Retail forecasting: Research and practice," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1319-1324.
    2. Keun Hee Lee & Mali Abdollahian & Sergei Schreider & Sona Taheri, 2023. "Supply Chain Demand Forecasting and Price Optimisation Models with Substitution Effect," Mathematics, MDPI, vol. 11(11), pages 1-28, May.
    3. Christopher K. Wikle & Abhirup Datta & Bhava Vyasa Hari & Edward L. Boone & Indranil Sahoo & Indulekha Kavila & Stefano Castruccio & Susan J. Simmons & Wesley S. Burr & Won Chang, 2023. "An illustration of model agnostic explainability methods applied to environmental data," Environmetrics, John Wiley & Sons, Ltd., vol. 34(1), February.
    4. Aras, Serkan & Hanifi Van, M., 2022. "An interpretable forecasting framework for energy consumption and CO2 emissions," Applied Energy, Elsevier, vol. 328(C).
    5. Kolassa, Stephan, 2022. "Commentary on the M5 forecasting competition," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1562-1568.
    6. Mitrentsis, Georgios & Lens, Hendrik, 2022. "An interpretable probabilistic model for short-term solar power forecasting using natural gradient boosting," Applied Energy, Elsevier, vol. 309(C).
    7. Dennis W. Campbell & Ruidi Shang, 2022. "Tone at the Bottom: Measuring Corporate Misconduct Risk from the Text of Employee Reviews," Management Science, INFORMS, vol. 68(9), pages 7034-7053, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    2. Ma, Shaohui & Fildes, Robert, 2017. "A retail store SKU promotions optimization model for category multi-period profit maximization," European Journal of Operational Research, Elsevier, vol. 260(2), pages 680-692.
    3. Gür Ali, Özden & Gürlek, Ragıp, 2020. "Automatic Interpretable Retail forecasting with promotional scenarios," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1389-1406.
    4. Philipp Aschersleben & Winfried J. Steiner, 2022. "A semiparametric approach to estimating reference price effects in sales response models," Journal of Business Economics, Springer, vol. 92(4), pages 591-643, May.
    5. Naragain Phumchusri & Warot Kosawanitchakarn & Sirawich Chawanapranee & Sirawish Srimook, 2023. "Evaluating promotional pricing effectiveness using convenience store daily sales data," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 22(5), pages 362-373, October.
    6. Huber, Jakob & Stuckenschmidt, Heiner, 2020. "Daily retail demand forecasting using machine learning with emphasis on calendric special days," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1420-1438.
    7. Weber, Anett & Steiner, Winfried J., 2021. "Modeling price response from retail sales: An empirical comparison of models with different representations of heterogeneity," European Journal of Operational Research, Elsevier, vol. 294(3), pages 843-859.
    8. Fildes, Robert & Ma, Shaohui & Kolassa, Stephan, 2022. "Retail forecasting: Research and practice," International Journal of Forecasting, Elsevier, vol. 38(4), pages 1283-1318.
    9. Hewage, Harsha Chamara & Perera, H. Niles & De Baets, Shari, 2022. "Forecast adjustments during post-promotional periods," European Journal of Operational Research, Elsevier, vol. 300(2), pages 461-472.
    10. Lang, Stefan & Steiner, Winfried J. & Weber, Anett & Wechselberger, Peter, 2015. "Accommodating heterogeneity and nonlinearity in price effects for predicting brand sales and profits," European Journal of Operational Research, Elsevier, vol. 246(1), pages 232-241.
    11. Leeflang, Peter, 2011. "Paving the way for “distinguished marketing”," International Journal of Research in Marketing, Elsevier, vol. 28(2), pages 76-88.
    12. Cui, Hailong & Rajagopalan, Sampath & Ward, Amy R., 2020. "Predicting product return volume using machine learning methods," European Journal of Operational Research, Elsevier, vol. 281(3), pages 612-627.
    13. Fildes, Robert & Ma, Shaohui & Kolassa, Stephan, 2019. "Retail forecasting: research and practice," MPRA Paper 89356, University Library of Munich, Germany.
    14. Parreño-Selva, Josefa & Mas-Ruiz, Francisco J. & Ruiz-Conde, Enar, 2017. "The effects of price promotion on relative virtue and vice food products," International Food and Agribusiness Management Review, International Food and Agribusiness Management Association, vol. 20(5).
    15. Sagaert, Yves R. & Aghezzaf, El-Houssaine & Kourentzes, Nikolaos & Desmet, Bram, 2018. "Tactical sales forecasting using a very large set of macroeconomic indicators," European Journal of Operational Research, Elsevier, vol. 264(2), pages 558-569.
    16. Anett Weber & Winfried J. Steiner & Stefan Lang, 2017. "A comparison of semiparametric and heterogeneous store sales models for optimal category pricing," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 39(2), pages 403-445, March.
    17. Amin, Modhurima Dey & Badruddoza, Syed & McCluskey, Jill J., 2021. "Predicting access to healthful food retailers with machine learning," Food Policy, Elsevier, vol. 99(C).
    18. Guhl, Daniel & Baumgartner, Bernhard & Kneib, Thomas & Steiner, Winfried J., 2018. "Estimating time-varying parameters in brand choice models: A semiparametric approach," International Journal of Research in Marketing, Elsevier, vol. 35(3), pages 394-414.
    19. Ma, Shaohui & Fildes, Robert, 2021. "Retail sales forecasting with meta-learning," European Journal of Operational Research, Elsevier, vol. 288(1), pages 111-128.
    20. Kurt A. Jetta & Erick W. Rengifo, 2009. "Improved Baseline Sales," Fordham Economics Discussion Paper Series dp2009-02, Fordham University, Department of Economics.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:jorapm:v:19:y:2020:i:5:d:10.1057_s41272-020-00236-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.palgrave.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.