IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/20955.html
   My bibliography  Save this paper

Demand Estimation with Machine Learning and Model Combination

Author

Listed:
  • Patrick Bajari
  • Denis Nekipelov
  • Stephen P. Ryan
  • Miaoyu Yang

Abstract

We survey and apply several techniques from the statistical and computer science literature to the problem of demand estimation. We derive novel asymptotic properties for several of these models. To improve out-of-sample prediction accuracy and obtain parametric rates of convergence, we propose a method of combining the underlying models via linear regression. Our method has several appealing features: it is robust to a large number of potentially-collinear regressors; it scales easily to very large data sets; the machine learning methods combine model selection and estimation; and the method can flexibly approximate arbitrary non-linear functions, even when the set of regressors is high dimensional and we also allow for fixed effects. We illustrate our method using a standard scanner panel data set to estimate promotional lift and find that our estimates are considerably more accurate in out of sample predictions of demand than some commonly used alternatives. While demand estimation is our motivating application, these methods are likely to be useful in other microeconometric problems.

Suggested Citation

  • Patrick Bajari & Denis Nekipelov & Stephen P. Ryan & Miaoyu Yang, 2015. "Demand Estimation with Machine Learning and Model Combination," NBER Working Papers 20955, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:20955
    Note: IO
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w20955.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Nevo, Aviv, 2001. "Measuring Market Power in the Ready-to-Eat Cereal Industry," Econometrica, Econometric Society, vol. 69(2), pages 307-342, March.
    2. Alberto Abadie & Javier Gardeazabal, 2003. "The Economic Costs of Conflict: A Case Study of the Basque Country," American Economic Review, American Economic Association, vol. 93(1), pages 113-132, March.
    3. A. Belloni & D. Chen & V. Chernozhukov & C. Hansen, 2012. "Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain," Econometrica, Econometric Society, vol. 80(6), pages 2369-2429, November.
    4. A. Belloni & V. Chernozhukov & L. Wang, 2011. "Square-root lasso: pivotal recovery of sparse signals via conic programming," Biometrika, Biometrika Trust, vol. 98(4), pages 791-806.
    5. Matthew Gentzkow & Jesse M. Shapiro, 2010. "What Drives Media Slant? Evidence From U.S. Daily Newspapers," Econometrica, Econometric Society, vol. 78(1), pages 35-71, January.
    6. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    7. Komarova, Tatiana, 2013. "Binary choice models with discrete regressors: Identification and misspecification," Journal of Econometrics, Elsevier, vol. 177(1), pages 14-33.
    8. Berry, Steven & Levinsohn, James & Pakes, Ariel, 1995. "Automobile Prices in Market Equilibrium," Econometrica, Econometric Society, vol. 63(4), pages 841-890, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Krüger, Jens J. & Rhiel, Mathias, 2016. "Determinants of ICT infrastructure: A cross-country statistical analysis," Darmstadt Discussion Papers in Economics 228, Darmstadt University of Technology, Department of Law and Economics.
    2. Evgeniy M. Ozhegov & Alina Ozhegova, 2020. "Regression tree model for prediction of demand with heterogeneity and censorship," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(3), pages 489-500, April.
    3. Evgeniy M. Ozhegov & Daria Teterina, 2018. "The Ensemble Method For Censored Demand Prediction," HSE Working papers WP BRP 200/EC/2018, National Research University Higher School of Economics.
    4. Pierre Dodin & Jingyi Xiao & Yossiri Adulyasak & Neda Etebari Alamdari & Lea Gauthier & Philippe Grangier & Paul Lemaitre & William L. Hamilton, 2023. "Bombardier Aftermarket Demand Forecast with Machine Learning," Interfaces, INFORMS, vol. 53(6), pages 425-445, November.
    5. Erik Nelson & John Fitzgerald & Nathan Tefft, 2019. "The distributional impact of a green payment policy for organic fruit," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-25, February.
    6. Adam N. Smith & Jim E. Griffin, 2023. "Shrinkage priors for high-dimensional demand estimation," Quantitative Marketing and Economics (QME), Springer, vol. 21(1), pages 95-146, March.
    7. Pédussel Wu, Jennifer & Metzger, Martina & Neira, Ignacio Silva & Farroukh, Arafet, 2023. "What determines demand for digital community currencies? OurVillage in Cameroon," IPE Working Papers 209/2023, Berlin School of Economics and Law, Institute for International Political Economy (IPE).
    8. Evgeniy M. Ozhegov & Alina Ozhegova, 2017. "Regression Tree Model for Analysis of Demand with Heterogeneity and Censorship," HSE Working papers WP BRP 174/EC/2017, National Research University Higher School of Economics.
    9. Green, Gareth & Richards, Timothy, 2016. "Interpreting Results of Demand Estimation from Machine Learning Models," 2016 Annual Meeting, July 31-August 2, Boston, Massachusetts 236147, Agricultural and Applied Economics Association.
    10. Adam N. Smith & Peter E. Rossi & Greg M. Allenby, 2019. "Inference for Product Competition and Separable Demand," Marketing Science, INFORMS, vol. 38(4), pages 690-710, July.
    11. Patrick Bajari & Denis Nekipelov & Stephen P. Ryan & Miaoyu Yang, 2015. "Machine Learning Methods for Demand Estimation," American Economic Review, American Economic Association, vol. 105(5), pages 481-485, May.
    12. Raval, Devesh & Rosenbaum, Ted & Wilson, Nathan E., 2021. "How do machine learning algorithms perform in predicting hospital choices? evidence from changing environments," Journal of Health Economics, Elsevier, vol. 78(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhu, Ying, 2013. "Sparse Linear Models and Two-Stage Estimation in High-Dimensional Settings with Possibly Many Endogenous Regressors," MPRA Paper 49846, University Library of Munich, Germany.
    2. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    3. Steven T. Berry & Philip A. Haile, 2014. "Identification in Differentiated Products Markets Using Market Level Data," Econometrica, Econometric Society, vol. 82, pages 1749-1797, September.
    4. J. Levin & L. Einav, 2012. "Empirical Industrial Organization: A Progress Report," Voprosy Ekonomiki, NP Voprosy Ekonomiki, issue 1.
    5. Amit Gandhi & Jean-François Houde, 2019. "Measuring Substitution Patterns in Differentiated-Products Industries," NBER Working Papers 26375, National Bureau of Economic Research, Inc.
    6. Matthew Backus & Christopher Conlon & Michael Sinkinson, 2021. "Common Ownership and Competition in the Ready-to-Eat Cereal Industry," NBER Working Papers 28350, National Bureau of Economic Research, Inc.
    7. Achim Ahrens & Christian B. Hansen & Mark E. Schaffer, 2020. "lassopack: Model selection and prediction with regularized regression in Stata," Stata Journal, StataCorp LP, vol. 20(1), pages 176-235, March.
    8. Ziv Bar-Nahum & Israel Finkelshtain & Rico Ihle & Ofir D. Rubin, 2020. "Effects of violent political conflict on the supply, demand and fragmentation of fresh food markets," Food Security: The Science, Sociology and Economics of Food Production and Access to Food, Springer;The International Society for Plant Pathology, vol. 12(3), pages 503-515, June.
    9. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2021. "Economic Predictions With Big Data: The Illusion of Sparsity," Econometrica, Econometric Society, vol. 89(5), pages 2409-2437, September.
    10. Byrne, David P. & Imai, Susumu & Jain, Neelam & Sarafidis, Vasilis, 2022. "Instrument-free identification and estimation of differentiated products models using cost data," Journal of Econometrics, Elsevier, vol. 228(2), pages 278-301.
    11. Bonnet, Céline & Richards, Timothy J., 2016. "Models of Consumer Demand for Differentiated Products," TSE Working Papers 16-741, Toulouse School of Economics (TSE).
    12. Viswanathan, Madhu & Mukherji, Prokriti & Narasimhan, Om & Chandy, Rajesh, 2021. "The performance impact of core component outsourcing: insights from the LCD TV industry," LSE Research Online Documents on Economics 109853, London School of Economics and Political Science, LSE Library.
    13. Liqian Cai & Arnab Bhattacharjee & Roger Calantone & Taps Maiti, 2019. "Variable Selection with Spatially Autoregressive Errors: A Generalized Moments LASSO Estimator," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 81(1), pages 146-200, September.
    14. Donna, Javier D. & Pereira, Pedro & Trindade, Andre & Yoshida, Renan C., 2020. "Direct-to-Consumer Sales by Manufacturers and Bargaining," MPRA Paper 105773, University Library of Munich, Germany.
    15. Kaplow, Louis & Shapiro, Carl, 2007. "Antitrust," Handbook of Law and Economics, in: A. Mitchell Polinsky & Steven Shavell (ed.), Handbook of Law and Economics, edition 1, volume 2, chapter 15, pages 1073-1225, Elsevier.
    16. Alexandre Belloni & Victor Chernozhukov & Kengo Kato, 2019. "Valid Post-Selection Inference in High-Dimensional Approximately Sparse Quantile Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(526), pages 749-758, April.
    17. Ho, Chun-Yu, 2012. "Market structure, welfare, and banking reform in China," Journal of Comparative Economics, Elsevier, vol. 40(2), pages 291-313.
    18. Tovar, Jorge, 2012. "Consumers’ Welfare and Trade Liberalization: Evidence from the Car Industry in Colombia," World Development, Elsevier, vol. 40(4), pages 808-820.
    19. Pereira, Pedro & Ribeiro, Tiago, 2011. "The impact on broadband access to the Internet of the dual ownership of telephone and cable networks," International Journal of Industrial Organization, Elsevier, vol. 29(2), pages 283-293, March.
    20. Allais, Olivier & Etilé, Fabrice & Lecocq, Sébastien, 2015. "Mandatory labels, taxes and market forces: An empirical evaluation of fat policies," Journal of Health Economics, Elsevier, vol. 43(C), pages 27-44.

    More about this item

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:20955. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.