IDEAS home Printed from https://ideas.repec.org/a/taf/jnlbes/v34y2016i4p661-672.html
   My bibliography  Save this article

A Nonparametric Bayesian Analysis of Heterogenous Treatment Effects in Digital Experimentation

Author

Listed:
  • Matt Taddy
  • Matt Gardner
  • Liyun Chen
  • David Draper

Abstract

Randomized controlled trials play an important role in how Internet companies predict the impact of policy decisions and product changes. In these “digital experiments,” different units (people, devices, products) respond differently to the treatment. This article presents a fast and scalable Bayesian nonparametric analysis of such heterogenous treatment effects and their measurement in relation to observable covariates. New results and algorithms are provided for quantifying the uncertainty associated with treatment effect measurement via both linear projections and nonlinear regression trees (CART and random forests). For linear projections, our inference strategy leads to results that are mostly in agreement with those from the frequentist literature. We find that linear regression adjustment of treatment effect averages (i.e., post-stratification) can provide some variance reduction, but that this reduction will be vanishingly small in the low-signal and large-sample setting of digital experiments. For regression trees, we provide uncertainty quantification for the machine learning algorithms that are commonly applied in tree-fitting. We argue that practitioners should look to ensembles of trees (forests) rather than individual trees in their analysis. The ideas are applied on and illustrated through an example experiment involving 21 million unique users of EBay.com.

Suggested Citation

  • Matt Taddy & Matt Gardner & Liyun Chen & David Draper, 2016. "A Nonparametric Bayesian Analysis of Heterogenous Treatment Effects in Digital Experimentation," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 661-672, October.
  • Handle: RePEc:taf:jnlbes:v:34:y:2016:i:4:p:661-672
    DOI: 10.1080/07350015.2016.1172013
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/07350015.2016.1172013
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/07350015.2016.1172013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chamberlain, Gary & Imbens, Guido W, 2003. "Nonparametric Applications of Bayesian Inference," Journal of Business & Economic Statistics, American Statistical Association, vol. 21(1), pages 12-18, January.
    2. Richard Berk & Emil Pitkin & Lawrence Brown & Andreas Buja & Edward George & Linda Zhao, 2013. "Covariance Adjustments for the Analysis of Randomized Field Experiments," Evaluation Review, , vol. 37(3-4), pages 170-196, June.
    3. Donald B. Rubin, 2005. "Causal Inference Using Potential Outcomes: Design, Modeling, Decisions," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 322-331, March.
    4. Guido W. Imbens, 2004. "Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review," The Review of Economics and Statistics, MIT Press, vol. 86(1), pages 4-29, February.
    5. Taddy, Matthew A. & Kottas, Athanasios, 2010. "A Bayesian Nonparametric Approach to Inference for Quantile Regression," Journal of Business & Economic Statistics, American Statistical Association, vol. 28(3), pages 357-369.
    6. Dale J. Poirier, 2011. "Bayesian Interpretations of Heteroskedastic Consistent Covariance Estimators Using the Informed Bayesian Bootstrap," Econometric Reviews, Taylor & Francis Journals, vol. 30(4), pages 457-468, August.
    7. Imbens,Guido W. & Rubin,Donald B., 2015. "Causal Inference for Statistics, Social, and Biomedical Sciences," Cambridge Books, Cambridge University Press, number 9780521885881.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lechner, Michael, 2018. "Modified Causal Forests for Estimating Heterogeneous Causal Effects," IZA Discussion Papers 12040, Institute of Labor Economics (IZA).
    2. Davide Viviano & Jelena Bradic, 2019. "Synthetic learner: model-free inference on treatments over time," Papers 1904.01490, arXiv.org, revised Aug 2022.
    3. Michael C Knaus & Michael Lechner & Anthony Strittmatter, 2021. "Machine learning estimation of heterogeneous causal effects: Empirical Monte Carlo evidence," The Econometrics Journal, Royal Economic Society, vol. 24(1), pages 134-161.
    4. Philippe Goulet Coulombe, 2020. "The Macroeconomy as a Random Forest," Papers 2006.12724, arXiv.org, revised Mar 2021.
    5. Miller, Steve, 2020. "Causal forest estimation of heterogeneous and time-varying environmental policy effects," Journal of Environmental Economics and Management, Elsevier, vol. 103(C).
    6. Seungwoo Chin & Matthew E. Kahn & Hyungsik Roger Moon, 2020. "Estimating the Gains from New Rail Transit Investment: A Machine Learning Tree Approach," Real Estate Economics, American Real Estate and Urban Economics Association, vol. 48(3), pages 886-914, September.
    7. Qingyuan Zhao & Dylan S. Small & Ashkan Ertefaie, 2022. "Selective inference for effect modification via the lasso," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 382-413, April.
    8. Philippe Goulet Coulombe, 2021. "The Macroeconomy as a Random Forest," Working Papers 21-05, Chair in macroeconomics and forecasting, University of Quebec in Montreal's School of Management.
    9. Jaeho Kim & Le Wang, 2019. "Hidden group patterns in democracy developments: Bayesian inference for grouped heterogeneity," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(6), pages 1016-1028, September.
    10. Shi, Chengchun & Luo, Shikai & Zhu, Hongtu & Song, Rui, 2021. "An online sequential test for qualitative treatment effects," LSE Research Online Documents on Economics 112521, London School of Economics and Political Science, LSE Library.
    11. Anthony Strittmatter, 2018. "What Is the Value Added by Using Causal Machine Learning Methods in a Welfare Experiment Evaluation?," Papers 1812.06533, arXiv.org, revised Dec 2021.
    12. Gubela, Robin M. & Lessmann, Stefan & Jaroszewicz, Szymon, 2020. "Response transformation and profit decomposition for revenue uplift modeling," European Journal of Operational Research, Elsevier, vol. 283(2), pages 647-661.
    13. Lai Xinglin, 2021. "Modelling hetegeneous treatment effects by quantitle local polynomial decision tree and forest," Papers 2111.15320, arXiv.org, revised Mar 2022.
    14. Huck, Nicolas, 2019. "Large data sets and machine learning: Applications to statistical arbitrage," European Journal of Operational Research, Elsevier, vol. 278(1), pages 330-342.
    15. Martin Cousineau & Vedat Verter & Susan A. Murphy & Joelle Pineau, 2022. "Estimating causal effects with optimization-based methods: A review and empirical comparison," Papers 2203.00097, arXiv.org.
    16. Michael Lechner & Jana Mareckova, 2022. "Modified Causal Forest," Papers 2209.03744, arXiv.org.
    17. Jean-Pierre Dubé & Sanjog Misra, 2017. "Personalized Pricing and Consumer Welfare," NBER Working Papers 23775, National Bureau of Economic Research, Inc.
    18. Guihua Wang & Jun Li & Wallace J. Hopp, 2022. "An Instrumental Variable Forest Approach for Detecting Heterogeneous Treatment Effects in Observational Studies," Management Science, INFORMS, vol. 68(5), pages 3399-3418, May.
    19. Maria Nareklishvili & Nicholas Polson & Vadim Sokolov, 2022. "Feature Selection for Personalized Policy Analysis," Papers 2301.00251, arXiv.org, revised Jul 2023.
    20. Gabriel Okasa, 2022. "Meta-Learners for Estimation of Causal Effects: Finite Sample Cross-Fit Performance," Papers 2201.12692, arXiv.org.
    21. Matthew Gentzkow & Bryan T. Kelly & Matt Taddy, 2017. "Text as Data," NBER Working Papers 23276, National Bureau of Economic Research, Inc.
    22. Strittmatter, Anthony, 2023. "What is the value added by using causal machine learning methods in a welfare experiment evaluation?," Labour Economics, Elsevier, vol. 84(C).
    23. Cousineau, Martin & Verter, Vedat & Murphy, Susan A. & Pineau, Joelle, 2023. "Estimating causal effects with optimization-based methods: A review and empirical comparison," European Journal of Operational Research, Elsevier, vol. 304(2), pages 367-380.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Tenglong Li & Kenneth A. Frank & Mingming Chen, 2024. "A Conceptual Framework for Quantifying the Robustness of a Regression-Based Causal Inference in Observational Study," Mathematics, MDPI, vol. 12(3), pages 1-14, January.
    2. Tang, Chuan & Heintzelman, Martin D. & Holsen, Thomas M., 2018. "Mercury pollution, information, and property values," Journal of Environmental Economics and Management, Elsevier, vol. 92(C), pages 418-432.
    3. Tenglong Li & Kenneth A. Frank, 2020. "The probability of a robust inference for internal validity and its applications in regression models," Papers 2005.12784, arXiv.org.
    4. Hang Miao & Kui Zhao & Zhun Wang & Linbo Jiang & Quanhui Jia & Yanming Fang & Quan Yu, 2020. "Intelligent Credit Limit Management in Consumer Loans Based on Causal Inference," Papers 2007.05188, arXiv.org.
    5. Peter Z. Schochet, "undated". "Statistical Theory for the RCT-YES Software: Design-Based Causal Inference for RCTs," Mathematica Policy Research Reports a0c005c003c242308a92c02dc, Mathematica Policy Research.
    6. Tenglong Li & Ken Frank, 2022. "The probability of a robust inference for internal validity," Sociological Methods & Research, , vol. 51(4), pages 1947-1968, November.
    7. Christoph Breunig & Ruixuan Liu & Zhengfei Yu, 2022. "Double Robust Bayesian Inference on Average Treatment Effects," Papers 2211.16298, arXiv.org, revised Feb 2024.
    8. Tenglong Li & Kenneth A. Frank, 2019. "On the probability of a causal inference is robust for internal validity," Papers 1906.08726, arXiv.org.
    9. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-dimensional econometrics and regularized GMM," CeMMAP working papers CWP35/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    10. Caloffi, Annalisa & Freo, Marzia & Ghinoi, Stefano & Mariani, Marco & Rossi, Federica, 2022. "Assessing the effects of a deliberate policy mix: The case of technology and innovation advisory services and innovation vouchers," Research Policy, Elsevier, vol. 51(6).
    11. Shanike J. Smart & Solomon W. Polachek, 2024. "COVID-19 vaccine and risk-taking," Journal of Risk and Uncertainty, Springer, vol. 68(1), pages 25-49, February.
    12. Florian Gunsilius & Yuliang Xu, 2021. "Matching for causal effects via multimarginal unbalanced optimal transport," Papers 2112.04398, arXiv.org, revised Jul 2022.
    13. Zhengyuan Zhou & Susan Athey & Stefan Wager, 2023. "Offline Multi-Action Policy Learning: Generalization and Optimization," Operations Research, INFORMS, vol. 71(1), pages 148-183, January.
    14. Plamen Nikolov & Hongjian Wang & Kevin Acker, 2020. "Wage premium of Communist Party membership: Evidence from China," Pacific Economic Review, Wiley Blackwell, vol. 25(3), pages 309-338, August.
    15. Daniel Burkhard & Christian P. R. Schmid & Kaspar Wüthrich, 2019. "Financial incentives and physician prescription behavior: Evidence from dispensing regulations," Health Economics, John Wiley & Sons, Ltd., vol. 28(9), pages 1114-1129, September.
    16. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
    17. Özler, Berk & Çelik, Çiğdem & Cunningham, Scott & Cuevas, P. Facundo & Parisotto, Luca, 2021. "Children on the move: Progressive redistribution of humanitarian cash transfers among refugees," Journal of Development Economics, Elsevier, vol. 153(C).
    18. Guido W. Imbens, 2022. "Causality in Econometrics: Choice vs Chance," Econometrica, Econometric Society, vol. 90(6), pages 2541-2566, November.
    19. Lechner, Michael, 2008. "A note on endogenous control variables in causal studies," Statistics & Probability Letters, Elsevier, vol. 78(2), pages 190-195, February.
    20. Mark Kattenberg & Bas Scheer & Jurre Thiel, 2023. "Causal forests with fixed effects for treatment effect heterogeneity in difference-in-differences," CPB Discussion Paper 452, CPB Netherlands Bureau for Economic Policy Analysis.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlbes:v:34:y:2016:i:4:p:661-672. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UBES20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.