IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/25132.html
   My bibliography  Save this paper

Matrix Completion Methods for Causal Panel Data Models

Author

Listed:
  • Susan Athey
  • Mohsen Bayati
  • Nikolay Doudchenko
  • Guido Imbens
  • Khashayar Khosravi

Abstract

In this paper we study methods for estimating causal effects in settings with panel data, where a subset of units are exposed to a treatment during a subset of periods, and the goal is estimating counterfactual (untreated) outcomes for the treated unit/period combinations. We develop a class of matrix completion estimators that uses the observed elements of the matrix of control outcomes corresponding to untreated unit/periods to predict the “missing” elements of the matrix, corresponding to treated units/periods. The approach estimates a matrix that well-approximates the original (incomplete) matrix, but has lower complexity according to the nuclear norm for matrices. From a technical perspective, we generalize results from the matrix completion literature by allowing the patterns of missing data to have a time series dependency structure. We also present novel insights concerning the connections between the matrix completion literature, the literature on interactive fixed effects models and the literatures on program evaluation under unconfoundedness and synthetic control methods.

Suggested Citation

  • Susan Athey & Mohsen Bayati & Nikolay Doudchenko & Guido Imbens & Khashayar Khosravi, 2018. "Matrix Completion Methods for Causal Panel Data Models," NBER Working Papers 25132, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:25132
    Note: IO LS TWP
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w25132.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    2. Arellano, Manuel & Honore, Bo, 2001. "Panel data models: some recent developments," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 5, chapter 53, pages 3229-3296, Elsevier.
    3. Alberto Abadie & Javier Gardeazabal, 2003. "The Economic Costs of Conflict: A Case Study of the Basque Country," American Economic Review, American Economic Association, vol. 93(1), pages 113-132, March.
    4. Nikolay Doudchenko & Guido W. Imbens, 2016. "Balancing, Regression, Difference-In-Differences and Synthetic Control Methods: A Synthesis," NBER Working Papers 22791, National Bureau of Economic Research, Inc.
    5. Xu, Yiqing, 2017. "Generalized Synthetic Control Method: Causal Inference with Interactive Fixed Effects Models," Political Analysis, Cambridge University Press, vol. 25(1), pages 57-76, January.
    6. repec:hal:pseose:halshs-00849071 is not listed on IDEAS
    7. M. Hashem Pesaran, 2006. "Estimation and Inference in Large Heterogeneous Panels with a Multifactor Error Structure," Econometrica, Econometric Society, vol. 74(4), pages 967-1012, July.
    8. Laurent Gobillon & Thierry Magnac, 2016. "Regional Policy Evaluation: Interactive Fixed Effects and Synthetic Controls," The Review of Economics and Statistics, MIT Press, vol. 98(3), pages 535-551, July.
    9. Athey, Susan & Imbens, Guido W. & Wager, Stefan, 2016. "Efficient Inference of Average Treatment Effects in High Dimensions via Approximate Residual Balancing," Research Papers 3408, Stanford University, Graduate School of Business.
    10. Chamberlain, Gary & Rothschild, Michael, 1983. "Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets," Econometrica, Econometric Society, vol. 51(5), pages 1281-1304, September.
    11. Hyungsik Roger Moon & Martin Weidner, 2015. "Linear Regression for Panel With Unknown Number of Factors as Interactive Fixed Effects," Econometrica, Econometric Society, vol. 83(4), pages 1543-1579, July.
    12. Meyer, Bruce D & Viscusi, W Kip & Durbin, David L, 1995. "Workers' Compensation and Injury Duration: Evidence from a Natural Experiment," American Economic Review, American Economic Association, vol. 85(3), pages 322-340, June.
    13. Ferman, Bruno & Pinto, Cristine Campos de Xavier, 2016. "Revisiting the synthetic control estimator," Textos para discussão 421, FGV EESP - Escola de Economia de São Paulo, Fundação Getulio Vargas (Brazil).
    14. Marianne Bertrand & Esther Duflo & Sendhil Mullainathan, 2004. "How Much Should We Trust Differences-In-Differences Estimates?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 119(1), pages 249-275.
    15. Jushan Bai, 2009. "Panel Data Models With Interactive Fixed Effects," Econometrica, Econometric Society, vol. 77(4), pages 1229-1279, July.
    16. Dukpa Kim & Tatsushi Oka, 2014. "Divorce Law Reforms And Divorce Rates In The Usa: An Interactive Fixed‐Effects Approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(2), pages 231-245, March.
    17. Abadie, Alberto & Diamond, Alexis & Hainmueller, Jens, 2010. "Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 493-505.
    18. Jushan Bai & Serena Ng, 2017. "Principal Components and Regularized Estimation of Factor Models," Papers 1708.08137, arXiv.org, revised Nov 2017.
    19. Athey, Susan & Imbens, Guido W., 2022. "Design-based analysis in Difference-In-Differences settings with staggered adoption," Journal of Econometrics, Elsevier, vol. 226(1), pages 62-79.
    20. Jeffrey M Wooldridge, 2010. "Econometric Analysis of Cross Section and Panel Data," MIT Press Books, The MIT Press, edition 2, volume 1, number 0262232588, December.
    21. Susan Athey & Scott Stern, 2002. "The Impact of Information Technology on Emergency Health Care Outcomes," RAND Journal of Economics, The RAND Corporation, vol. 33(3), pages 399-432, Autumn.
    22. Moon, Hyungsik Roger & Weidner, Martin, 2017. "Dynamic Linear Panel Regression Models With Interactive Fixed Effects," Econometric Theory, Cambridge University Press, vol. 33(1), pages 158-195, February.
    23. Alberto Abadie & Alexis Diamond & Jens Hainmueller, 2015. "Comparative Politics and the Synthetic Control Method," American Journal of Political Science, John Wiley & Sons, vol. 59(2), pages 495-510, February.
    24. Muhammad Jehangir Amjad & Devavrat Shah & Dennis Shen, 2017. "Robust Synthetic Control," Papers 1711.06940, arXiv.org.
    25. Cheng Hsiao & H. Steve Ching & Shui Ki Wan, 2012. "A Panel Data Approach For Program Evaluation: Measuring The Benefits Of Political And Economic Integration Of Hong Kong With Mainland China," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 27(5), pages 705-740, August.
    26. James J. Heckman & Vytlacil, Edward J., 2007. "Econometric Evaluation of Social Programs, Part I: Causal Models, Structural Models and Econometric Policy Evaluation," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 70, Elsevier.
    27. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    28. Guido W. Imbens, 2004. "Nonparametric Estimation of Average Treatment Effects Under Exogeneity: A Review," The Review of Economics and Statistics, MIT Press, vol. 86(1), pages 4-29, February.
    29. Goldberger, Arthur S, 1972. "Structural Equation Methods in the Social Sciences," Econometrica, Econometric Society, vol. 40(6), pages 979-1001, November.
    30. Imbens,Guido W. & Rubin,Donald B., 2015. "Causal Inference for Statistics, Social, and Biomedical Sciences," Cambridge Books, Cambridge University Press, number 9780521885881.
    31. Lee, Nayoung & Moon, Hyungsik Roger & Zhou, Qiankun, 2017. "Many IVs estimation of dynamic panel regression models with measurement error," Journal of Econometrics, Elsevier, vol. 200(2), pages 251-259.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dmitry Arkhangelsky & Guido Imbens, 2023. "Causal Models for Longitudinal and Panel Data: A Survey," Papers 2311.15458, arXiv.org, revised Mar 2024.
    2. Bruno Ferman & Cristine Pinto, 2021. "Synthetic controls with imperfect pretreatment fit," Quantitative Economics, Econometric Society, vol. 12(4), pages 1197-1221, November.
    3. Victor Chernozhukov & Kaspar Wüthrich & Yinchu Zhu, 2021. "An Exact and Robust Conformal Inference Method for Counterfactual and Synthetic Controls," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 1849-1864, October.
    4. Victor Chernozhukov & Kaspar Wüthrich & Yinchu Zhu, 2019. "Inference on average treatment effects in aggregate panel data settings," CeMMAP working papers CWP32/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    5. Victor Chernozhukov & Kaspar Wuthrich & Yinchu Zhu, 2018. "A $t$-test for synthetic controls," Papers 1812.10820, arXiv.org, revised Jan 2024.
    6. Laurent Gobillon & Thierry Magnac, 2016. "Regional Policy Evaluation: Interactive Fixed Effects and Synthetic Controls," The Review of Economics and Statistics, MIT Press, vol. 98(3), pages 535-551, July.
    7. repec:hal:pseose:halshs-00849071 is not listed on IDEAS
    8. Keegan Harris & Anish Agarwal & Chara Podimata & Zhiwei Steven Wu, 2022. "Strategyproof Decision-Making in Panel Data Settings and Beyond," Papers 2211.14236, arXiv.org, revised Dec 2023.
    9. Callaway, Brantly & Karami, Sonia, 2023. "Treatment effects in interactive fixed effects models with a small number of time periods," Journal of Econometrics, Elsevier, vol. 233(1), pages 184-208.
    10. Anish Agarwal & Vasilis Syrgkanis, 2022. "Synthetic Blip Effects: Generalizing Synthetic Controls for the Dynamic Treatment Regime," Papers 2210.11003, arXiv.org.
    11. Li, Xingyu & Shen, Yan & Zhou, Qiankun, 2024. "Confidence intervals of treatment effects in panel data models with interactive fixed effects," Journal of Econometrics, Elsevier, vol. 240(1).
    12. Peter Backus & Thien Nguyen, 2021. "The Effect of the Sex Buyer Law on the Market for Sex, Sexual Health and Sexual Violence," Economics Discussion Paper Series 2106, Economics, The University of Manchester.
    13. Bai, Jushan & Wang, Peng, 2024. "Causal inference using factor models," MPRA Paper 120585, University Library of Munich, Germany.
    14. Dmitry Arkhangelsky & Guido Imbens, 2018. "Fixed Effects and the Generalized Mundlak Estimator," Papers 1807.02099, arXiv.org, revised Aug 2023.
    15. Ruofan Xu & Jiti Gao & Tatsushi Oka & Yoon-Jae Whang, 2022. "Estimation of Heterogeneous Treatment Effects Using Quantile Regression with Interactive Fixed Effects," Monash Econometrics and Business Statistics Working Papers 13/22, Monash University, Department of Econometrics and Business Statistics.
    16. Michał Marcin Kobierecki & Michał Pierzgalski, 2022. "Sports Mega-Events and Economic Growth: A Synthetic Control Approach," Journal of Sports Economics, , vol. 23(5), pages 567-597, June.
    17. Wei Shi & Lung-fei Lee, 2018. "The effects of gun control on crimes: a spatial interactive fixed effects approach," Empirical Economics, Springer, vol. 55(1), pages 233-263, August.
    18. Dmitry Arkhangelsky & Susan Athey & David A. Hirshberg & Guido W. Imbens & Stefan Wager, 2021. "Synthetic Difference-in-Differences," American Economic Review, American Economic Association, vol. 111(12), pages 4088-4118, December.
    19. Guido W. Imbens & Davide Viviano, 2023. "Identification and Inference for Synthetic Controls with Confounding," Papers 2312.00955, arXiv.org.
    20. Yinchu Zhu, 2019. "How well can we learn large factor models without assuming strong factors?," Papers 1910.10382, arXiv.org, revised Nov 2019.
    21. Viviano, Davide & Bradic, Jelena, 2023. "Synthetic Learner: Model-free inference on treatments over time," Journal of Econometrics, Elsevier, vol. 234(2), pages 691-713.

    More about this item

    JEL classification:

    • C01 - Mathematical and Quantitative Methods - - General - - - Econometrics
    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
    • C23 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Models with Panel Data; Spatio-temporal Models

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:25132. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.