IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v234y2023i2p512-535.html
   My bibliography  Save this article

Forward-selected panel data approach for program evaluation

Author

Listed:
  • Shi, Zhentao
  • Huang, Jingyi

Abstract

Policy evaluation is central to economic data analysis, but economists mostly work with observational data in view of limited opportunities to carry out controlled experiments. In the potential outcome framework, the panel data approach (Hsiao et al., 2012) constructs the counterfactual by exploiting the correlation between cross-sectional units in panel data. The choice of cross-sectional control units, a key step in its implementation, is nevertheless unresolved in data-rich environments when many possible controls are at the researcher’s disposal. We propose the forward selection method to choose control units, and establish validity of the post-selection inference. Our asymptotic framework allows the number of possible controls to grow much faster than the time dimension. The easy-to-implement algorithms and their theoretical guarantee extend the panel data approach to big data settings.

Suggested Citation

  • Shi, Zhentao & Huang, Jingyi, 2023. "Forward-selected panel data approach for program evaluation," Journal of Econometrics, Elsevier, vol. 234(2), pages 512-535.
  • Handle: RePEc:eee:econom:v:234:y:2023:i:2:p:512-535
    DOI: 10.1016/j.jeconom.2021.04.009
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407621001536
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2021.04.009?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Peter C. B. Phillips & Zhentao Shi, 2021. "Boosting: Why You Can Use The Hp Filter," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 62(2), pages 521-570, May.
    2. Zou, Hui, 2006. "The Adaptive Lasso and Its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1418-1429, December.
    3. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2021. "Economic Predictions With Big Data: The Illusion of Sparsity," Econometrica, Econometric Society, vol. 89(5), pages 2409-2437, September.
    4. Jiahua Chen & Zehua Chen, 2008. "Extended Bayesian information criteria for model selection with large model spaces," Biometrika, Biometrika Trust, vol. 95(3), pages 759-771.
    5. Davidson, James, 1994. "Stochastic Limit Theory: An Introduction for Econometricians," OUP Catalogue, Oxford University Press, number 9780198774037, Decembrie.
    6. Koo, Bonsoo & Anderson, Heather M. & Seo, Myung Hwan & Yao, Wenying, 2020. "High-dimensional predictive regression in the presence of cointegration," Journal of Econometrics, Elsevier, vol. 219(2), pages 456-477.
    7. Damian Kozbur, 2017. "Sharp convergence rates for forward regression in high-dimensional sparse linear models," ECON - Working Papers 253, Department of Economics - University of Zurich, revised Apr 2018.
    8. Andrews, Donald W K, 1991. "Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation," Econometrica, Econometric Society, vol. 59(3), pages 817-858, May.
    9. Lan, Xiaohuan & Li, Wei, 2018. "Swiss watch cycles: Evidence of corruption during leadership transition in China," Journal of Comparative Economics, Elsevier, vol. 46(4), pages 1234-1252.
    10. Alberto Abadie & Javier Gardeazabal, 2003. "The Economic Costs of Conflict: A Case Study of the Basque Country," American Economic Review, American Economic Association, vol. 93(1), pages 113-132, March.
    11. Leeb, Hannes & Pötscher, Benedikt M., 2005. "Model Selection And Inference: Facts And Fiction," Econometric Theory, Cambridge University Press, vol. 21(1), pages 21-59, February.
    12. Ouyang, Min & Peng, Yulei, 2015. "The treatment-effect estimation: A case study of the 2008 economic stimulus package of China," Journal of Econometrics, Elsevier, vol. 188(2), pages 545-557.
    13. Stefan Wager & Susan Athey, 2018. "Estimation and Inference of Heterogeneous Treatment Effects using Random Forests," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1228-1242, July.
    14. A. Belloni & D. Chen & V. Chernozhukov & C. Hansen, 2012. "Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain," Econometrica, Econometric Society, vol. 80(6), pages 2369-2429, November.
    15. Cheng Hsiao & H. Steve Ching & Shui Ki Wan, 2012. "A Panel Data Approach For Program Evaluation: Measuring The Benefits Of Political And Economic Integration Of Hong Kong With Mainland China," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 27(5), pages 705-740, August.
    16. Duflo, Esther & Glennerster, Rachel & Kremer, Michael, 2008. "Using Randomization in Development Economics Research: A Toolkit," Handbook of Development Economics, in: T. Paul Schultz & John A. Strauss (ed.), Handbook of Development Economics, edition 1, volume 4, chapter 61, pages 3895-3962, Elsevier.
    17. Damian Kozbur, 2017. "Testing-Based Forward Model Selection," American Economic Review, American Economic Association, vol. 107(5), pages 266-269, May.
    18. Carvalho, Carlos & Masini, Ricardo & Medeiros, Marcelo C., 2018. "ArCo: An artificial counterfactual approach for high-dimensional panel time-series data," Journal of Econometrics, Elsevier, vol. 207(2), pages 352-380.
    19. Bai, ChongEn & Li, Qi & Ouyang, Min, 2014. "Property taxes and home prices: A tale of two cities," Journal of Econometrics, Elsevier, vol. 180(1), pages 1-15.
    20. Christian Hansen & Damian Kozbur & Sanjog Misra, 2016. "Targeted undersmoothing," ECON - Working Papers 282, Department of Economics - University of Zurich, revised Apr 2018.
    21. Abhijit V. Banerjee & Esther Duflo, 2009. "The Experimental Approach to Development Economics," Annual Review of Economics, Annual Reviews, vol. 1(1), pages 151-178, May.
    22. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    23. Jushan Bai, 2003. "Inferential Theory for Factor Models of Large Dimensions," Econometrica, Econometric Society, vol. 71(1), pages 135-171, January.
    24. Hansheng Wang & Bo Li & Chenlei Leng, 2009. "Shrinkage tuning parameter selection with a diverging number of parameters," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 71(3), pages 671-683, June.
    25. Tingyou Zhou & Liping Zhu & Chen Xu & Runze Li, 2020. "Model-Free Forward Screening Via Cumulative Divergence," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(531), pages 1393-1405, July.
    26. Du, Zaichao & Zhang, Lin, 2015. "Home-purchase restriction, property tax and housing price in China: A counterfactual analysis," Journal of Econometrics, Elsevier, vol. 188(2), pages 558-568.
    27. Kock, Anders Bredahl & Callot, Laurent, 2015. "Oracle inequalities for high dimensional vector autoregressions," Journal of Econometrics, Elsevier, vol. 186(2), pages 325-344.
    28. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    29. Li, Kathleen T. & Bell, David R., 2017. "Estimation of average treatment effects with panel data: Asymptotic theory and implementation," Journal of Econometrics, Elsevier, vol. 197(1), pages 65-75.
    30. Newey, Whitney & West, Kenneth, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    31. Wan, Shui-Ki & Xie, Yimeng & Hsiao, Cheng, 2018. "Panel data approach vs synthetic control method," Economics Letters, Elsevier, vol. 164(C), pages 121-123.
    32. Jushan Bai & Serena Ng, 2009. "Boosting diffusion indices," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 24(4), pages 607-629.
    33. Wang, Hansheng, 2009. "Forward Regression for Ultra-High Dimensional Variable Screening," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1512-1524.
    34. Abadie, Alberto & Diamond, Alexis & Hainmueller, Jens, 2010. "Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program," Journal of the American Statistical Association, American Statistical Association, vol. 105(490), pages 493-505.
    35. Cheng Hsiao & Qiankun Zhou, 2019. "Panel parametric, semiparametric, and nonparametric construction of counterfactuals," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 34(4), pages 463-481, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhentao Shi & Jingyi Huang, 2019. "Forward-Selected Panel Data Approach for Program Evaluation," Papers 1908.05894, arXiv.org, revised Apr 2021.
    2. Carvalho, Carlos & Masini, Ricardo & Medeiros, Marcelo C., 2018. "ArCo: An artificial counterfactual approach for high-dimensional panel time-series data," Journal of Econometrics, Elsevier, vol. 207(2), pages 352-380.
    3. Carlos Viana de Carvalho & Ricardo Masini & Marcelo Cunha Medeiros, 2016. "The perils of Counterfactual Analysis with Integrated Processes," Textos para discussão 654, Department of Economics PUC-Rio (Brazil).
    4. Hongjun Li & Zheng Li & Cheng Hsiao, 2023. "Assessing the impacts of pandemic and the increase in minimum down payment rate on Shanghai housing prices," Empirical Economics, Springer, vol. 64(6), pages 2661-2682, June.
    5. Viviano, Davide & Bradic, Jelena, 2023. "Synthetic Learner: Model-free inference on treatments over time," Journal of Econometrics, Elsevier, vol. 234(2), pages 691-713.
    6. Lee, Ji Hyung & Shi, Zhentao & Gao, Zhan, 2022. "On LASSO for predictive regression," Journal of Econometrics, Elsevier, vol. 229(2), pages 322-349.
    7. Damian Kozbur, 2017. "Testing-Based Forward Model Selection," American Economic Review, American Economic Association, vol. 107(5), pages 266-269, May.
    8. Jianqing Fan & Ricardo Masini & Marcelo C. Medeiros, 2021. "Bridging factor and sparse models," Papers 2102.11341, arXiv.org, revised Sep 2022.
    9. Xingyu Li & Yan Shen & Qiankun Zhou, 2022. "Confidence Intervals of Treatment Effects in Panel Data Models with Interactive Fixed Effects," Papers 2202.12078, arXiv.org.
    10. Li, Kathleen T. & Bell, David R., 2017. "Estimation of average treatment effects with panel data: Asymptotic theory and implementation," Journal of Econometrics, Elsevier, vol. 197(1), pages 65-75.
    11. Michael Funke & Kadri Männasoo & Helery Tasane, 2023. "Regional Economic Impacts of the Øresund Cross-Border Fixed Link: Cui Bono?," CESifo Working Paper Series 10557, CESifo.
    12. Du, Zaichao & Pei, Pei, 2021. "A simple and robust counterfactual impact evaluation," Economics Letters, Elsevier, vol. 207(C).
    13. Ricardo P. Masini & Marcelo C. Medeiros & Eduardo F. Mendes, 2023. "Machine learning advances for time series forecasting," Journal of Economic Surveys, Wiley Blackwell, vol. 37(1), pages 76-111, February.
    14. Bruno Ferman & Cristine Pinto, 2021. "Synthetic controls with imperfect pretreatment fit," Quantitative Economics, Econometric Society, vol. 12(4), pages 1197-1221, November.
    15. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
    16. Shih-Chih Chen & Jianing Hou & De Xiao, 2018. "“One Belt, One Road” Initiative to Stimulate Trade in China: A Counter-Factual Analysis," Sustainability, MDPI, vol. 10(9), pages 1-13, September.
    17. Victor Chernozhukov & Kaspar Wüthrich & Yinchu Zhu, 2019. "Inference on average treatment effects in aggregate panel data settings," CeMMAP working papers CWP32/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    18. Victor Chernozhukov & Kaspar Wuthrich & Yinchu Zhu, 2018. "A $t$-test for synthetic controls," Papers 1812.10820, arXiv.org, revised Jan 2024.
    19. Sayoni Roychowdhury & Indrila Ganguly & Abhik Ghosh, 2021. "Robust Estimation of Average Treatment Effects from Panel Data," Papers 2112.13228, arXiv.org, revised Dec 2022.
    20. Echevarría, Cruz A. & Hasancebi, Serhat & García-Enríquez, Javier, 2022. "Economic Effects of Macao’s Integration with Mainland China: A Causal Inference Study," Journal of Economic Integration, Center for Economic Integration, Sejong University, vol. 37(2), pages 179-215.

    More about this item

    Keywords

    Aggressive algorithm; Average treatment effect; Counterfactual analysis; Post-selection inference;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
    • C23 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Models with Panel Data; Spatio-temporal Models
    • C38 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Classification Methdos; Cluster Analysis; Principal Components; Factor Analysis
    • D73 - Microeconomics - - Analysis of Collective Decision-Making - - - Bureaucracy; Administrative Processes in Public Organizations; Corruption

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:234:y:2023:i:2:p:512-535. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.