IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v218y2020i2p609-632.html
   My bibliography  Save this article

Simple and reliable estimators of coefficients of interest in a model with high-dimensional confounding effects

Author

Listed:
  • Galbraith, John W.
  • Zinde-Walsh, Victoria

Abstract

Often an investigator is interested in a single parameter or low-dimensional parameter vector (e.g. a treatment effect) in a regression model, rather than in the full set of regression coefficients. There may also be a relatively high-dimensional set of potential explanatory variables other than the effect of interest which, if omitted, could bias the estimate of the parameter of interest. However there may be too many such variables to include all of them without substantial efficiency loss. We suggest a simple, easily computed estimator for this case, using principal components to compute auxiliary regressors from the set of potential controls, and establish the limit properties of the estimator allowing for dependence and heterogeneity as well as increasing dimension of the set of controls. We also provide finite-sample evidence on the performance of the estimator where principal components are selected in a one-dimensional search using an appropriate information criterion as stopping rule for the number of components. The results suggest that the estimator has practical usefulness in small samples.

Suggested Citation

  • Galbraith, John W. & Zinde-Walsh, Victoria, 2020. "Simple and reliable estimators of coefficients of interest in a model with high-dimensional confounding effects," Journal of Econometrics, Elsevier, vol. 218(2), pages 609-632.
  • Handle: RePEc:eee:econom:v:218:y:2020:i:2:p:609-632
    DOI: 10.1016/j.jeconom.2020.04.031
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407620301512
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2020.04.031?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Banerjee, Anindya & Marcellino, Massimiliano & Masten, Igor, 2014. "Forecasting with factor-augmented error correction models," International Journal of Forecasting, Elsevier, vol. 30(3), pages 589-612.
    2. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2021. "Economic Predictions With Big Data: The Illusion of Sparsity," Econometrica, Econometric Society, vol. 89(5), pages 2409-2437, September.
    3. Marine Carrasco & Barbara Rossi, 2016. "In-Sample Inference and Forecasting in Misspecified Factor Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(3), pages 313-338, July.
    4. Jushan Bai & Serena Ng, 2002. "Determining the Number of Factors in Approximate Factor Models," Econometrica, Econometric Society, vol. 70(1), pages 191-221, January.
    5. Jean-Marie Dufour, 2003. "Identification, weak instruments, and statistical inference in econometrics," Canadian Journal of Economics, Canadian Economics Association, vol. 36(4), pages 767-808, November.
    6. Cattaneo, Matias D. & Jansson, Michael & Newey, Whitney K., 2018. "Alternative Asymptotics And The Partially Linear Model With Many Regressors," Econometric Theory, Cambridge University Press, vol. 34(2), pages 277-301, April.
    7. Krolzig, Hans-Martin & Hendry, David F., 2001. "Computer automation of general-to-specific model selection procedures," Journal of Economic Dynamics and Control, Elsevier, vol. 25(6-7), pages 831-866, June.
    8. Jean‐Marie Dufour, 2003. "Identification, weak instruments, and statistical inference in econometrics," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 36(4), pages 767-808, November.
    9. Ben S. Bernanke & Jean Boivin & Piotr Eliasz, 2005. "Measuring the Effects of Monetary Policy: A Factor-Augmented Vector Autoregressive (FAVAR) Approach," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 120(1), pages 387-422.
    10. Connor, Gregory & Korajczyk, Robert A., 1986. "Performance measurement with the arbitrage pricing theory : A new framework for analysis," Journal of Financial Economics, Elsevier, vol. 15(3), pages 373-394, March.
    11. Hendry, David F. & Learmer, Edward E. & Poirier, Dale J., 1990. "A Conversation on Econometric Methodology," Econometric Theory, Cambridge University Press, vol. 6(02), pages 171-261, June.
    12. Gorodnichenko, Yuriy & Ng, Serena, 2017. "Level and volatility factors in macroeconomic data," Journal of Monetary Economics, Elsevier, vol. 91(C), pages 52-68.
    13. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    14. Alexei Onatski, 2010. "Determining the Number of Factors from Empirical Distribution of Eigenvalues," The Review of Economics and Statistics, MIT Press, vol. 92(4), pages 1004-1016, November.
    15. Galbraith, John W. & Hodgson, Douglas J., 2012. "Dimension reduction and model averaging for estimation of artists' age-valuation profiles," European Economic Review, Elsevier, vol. 56(3), pages 422-435.
    16. Dufour, Jean-Marie & Taamouti, Mohamed, 2007. "Further results on projection-based inference in IV regressions with weak, collinear or missing instruments," Journal of Econometrics, Elsevier, vol. 139(1), pages 133-153, July.
    17. Li, Hongjun & Li, Qi & Shi, Yutang, 2017. "Determining the number of factors when the number of factors can increase with sample size," Journal of Econometrics, Elsevier, vol. 197(1), pages 76-86.
    18. Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
    19. Jan J. J. Groen & George Kapetanios, 2013. "Model Selection Criteria for Factor-Augmented Regressions-super-," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 75(1), pages 37-63, February.
    20. Firmin Doko Tchatoka & Jean‐Marie Dufour, 2014. "Identification‐robust inference for endogeneity parameters in linear structural models," Econometrics Journal, Royal Economic Society, vol. 17(1), pages 165-187, February.
    21. Andrews, Donald W K, 1991. "Heteroskedasticity and Autocorrelation Consistent Covariance Matrix Estimation," Econometrica, Econometric Society, vol. 59(3), pages 817-858, May.
    22. Aaron Fisher & Brian Caffo & Brian Schwartz & Vadim Zipunnikov, 2016. "Fast, Exact Bootstrap Principal Component Analysis for > 1 Million," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 846-860, April.
    23. Jushan Bai & Serena Ng, 2006. "Confidence Intervals for Diffusion Index Forecasts and Inference for Factor-Augmented Regressions," Econometrica, Econometric Society, vol. 74(4), pages 1133-1150, July.
    24. Farebrother, R W, 1972. "Principal Component Estimators and Minimum Mean Square Error Criteria in Regression Analysis," The Review of Economics and Statistics, MIT Press, vol. 54(3), pages 332-336, August.
    25. Bai, Jushan & Ng, Serena, 2013. "Principal components estimation and identification of static factors," Journal of Econometrics, Elsevier, vol. 176(1), pages 18-29.
    26. Jan R. Magnus & J. Durbin, 1999. "Estimation of Regression Coefficients of Interest When Other Regression Coefficients Are of No Interest," Econometrica, Econometric Society, vol. 67(3), pages 639-644, May.
    27. Fama, Eugene F. & French, Kenneth R., 1993. "Common risk factors in the returns on stocks and bonds," Journal of Financial Economics, Elsevier, vol. 33(1), pages 3-56, February.
    28. Stock, James H & Watson, Mark W, 2002. "Macroeconomic Forecasting Using Diffusion Indexes," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(2), pages 147-162, April.
    29. Stock J.H. & Watson M.W., 2002. "Forecasting Using Principal Components From a Large Number of Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1167-1179, December.
    30. Donald, S. G. & Newey, W. K., 1994. "Series Estimation of Semilinear Models," Journal of Multivariate Analysis, Elsevier, vol. 50(1), pages 30-40, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Guo, Xu & Li, Runze & Liu, Jingyuan & Zeng, Mudong, 2024. "Reprint: Statistical inference for linear mediation models with high-dimensional mediators and application to studying stock reaction to COVID-19 pandemic," Journal of Econometrics, Elsevier, vol. 239(2).
    2. Guo, Xu & Li, Runze & Liu, Jingyuan & Zeng, Mudong, 2023. "Statistical inference for linear mediation models with high-dimensional mediators and application to studying stock reaction to COVID-19 pandemic," Journal of Econometrics, Elsevier, vol. 235(1), pages 166-179.
    3. Jooyoung Cha & Harold D. Chiang & Yuya Sasaki, 2021. "Inference in high-dimensional regression models without the exact or $L^p$ sparsity," Papers 2108.09520, arXiv.org, revised Dec 2022.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Stock, J.H. & Watson, M.W., 2016. "Dynamic Factor Models, Factor-Augmented Vector Autoregressions, and Structural Vector Autoregressions in Macroeconomics," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 415-525, Elsevier.
    2. Smeekes, Stephan & Wijler, Etienne, 2018. "Macroeconomic forecasting using penalized regression methods," International Journal of Forecasting, Elsevier, vol. 34(3), pages 408-430.
    3. Catherine Doz & Peter Fuleky, 2019. "Dynamic Factor Models," Working Papers 2019-4, University of Hawaii Economic Research Organization, University of Hawaii at Manoa.
    4. Matteo Barigozzi, 2023. "Asymptotic equivalence of Principal Components and Quasi Maximum Likelihood estimators in Large Approximate Factor Models," Papers 2307.09864, arXiv.org, revised Jun 2024.
    5. Liang Chen & Juan J. Dolado & Jesús Gonzalo, 2021. "Quantile Factor Models," Econometrica, Econometric Society, vol. 89(2), pages 875-910, March.
    6. Helmut Lütkepohl, 2014. "Structural Vector Autoregressive Analysis in a Data Rich Environment: A Survey," Discussion Papers of DIW Berlin 1351, DIW Berlin, German Institute for Economic Research.
    7. Karim Barhoumi & Olivier Darné & Laurent Ferrara, 2014. "Dynamic factor models: A review of the literature," OECD Journal: Journal of Business Cycle Measurement and Analysis, OECD Publishing, Centre for International Research on Economic Tendency Surveys, vol. 2013(2), pages 73-107.
    8. Varlam Kutateladze, 2021. "The Kernel Trick for Nonlinear Factor Modeling," Papers 2103.01266, arXiv.org.
    9. repec:cte:wsrepe:23974 is not listed on IDEAS
    10. Bai, Jushan & Ng, Serena, 2019. "Rank regularized estimation of approximate factor models," Journal of Econometrics, Elsevier, vol. 212(1), pages 78-96.
    11. Kutateladze, Varlam, 2022. "The kernel trick for nonlinear factor modeling," International Journal of Forecasting, Elsevier, vol. 38(1), pages 165-177.
    12. Martin Solberger & Erik Spånberg, 2020. "Estimating a Dynamic Factor Model in EViews Using the Kalman Filter and Smoother," Computational Economics, Springer;Society for Computational Economics, vol. 55(3), pages 875-900, March.
    13. Jushan Bai & Serena Ng, 2017. "Principal Components and Regularized Estimation of Factor Models," Papers 1708.08137, arXiv.org, revised Nov 2017.
    14. Chen, Qitong & Hong, Yongmiao & Li, Haiqi, 2024. "Time-varying forecast combination for factor-augmented regressions with smooth structural changes," Journal of Econometrics, Elsevier, vol. 240(1).
    15. Kim, Hyun Hak & Swanson, Norman R., 2018. "Mining big data using parsimonious factor, machine learning, variable selection and shrinkage methods," International Journal of Forecasting, Elsevier, vol. 34(2), pages 339-354.
    16. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    17. Luke Hartigan & James Morley, 2020. "A Factor Model Analysis of the Australian Economy and the Effects of Inflation Targeting," The Economic Record, The Economic Society of Australia, vol. 96(314), pages 271-293, September.
    18. Gagliardini, Patrick & Ossola, Elisa & Scaillet, Olivier, 2019. "A diagnostic criterion for approximate factor structure," Journal of Econometrics, Elsevier, vol. 212(2), pages 503-521.
    19. Mao Takongmo, Charles Olivier & Stevanovic, Dalibor, 2015. "Selection Of The Number Of Factors In Presence Of Structural Instability: A Monte Carlo Study," L'Actualité Economique, Société Canadienne de Science Economique, vol. 91(1-2), pages 177-233, Mars-Juin.
    20. Banerjee, Anindya & Marcellino, Massimiliano & Masten, Igor, 2014. "Forecasting with factor-augmented error correction models," International Journal of Forecasting, Elsevier, vol. 30(3), pages 589-612.
    21. Terence Tai-Leung Chong & Nasha Li & Lin Zou, 2017. "A New Approach to Modeling Sector Stock Returns in China," Chinese Economy, Taylor & Francis Journals, vol. 50(5), pages 305-322, September.

    More about this item

    Keywords

    Confounding; High-dimensional data; Principal components; Subspace consistency; Treatment effect; Wide data;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:218:y:2020:i:2:p:609-632. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.