IDEAS home Printed from https://ideas.repec.org/a/eee/econom/v234y2023i1p151-177.html
   My bibliography  Save this article

Most powerful test against a sequence of high dimensional local alternatives

Author

Listed:
  • He, Yi
  • Jaidee, Sombut
  • Gao, Jiti

Abstract

We develop a powerful quadratic test for the overall significance of many covariates in a dense regression model in the presence of nuisance parameters. By equally weighting the sample moments, the test is asymptotically correct in high dimensions even when the number of coefficients is larger than the sample size. Our theory allows a non-parametric error distribution and weakly exogenous nuisance variables, in particular autoregressors in many applications. Using random matrix theory, we show that the test has the optimal asymptotic testing power among a large class of competitors against local alternatives whose coordinates are dense in the eigenbasis of the high dimensional sample covariance matrix among regressors. The asymptotic results are adaptive to the covariates’ cross-sectional and temporal dependence structure and do not require a limiting spectral law of their sample covariance matrix. In the most general case, the nuisance estimation may play a role in the asymptotic limit and we give a robust modification for these irregular scenarios. Monte Carlo studies suggest a good power performance of our proposed test against high dimensional dense alternative for various data generating processes. We apply the test to detect the significance of over one hundred exogenous variables in the FRED-MD database for predicting the monthly growth in the US industrial production index.

Suggested Citation

  • He, Yi & Jaidee, Sombut & Gao, Jiti, 2023. "Most powerful test against a sequence of high dimensional local alternatives," Journal of Econometrics, Elsevier, vol. 234(1), pages 151-177.
  • Handle: RePEc:eee:econom:v:234:y:2023:i:1:p:151-177
    DOI: 10.1016/j.jeconom.2021.10.015
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304407621003079
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jeconom.2021.10.015?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Domenico Giannone & Michele Lenza & Giorgio E. Primiceri, 2021. "Economic Predictions With Big Data: The Illusion of Sparsity," Econometrica, Econometric Society, vol. 89(5), pages 2409-2437, September.
    2. Jelle J. Goeman & Sara A. Van De Geer & Hans C. Van Houwelingen, 2006. "Testing against a high dimensional alternative," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(3), pages 477-493, June.
    3. Zhong, Ping-Shou & Chen, Song Xi, 2011. "Tests for High-Dimensional Regression Coefficients With Factorial Designs," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 260-274.
    4. Yinchu Zhu & Jelena Bradic, 2018. "Linear Hypothesis Testing in Dense High-Dimensional Linear Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(524), pages 1583-1600, October.
    5. Jiti Gao & Xiao Han & Guangming Pan & Yanrong Yang, 2017. "High dimensional correlation matrices: the central limit theorem and its applications," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 677-693, June.
    6. Bin Guo & Song Xi Chen, 2016. "Tests for high dimensional generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(5), pages 1079-1102, November.
    7. Jianqing Fan & Shaojun Guo & Ning Hao, 2012. "Variance estimation using refitted cross‐validation in ultrahigh dimensional regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 74(1), pages 37-65, January.
    8. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2011. "Inference on Treatment Effects After Selection Amongst High-Dimensional Controls," Papers 1201.0224, arXiv.org, revised May 2012.
    9. Olivier Ledoit & Michael Wolf, 2017. "Nonlinear Shrinkage of the Covariance Matrix for Portfolio Selection: Markowitz Meets Goldilocks," Review of Financial Studies, Society for Financial Studies, vol. 30(12), pages 4349-4388.
    10. Matias D. Cattaneo & Michael Jansson & Whitney K. Newey, 2018. "Inference in Linear Regression Models with Many Covariates and Heteroscedasticity," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(523), pages 1350-1361, July.
    11. Michael W. McCracken & Serena Ng, 2016. "FRED-MD: A Monthly Database for Macroeconomic Research," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(4), pages 574-589, October.
    12. Victor Chernozhukov & Denis Chetverikov & Kengo Kato & Aureo de Paula, 2019. "Inference on Causal and Structural Parameters using Many Moment Inequalities," Review of Economic Studies, Oxford University Press, vol. 86(5), pages 1867-1900.
    13. Ruben Dezeure & Peter Bühlmann & Cun-Hui Zhang, 2017. "High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 685-719, December.
    14. Anders Bredahl Kock & David Preinerstorfer, 2019. "Power in High‐Dimensional Testing Problems," Econometrica, Econometric Society, vol. 87(3), pages 1055-1069, May.
    15. Alexander Chudik & M. Hashem Pesaran & Elisa Tosetti, 2011. "Weak and strong cross‐section dependence and estimation of large panels," Econometrics Journal, Royal Economic Society, vol. 14(1), pages 45-90, February.
    16. Jelle J. Goeman & Hans C. van Houwelingen & Livio Finos, 2011. "Testing against a high-dimensional alternative in the generalized linear model: asymptotic type I error control," Biometrika, Biometrika Trust, vol. 98(2), pages 381-390.
    17. Yin, Y. Q., 1986. "Limiting spectral distribution for a class of random matrices," Journal of Multivariate Analysis, Elsevier, vol. 20(1), pages 50-68, October.
    18. Jin, Baisuo & Wang, Cheng & Miao, Baiqi & Lo Huang, Mong-Na, 2009. "Limiting spectral distribution of large-dimensional sample covariance matrices generated by VARMA," Journal of Multivariate Analysis, Elsevier, vol. 100(9), pages 2112-2125, October.
    19. Yu I. Ingster & Alexandre B. Tsybakov & N. Verzelzn, 2010. "Detection Boundary in Sparse Regression," Working Papers 2010-28, Center for Research in Economics and Statistics.
    20. Lee H. Dicker, 2014. "Variance estimation in high-dimensional linear models," Biometrika, Biometrika Trust, vol. 101(2), pages 269-284.
    21. Silverstein, J. W., 1995. "Strong Convergence of the Empirical Distribution of Eigenvalues of Large Dimensional Random Matrices," Journal of Multivariate Analysis, Elsevier, vol. 55(2), pages 331-339, November.
    22. Wang, Siyang & Cui, Hengjian, 2013. "Generalized F test for high dimensional linear regression coefficients," Journal of Multivariate Analysis, Elsevier, vol. 117(C), pages 134-149.
    23. Xianyang Zhang & Guang Cheng, 2017. "Simultaneous Inference for High-Dimensional Linear Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 757-768, April.
    24. Guangming Pan & Jiti Gao & Yanrong Yang, 2014. "Testing Independence Among a Large Number of High-Dimensional Random Vectors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 600-612, June.
    25. Onatski, Alexei, 2012. "Asymptotics of the principal components estimator of large factor models with weakly influential factors," Journal of Econometrics, Elsevier, vol. 168(2), pages 244-258.
    26. Silverstein, J. W. & Bai, Z. D., 1995. "On the Empirical Distribution of Eigenvalues of a Class of Large Dimensional Random Matrices," Journal of Multivariate Analysis, Elsevier, vol. 54(2), pages 175-192, August.
    27. Wu, Wei Biao & Shao, Xiaofeng, 2007. "A Limit Theorem For Quadratic Forms And Its Applications," Econometric Theory, Cambridge University Press, vol. 23(5), pages 930-951, October.
    28. Stock J.H. & Watson M.W., 2002. "Forecasting Using Principal Components From a Large Number of Predictors," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 1167-1179, December.
    29. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2013. "Supplementary Appendix for "Inference on Treatment Effects After Selection Amongst High-Dimensional Controls"," Papers 1305.6099, arXiv.org, revised Jun 2013.
    30. Jianqing Fan & Yuan Liao & Jiawei Yao, 2015. "Power Enhancement in High‐Dimensional Cross‐Sectional Tests," Econometrica, Econometric Society, vol. 83(4), pages 1497-1541, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yi He & Sombut Jaidee & Jiti Gao, 2020. "Most Powerful Test against High Dimensional Free Alternatives," Monash Econometrics and Business Statistics Working Papers 13/20, Monash University, Department of Econometrics and Business Statistics.
    2. Rui Wang & Xingzhong Xu, 2021. "A Bayesian-motivated test for high-dimensional linear regression models with fixed design matrix," Statistical Papers, Springer, vol. 62(4), pages 1821-1852, August.
    3. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.
    4. Natalia Bailey & George Kapetanios & M. Hashem Pesaran, 2021. "Measurement of factor strength: Theory and practice," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(5), pages 587-613, August.
    5. Jianqing Fan & Kunpeng Li & Yuan Liao, 2020. "Recent Developments on Factor Models and its Applications in Econometric Learning," Papers 2009.10103, arXiv.org.
    6. Liu, Yang & Sun, Wei & Hsu, Li & He, Qianchuan, 2022. "Statistical inference for high-dimensional pathway analysis with multiple responses," Computational Statistics & Data Analysis, Elsevier, vol. 169(C).
    7. Ping-Shou Zhong & Tao Hu & Jun Li, 2015. "Tests for Coefficients in High-dimensional Additive Hazard Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(3), pages 649-664, September.
    8. Jianqing Fan & Ricardo Masini & Marcelo C. Medeiros, 2021. "Bridging factor and sparse models," Papers 2102.11341, arXiv.org, revised Sep 2022.
    9. Merlevède, F. & Peligrad, M., 2016. "On the empirical spectral distribution for matrices with long memory and independent rows," Stochastic Processes and their Applications, Elsevier, vol. 126(9), pages 2734-2760.
    10. Sardy, Sylvain & Diaz-Rodriguez, Jairo & Giacobino, Caroline, 2022. "Thresholding tests based on affine LASSO to achieve non-asymptotic nominal level and high power under sparse and dense alternatives in high dimension," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    11. Jiang Dandan & Sun Jianguo, 2017. "Group Tests for High-dimensional Failure Time Data with the Additive Hazards Models," The International Journal of Biostatistics, De Gruyter, vol. 13(1), pages 1-10, May.
    12. Wang, Siyang & Cui, Hengjian, 2015. "A new test for part of high dimensional regression coefficients," Journal of Multivariate Analysis, Elsevier, vol. 137(C), pages 187-203.
    13. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    14. Hyungsik Roger Moon & Martin Weidner, 2015. "Linear Regression for Panel With Unknown Number of Factors as Interactive Fixed Effects," Econometrica, Econometric Society, vol. 83(4), pages 1543-1579, July.
    15. Pan, Guangming, 2010. "Strong convergence of the empirical distribution of eigenvalues of sample covariance matrices with a perturbation matrix," Journal of Multivariate Analysis, Elsevier, vol. 101(6), pages 1330-1338, July.
    16. Ian W. McKeague & Min Qian, 2015. "An Adaptive Resampling Test for Detecting the Presence of Significant Predictors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1422-1433, December.
    17. Yuan Liao & Xinjie Ma & Andreas Neuhierl & Zhentao Shi, 2023. "Economic Forecasts Using Many Noises," Papers 2312.05593, arXiv.org, revised Dec 2023.
    18. Ledoit, Olivier & Wolf, Michael, 2017. "Numerical implementation of the QuEST function," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 199-223.
    19. Jamshid Namdari & Debashis Paul & Lili Wang, 2021. "High-Dimensional Linear Models: A Random Matrix Perspective," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(2), pages 645-695, August.
    20. Hyungsik Roger Roger Moon & Martin Weidner, 2013. "Linear regression for panel with unknown number of factors as interactive fixed effects," CeMMAP working papers 49/13, Institute for Fiscal Studies.

    More about this item

    Keywords

    High-dimensional linear model; Hypothesis testing; Uniformly powerful test; Nuisance parameter; Random matrix theory;
    All these keywords.

    JEL classification:

    • C12 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Hypothesis Testing: General
    • C22 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Time-Series Models; Dynamic Quantile Regressions; Dynamic Treatment Effect Models; Diffusion Processes
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:econom:v:234:y:2023:i:1:p:151-177. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jeconom .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.