IDEAS home Printed from https://ideas.repec.org/a/spr/stpapr/v66y2025i3d10.1007_s00362-025-01679-w.html
   My bibliography  Save this article

Tests for high-dimensional partially linear regression models

Author

Listed:
  • Hongwei Shi

    (Beijing Normal University)

  • Weichao Yang

    (Beijing Normal University)

  • Bowen Sun

    (Beijing Normal University)

  • Xu Guo

    (Beijing Normal University)

Abstract

In this paper, we consider the tests for high-dimensional partially linear regression models. The presence of high-dimensional nuisance covariates and the unknown nuisance function makes the inference problem very challenging. We adopt machine learning methods to estimate the unknown nuisance function and introduce quadratic-form test statistics. Interestingly, though the machine learning methods can be very complex, under suitable conditions, we establish the asymptotic normality of our introduced test statistics under the null hypothesis and local alternative hypotheses. We further propose a power-enhanced procedure to improve the performance of test statistics. Two thresholding determination methods are provided for the proposed power-enhanced procedure. We show that the power enhancement procedure is powerful to detect signals under either sparse or dense alternatives and it can still control the type-I error asymptotically under the null hypothesis. Numerical studies are carried out to illustrate the empirical performance of our introduced procedures.

Suggested Citation

  • Hongwei Shi & Weichao Yang & Bowen Sun & Xu Guo, 2025. "Tests for high-dimensional partially linear regression models," Statistical Papers, Springer, vol. 66(3), pages 1-23, April.
  • Handle: RePEc:spr:stpapr:v:66:y:2025:i:3:d:10.1007_s00362-025-01679-w
    DOI: 10.1007/s00362-025-01679-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00362-025-01679-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00362-025-01679-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Rauf Ahmad, M. & Werner, C. & Brunner, E., 2008. "Analysis of high-dimensional repeated measures designs: The one sample case," Computational Statistics & Data Analysis, Elsevier, vol. 53(2), pages 416-427, December.
    2. Zhong, Ping-Shou & Chen, Song Xi, 2011. "Tests for High-Dimensional Regression Coefficients With Factorial Designs," Journal of the American Statistical Association, American Statistical Association, vol. 106(493), pages 260-274.
    3. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    4. Lilun Du & Xu Guo & Wenguang Sun & Changliang Zou, 2023. "False Discovery Rate Control Under General Dependence By Symmetrized Data Aggregation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(541), pages 607-621, January.
    5. Yongshuai Chen & Wenwen Guo & Hengjian Cui, 2024. "On the test of covariance between two high-dimensional random vectors," Statistical Papers, Springer, vol. 65(5), pages 2687-2717, July.
    6. Runze Li & Wei Zhong & Liping Zhu, 2012. "Feature Screening via Distance Correlation Learning," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(499), pages 1129-1139, September.
    7. Rui Wang & Xingzhong Xu, 2021. "A Bayesian-motivated test for high-dimensional linear regression models with fixed design matrix," Statistical Papers, Springer, vol. 62(4), pages 1821-1852, August.
    8. Patric Müller & Sara Geer, 2015. "The Partial Linear Model in High Dimensions," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 580-608, June.
    9. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "Inference on Treatment Effects after Selection among High-Dimensional Controlsâ€," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(2), pages 608-650.
    10. T. Tony Cai & Weidong Liu & Yin Xia, 2014. "Two-sample test of high dimensional means under dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(2), pages 349-372, March.
    11. Brian D. Williamson & Peter B. Gilbert & Marco Carone & Noah Simon, 2021. "Nonparametric variable importance assessment using machine learning techniques," Biometrics, The International Biometric Society, vol. 77(1), pages 9-22, March.
    12. Xianyang Zhang & Guang Cheng, 2017. "Simultaneous Inference for High-Dimensional Linear Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 757-768, April.
    13. Mingxiang Cao & Ziyang Cheng & Kai Xu & Daojiang He, 2024. "A scale-invariant test for linear hypothesis of means in high dimensions," Statistical Papers, Springer, vol. 65(6), pages 3477-3497, August.
    14. Liu, Yan & Zhang, Sanguo & Ma, Shuangge & Zhang, Qingzhao, 2020. "Tests for regression coefficients in high dimensional partially linear models," Statistics & Probability Letters, Elsevier, vol. 163(C).
    15. Stijn Vansteelandt & Oliver Dukes, 2022. "Assumption‐lean inference for generalised linear model parameters," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 657-685, July.
    16. Jinsong Chen & Quefeng Li & Hua Yun Chen, 2023. "Testing generalized linear models with high-dimensional nuisance parameters," Biometrika, Biometrika Trust, vol. 110(1), pages 83-99.
    17. Brian D. Williamson & Peter B. Gilbert & Marco Carone & Noah Simon, 2021. "Rejoinder to “Nonparametric variable importance assessment using machine learning techniques”," Biometrics, The International Biometric Society, vol. 77(1), pages 28-30, March.
    18. Jianqing Fan & Yuan Liao & Jiawei Yao, 2015. "Power Enhancement in High‐Dimensional Cross‐Sectional Tests," Econometrica, Econometric Society, vol. 83(4), pages 1497-1541, July.
    19. Meinshausen, Nicolai & Meier, Lukas & Bühlmann, Peter, 2009. "p-Values for High-Dimensional Regression," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1671-1681.
    20. Jianqing Fan & Shaojun Guo & Ning Hao, 2012. "Variance estimation using refitted cross‐validation in ultrahigh dimensional regression," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 74(1), pages 37-65, January.
    21. Bin Guo & Song Xi Chen, 2016. "Tests for high dimensional generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(5), pages 1079-1102, November.
    22. Ruben Dezeure & Peter Bühlmann & Cun-Hui Zhang, 2017. "High-dimensional simultaneous inference with the bootstrap," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 26(4), pages 685-719, December.
    23. Nilanjan Chatterjee & Raymond J. Carroll, 2005. "Semiparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies," Biometrika, Biometrika Trust, vol. 92(2), pages 399-418, June.
    24. Yu, Xiufan & Yao, Jiawei & Xue, Lingzhou, 2024. "Power enhancement for testing multi-factor asset pricing models via Fisher’s method," Journal of Econometrics, Elsevier, vol. 239(2).
    25. Siyang Wang & Hengjian Cui, 2020. "Test for high dimensional regression coefficients of partially linear models," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 49(17), pages 4091-4116, September.
    26. Yanlin Tang & Huixia Judy Wang & Emre Barut, 2018. "Testing for the presence of significant covariates through conditional marginal regression," Biometrika, Biometrika Trust, vol. 105(1), pages 57-71.
    27. Gongjun Xu & Lifeng Lin & Peng Wei & Wei Pan, 2016. "An adaptive two-sample test for high-dimensional means," Biometrika, Biometrika Trust, vol. 103(3), pages 609-624.
    28. Li, Qi, 2000. "Efficient Estimation of Additive Partially Linear Models," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 41(4), pages 1073-1092, November.
    29. Hardle, Wolfgang & LIang, Hua & Gao, Jiti, 2000. "Partially linear models," MPRA Paper 39562, University Library of Munich, Germany, revised 01 Sep 2000.
    30. Xiufan Yu & Danning Li & Lingzhou Xue & Runze Li, 2023. "Power-Enhanced Simultaneous Test of High-Dimensional Mean Vectors and Covariance Matrices with Application to Gene-Set Testing," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(544), pages 2548-2561, October.
    31. Jingshen Wang & Xuming He & Gongjun Xu, 2020. "Debiased Inference on Treatment Effect in a High-Dimensional Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 442-454, January.
    32. Rong Ma & T. Tony Cai & Hongzhe Li, 2021. "Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(534), pages 984-998, April.
    33. Cun-Hui Zhang & Stephanie S. Zhang, 2014. "Confidence intervals for low dimensional parameters in high dimensional linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 217-242, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. T. Tony Cai & Zijian Guo & Yin Xia, 2023. "Statistical inference and large-scale multiple testing for high-dimensional regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(4), pages 1135-1171, December.
    2. He, Yi & Jaidee, Sombut & Gao, Jiti, 2023. "Most powerful test against a sequence of high dimensional local alternatives," Journal of Econometrics, Elsevier, vol. 234(1), pages 151-177.
    3. Lan, Wei & Zhong, Ping-Shou & Li, Runze & Wang, Hansheng & Tsai, Chih-Ling, 2016. "Testing a single regression coefficient in high dimensional linear models," Journal of Econometrics, Elsevier, vol. 195(1), pages 154-168.
    4. Alexandre Belloni & Victor Chernozhukov & Denis Chetverikov & Christian Hansen & Kengo Kato, 2018. "High-Dimensional Econometrics and Regularized GMM," Papers 1806.01888, arXiv.org, revised Jun 2018.
    5. Lan, Wei & Ding, Yue & Fang, Zheng & Fang, Kuangnan, 2016. "Testing covariates in high dimension linear regression with latent factors," Journal of Multivariate Analysis, Elsevier, vol. 144(C), pages 25-37.
    6. Agboola, Oluwagbenga David & Yu, Han, 2023. "Neighborhood-based cross fitting approach to treatment effects with high-dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 186(C).
    7. Dong, Chaohua & Gao, Jiti & Linton, Oliver, 2023. "High dimensional semiparametric moment restriction models," Journal of Econometrics, Elsevier, vol. 232(2), pages 320-345.
    8. Qingliang Fan & Zijian Guo & Ziwei Mei & Cun-Hui Zhang, 2023. "Inference for Nonlinear Endogenous Treatment Effects Accounting for High-Dimensional Covariate Complexity," Papers 2310.08063, arXiv.org, revised Jun 2024.
    9. Alexander Giessing & Jianqing Fan, 2020. "Bootstrapping $\ell_p$-Statistics in High Dimensions," Papers 2006.13099, arXiv.org, revised Aug 2020.
    10. Victor Chernozhukov & Wolfgang K. Hardle & Chen Huang & Weining Wang, 2018. "LASSO-Driven Inference in Time and Space," Papers 1806.05081, arXiv.org, revised May 2020.
    11. Kaspar Wuthrich & Ying Zhu, 2019. "Omitted variable bias of Lasso-based inference methods: A finite sample analysis," Papers 1903.08704, arXiv.org, revised Sep 2021.
    12. Xiaorui Zhu & Yichen Qin & Peng Wang, 2023. "Sparsified Simultaneous Confidence Intervals for High-Dimensional Linear Models," Papers 2307.07574, arXiv.org, revised Jan 2025.
    13. Yi He & Sombut Jaidee & Jiti Gao, 2020. "Most Powerful Test against High Dimensional Free Alternatives," Monash Econometrics and Business Statistics Working Papers 13/20, Monash University, Department of Econometrics and Business Statistics.
    14. Adamek, Robert & Smeekes, Stephan & Wilms, Ines, 2023. "Lasso inference for high-dimensional time series," Journal of Econometrics, Elsevier, vol. 235(2), pages 1114-1143.
    15. Rui Wang & Xingzhong Xu, 2021. "A Bayesian-motivated test for high-dimensional linear regression models with fixed design matrix," Statistical Papers, Springer, vol. 62(4), pages 1821-1852, August.
    16. Maur,Jean-Christophe & Nedeljkovic,Milan & Von Uexkull,Jan Erik, 2022. "FDI and Trade Outcomes at the Industry Level—A Data-Driven Approach," Policy Research Working Paper Series 9901, The World Bank.
    17. Victor Chernozhukov & Denis Chetverikov & Kengo Kato & Yuta Koike, 2022. "High-dimensional Data Bootstrap," Papers 2205.09691, arXiv.org.
    18. Guo, Xu & Li, Runze & Liu, Jingyuan & Zeng, Mudong, 2023. "Statistical inference for linear mediation models with high-dimensional mediators and application to studying stock reaction to COVID-19 pandemic," Journal of Econometrics, Elsevier, vol. 235(1), pages 166-179.
    19. Philipp Bach & Victor Chernozhukov & Malte S. Kurz & Martin Spindler & Sven Klaassen, 2021. "DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R," Papers 2103.09603, arXiv.org, revised Jun 2024.
    20. Yumou Qiu & Jing Tao & Xiao‐Hua Zhou, 2021. "Inference of heterogeneous treatment effects using observational data with high‐dimensional covariates," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(5), pages 1016-1043, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stpapr:v:66:y:2025:i:3:d:10.1007_s00362-025-01679-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.