IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2505.10738.html
   My bibliography  Save this paper

Statistically Significant Linear Regression Coefficients Solely Driven By Outliers In Finite-sample Inference

Author

Listed:
  • Felix Reichel

Abstract

In this paper, we investigate the impact of outliers on the statistical significance of coefficients in linear regression. We demonstrate, through numerical simulation using R, that a single outlier can cause an otherwise insignificant coefficient to appear statistically significant. We compare this with robust Huber regression, which reduces the effects of outliers. Afterwards, we approximate the influence of a single outlier on estimated regression coefficients and discuss common diagnostic statistics to detect influential observations in regression (e.g., studentized residuals). Furthermore, we relate this issue to the optional normality assumption in simple linear regression [14], required for exact finite-sample inference but asymptotically justified for large n by the Central Limit Theorem (CLT). We also address the general dangers of relying solely on p-values without performing adequate regression diagnostics. Finally, we provide a brief overview of regression methods and discuss how they relate to the assumptions of the Gauss-Markov theorem.

Suggested Citation

  • Felix Reichel, 2025. "Statistically Significant Linear Regression Coefficients Solely Driven By Outliers In Finite-sample Inference," Papers 2505.10738, arXiv.org, revised May 2025.
  • Handle: RePEc:arx:papers:2505.10738
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2505.10738
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Koenker, Roger W & Bassett, Gilbert, Jr, 1978. "Regression Quantiles," Econometrica, Econometric Society, vol. 46(1), pages 33-50, January.
    2. Tobias Ejiofor Ugah & Emmanuel Ikechukwu Mba & Micheal Chinonso Eze & Kingsley Chinedu Arum & Ifeoma Christy Mba & Henrietta Ebele Oranye, 2021. "On the Upper Bounds of Test Statistics for a Single Outlier Test in Linear Regression Models," Journal of Applied Mathematics, Hindawi, vol. 2021, pages 1-5, September.
    3. Li, Baibing & Martin, Elaine B. & Morris, A. Julian, 2002. "On principal component analysis in L1," Computational Statistics & Data Analysis, Elsevier, vol. 40(3), pages 471-474, September.
    4. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    5. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    6. Tobias Ejiofor Ugah & Emmanuel Ikechukwu Mba & Micheal Chinonso Eze & Kingsley Chinedu Arum & Ifeoma Christy Mba & Henrietta Ebele Oranye, 2021. "On the Upper Bounds of Test Statistics for a Single Outlier Test in Linear Regression Models," Journal of Applied Mathematics, John Wiley & Sons, vol. 2021(1).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chuliá, Helena & Garrón, Ignacio & Uribe, Jorge M., 2024. "Daily growth at risk: Financial or real drivers? The answer is not always the same," International Journal of Forecasting, Elsevier, vol. 40(2), pages 762-776.
    2. Marfè, Roberto & Pénasse, Julien, 2024. "Measuring macroeconomic tail risk," Journal of Financial Economics, Elsevier, vol. 156(C).
    3. Zhang, Ting & Wang, Lei, 2020. "Smoothed empirical likelihood inference and variable selection for quantile regression with nonignorable missing response," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    4. Yu-Zhu Tian & Man-Lai Tang & Wai-Sum Chan & Mao-Zai Tian, 2021. "Bayesian bridge-randomized penalized quantile regression for ordinal longitudinal data, with application to firm’s bond ratings," Computational Statistics, Springer, vol. 36(2), pages 1289-1319, June.
    5. Yu, Dengdeng & Zhang, Li & Mizera, Ivan & Jiang, Bei & Kong, Linglong, 2019. "Sparse wavelet estimation in quantile regression with multiple functional predictors," Computational Statistics & Data Analysis, Elsevier, vol. 136(C), pages 12-29.
    6. Torossian, Léonard & Picheny, Victor & Faivre, Robert & Garivier, Aurélien, 2020. "A review on quantile regression for stochastic computer experiments," Reliability Engineering and System Safety, Elsevier, vol. 201(C).
    7. Jiang, Liewen & Bondell, Howard D. & Wang, Huixia Judy, 2014. "Interquantile shrinkage and variable selection in quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 69(C), pages 208-219.
    8. Masao Ueki, 2024. "Data-Adaptive Multivariate Test for Genomic Studies Using Fused Lasso," Mathematics, MDPI, vol. 12(10), pages 1-16, May.
    9. Ranran Chen & Mai Dao & Keying Ye & Min Wang, 2025. "Bayesian adaptive lasso quantile regression with non-ignorable missing responses," Computational Statistics, Springer, vol. 40(3), pages 1643-1682, March.
    10. Anshul Verma & Orazio Angelini & Tiziana Di Matteo, 2019. "A new set of cluster driven composite development indicators," Papers 1911.11226, arXiv.org, revised Mar 2020.
    11. Yu-Zhu Tian & Man-Lai Tang & Mao-Zai Tian, 2021. "Bayesian joint inference for multivariate quantile regression model with L $$_{1/2}$$ 1 / 2 penalty," Computational Statistics, Springer, vol. 36(4), pages 2967-2994, December.
    12. Alireza Daneshvar & Golalizadeh Mousa, 2023. "Regression shrinkage and selection via least quantile shrinkage and selection operator," PLOS ONE, Public Library of Science, vol. 18(2), pages 1-17, February.
    13. Bousebata, Meryem & Enjolras, Geoffroy & Girard, Stéphane, 2023. "Extreme partial least-squares," Journal of Multivariate Analysis, Elsevier, vol. 194(C).
    14. Bonaccolto, Giovanni & Borri, Nicola & Consiglio, Andrea, 2023. "Breakup and default risks in the great lockdown," Journal of Banking & Finance, Elsevier, vol. 147(C).
    15. Yazhao Lv & Riquan Zhang & Weihua Zhao & Jicai Liu, 2014. "Quantile regression and variable selection for the single-index model," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(7), pages 1565-1577, July.
    16. Bayer, Sebastian, 2018. "Combining Value-at-Risk forecasts using penalized quantile regressions," Econometrics and Statistics, Elsevier, vol. 8(C), pages 56-77.
    17. Jiang, He & Tao, Changqi & Dong, Yao & Xiong, Ren, 2021. "Robust low-rank multiple kernel learning with compound regularization," European Journal of Operational Research, Elsevier, vol. 295(2), pages 634-647.
    18. Tian, Yuzhu & Song, Xinyuan, 2020. "Bayesian bridge-randomized penalized quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 144(C).
    19. Konrad Bogner & Florian Pappenberger & Massimiliano Zappa, 2019. "Machine Learning Techniques for Predicting the Energy Consumption/Production and Its Uncertainties Driven by Meteorological Observations and Forecasts," Sustainability, MDPI, vol. 11(12), pages 1-22, June.
    20. Xu, Qifa & Zhou, Yingying & Jiang, Cuixia & Yu, Keming & Niu, Xufeng, 2016. "A large CVaR-based portfolio selection model with weight constraints," Economic Modelling, Elsevier, vol. 59(C), pages 436-447.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2505.10738. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.