IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2103.06691.html
   My bibliography  Save this paper

Regression based thresholds in principal loading analysis

Author

Listed:
  • J. O. Bauer
  • B. Drabant

Abstract

Principal loading analysis is a dimension reduction method that discards variables which have only a small distorting effect on the covariance matrix. As a special case, principal loading analysis discards variables that are not correlated with the remaining ones. In multivariate linear regression on the other hand, predictors that are neither correlated with both the remaining predictors nor with the dependent variables have a regression coefficients equal to zero. Hence, if the goal is to select a number of predictors, variables that do not correlate are discarded as it is also done in principal loading analysis. That both methods select the same variables occurs not only for the special case of zero correlation however. We contribute conditions under which both methods share the same variable selection. Further, we extend those conditions to provide a choice for the threshold in principal loading analysis which only follows recommendations based on simulation results so far.

Suggested Citation

  • J. O. Bauer & B. Drabant, 2021. "Regression based thresholds in principal loading analysis," Papers 2103.06691, arXiv.org, revised Mar 2022.
  • Handle: RePEc:arx:papers:2103.06691
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2103.06691
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Bauer, Jan O. & Drabant, Bernhard, 2021. "Principal loading analysis," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    2. Ian T. Jolliffe, 1982. "A Note on the Use of Principal Components in Regression," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 31(3), pages 300-303, November.
    3. Kollo, T. & Neudecker, H., 1993. "Asymptotics of Eigenvalues and Unit-Length Eigenvectors of Sample Variance and Correlation Matrices," Journal of Multivariate Analysis, Elsevier, vol. 47(2), pages 283-300, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bauer, Jan O. & Drabant, Bernhard, 2023. "Regression based thresholds in principal loading analysis," Journal of Multivariate Analysis, Elsevier, vol. 193(C).
    2. Dennis Shen & Peng Ding & Jasjeet Sekhon & Bin Yu, 2022. "Same Root Different Leaves: Time Series and Cross-Sectional Methods in Panel Data," Papers 2207.14481, arXiv.org, revised Oct 2022.
    3. Carlos Moreno-Miranda & Hipatia Palacios & Daniele Rama, 2019. "Small-holders perception of sustainability and chain coordination: evidence from Arriba PDO Cocoa in Western Ecuador," Bio-based and Applied Economics Journal, Italian Association of Agricultural and Applied Economics (AIEAA), vol. 8(3), December.
    4. Fernandez-Haddad, Zaira & Quiroga, Sonia, 2011. "Adaptation Of Mediterranean Crops To Water Pressure In The Ebro Basin: A Water Efficiency Index," 2011 International Congress, August 30-September 2, 2011, Zurich, Switzerland 114358, European Association of Agricultural Economists.
    5. Kawano, Shuichi & Fujisawa, Hironori & Takada, Toyoyuki & Shiroishi, Toshihiko, 2015. "Sparse principal component regression with adaptive loading," Computational Statistics & Data Analysis, Elsevier, vol. 89(C), pages 192-203.
    6. Heni Masruroh & Soemarno Soemarno & Syahrul Kurniawan & Amin Setyo Leksono, 2023. "A Spatial Model of Landslides with A Micro-Topography and Vegetation Approach for Sustainable Land Management in the Volcanic Area," Sustainability, MDPI, vol. 15(4), pages 1-26, February.
    7. Minjung Kyung & Ju-Hyun Park & Ji Yeh Choi, 2022. "Bayesian Mixture Model of Extended Redundancy Analysis," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 946-966, September.
    8. Hugh L. Christensen, 2015. "Algorithmic arbitrage of open-end funds using variational Bayes," International Journal of Financial Engineering (IJFE), World Scientific Publishing Co. Pte. Ltd., vol. 2(04), pages 1-38, December.
    9. Liu, Shuangzhe & Leiva, Víctor & Zhuang, Dan & Ma, Tiefeng & Figueroa-Zúñiga, Jorge I., 2022. "Matrix differential calculus with applications in the multivariate linear model and its diagnostics," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    10. Jiaju Miao & Pawel Polak, 2023. "Online Ensemble of Models for Optimal Predictive Performance with Applications to Sector Rotation Strategy," Papers 2304.09947, arXiv.org.
    11. Mirza Pasic & Halima Hadziahmetovic & Ismira Ahmovic & Mugdim Pasic, 2023. "Principal Component Regression Modeling and Analysis of PM 10 and Meteorological Parameters in Sarajevo with and without Temperature Inversion," Sustainability, MDPI, vol. 15(14), pages 1-22, July.
    12. Cai, Yuezhou & Hanley, Aoife, 2012. "Building BRICS: 2-Stage DEA analysis of R&D efficiency," Kiel Working Papers 1788, Kiel Institute for the World Economy (IfW Kiel).
    13. Travaglini, Guido, 2010. "Supervised Principal Components and Factor Instrumental Variables. An Application to Violent CrimeTrends in the US, 1982-2005," MPRA Paper 22077, University Library of Munich, Germany.
    14. Elkin Castaño & Santiago Gallón, 2017. "A solution for multicollinearity in stochastic frontier production function models," Lecturas de Economía, Universidad de Antioquia, Departamento de Economía, issue 86, pages 9-23, Enero - J.
    15. Mansouri, Majdi & Hajji, Mansour & Trabelsi, Mohamed & Harkat, Mohamed Faouzi & Al-khazraji, Ayman & Livera, Andreas & Nounou, Hazem & Nounou, Mohamed, 2018. "An effective statistical fault detection technique for grid connected photovoltaic systems based on an improved generalized likelihood ratio test," Energy, Elsevier, vol. 159(C), pages 842-856.
    16. Ranjith Vijayakumar & Ji Yeh Choi & Eun Hwa Jung, 2022. "A Unified Neural Network Framework for Extended Redundancy Analysis," Psychometrika, Springer;The Psychometric Society, vol. 87(4), pages 1503-1528, December.
    17. Mishra, Aditya & Dey, Dipak K. & Chen, Yong & Chen, Kun, 2021. "Generalized co-sparse factor regression," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    18. Anish Agarwal & Keegan Harris & Justin Whitehouse & Zhiwei Steven Wu, 2023. "Adaptive Principal Component Regression with Applications to Panel Data," Papers 2307.01357, arXiv.org, revised Oct 2023.
    19. Santiago Velásquez & Juho Kanniainen & Saku Mäkinen & Jaakko Valli, 2018. "Layoff announcements and intra-day market reactions," Review of Managerial Science, Springer, vol. 12(1), pages 203-228, January.
    20. Sandip Garai & Ranjit Kumar Paul & Debopam Rakshit & Md Yeasin & Walid Emam & Yusra Tashkandy & Christophe Chesneau, 2023. "Wavelets in Combination with Stochastic and Machine Learning Models to Predict Agricultural Prices," Mathematics, MDPI, vol. 11(13), pages 1-18, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2103.06691. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.