IDEAS home Printed from https://ideas.repec.org/h/ito/pchaps/256804.html
   My bibliography  Save this book chapter

Determining an Adequate Number of Principal Components

In: Advances in Principal Component Analysis

Author

Listed:
  • Stanley L. Sclove

Abstract

The problem of choosing the number of PCs to retain is analyzed in the context of model selection, using so-called model selection criteria (MSCs). For a prespecified set of models, indexed by k=1,2,...,K, these model selection criteria (MSCs) take the form MSCk=nLLk+anmk, where, for model k,LLk is the maximum log likelihood, mk is the number of independent parameters, and the constant an is an=lnn for BIC and an=2 for AIC. The maximum log likelihood LLk is achieved by using the maximum likelihood estimates (MLEs) of the parameters. In Gaussian models, LLk involves the logarithm of the mean squared error (MSE). The main contribution of this chapter is to show how to best use BIC to choose the number of PCs, and to compare these results to ad hoc procedures that have been used. Findings include the following. These are stated as they apply to the eigenvalues of the correlation matrix, which are between 0 and p and have an average of 1. For considering an additional PCk + 1, with AIC, inclusion of the additional PCk + 1 is justified if the corresponding eigenvalue ?k+1 is greater than exp-2/n. For BIC, the inclusion of an additional PCk + 1 is justified if ?k+1>n1/n, which tends to 1 for large n. Therefore, this is in approximate agreement with the average eigenvalue rule for correlation matrices, stating that one should retain dimensions with eigenvalues larger than 1.

Suggested Citation

  • Stanley L. Sclove, 2022. "Determining an Adequate Number of Principal Components," Chapters, in: Fausto Pedro Garcia Marquez (ed.), Advances in Principal Component Analysis, IntechOpen.
  • Handle: RePEc:ito:pchaps:256804
    DOI: 10.5772/intechopen.104534
    as

    Download full text from publisher

    File URL: https://www.intechopen.com/chapters/81645
    Download Restriction: no

    File URL: https://libkey.io/10.5772/intechopen.104534?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Keywords

    reduction of dimensionality; principal components; model selection criteria; information criteria; AIC; BIC;
    All these keywords.

    JEL classification:

    • C10 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ito:pchaps:256804. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Slobodan Momcilovic (email available below). General contact details of provider: http://www.intechopen.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.