IDEAS home Printed from https://ideas.repec.org/a/spr/jclass/v31y2014i2p242-269.html
   My bibliography  Save this article

Using Generalized Procrustes Analysis for Multiple Imputation in Principal Component Analysis

Author

Listed:
  • Joost Ginkel
  • Pieter Kroonenberg

Abstract

Multiple imputation is one of the most highly recommended procedures for dealing with missing data. However, to date little attention has been paid to methods for combining the results from principal component analyses applied to a multiply imputed data set. In this paper we propose Generalized Procrustes analysis for this purpose, of which its centroid solution can be used as a final estimate for the component loadings. Convex hulls based on the loadings of the imputed data sets can be used to represent the uncertainty due to the missing data. In two simulation studies, the performance of Generalized Procrustes approach is evaluated and compared with other methods. More specifically it is studied how these methods behave when order changes of components and sign reversals of component loadings occur, such as in case of near-equal eigenvalues, or data having almost as many counterindicative items as indicative items. The simulations show that other proposed methods either may run into serious problems or are not able to adequately assess the accuracy due to the presence of missing data. However, when the above situations do not occur, all methods will provide adequate estimates for the PCA loadings. Copyright Classification Society of North America 2014

Suggested Citation

  • Joost Ginkel & Pieter Kroonenberg, 2014. "Using Generalized Procrustes Analysis for Multiple Imputation in Principal Component Analysis," Journal of Classification, Springer;The Classification Society, vol. 31(2), pages 242-269, July.
  • Handle: RePEc:spr:jclass:v:31:y:2014:i:2:p:242-269
    DOI: 10.1007/s00357-014-9154-y
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s00357-014-9154-y
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s00357-014-9154-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Rubin, Donald B, 1986. "Statistical Matching Using File Concatenation with Adjusted Weights and Multiple Imputations," Journal of Business & Economic Statistics, American Statistical Association, vol. 4(1), pages 87-94, January.
    2. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 287-296, July.
    3. Yuan, Yang, 2011. "Multiple Imputation Using SAS Software," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i06).
    4. Bert Green, 1952. "The orthogonal approximation of an oblique structure in factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 17(4), pages 429-440, December.
    5. Haruhiko Ogasawara, 2002. "Concise formulas for the standard errors of component loading estimates," Psychometrika, Springer;The Psychometric Society, vol. 67(2), pages 289-297, June.
    6. Henk Kiers, 1997. "Weighted least squares fitting using ordinary least squares algorithms," Psychometrika, Springer;The Psychometric Society, vol. 62(2), pages 251-266, June.
    7. Robert Jennrich, 1973. "Standard errors for obliquely rotated factor loadings," Psychometrika, Springer;The Psychometric Society, vol. 38(4), pages 593-604, December.
    8. Little, Roderick J A, 1988. "Missing-Data Adjustments in Large Surveys: Reply," Journal of Business & Economic Statistics, American Statistical Association, vol. 6(3), pages 300-301, July.
    9. Jos Berge, 1977. "Orthogonal procrustes rotation for two or more matrices," Psychometrika, Springer;The Psychometric Society, vol. 42(2), pages 267-276, June.
    10. James Lingoes & Ingwer Borg, 1978. "A direct approach to individual differences scaling using increasingly complex transformations," Psychometrika, Springer;The Psychometric Society, vol. 43(4), pages 491-519, December.
    11. Luis Milan & Joe Whittaker, 1995. "Application of the Parametric Bootstrap to Models that Incorporate a Singular Value Decomposition," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 44(1), pages 31-49, March.
    12. Su, Yu-Sung & Gelman, Andrew & Hill, Jennifer & Yajima, Masanao, 2011. "Multiple Imputation with Diagnostics (mi) in R: Opening Windows into the Black Box," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i02).
    13. Claude Archer & Robert Jennrich, 1973. "Standard errors for rotated factor loadings," Psychometrika, Springer;The Psychometric Society, vol. 38(4), pages 581-592, December.
    14. Van Ginkel, Joost R. & Andries Van der Ark, L. & Sijtsma, Klaas & Vermunt, Jeroen K., 2007. "Two-way imputation: A Bayesian method for estimating missing scores in tests and questionnaires, and an accurate approximation," Computational Statistics & Data Analysis, Elsevier, vol. 51(8), pages 4013-4027, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kensuke Okada & Shin-ichi Mayekawa, 2018. "Post-processing of Markov chain Monte Carlo output in Bayesian latent variable models with application to multidimensional scaling," Computational Statistics, Springer, vol. 33(3), pages 1457-1473, September.
    2. Renate S M Buisman & Katharina Pittner & Marieke S Tollenaar & Jolanda Lindenberg & Lisa J M van den Berg & Laura H C G Compier-de Block & Joost R van Ginkel & Lenneke R A Alink & Marian J Bakermans-K, 2020. "Intergenerational transmission of child maltreatment using a multi-informant multi-generation family design," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-23, March.
    3. Sébastien Loisel & Yoshio Takane, 2019. "Comparisons among several methods for handling missing data in principal component analysis (PCA)," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 13(2), pages 495-518, June.
    4. Ann Bostrom & Adam L. Hayes & Katherine M. Crosman, 2019. "Efficacy, Action, and Support for Reducing Climate Change Risks," Risk Analysis, John Wiley & Sons, vol. 39(4), pages 805-828, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    2. Brownstone, David, 1997. "Multiple Imputation Methodology for Missing Data, Non-Random Response, and Panel Attrition," University of California Transportation Center, Working Papers qt2zd6w6hh, University of California Transportation Center.
    3. Westermeier, Christian & Grabka, Markus M., 2016. "Longitudinal Wealth Data and Multiple Imputation: An Evaluation Study," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 10(3), pages 237-252.
    4. Arif Mamun & David Wittenburg & Noelle Denny-Brown & Michael Levere & David Mann & Rebecca Coughlin & Sarah Croake & Heather Gordon & Denise Hoffman & Rachel Holzwart & Rosalind Keith & Brittany McGil, "undated". "Promoting Opportunity Demonstration: Interim Evaluation Report," Mathematica Policy Research Reports caa99d38a8b14f968ea3438e5, Mathematica Policy Research.
    5. Baltussen, Guido & Swinkels, Laurens & Van Vliet, Pim, 2021. "Global factor premiums," Journal of Financial Economics, Elsevier, vol. 142(3), pages 1128-1154.
    6. Sean Mc Auliffe & Georg U. Thunecke & Georg Wamser, 2023. "The Tax-Elasticity of Tangible Fixed Assets: Evidence from Novel Corporate Tax Data," CESifo Working Paper Series 10628, CESifo.
    7. Leonie C. Steckermeier & Jan Delhey, 2019. "Better for Everyone? Egalitarian Culture and Social Wellbeing in Europe," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 143(3), pages 1075-1108, June.
    8. Saeideh Kamgar & Florian Meinfelder & Ralf Münnich & Hamidreza Navvabpour, 2020. "Estimation within the new integrated system of household surveys in Germany," Statistical Papers, Springer, vol. 61(5), pages 2091-2117, October.
    9. Roderick J. A. Little & Donald B. Rubin, 1989. "The Analysis of Social Science Data with Missing Values," Sociological Methods & Research, , vol. 18(2-3), pages 292-326, November.
    10. Jana Emmenegger & Ralf Münnich & Jannik Schaller, 2022. "Evaluating Data Fusion Methods to Improve Income Modelling," Research Papers in Economics 2022-03, University of Trier, Department of Economics.
    11. Rebecca R. Andridge & Roderick J. A. Little, 2010. "A Review of Hot Deck Imputation for Survey Non‐response," International Statistical Review, International Statistical Institute, vol. 78(1), pages 40-64, April.
    12. Renate S M Buisman & Katharina Pittner & Marieke S Tollenaar & Jolanda Lindenberg & Lisa J M van den Berg & Laura H C G Compier-de Block & Joost R van Ginkel & Lenneke R A Alink & Marian J Bakermans-K, 2020. "Intergenerational transmission of child maltreatment using a multi-informant multi-generation family design," PLOS ONE, Public Library of Science, vol. 15(3), pages 1-23, March.
    13. Chenyang Gu & Roee Gutman, 2017. "Combining item response theory with multiple imputation to equate health assessment questionnaires," Biometrics, The International Biometric Society, vol. 73(3), pages 990-998, September.
    14. Chia-Ning Wang & Roderick Little & Bin Nan & Siobán D. Harlow, 2011. "A Hot-Deck Multiple Imputation Procedure for Gaps in Longitudinal Recurrent Event Histories," Biometrics, The International Biometric Society, vol. 67(4), pages 1573-1582, December.
    15. Adel Bosch & Steven F. Koch, 2021. "Individual and Household Debt: Does Imputation Choice Matter?," Working Papers 202141, University of Pretoria, Department of Economics.
    16. Bennani Dosse, Mohammed & Kiers, Henk A.L. & Ten Berge, Jos M.F., 2011. "Anisotropic generalized Procrustes analysis," Computational Statistics & Data Analysis, Elsevier, vol. 55(5), pages 1961-1968, May.
    17. Matthias von Davier & Youngmi Cho & Tianshu Pan, 2019. "Effects of Discontinue Rules on Psychometric Properties of Test Scores," Psychometrika, Springer;The Psychometric Society, vol. 84(1), pages 147-163, March.
    18. Morris A. Davis & William D. Larson & Stephen D. Oliner & Benjamin Smith, 2019. "Mortgage Risk Since 1990," FHFA Staff Working Papers 19-02, Federal Housing Finance Agency.
    19. Fernandes, Mario & Hilber, Simon & Sturm, Jan-Egbert & Walter, Andreas, 2023. "Closing the gender gap in academia? Evidence from an affirmative action program," Research Policy, Elsevier, vol. 52(9).
    20. Patrick M. Joyce & Donald Malec & Roderick J. A. Little & Aaron Gilary & Alfredo Navarro & Mark E. Asiala, 2014. "Statistical Modeling Methodology for the Voting Rights Act Section 203 Language Assistance Determinations," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(505), pages 36-47, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:jclass:v:31:y:2014:i:2:p:242-269. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.