IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v111y2016i516p1608-1622.html
   My bibliography  Save this article

Fast Bayesian Factor Analysis via Automatic Rotations to Sparsity

Author

Listed:
  • Veronika Ročková
  • Edward I. George

Abstract

Rotational post hoc transformations have traditionally played a key role in enhancing the interpretability of factor analysis. Regularization methods also serve to achieve this goal by prioritizing sparse loading matrices. In this work, we bridge these two paradigms with a unifying Bayesian framework. Our approach deploys intermediate factor rotations throughout the learning process, greatly enhancing the effectiveness of sparsity inducing priors. These automatic rotations to sparsity are embedded within a PXL-EM algorithm, a Bayesian variant of parameter-expanded EM for posterior mode detection. By iterating between soft-thresholding of small factor loadings and transformations of the factor basis, we obtain (a) dramatic accelerations, (b) robustness against poor initializations, and (c) better oriented sparse solutions. To avoid the prespecification of the factor cardinality, we extend the loading matrix to have infinitely many columns with the Indian buffet process (IBP) prior. The factor dimensionality is learned from the posterior, which is shown to concentrate on sparse matrices. Our deployment of PXL-EM performs a dynamic posterior exploration, outputting a solution path indexed by a sequence of spike-and-slab priors. For accurate recovery of the factor loadings, we deploy the spike-and-slab LASSO prior, a two-component refinement of the Laplace prior. A companion criterion, motivated as an integral lower bound, is provided to effectively select the best recovery. The potential of the proposed procedure is demonstrated on both simulated and real high-dimensional data, which would render posterior simulation impractical. Supplementary materials for this article are available online.

Suggested Citation

  • Veronika Ročková & Edward I. George, 2016. "Fast Bayesian Factor Analysis via Automatic Rotations to Sparsity," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1608-1622, October.
  • Handle: RePEc:taf:jnlasa:v:111:y:2016:i:516:p:1608-1622
    DOI: 10.1080/01621459.2015.1100620
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2015.1100620
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2015.1100620?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Veronika Ročková & Edward I. George, 2014. "EMVS: The EM Approach to Bayesian Variable Selection," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 828-846, June.
    2. Geweke, John & Zhou, Guofu, 1996. "Measuring the Pricing Error of the Arbitrage Pricing Theory," Review of Financial Studies, Society for Financial Studies, vol. 9(2), pages 557-587.
    3. Michael E. Tipping & Christopher M. Bishop, 1999. "Probabilistic Principal Component Analysis," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(3), pages 611-622.
    4. Jacob M Zahn & Suresh Poosala & Art B Owen & Donald K Ingram & Ana Lustig & Arnell Carter & Ashani T Weeraratna & Dennis D Taub & Myriam Gorospe & Krystyna Mazan-Mamczarz & Edward G Lakatta & Kenneth , 2007. "AGEMAP: A Gene Expression Database for Aging in Mice," PLOS Genetics, Public Library of Science, vol. 3(11), pages 1-12, November.
    5. A. Bhattacharya & D. B. Dunson, 2011. "Sparse Bayesian infinite factor models," Biometrika, Biometrika Trust, vol. 98(2), pages 291-306.
    6. Carvalho, Carlos M. & Chang, Jeffrey & Lucas, Joseph E. & Nevins, Joseph R. & Wang, Quanli & West, Mike, 2008. "High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1438-1456.
    7. Henry Kaiser, 1958. "The varimax criterion for analytic rotation in factor analysis," Psychometrika, Springer;The Psychometric Society, vol. 23(3), pages 187-200, September.
    8. Nicholas G. Polson & James G. Scott & Jesse Windle, 2013. "Bayesian Inference for Logistic Models Using Pólya--Gamma Latent Variables," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1339-1349, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Natalia Bailey & George Kapetanios & M. Hashem Pesaran, 2021. "Measurement of factor strength: Theory and practice," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(5), pages 587-613, August.
    2. Adrian Quintero & Emmanuel Lesaffre & Geert Verbeke, 2024. "Bayesian Exploratory Factor Analysis via Gibbs Sampling," Journal of Educational and Behavioral Statistics, , vol. 49(1), pages 121-142, February.
    3. Dimitris Korobilis & Kenichi Shimizu, 2022. "Bayesian Approaches to Shrinkage and Sparse Estimation," Foundations and Trends(R) in Econometrics, now publishers, vol. 11(4), pages 230-354, June.
    4. Lee, Kwangmin & Lee, Jaeyong, 2023. "Post-processed posteriors for sparse covariances," Journal of Econometrics, Elsevier, vol. 236(1).
    5. Simon Beyeler & Sylvia Kaufmann, 2021. "Reduced‐form factor augmented VAR—Exploiting sparsity to include meaningful factors," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(7), pages 989-1012, November.
    6. Roberto Casarin & Fausto Corradin & Francesco Ravazzolo & Nguyen Domenico Sartore & Wing-Keung Wong, 2020. "A Scoring Rule for Factor and Autoregressive Models Under Misspecification," Advances in Decision Sciences, Asia University, Taiwan, vol. 24(2), pages 66-103, June.
    7. Simon Freyaldenhoven, 2020. "Identification Through Sparsity in Factor Models," Working Papers 20-25, Federal Reserve Bank of Philadelphia.
    8. Javier Maldonado & Esther Ruiz, 2021. "Accurate Confidence Regions for Principal Components Factors," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 83(6), pages 1432-1453, December.
    9. Kaufmann, Sylvia & Schumacher, Christian, 2019. "Bayesian estimation of sparse dynamic factor models with order-independent and ex-post mode identification," Journal of Econometrics, Elsevier, vol. 210(1), pages 116-134.
    10. L Schiavon & A Canale & D B Dunson, 2022. "Generalized infinite factorization models [A latent factor linear mixed model for high-dimensional longitudinal data analysis]," Biometrika, Biometrika Trust, vol. 109(3), pages 817-835.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kaufmann, Sylvia & Schumacher, Christian, 2019. "Bayesian estimation of sparse dynamic factor models with order-independent and ex-post mode identification," Journal of Econometrics, Elsevier, vol. 210(1), pages 116-134.
    2. Conti, Gabriella & Frühwirth-Schnatter, Sylvia & Heckman, James J. & Piatek, Rémi, 2014. "Bayesian exploratory factor analysis," Journal of Econometrics, Elsevier, vol. 183(1), pages 31-57.
    3. Roberta De Vito & Ruggero Bellio & Lorenzo Trippa & Giovanni Parmigiani, 2019. "Multi‐study factor analysis," Biometrics, The International Biometric Society, vol. 75(1), pages 337-346, March.
    4. Simon Beyeler & Sylvia Kaufmann, 2021. "Reduced‐form factor augmented VAR—Exploiting sparsity to include meaningful factors," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(7), pages 989-1012, November.
    5. Dimitris Korobilis & Kenichi Shimizu, 2022. "Bayesian Approaches to Shrinkage and Sparse Estimation," Foundations and Trends(R) in Econometrics, now publishers, vol. 11(4), pages 230-354, June.
    6. Sylvia Kaufmann & Christian Schumacher, 2013. "Bayesian estimation of sparse dynamic factor models with order-independent identification," Working Papers 13.04, Swiss National Bank, Study Center Gerzensee.
    7. Sylvia Fruhwirth-Schnatter & Darjus Hosszejni & Hedibert Freitas Lopes, 2023. "When it counts -- Econometric identification of the basic factor model based on GLT structures," Papers 2301.06354, arXiv.org.
    8. Pantelis Samartsidis & Shaun R. Seaman & Silvia Montagna & André Charlett & Matthew Hickman & Daniela De Angelis, 2020. "A Bayesian multivariate factor analysis model for evaluating an intervention by using observational time series data on multiple outcomes," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(4), pages 1437-1459, October.
    9. repec:bfi:wpaper:2014-014 is not listed on IDEAS
    10. Sylvia Fruhwirth-Schnatter, 2023. "Generalized Cumulative Shrinkage Process Priors with Applications to Sparse Bayesian Factor Analysis," Papers 2303.00473, arXiv.org.
    11. Aßmann, Christian & Boysen-Hogrefe, Jens & Pape, Markus, 2012. "The directional identification problem in Bayesian factor analysis: An ex-post approach," Kiel Working Papers 1799, Kiel Institute for the World Economy (IfW Kiel).
    12. Matthew W. Wheeler, 2019. "Bayesian additive adaptive basis tensor product models for modeling high dimensional surfaces: an application to high‐throughput toxicity testing," Biometrics, The International Biometric Society, vol. 75(1), pages 193-201, March.
    13. Hauber, Philipp, 2022. "Real-time nowcasting with sparse factor models," EconStor Preprints 251551, ZBW - Leibniz Information Centre for Economics.
    14. Aßmann, Christian & Boysen-Hogrefe, Jens & Pape, Markus, 2014. "Bayesian analysis of dynamic factor models: An ex-post approach towards the rotation problem," Kiel Working Papers 1902, Kiel Institute for the World Economy (IfW Kiel).
    15. Philip A. White & Alan E. Gelfand, 2021. "Multivariate functional data modeling with time-varying clustering," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(3), pages 586-602, September.
    16. Crespo Cuaresma, Jesús & Huber, Florian & Onorante, Luca, 2020. "Fragility and the effect of international uncertainty shocks," Journal of International Money and Finance, Elsevier, vol. 108(C).
    17. Simon Beyeler & Sylvia Kaufmann, 2016. "Factor augmented VAR revisited - A sparse dynamic factor model approach," Working Papers 16.08, Swiss National Bank, Study Center Gerzensee.
    18. Aßmann, Christian & Boysen-Hogrefe, Jens & Pape, Markus, 2016. "Bayesian analysis of static and dynamic factor models: An ex-post approach towards the rotation problem," Journal of Econometrics, Elsevier, vol. 192(1), pages 190-206.
    19. Jaejoon Lee & Seongil Jo & Jaeyong Lee, 2022. "Robust sparse Bayesian infinite factor models," Computational Statistics, Springer, vol. 37(5), pages 2693-2715, November.
    20. R Asvat & CA Bisschoff & CJ Botha, 2018. "Factors to Measure the Performance of Private Business Schools in South Africa," Journal of Economics and Behavioral Studies, AMH International, vol. 10(6), pages 50-69.
    21. Uddin, Md Nazir & Gaskins, Jeremy T., 2023. "Shared Bayesian variable shrinkage in multinomial logistic regression," Computational Statistics & Data Analysis, Elsevier, vol. 177(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:111:y:2016:i:516:p:1608-1622. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.