IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v164y2018icp65-82.html
   My bibliography  Save this article

Bayesian hierarchical robust factor analysis models for partially observed sample-selection data

Author

Listed:
  • Kim, Hea-Jung

Abstract

This paper introduces a class of scale mixtures of normal selection factor (SMNSF) analysis models which are robust against departures from normality and designed to correct sample-selection bias. Various properties of this class of models are established, including a stochastic representation, a distributional hierarchy, and a quantification of sample-selection bias. A hierarchical Bayesian methodology is also developed for estimation purposes. It involves a simple and computationally feasible Markov Chain Monte Carlo algorithm that avoids analytical or numerical derivatives of the log-likelihood function. Results from simulation studies attest to the good finite-sample performance of the new model in terms of sample-selection bias reduction and robustness against outliers. A data illustration is included.

Suggested Citation

  • Kim, Hea-Jung, 2018. "Bayesian hierarchical robust factor analysis models for partially observed sample-selection data," Journal of Multivariate Analysis, Elsevier, vol. 164(C), pages 65-82.
  • Handle: RePEc:eee:jmvana:v:164:y:2018:i:c:p:65-82
    DOI: 10.1016/j.jmva.2017.11.003
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X17300489
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2017.11.003?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kim, Hyoung-Moon & Maadooliat, Mehdi & Arellano-Valle, Reinaldo B. & Genton, Marc G., 2016. "Skewed factor models using selection mechanisms," Journal of Multivariate Analysis, Elsevier, vol. 145(C), pages 162-177.
    2. Yiu-Fai Yung, 1997. "Finite mixtures in confirmatory factor-analysis models," Psychometrika, Springer;The Psychometric Society, vol. 62(3), pages 297-330, September.
    3. Arrazola, María & de Hevia, José, 2008. "Three measures of returns to education: An illustration for the case of Spain," Economics of Education Review, Elsevier, vol. 27(3), pages 266-275, June.
    4. Asim Ansari & Kamel Jedidi & Laurette Dube, 2002. "Heterogeneous factor analysis models: A bayesian approach," Psychometrika, Springer;The Psychometric Society, vol. 67(1), pages 49-77, March.
    5. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    6. Chib S. & Jeliazkov I., 2001. "Marginal Likelihood From the Metropolis-Hastings Output," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 270-281, March.
    7. Wan-Lun Wang & Tsung-I Lin, 2013. "An efficient ECM algorithm for maximum likelihood estimation in mixtures of t-factor analyzers," Computational Statistics, Springer, vol. 28(2), pages 751-769, April.
    8. Kim, Hea-Jung, 2008. "A class of weighted multivariate normal distributions and its properties," Journal of Multivariate Analysis, Elsevier, vol. 99(8), pages 1758-1771, September.
    9. Ding, Peng, 2014. "Bayesian robust inference of sample selection using selection-t models," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 451-464.
    10. Pison, Greet & Rousseeuw, Peter J. & Filzmoser, Peter & Croux, Christophe, 2003. "Robust factor analysis," Journal of Multivariate Analysis, Elsevier, vol. 84(1), pages 145-172, January.
    11. Angela Montanari & Cinzia Viroli, 2010. "A skew-normal factor model for the analysis of student satisfaction towards university courses," Journal of Applied Statistics, Taylor & Francis Journals, vol. 37(3), pages 473-487.
    12. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    13. Tsung-I Lin & Pal Wu & Geoffrey McLachlan & Sharon Lee, 2015. "A robust factor analysis model using the restricted skew- $$t$$ t distribution," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(3), pages 510-531, September.
    14. McLachlan, G.J. & Bean, R.W. & Ben-Tovim Jones, L., 2007. "Extension of the mixture of factor analyzers model to incorporate the multivariate t-distribution," Computational Statistics & Data Analysis, Elsevier, vol. 51(11), pages 5327-5338, July.
    15. Ming‐Hui Chen, 2005. "Computing marginal likelihoods from a single MCMC output," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 59(1), pages 16-29, February.
    16. Sik-Yum Lee & Ye-Mao Xia, 2008. "A Robust Bayesian Approach for Structural Equation Models with Missing Data," Psychometrika, Springer;The Psychometric Society, vol. 73(3), pages 343-364, September.
    17. Yulia V. Marchenko & Marc G. Genton, 2012. "A Heckman Selection- t Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 304-317, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sharon X. Lee & Tsung-I Lin & Geoffrey J. McLachlan, 2021. "Mixtures of factor analyzers with scale mixtures of fundamental skew normal distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 15(2), pages 481-512, June.
    2. Wiemann, Paul F.V. & Klein, Nadja & Kneib, Thomas, 2022. "Correcting for sample selection bias in Bayesian distributional regression models," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    3. Tsung-I Lin & I-An Chen & Wan-Lun Wang, 2023. "A robust factor analysis model based on the canonical fundamental skew-t distribution," Statistical Papers, Springer, vol. 64(2), pages 367-393, April.
    4. Lachos, Victor H. & Prates, Marcos O. & Dey, Dipak K., 2021. "Heckman selection-t model: Parameter estimation via the EM-algorithm," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
    5. Saulo, Helton & Vila, Roberto & Cordeiro, Shayane S. & Leiva, Víctor, 2023. "Bivariate symmetric Heckman models and their characterization," Journal of Multivariate Analysis, Elsevier, vol. 193(C).
    6. Murray, Paula M. & Browne, Ryan P. & McNicholas, Paul D., 2017. "A mixture of SDB skew-t factor analyzers," Econometrics and Statistics, Elsevier, vol. 3(C), pages 160-168.
    7. Wan-Lun Wang & Min Liu & Tsung-I Lin, 2017. "Robust skew-t factor analysis models for handling missing data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 26(4), pages 649-672, November.
    8. Wojtyś, Małgorzata & Marra, Giampiero & Radice, Rosalba, 2018. "Copula based generalized additive models for location, scale and shape with non-random sample selection," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 1-14.
    9. Emmanuel O. Ogundimu & Jane L. Hutton, 2016. "A Sample Selection Model with Skew-normal Distribution," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 172-190, March.
    10. Helton Saulo & Roberto Vila & Shayane S. Cordeiro, 2022. "Symmetric generalized Heckman models," Papers 2206.10054, arXiv.org.
    11. Lin, Tsung-I & McLachlan, Geoffrey J. & Lee, Sharon X., 2016. "Extending mixtures of factor models using the restricted multivariate skew-normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 398-413.
    12. Kim, Hyoung-Moon & Maadooliat, Mehdi & Arellano-Valle, Reinaldo B. & Genton, Marc G., 2016. "Skewed factor models using selection mechanisms," Journal of Multivariate Analysis, Elsevier, vol. 145(C), pages 162-177.
    13. Hashemi, Farzane & Naderi, Mehrdad & Jamalizadeh, Ahad & Bekker, Andriette, 2021. "A flexible factor analysis based on the class of mean-mixture of normal distributions," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    14. Wang Miao & Peng Ding & Zhi Geng, 2016. "Identifiability of Normal and Normal Mixture Models with Nonignorable Missing Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1673-1683, October.
    15. Peng Ding, 2016. "On the Conditional Distribution of the Multivariate Distribution," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 293-295, July.
    16. Lucia Mateos & Ines Murillo & Maria del Mar Salinas, 2014. "Desajuste educativo y competencias cognitivas: efectos sobre los salarios," Hacienda Pública Española / Review of Public Economics, IEF, vol. 210(3), pages 85-108, September.
    17. Cai, Jing-Heng & Song, Xin-Yuan & Lam, Kwok-Hap & Ip, Edward Hak-Sing, 2011. "A mixture of generalized latent variable models for mixed mode and heterogeneous data," Computational Statistics & Data Analysis, Elsevier, vol. 55(11), pages 2889-2907, November.
    18. Stegeman, Alwin, 2016. "A new method for simultaneous estimation of the factor model parameters, factor scores, and unique parts," Computational Statistics & Data Analysis, Elsevier, vol. 99(C), pages 189-203.
    19. Wongnaa, Camillus Abawiera & Kyei, Afrane Baffour & Apike, Isaac Akurugu & Awunyo-Vitor, Dadson & Dziwornu, Raymond K., 2021. "Perception and Adoption of Artificial Pollination Technology in Cocoa Production: Evidence from Ghana," 2021 Conference, August 17-31, 2021, Virtual 314939, International Association of Agricultural Economists.
    20. Chaofeng Yuan & Wensheng Zhu & Xuming He & Jianhua Guo, 2019. "A mixture factor model with applications to microarray data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(1), pages 60-76, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:164:y:2018:i:c:p:65-82. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.