IDEAS home Printed from https://ideas.repec.org/a/spr/psycho/v77y2012i4p803-826.html
   My bibliography  Save this article

Robust Structural Equation Modeling with Missing Data and Auxiliary Variables

Author

Listed:
  • Ke-Hai Yuan
  • Zhiyong Zhang

Abstract

The paper develops a two-stage robust procedure for structural equation modeling (SEM) and an R package rsem to facilitate the use of the procedure by applied researchers. In the first stage, M-estimates of the saturated mean vector and covariance matrix of all variables are obtained. Those corresponding to the substantive variables are then fitted to the structural model in the second stage. A sandwich-type covariance matrix is used to obtain consistent standard errors (SE) of the structural parameter estimates. Rescaled, adjusted as well as corrected and F-statistics are proposed for overall model evaluation. Using R and EQS, the R package rsem combines the two stages and generates all the test statistics and consistent SEs. Following the robust analysis, multiple model fit indices and standardized solutions are provided in the corresponding output of EQS. An example with open/closed book examination data illustrates the proper use of the package. The method is further applied to the analysis of a data set from the National Longitudinal Survey of Youth 1997 cohort, and results show that the developed procedure not only gives a better endorsement of the substantive models but also yields estimates with uniformly smaller standard errors than the normal-distribution-based maximum likelihood. Copyright The Psychometric Society 2012

Suggested Citation

  • Ke-Hai Yuan & Zhiyong Zhang, 2012. "Robust Structural Equation Modeling with Missing Data and Auxiliary Variables," Psychometrika, Springer;The Psychometric Society, vol. 77(4), pages 803-826, October.
  • Handle: RePEc:spr:psycho:v:77:y:2012:i:4:p:803-826
    DOI: 10.1007/s11336-012-9282-4
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s11336-012-9282-4
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11336-012-9282-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sik-Yum Lee & Ye-Mao Xia, 2006. "Maximum Likelihood Methods in Treating Outliers and Symmetrically Heavy-Tailed Distributions for Nonlinear Structural Equation Models with Missing Data," Psychometrika, Springer;The Psychometric Society, vol. 71(3), pages 565-585, September.
    2. Sik-Yum Lee, 2006. "Bayesian Analysis of Nonlinear Structural Equation Models with Nonignorable Missing Data," Psychometrika, Springer;The Psychometric Society, vol. 71(3), pages 541-564, September.
    3. Ke-Hai Yuan & Linda Marshall & Peter Bentler, 2002. "A unified approach to exploratory factor analysis with missing data, nonnormal data, and in the presence of outliers," Psychometrika, Springer;The Psychometric Society, vol. 67(1), pages 95-121, March.
    4. Roderick J. A. Little, 1988. "Robust Estimation of the Mean and Covariance Matrix from Data with Missing Values," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 37(1), pages 23-38, March.
    5. Yuan, Ke-Hai & Jennrich, Robert I., 1998. "Asymptotics of Estimating Equations under Natural Conditions," Journal of Multivariate Analysis, Elsevier, vol. 65(2), pages 245-260, May.
    6. Liu, Chuanhai, 1997. "ML Estimation of the MultivariatetDistribution and the EM Algorithm," Journal of Multivariate Analysis, Elsevier, vol. 63(2), pages 296-312, November.
    7. Yuan, Ke-Hai & Bentler, Peter M., 1997. "Improving parameter tests in covariance structure analysis," Computational Statistics & Data Analysis, Elsevier, vol. 26(2), pages 177-198, December.
    8. Sik-Yum Lee & Ye-Mao Xia, 2008. "A Robust Bayesian Approach for Structural Equation Models with Missing Data," Psychometrika, Springer;The Psychometric Society, vol. 73(3), pages 343-364, September.
    9. Ke-Hai Yuan & Peter Bentler & Wai Chan, 2004. "Structural equation modeling with heavy tailed distributions," Psychometrika, Springer;The Psychometric Society, vol. 69(3), pages 421-436, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ke-Hai Yuan & Wai Chan & Yubin Tian, 2016. "Expectation-robust algorithm and estimating equations for means and dispersion matrix with missing data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 68(2), pages 329-351, April.
    2. Hemant C. Sashittal & Avan R. Jassawalla & Ruchika Sachdeva, 2023. "The influence of COVID-19 pandemic on consumer–brand relationships: evidence of brand evangelism behaviors," Journal of Brand Management, Palgrave Macmillan, vol. 30(3), pages 245-260, May.
    3. Lingzhi Chen & Ričardas Zitikis, 2020. "Quantifying and analyzing nonlinear relationships with a fresh look at a classical dataset of student scores," Quality & Quantity: International Journal of Methodology, Springer, vol. 54(4), pages 1145-1169, August.
    4. Yuan, Ke-Hai & Savalei, Victoria, 2014. "Consistency, bias and efficiency of the normal-distribution-based MLE: The role of auxiliary variables," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 353-370.
    5. Lu, Zhenqiu (Laura) & Zhang, Zhiyong, 2014. "Robust growth mixture models with non-ignorable missingness: Models, estimation, selection, and application," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 220-240.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ke-Hai Yuan & Wai Chan & Yubin Tian, 2016. "Expectation-robust algorithm and estimating equations for means and dispersion matrix with missing data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 68(2), pages 329-351, April.
    2. Ando, Tomohiro, 2009. "Bayesian factor analysis with fat-tailed factors and its exact marginal likelihood," Journal of Multivariate Analysis, Elsevier, vol. 100(8), pages 1717-1726, September.
    3. Ke-Hai Yuan & Kentaro Hayashi, 2005. "On muthén’s maximum likelihood for two-level covariance structure models," Psychometrika, Springer;The Psychometric Society, vol. 70(1), pages 147-167, March.
    4. Yuan, Ke-Hai, 2009. "Normal distribution based pseudo ML for missing data: With applications to mean and covariance structure analysis," Journal of Multivariate Analysis, Elsevier, vol. 100(9), pages 1900-1918, October.
    5. Yuan, Ke-Hai & Chan, Wai, 2008. "Structural equation modeling with near singular covariance matrices," Computational Statistics & Data Analysis, Elsevier, vol. 52(10), pages 4842-4858, June.
    6. Song, Xin-Yuan & Chen, Fei & Lu, Zhao-Hua, 2013. "A Bayesian semiparametric dynamic two-level structural equation model for analyzing non-normal longitudinal data," Journal of Multivariate Analysis, Elsevier, vol. 121(C), pages 87-108.
    7. Tian, Guo-Liang & Ng, Kai Wang & Tan, Ming, 2008. "EM-type algorithms for computing restricted MLEs in multivariate normal distributions and multivariate t-distributions," Computational Statistics & Data Analysis, Elsevier, vol. 52(10), pages 4768-4778, June.
    8. Kano, Yutaka & Takai, Keiji, 2011. "Analysis of NMAR missing data without specifying missing-data mechanisms in a linear latent variate model," Journal of Multivariate Analysis, Elsevier, vol. 102(9), pages 1241-1255, October.
    9. Xinyuan Song & Yemao Xia & Hongtu Zhu, 2017. "Hidden Markov latent variable models with multivariate longitudinal data," Biometrics, The International Biometric Society, vol. 73(1), pages 313-323, March.
    10. Xia, Ye-Mao & Tang, Nian-Sheng, 2019. "Bayesian analysis for mixture of latent variable hidden Markov models with multivariate longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 132(C), pages 190-211.
    11. Mingan Yang & David Dunson, 2010. "Bayesian Semiparametric Structural Equation Models with Latent Variables," Psychometrika, Springer;The Psychometric Society, vol. 75(4), pages 675-693, December.
    12. Yuan, Ke-Hai & Savalei, Victoria, 2014. "Consistency, bias and efficiency of the normal-distribution-based MLE: The role of auxiliary variables," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 353-370.
    13. Wang, Xiao, 2010. "Wiener processes with random effects for degradation data," Journal of Multivariate Analysis, Elsevier, vol. 101(2), pages 340-351, February.
    14. Jie Jiang & Xinsheng Liu & Keming Yu, 2013. "Maximum likelihood estimation of multinomial probit factor analysis models for multivariate t-distribution," Computational Statistics, Springer, vol. 28(4), pages 1485-1500, August.
    15. Vilca, Filidor & Balakrishnan, N. & Zeller, Camila Borelli, 2014. "A robust extension of the bivariate Birnbaum–Saunders distribution and associated inference," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 418-435.
    16. James C. Slaughter & Amy H. Herring & John M. Thorp, 2009. "A Bayesian Latent Variable Mixture Model for Longitudinal Fetal Growth," Biometrics, The International Biometric Society, vol. 65(4), pages 1233-1242, December.
    17. Jean Jacod & Michael Sørensen, 2018. "A review of asymptotic theory of estimating functions," Statistical Inference for Stochastic Processes, Springer, vol. 21(2), pages 415-434, July.
    18. Chen, Tao & Martin, Elaine & Montague, Gary, 2009. "Robust probabilistic PCA with missing data and contribution analysis for outlier detection," Computational Statistics & Data Analysis, Elsevier, vol. 53(10), pages 3706-3716, August.
    19. Ron Mittelhammer & George Judge, 2009. "A Minimum Power Divergence Class of CDFs and Estimators for the Binary Choice Model," International Econometric Review (IER), Econometric Research Association, vol. 1(1), pages 33-49, April.
    20. R. M. Balan & Ioana Schiopu-Kratina, 2004. "Asymptotic Results with Generalized Estimating Equations for Longitudinal data II," RePAd Working Paper Series lrsp-TRS398, Département des sciences administratives, UQO.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:psycho:v:77:y:2012:i:4:p:803-826. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.