IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v79y2023i2p1187-1200.html
   My bibliography  Save this article

Decomposition of variation of mixed variables by a latent mixed Gaussian copula model

Author

Listed:
  • Yutong Liu
  • Toni Darville
  • Xiaojing Zheng
  • Quefeng Li

Abstract

Many biomedical studies collect data of mixed types of variables from multiple groups of subjects. Some of these studies aim to find the group‐specific and the common variation among all these variables. Even though similar problems have been studied by some previous works, their methods mainly rely on the Pearson correlation, which cannot handle mixed data. To address this issue, we propose a latent mixed Gaussian copula (LMGC) model that can quantify the correlations among binary, ordinal, continuous, and truncated variables in a unified framework. We also provide a tool to decompose the variation into the group‐specific and the common variation over multiple groups via solving a regularized M‐estimation problem. We conduct extensive simulation studies to show the advantage of our proposed method over the Pearson correlation‐based methods. We also demonstrate that by jointly solving the M‐estimation problem over multiple groups, our method is better than decomposing the variation group by group. We also apply our method to a Chlamydia trachomatis genital tract infection study to demonstrate how it can be used to discover informative biomarkers that differentiate patients.

Suggested Citation

  • Yutong Liu & Toni Darville & Xiaojing Zheng & Quefeng Li, 2023. "Decomposition of variation of mixed variables by a latent mixed Gaussian copula model," Biometrics, The International Biometric Society, vol. 79(2), pages 1187-1200, June.
  • Handle: RePEc:bla:biomet:v:79:y:2023:i:2:p:1187-1200
    DOI: 10.1111/biom.13660
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13660
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13660?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Seung C. Ahn & Alex R. Horenstein, 2013. "Eigenvalue Ratio Test for the Number of Factors," Econometrica, Econometric Society, vol. 81(3), pages 1203-1227, May.
    2. Grace Yoon & Raymond J Carroll & Irina Gaynanova, 2020. "Sparse semiparametric canonical correlation analysis for data of mixed types," Biometrika, Biometrika Trust, vol. 107(3), pages 609-625.
    3. Jianqing Fan & Han Liu & Yang Ning & Hui Zou, 2017. "High dimensional semiparametric latent graphical model for mixed data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 405-421, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. GUO-FITOUSSI, Liang, 2013. "A Comparison of the Finite Sample Properties of Selection Rules of Factor Numbers in Large Datasets," MPRA Paper 50005, University Library of Munich, Germany.
    2. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    3. Givord, Pauline & Quantin, Simon & Trevien, Corentin, 2018. "A long-term evaluation of the first generation of French urban enterprise zones," Journal of Urban Economics, Elsevier, vol. 105(C), pages 149-161.
    4. Alain-Philippe Fortin & Patrick Gagliardini & O. Scaillet, 2022. "Eigenvalue tests for the number of latent factors in short panels," Swiss Finance Institute Research Paper Series 22-81, Swiss Finance Institute.
    5. Wei, Jie & Chen, Hui, 2020. "Determining the number of factors in approximate factor models by twice K-fold cross validation," Economics Letters, Elsevier, vol. 191(C).
    6. Fan, Jianqing & Liao, Yuan & Shi, Xiaofeng, 2015. "Risks of large portfolios," Journal of Econometrics, Elsevier, vol. 186(2), pages 367-387.
    7. Han Lin Shang, 2023. "Sieve bootstrapping the memory parameter in long-range dependent stationary functional time series," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 107(3), pages 421-441, September.
    8. Yuefeng Han & Rong Chen & Dan Yang & Cun-Hui Zhang, 2020. "Tensor Factor Model Estimation by Iterative Projection," Papers 2006.02611, arXiv.org, revised May 2022.
    9. Luke Hartigan & James Morley, 2020. "A Factor Model Analysis of the Australian Economy and the Effects of Inflation Targeting," The Economic Record, The Economic Society of Australia, vol. 96(314), pages 271-293, September.
    10. Yongfu Huang & Muhammad G. Quibria, 2015. "The global partnership for sustainable development," Natural Resources Forum, Blackwell Publishing, vol. 0(3-4), pages 157-174, August.
    11. Jianqing Fan & Xu Han, 2017. "Estimation of the false discovery proportion with unknown dependence," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1143-1164, September.
    12. Jushan Bai & Serena Ng, 2020. "Simpler Proofs for Approximate Factor Models of Large Dimensions," Papers 2008.00254, arXiv.org.
    13. Hyungsik Roger Moon & Martin Weidner, 2015. "Linear Regression for Panel With Unknown Number of Factors as Interactive Fixed Effects," Econometrica, Econometric Society, vol. 83(4), pages 1543-1579, July.
    14. Liddle, Brantley & Parker, Steven, 2022. "One more for the road: Reconsidering whether OECD gasoline income and price elasticities have changed over time," Energy Economics, Elsevier, vol. 114(C).
    15. Venetis, Ioannis & Ladas, Avgoustinos, 2022. "Co-movement and global factors in sovereign bond yields," MPRA Paper 115801, University Library of Munich, Germany.
    16. Matteo Barigozzi & Marc Hallin, 2023. "Dynamic Factor Models: a Genealogy," Papers 2310.17278, arXiv.org, revised Jan 2024.
    17. Guowei Cui & Vasilis Sarafidis & Takashi Yamagata, 2020. "IV Estimation of Spatial Dynamic Panels with Interactive Effects: Large Sample Theory and an Application on Bank Attitude," Monash Econometrics and Business Statistics Working Papers 11/20, Monash University, Department of Econometrics and Business Statistics.
    18. Oguzhan Cepni & I. Ethem Guney & Norman R. Swanson, 2020. "Forecasting and nowcasting emerging market GDP growth rates: The role of latent global economic policy uncertainty and macroeconomic data surprise factors," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 39(1), pages 18-36, January.
    19. Yoosoon Chang & Yongok Choi & Chang Sik Kim & J. Isaac Miller & Joon Y. Park, 2024. "Common Trends and Country Specific Heterogeneities in Long-Run World Energy Consumption," CAEPR Working Papers 2024-001 Classification-1, Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington.
    20. Zhang, Yixiao & Yu, Cindy L. & Li, Haitao, 2022. "Nowcasting GDP Using Dynamic Factor Model with Unknown Number of Factors and Stochastic Volatility: A Bayesian Approach," Econometrics and Statistics, Elsevier, vol. 24(C), pages 75-93.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:79:y:2023:i:2:p:1187-1200. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.