IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v190y2022ics0047259x22000045.html
   My bibliography  Save this article

A generative approach to modeling data with quantitative and qualitative responses

Author

Listed:
  • Kang, Xiaoning
  • Kang, Lulu
  • Chen, Wei
  • Deng, Xinwei

Abstract

In many scientific areas, data with mixed quantitative and qualitative (QQ) responses are commonly encountered with a large number of predictors. By exploring the association between QQ responses, existing approaches often consider a joint model of QQ responses given the predictor variables. However, the dependency among predictive variables also provides useful information for fitting QQ responses. Hence in this work, we propose a novel generative approach to jointly model the QQ responses by incorporating the dependency information of predictors. The proposed method is computationally efficient and provides accurate parameter estimation under a penalized likelihood framework. Moreover, because of the generative approach framework, the asymptotically theoretical results of the proposed method are established under some regularity conditions. The performance of the proposed method is examined through simulations and real case studies in material science and genetics.

Suggested Citation

  • Kang, Xiaoning & Kang, Lulu & Chen, Wei & Deng, Xinwei, 2022. "A generative approach to modeling data with quantitative and qualitative responses," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
  • Handle: RePEc:eee:jmvana:v:190:y:2022:i:c:s0047259x22000045
    DOI: 10.1016/j.jmva.2022.104952
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X22000045
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2022.104952?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhang, Xiao & Boscardin, W. John & Belin, Thomas R. & Wan, Xiaohai & He, Yulei & Zhang, Kui, 2015. "A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 43-58.
    2. Dunson, David B., 2003. "Dynamic Latent Trait Models for Multidimensional Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 555-563, January.
    3. Luo, Chongliang & Liang, Jian & Li, Gen & Wang, Fei & Zhang, Changshui & Dey, Dipak K. & Chen, Kun, 2018. "Leveraging mixed and incomplete outcomes via reduced-rank modeling," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 378-394.
    4. Jacob Bien & Robert J. Tibshirani, 2011. "Sparse estimation of a covariance matrix," Biometrika, Biometrika Trust, vol. 98(4), pages 807-820.
    5. Woojoo Lee & Yudi Pawitan, 2014. "Direct Calculation of the Variance of Maximum Penalized Likelihood Estimates via EM Algorithm," The American Statistician, Taylor & Francis Journals, vol. 68(2), pages 93-97, May.
    6. Ming Yuan & Yi Lin, 2007. "Model selection and estimation in the Gaussian graphical model," Biometrika, Biometrika Trust, vol. 94(1), pages 19-35.
    7. Fan J. & Li R., 2001. "Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 1348-1360, December.
    8. Alessandra Guglielmi & Francesca Ieva & Anna Maria Paganoni & Fernardo A. Quintana, 2018. "A semiparametric Bayesian joint model for multiple mixed-type outcomes: an application to acute myocardial infarction," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(2), pages 399-423, June.
    9. Kao, Ming-Hung & Khogeer, Hazar, 2021. "Optimal designs for mixed continuous and binary responses with quantitative and qualitative factors," Journal of Multivariate Analysis, Elsevier, vol. 182(C).
    10. Craiu, V. Radu & Sabeti, Avideh, 2012. "In mixed company: Bayesian inference for bivariate conditional copula models with discrete and continuous outcomes," Journal of Multivariate Analysis, Elsevier, vol. 110(C), pages 106-120.
    11. Quefeng Li & Lexin Li, 2018. "Integrative linear discriminant analysis with guaranteed error rate improvement," Biometrika, Biometrika Trust, vol. 105(4), pages 917-930.
    12. Lam, Clifford & Fan, Jianqing, 2009. "Sparsistency and rates of convergence in large covariance matrix estimation," LSE Research Online Documents on Economics 31540, London School of Economics and Political Science, LSE Library.
    13. Pires, Ana M. & Branco, João A., 2010. "Projection-pursuit approach to robust linear discriminant analysis," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2464-2485, November.
    14. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    15. Lanjia Lin & Dipankar Bandyopadhyay & Stuart R. Lipsitz & Debajyoti Sinha, 2010. "Association Models for Clustered Data with Binary and Continuous Responses," Biometrics, The International Biometric Society, vol. 66(1), pages 287-293, March.
    16. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    17. Z. K. Liu & L. X. Yang & S.-C. Wu & C. Shekhar & J. Jiang & H. F. Yang & Y. Zhang & S.-K. Mo & Z. Hussain & B. Yan & C. Felser & Y. L. Chen, 2016. "Observation of unusual topological surface states in half-Heusler compounds LnPtBi (Ln=Lu, Y)," Nature Communications, Nature, vol. 7(1), pages 1-7, December.
    18. Hansheng Wang & Runze Li & Chih-Ling Tsai, 2007. "Tuning parameter selectors for the smoothly clipped absolute deviation method," Biometrika, Biometrika Trust, vol. 94(3), pages 553-568.
    19. Peirong Xu & Ji Zhu & Lixing Zhu & Yi Li, 2015. "Covariance-enhanced discriminant analysis," Biometrika, Biometrika Trust, vol. 102(1), pages 33-45.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiao Ni & Daowen Zhang & Hao Helen Zhang, 2010. "Variable Selection for Semiparametric Mixed Models in Longitudinal Studies," Biometrics, The International Biometric Society, vol. 66(1), pages 79-88, March.
    2. Benjamin Poignard & Manabu Asai, 2023. "Estimation of high-dimensional vector autoregression via sparse precision matrix," The Econometrics Journal, Royal Economic Society, vol. 26(2), pages 307-326.
    3. Qingliang Fan & Yaqian Wu, 2020. "Endogenous Treatment Effect Estimation with some Invalid and Irrelevant Instruments," Papers 2006.14998, arXiv.org.
    4. Pei Wang & Shunjie Chen & Sijia Yang, 2022. "Recent Advances on Penalized Regression Models for Biological Data," Mathematics, MDPI, vol. 10(19), pages 1-24, October.
    5. Lam, Clifford, 2020. "High-dimensional covariance matrix estimation," LSE Research Online Documents on Economics 101667, London School of Economics and Political Science, LSE Library.
    6. Ping Zeng & Yongyue Wei & Yang Zhao & Jin Liu & Liya Liu & Ruyang Zhang & Jianwei Gou & Shuiping Huang & Feng Chen, 2014. "Variable selection approach for zero-inflated count data via adaptive lasso," Journal of Applied Statistics, Taylor & Francis Journals, vol. 41(4), pages 879-894, April.
    7. Zhang, Yan-Qing & Tian, Guo-Liang & Tang, Nian-Sheng, 2016. "Latent variable selection in structural equation models," Journal of Multivariate Analysis, Elsevier, vol. 152(C), pages 190-205.
    8. Bailey, Natalia & Pesaran, M. Hashem & Smith, L. Vanessa, 2019. "A multiple testing approach to the regularisation of large sample correlation matrices," Journal of Econometrics, Elsevier, vol. 208(2), pages 507-534.
    9. Xuan Liu & Jianbao Chen, 2021. "Variable Selection for the Spatial Autoregressive Model with Autoregressive Disturbances," Mathematics, MDPI, vol. 9(12), pages 1-20, June.
    10. Siwei Xia & Yuehan Yang & Hu Yang, 2022. "Sparse Laplacian Shrinkage with the Graphical Lasso Estimator for Regression Problems," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(1), pages 255-277, March.
    11. Engler David & Li Yi, 2009. "Survival Analysis with High-Dimensional Covariates: An Application in Microarray Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-22, February.
    12. Lin, Huazhen & Peng, Heng, 2013. "Smoothed rank correlation of the linear transformation regression model," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 615-630.
    13. Hirose, Kei & Tateishi, Shohei & Konishi, Sadanori, 2013. "Tuning parameter selection in sparse regression modeling," Computational Statistics & Data Analysis, Elsevier, vol. 59(C), pages 28-40.
    14. Sheng, Ying & Wang, Qihua, 2019. "Simultaneous variable selection and class fusion with penalized distance criterion based classifiers," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 138-152.
    15. Matsui, Hidetoshi, 2014. "Variable and boundary selection for functional data via multiclass logistic regression modeling," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 176-185.
    16. Zhigeng Geng & Sijian Wang & Menggang Yu & Patrick O. Monahan & Victoria Champion & Grace Wahba, 2015. "Group variable selection via convex log-exp-sum penalty with application to a breast cancer survivor study," Biometrics, The International Biometric Society, vol. 71(1), pages 53-62, March.
    17. Yanxin Wang & Qibin Fan & Li Zhu, 2018. "Variable selection and estimation using a continuous approximation to the $$L_0$$ L 0 penalty," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 70(1), pages 191-214, February.
    18. Taha Alshaybawee & Habshah Midi & Rahim Alhamzawi, 2017. "Bayesian elastic net single index quantile regression," Journal of Applied Statistics, Taylor & Francis Journals, vol. 44(5), pages 853-871, April.
    19. Huang, Zhensheng & Pang, Zhen & Lin, Bingqing & Shao, Quanxi, 2014. "Model structure selection in single-index-coefficient regression models," Journal of Multivariate Analysis, Elsevier, vol. 125(C), pages 159-175.
    20. Wang, Tao & Zhu, Lixing, 2011. "Consistent tuning parameter selection in high dimensional sparse linear regression," Journal of Multivariate Analysis, Elsevier, vol. 102(7), pages 1141-1151, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:190:y:2022:i:c:s0047259x22000045. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.