IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v42y2015i4p1065-1077.html

Cluster-Specific Variable Selection for Product Partition Models

Author

Listed:
  • Fernando A. Quintana
  • Peter Müller
  • Ana Luisa Papoila

Abstract

type="main" xml:id="sjos12151-abs-0001"> We propose a random partition model that implements prediction with many candidate covariates and interactions. The model is based on a modified product partition model that includes a regression on covariates by favouring homogeneous clusters in terms of these covariates. Additionally, the model allows for a cluster-specific choice of the covariates that are included in this evaluation of homogeneity. The variable selection is implemented by introducing a set of cluster-specific latent indicators that include or exclude covariates. The proposed model is motivated by an application to predicting mortality in an intensive care unit in Lisboa, Portugal.

Suggested Citation

  • Fernando A. Quintana & Peter Müller & Ana Luisa Papoila, 2015. "Cluster-Specific Variable Selection for Product Partition Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(4), pages 1065-1077, December.
  • Handle: RePEc:bla:scjsta:v:42:y:2015:i:4:p:1065-1077
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1111/sjos.12151
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Peter D. Hoff, 2005. "Subset Clustering of Binary Sequences, with an Application to Genomic Abnormality Data," Biometrics, The International Biometric Society, vol. 61(4), pages 1027-1036, December.
    2. Chung, Yeonseung & Dunson, David B., 2009. "Nonparametric Bayes Conditional Distribution Modeling With Variable Selection," Journal of the American Statistical Association, American Statistical Association, vol. 104(488), pages 1646-1660.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Qi Li & Juan Lin & Jeffrey S. Racine, 2013. "Optimal Bandwidth Selection for Nonparametric Conditional Distribution and Quantile Functions," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 31(1), pages 57-65, January.
    2. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    3. Hao, Meiling & Lin, Yuanyuan & Shen, Guohao & Su, Wen, 2023. "Nonparametric inference on smoothed quantile regression process," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    4. Tsuyoshi Kunihama & David B. Dunson, 2013. "Bayesian Modeling of Temporal Dependence in Large Sparse Contingency Tables," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1324-1338, December.
    5. Andrés F. Barrientos & Alejandro Jara & Fernando A. Quintana, 2017. "Fully Nonparametric Regression for Bounded Data Using Dependent Bernstein Polynomials," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 806-825, April.
    6. Pati, Debdeep & Dunson, David B. & Tokdar, Surya T., 2013. "Posterior consistency in conditional distribution estimation," Journal of Multivariate Analysis, Elsevier, vol. 116(C), pages 456-472.
    7. Ryo Kato & Takahiro Hoshino, 2020. "Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(3), pages 803-825, June.
    8. Benhuai Xie & Wei Pan & Xiaotong Shen, 2008. "Variable Selection in Penalized Model‐Based Clustering Via Regularization on Grouped Parameters," Biometrics, The International Biometric Society, vol. 64(3), pages 921-930, September.
    9. Jaeeun Yu & Jinsu Park & Taeryon Choi & Masahiro Hashizume & Yoonhee Kim & Yasushi Honda & Yeonseung Chung, 2021. "Nonparametric Bayesian Functional Meta-Regression: Applications in Environmental Epidemiology," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 26(1), pages 45-70, March.
    10. Liverani, Silvia & Hastie, David I. & Azizi, Lamiae & Papathomas, Michail & Richardson, Sylvia, 2015. "PReMiuM: An R Package for Profile Regression Mixture Models Using Dirichlet Processes," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 64(i07).
    11. Shi, Minghui & Dunson, David B., 2011. "Bayesian variable selection via particle stochastic search," Statistics & Probability Letters, Elsevier, vol. 81(2), pages 283-291, February.
    12. Ryo Kato & Takahiro Hoshino, 2018. "Semiparametric Bayes Instrumental Variable Estimation with Many Weak Instruments," Discussion Paper Series DP2018-14, Research Institute for Economics & Business Administration, Kobe University.
    13. Zhenke Wu & Livia Casciola‐Rosen & Antony Rosen & Scott L. Zeger, 2021. "A Bayesian approach to restricted latent class models for scientifically structured clustering of multivariate binary outcomes," Biometrics, The International Biometric Society, vol. 77(4), pages 1431-1444, December.
    14. Villani, Mattias & Kohn, Robert & Nott, David J., 2012. "Generalized smooth finite mixtures," Journal of Econometrics, Elsevier, vol. 171(2), pages 121-133.
    15. Lee, Kuo-Jung & Chen, Ray-Bing & Wu, Ying Nian, 2016. "Bayesian variable selection for finite mixture model of linear regressions," Computational Statistics & Data Analysis, Elsevier, vol. 95(C), pages 1-16.
    16. Miller, Jeffrey W., 2019. "An elementary derivation of the Chinese restaurant process from Sethuraman’s stick-breaking process," Statistics & Probability Letters, Elsevier, vol. 146(C), pages 112-117.
    17. Kemmawadee Preedalikit & Daniel Fernández & Ivy Liu & Louise McMillan & Marta Nai Ruscone & Roy Costilla, 2024. "Row mixture-based clustering with covariates for ordinal responses," Computational Statistics, Springer, vol. 39(5), pages 2511-2555, July.
    18. Takahiro Hoshino, 2013. "Semiparametric Bayesian Estimation for Marginal Parametric Potential Outcome Modeling: Application to Causal Inference," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(504), pages 1189-1204, December.
    19. Norets, Andriy & Pelenis, Justinas, 2012. "Bayesian modeling of joint and conditional distributions," Journal of Econometrics, Elsevier, vol. 168(2), pages 332-346.
    20. Cozzini, Alberto & Jasra, Ajay & Montana, Giovanni & Persing, Adam, 2014. "A Bayesian mixture of lasso regressions with t-errors," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 84-97.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:42:y:2015:i:4:p:1065-1077. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.