IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v70y2008i5p957-980.html
   My bibliography  Save this article

Separation measures and the geometry of Bayes factor selection for classification

Author

Listed:
  • Jim Q. Smith
  • Paul E. Anderson
  • Silvia Liverani

Abstract

Summary. Conjugacy assumptions are often used in Bayesian selection over a partition because they allow the otherwise unfeasibly large model space to be searched very quickly. The implications of such models can be analysed algebraically. We use the explicit forms of the associated Bayes factors to demonstrate that such methods can be unstable under common settings of the associated hyperparameters. We then prove that the regions of instability can be removed by setting the hyperparameters in an unconventional way. Under this family of assignments we prove that model selection is determined by an implicit separation measure: a function of the hyperparameters and the sufficient statistics of clusters in a given partition. We show that this family of separation measures has plausible properties. The methodology proposed is illustrated through the selection of clusters of longitudinal gene expression profiles.

Suggested Citation

  • Jim Q. Smith & Paul E. Anderson & Silvia Liverani, 2008. "Separation measures and the geometry of Bayes factor selection for classification," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 957-980, November.
  • Handle: RePEc:bla:jorssb:v:70:y:2008:i:5:p:957-980
    DOI: 10.1111/j.1467-9868.2008.00664.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2008.00664.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2008.00664.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Fernandez, Carmen & Ley, Eduardo & Steel, Mark F. J., 2001. "Benchmark priors for Bayesian model averaging," Journal of Econometrics, Elsevier, vol. 100(2), pages 381-427, February.
    2. Smith, Michael & Kohn, Robert, 1996. "Nonparametric regression using Bayesian variable selection," Journal of Econometrics, Elsevier, vol. 75(2), pages 317-343, December.
    3. Heard, Nicholas A. & Holmes, Christopher C. & Stephens, David A., 2006. "A Quantitative Study of Gene Regulation Involved in the Immune Response of Anopheline Mosquitoes: An Application of Bayesian Hierarchical Clustering of Curves," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 18-29, March.
    4. Fernando A. Quintana & Pilar L. Iglesias, 2003. "Bayesian clustering and product partition models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 557-574, May.
    5. Shubhankar Ray & Bani Mallick, 2006. "Functional clustering by Bayesian wavelet methods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(2), pages 305-332, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alessandra Guglielmi & Francesca Ieva & Anna M. Paganoni & Fabrizio Ruggeri & Jacopo Soriano, 2014. "Semiparametric Bayesian models for clustering and classification in the presence of unbalanced in-hospital survival," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 63(1), pages 25-46, January.
    2. Guarin, Alexander & Lozano, Ignacio, 2017. "Credit funding and banking fragility: A forecasting model for emerging economies," Emerging Markets Review, Elsevier, vol. 32(C), pages 168-189.
    3. Carmen Fernandez & Eduardo Ley & Mark F. J. Steel, 2001. "Model uncertainty in cross-country growth regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(5), pages 563-576.
    4. Lopresti, John, 2016. "Multiproduct firms and product scope adjustment in trade," Journal of International Economics, Elsevier, vol. 100(C), pages 160-173.
    5. Bin Jiang & Anastasios Panagiotelis & George Athanasopoulos & Rob Hyndman & Farshid Vahid, 2016. "Bayesian Rank Selection in Multivariate Regression," Monash Econometrics and Business Statistics Working Papers 6/16, Monash University, Department of Econometrics and Business Statistics.
    6. Pena, Daniel & Redondas, Dolores, 2006. "Bayesian curve estimation by model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 50(3), pages 688-709, February.
    7. Gilles Celeux & Mohammed El Anbari & Jean-Michel Marin & Christian P. Robert, 2010. "Regularization in Regression : Comparing Bayesian and Frequentist Methods in a Poorly Informative Situation," Working Papers 2010-43, Center for Research in Economics and Statistics.
    8. Dimitris Korobilis, 2008. "Forecasting in vector autoregressions with many predictors," Advances in Econometrics, in: Bayesian Econometrics, pages 403-431, Emerald Group Publishing Limited.
    9. Bruno Scarpa & David B. Dunson, 2014. "Enriched Stick-Breaking Processes for Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 647-660, June.
    10. Min Wang & Xiaoqian Sun & Tao Lu, 2015. "Bayesian structured variable selection in linear regression models," Computational Statistics, Springer, vol. 30(1), pages 205-229, March.
    11. Bruno Scarpa & David B. Dunson, 2009. "Bayesian Hierarchical Functional Data Analysis Via Contaminated Informative Priors," Biometrics, The International Biometric Society, vol. 65(3), pages 772-780, September.
    12. Frühwirth-Schnatter, Sylvia & Wagner, Helga, 2010. "Stochastic model specification search for Gaussian and partial non-Gaussian state space models," Journal of Econometrics, Elsevier, vol. 154(1), pages 85-100, January.
    13. Ruggieri, Eric & Lawrence, Charles E., 2012. "On efficient calculations for Bayesian variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1319-1332.
    14. Priya Kedia & Damitri Kundu & Kiranmoy Das, 2023. "A Bayesian variable selection approach to longitudinal quantile regression," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(1), pages 149-168, March.
    15. Eklund, Jana & Karlsson, Sune, 2007. "Computational Efficiency in Bayesian Model and Variable Selection," Working Papers 2007:4, Örebro University, School of Business.
    16. Kelvin Balcombe, 2005. "Model Selection Using Information Criteria and Genetic Algorithms," Computational Economics, Springer;Society for Computational Economics, vol. 25(3), pages 207-228, June.
    17. Daewon Yang & Taeryon Choi & Eric Lavigne & Yeonseung Chung, 2022. "Non‐parametric Bayesian covariate‐dependent multivariate functional clustering: An application to time‐series data for multiple air pollutants," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1521-1542, November.
    18. Cathy Chen & Feng Liu & Richard Gerlach, 2011. "Bayesian subset selection for threshold autoregressive moving-average models," Computational Statistics, Springer, vol. 26(1), pages 1-30, March.
    19. Sweata Sen & Damitri Kundu & Kiranmoy Das, 2023. "Variable selection for categorical response: a comparative study," Computational Statistics, Springer, vol. 38(2), pages 809-826, June.
    20. Ouysse, Rachida & Kohn, Robert, 2010. "Bayesian variable selection and model averaging in the arbitrage pricing theory model," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3249-3268, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:70:y:2008:i:5:p:957-980. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.