IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v70y2008i5p957-980.html

Separation measures and the geometry of Bayes factor selection for classification

Author

Listed:
  • Jim Q. Smith
  • Paul E. Anderson
  • Silvia Liverani

Abstract

Summary. Conjugacy assumptions are often used in Bayesian selection over a partition because they allow the otherwise unfeasibly large model space to be searched very quickly. The implications of such models can be analysed algebraically. We use the explicit forms of the associated Bayes factors to demonstrate that such methods can be unstable under common settings of the associated hyperparameters. We then prove that the regions of instability can be removed by setting the hyperparameters in an unconventional way. Under this family of assignments we prove that model selection is determined by an implicit separation measure: a function of the hyperparameters and the sufficient statistics of clusters in a given partition. We show that this family of separation measures has plausible properties. The methodology proposed is illustrated through the selection of clusters of longitudinal gene expression profiles.

Suggested Citation

  • Jim Q. Smith & Paul E. Anderson & Silvia Liverani, 2008. "Separation measures and the geometry of Bayes factor selection for classification," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(5), pages 957-980, November.
  • Handle: RePEc:bla:jorssb:v:70:y:2008:i:5:p:957-980
    DOI: 10.1111/j.1467-9868.2008.00664.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9868.2008.00664.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9868.2008.00664.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Smith, Michael & Kohn, Robert, 1996. "Nonparametric regression using Bayesian variable selection," Journal of Econometrics, Elsevier, vol. 75(2), pages 317-343, December.
    2. Shubhankar Ray & Bani Mallick, 2006. "Functional clustering by Bayesian wavelet methods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(2), pages 305-332, April.
    3. Fernandez, Carmen & Ley, Eduardo & Steel, Mark F. J., 2001. "Benchmark priors for Bayesian model averaging," Journal of Econometrics, Elsevier, vol. 100(2), pages 381-427, February.
    4. Heard, Nicholas A. & Holmes, Christopher C. & Stephens, David A., 2006. "A Quantitative Study of Gene Regulation Involved in the Immune Response of Anopheline Mosquitoes: An Application of Bayesian Hierarchical Clustering of Curves," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 18-29, March.
    5. Fernando A. Quintana & Pilar L. Iglesias, 2003. "Bayesian clustering and product partition models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 557-574, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Heng-Hui Lue, 2025. "Curve Clustering via Pairwise Directions Estimation," Journal of Classification, Springer;The Classification Society, vol. 42(3), pages 565-595, November.
    2. Alessandra Guglielmi & Francesca Ieva & Anna M. Paganoni & Fabrizio Ruggeri & Jacopo Soriano, 2014. "Semiparametric Bayesian models for clustering and classification in the presence of unbalanced in-hospital survival," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 63(1), pages 25-46, January.
    3. Bin Jiang & Anastasios Panagiotelis & George Athanasopoulos & Rob Hyndman & Farshid Vahid, 2016. "Bayesian Rank Selection in Multivariate Regression," Monash Econometrics and Business Statistics Working Papers 6/16, Monash University, Department of Econometrics and Business Statistics.
    4. Dimitris Korobilis, 2008. "Forecasting in vector autoregressions with many predictors," Advances in Econometrics, in: Bayesian Econometrics, pages 403-431, Emerald Group Publishing Limited.
    5. Bruno Scarpa & David B. Dunson, 2014. "Enriched Stick-Breaking Processes for Functional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 647-660, June.
    6. Ruggieri, Eric & Lawrence, Charles E., 2012. "On efficient calculations for Bayesian variable selection," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1319-1332.
    7. Priya Kedia & Damitri Kundu & Kiranmoy Das, 2023. "A Bayesian variable selection approach to longitudinal quantile regression," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 32(1), pages 149-168, March.
    8. Daewon Yang & Taeryon Choi & Eric Lavigne & Yeonseung Chung, 2022. "Non‐parametric Bayesian covariate‐dependent multivariate functional clustering: An application to time‐series data for multiple air pollutants," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1521-1542, November.
    9. Pena, Daniel & Redondas, Dolores, 2006. "Bayesian curve estimation by model averaging," Computational Statistics & Data Analysis, Elsevier, vol. 50(3), pages 688-709, February.
    10. Zhongnan Jin & Jie Min & Yili Hong & Pang Du & Qingyu Yang, 2024. "Multivariate Functional Clustering with Variable Selection and Application to Sensor Data from Engineering Systems," INFORMS Joural on Data Science, INFORMS, vol. 3(2), pages 203-218, October.
    11. Sweata Sen & Damitri Kundu & Kiranmoy Das, 2023. "Variable selection for categorical response: a comparative study," Computational Statistics, Springer, vol. 38(2), pages 809-826, June.
    12. Ouysse, Rachida & Kohn, Robert, 2010. "Bayesian variable selection and model averaging in the arbitrage pricing theory model," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3249-3268, December.
    13. Carmen Fernandez & Eduardo Ley & Mark F. J. Steel, 2001. "Model uncertainty in cross-country growth regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(5), pages 563-576.
    14. Dongik Jang & Hee-Seok Oh & Philippe Naveau, 2017. "Identifying local smoothness for spatially inhomogeneous functions," Computational Statistics, Springer, vol. 32(3), pages 1115-1138, September.
    15. Li Ma, 2015. "Scalable Bayesian Model Averaging Through Local Information Propagation," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(510), pages 795-809, June.
    16. Zhang, Hongmei & Huang, Xianzheng & Han, Shengtong & Rezwan, Faisal I. & Karmaus, Wilfried & Arshad, Hasan & Holloway, John W., 2021. "Gaussian Bayesian network comparisons with graph ordering unknown," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    17. Julien Jacques & Cristian Preda, 2014. "Functional data clustering: a survey," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(3), pages 231-255, September.
    18. Guarin, Alexander & Lozano, Ignacio, 2017. "Credit funding and banking fragility: A forecasting model for emerging economies," Emerging Markets Review, Elsevier, vol. 32(C), pages 168-189.
    19. Lopresti, John, 2016. "Multiproduct firms and product scope adjustment in trade," Journal of International Economics, Elsevier, vol. 100(C), pages 160-173.
    20. Gilles Celeux & Mohammed El Anbari & Jean-Michel Marin & Christian P. Robert, 2010. "Regularization in Regression : Comparing Bayesian and Frequentist Methods in a Poorly Informative Situation," Working Papers 2010-43, Center for Research in Economics and Statistics.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:70:y:2008:i:5:p:957-980. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.