IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v84y2022i5p1640-1665.html
   My bibliography  Save this article

General Bayesian loss function selection and the use of improper models

Author

Listed:
  • Jack Jewson
  • David Rossell

Abstract

Statisticians often face the choice between using probability models or a paradigm defined by minimising a loss function. Both approaches are useful and, if the loss can be re‐cast into a proper probability model, there are many tools to decide which model or loss is more appropriate for the observed data, in the sense of explaining the data's nature. However, when the loss leads to an improper model, there are no principled ways to guide this choice. We address this task by combining the Hyvärinen score, which naturally targets infinitesimal relative probabilities, and general Bayesian updating, which provides a unifying framework for inference on losses and models. Specifically we propose the ℋ$$ \mathscr{H} $$‐score, a general Bayesian selection criterion and prove that it consistently selects the (possibly improper) model closest to the data‐generating truth in Fisher's divergence. We also prove that an associated ℋ$$ \mathscr{H} $$‐posterior consistently learns optimal hyper‐parameters featuring in loss functions, including a challenging tempering parameter in generalised Bayesian inference. As salient examples, we consider robust regression and non‐parametric density estimation where popular loss functions define improper models for the data and hence cannot be dealt with using standard model selection tools. These examples illustrate advantages in robustness‐efficiency trade‐offs and enable Bayesian inference for kernel density estimation, opening a new avenue for Bayesian non‐parametrics.

Suggested Citation

  • Jack Jewson & David Rossell, 2022. "General Bayesian loss function selection and the use of improper models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(5), pages 1640-1665, November.
  • Handle: RePEc:bla:jorssb:v:84:y:2022:i:5:p:1640-1665
    DOI: 10.1111/rssb.12553
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssb.12553
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssb.12553?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lan Wang & Bo Peng & Jelena Bradic & Runze Li & Yunan Wu, 2020. "Rejoinder to “A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression”," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1726-1729, December.
    2. Peter Hall & J. S. Marron & Amnon Neeman, 2005. "Geometric representation of high dimension, low sample size data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(3), pages 427-444, June.
    3. Marco Riani & Andrea Cerioli & Francesca Torti, 2014. "On consistency factors and efficiency of robust S-estimators," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(2), pages 356-387, June.
    4. Jianqing Fan & Cong Ma & Kaizheng Wang, 2020. "Comment on “A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression”," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1720-1725, December.
    5. Bradley Efron, 2020. "Prediction, Estimation, and Attribution," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 636-655, April.
    6. Chernozhukov, Victor & Hong, Han, 2003. "An MCMC approach to classical estimation," Journal of Econometrics, Elsevier, vol. 115(2), pages 293-346, August.
    7. Bradley Efron, 2020. "Prediction, Estimation, and Attribution," International Statistical Review, International Statistical Institute, vol. 88(S1), pages 28-59, December.
    8. Lan Wang & Bo Peng & Jelena Bradic & Runze Li & Yunan Wu, 2020. "A Tuning-free Robust and Efficient Approach to High-dimensional Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1700-1714, December.
    9. F. Giummolè & V. Mameli & E. Ruli & L. Ventura, 2019. "Objective Bayesian inference with proper scoring rules," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 728-755, September.
    10. Filzmoser, Peter & Maronna, Ricardo & Werner, Mark, 2008. "Outlier identification in high dimensions," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1694-1711, January.
    11. Vaart,A. W. van der, 2000. "Asymptotic Statistics," Cambridge Books, Cambridge University Press, number 9780521784504.
    12. A. Philip Dawid & Monica Musio & Laura Ventura, 2016. "Minimum Scoring Rule Inference," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 123-138, March.
    13. Stephane Shao & Pierre E. Jacob & Jie Ding & Vahid Tarokh, 2019. "Bayesian Model Comparison with the Hyvärinen Score: Computation and Consistency," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(528), pages 1826-1837, October.
    14. P. G. Bissiri & C. C. Holmes & S. G. Walker, 2016. "A general framework for updating belief distributions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(5), pages 1103-1130, November.
    15. David Rossell & Francisco J. Rubio, 2018. "Tractable Bayesian Variable Selection: Beyond Normality," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(524), pages 1742-1758, October.
    16. Takuo Matsubara & Jeremias Knoblauch & François‐Xavier Briol & Chris J. Oates, 2022. "Robust generalised Bayesian inference for intractable likelihoods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 997-1022, July.
    17. Xiudi Li & Ali Shojaie, 2020. "Discussion of “A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression”," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1717-1719, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Christis Katsouris, 2023. "High Dimensional Time Series Regression Models: Applications to Statistical Learning Methods," Papers 2308.16192, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Canhong Wen & Zhenduo Li & Ruipeng Dong & Yijin Ni & Wenliang Pan, 2023. "Simultaneous Dimension Reduction and Variable Selection for Multinomial Logistic Regression," INFORMS Journal on Computing, INFORMS, vol. 35(5), pages 1044-1060, September.
    2. Mingyang Ren & Sanguo Zhang & Junhui Wang, 2023. "Consistent estimation of the number of communities via regularized network embedding," Biometrics, The International Biometric Society, vol. 79(3), pages 2404-2416, September.
    3. Yuyang Liu & Pengfei Pi & Shan Luo, 2023. "A semi-parametric approach to feature selection in high-dimensional linear regression models," Computational Statistics, Springer, vol. 38(2), pages 979-1000, June.
    4. Takuo Matsubara & Jeremias Knoblauch & François‐Xavier Briol & Chris J. Oates, 2022. "Robust generalised Bayesian inference for intractable likelihoods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 997-1022, July.
    5. Benítez-Peña, Sandra & Carrizosa, Emilio & Guerrero, Vanesa & Jiménez-Gamero, M. Dolores & Martín-Barragán, Belén & Molero-Río, Cristina & Ramírez-Cobo, Pepa & Romero Morales, Dolores & Sillero-Denami, 2021. "On sparse ensemble methods: An application to short-term predictions of the evolution of COVID-19," European Journal of Operational Research, Elsevier, vol. 295(2), pages 648-663.
    6. Fabio Canova & Christian Matthes, 2021. "Dealing with misspecification in structural macroeconometric models," Quantitative Economics, Econometric Society, vol. 12(2), pages 313-350, May.
    7. Manski, Charles F., 2023. "Probabilistic prediction for binary treatment choice: With focus on personalized medicine," Journal of Econometrics, Elsevier, vol. 234(2), pages 647-663.
    8. Gael M. Martin & David T. Frazier & Christian P. Robert, 2020. "Computing Bayes: Bayesian Computation from 1763 to the 21st Century," Monash Econometrics and Business Statistics Working Papers 14/20, Monash University, Department of Econometrics and Business Statistics.
    9. Yuan Liao & Anna Simoni, 2012. "Semi-parametric Bayesian Partially Identified Models based on Support Function," Papers 1212.3267, arXiv.org, revised Nov 2013.
    10. Weishampel, Anthony & Staicu, Ana-Maria & Rand, William, 2023. "Classification of social media users with generalized functional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    11. Toru Kitagawa & Hugo Lopez & Jeff Rowley, 2022. "Stochastic Treatment Choice with Empirical Welfare Updating," Papers 2211.01537, arXiv.org, revised Feb 2023.
    12. Zhichao Liu & Catherine Forbes & Heather Anderson, 2017. "Robust Bayesian exponentially tilted empirical likelihood method," Monash Econometrics and Business Statistics Working Papers 21/17, Monash University, Department of Econometrics and Business Statistics.
    13. Nelson P. Rayl & Nitish R. Sinha, 2022. "Integrating Prediction and Attribution to Classify News," Finance and Economics Discussion Series 2022-042, Board of Governors of the Federal Reserve System (U.S.).
    14. Xiaohong Chen & Timothy M. Christensen & Elie Tamer, 2018. "Monte Carlo Confidence Sets for Identified Sets," Econometrica, Econometric Society, vol. 86(6), pages 1965-2018, November.
    15. Denis A Shah & Erick D De Wolf & Pierce A Paul & Laurence V Madden, 2021. "Accuracy in the prediction of disease epidemics when ensembling simple but highly correlated models," PLOS Computational Biology, Public Library of Science, vol. 17(3), pages 1-23, March.
    16. Ruben Loaiza‐Maya & Gael M. Martin & David T. Frazier, 2021. "Focused Bayesian prediction," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(5), pages 517-543, August.
    17. Xiaohong Chen & Timothy M. Christensen & Keith O'Hara & Elie Tamer, 2016. "MCMC confidence sets for identified sets," CeMMAP working papers 28/16, Institute for Fiscal Studies.
    18. Fabio Canova & Christian Matthes, 2021. "A Composite Likelihood Approach for Dynamic Structural Models," The Economic Journal, Royal Economic Society, vol. 131(638), pages 2447-2477.
    19. Chung, Hee Cheol & Ahn, Jeongyoun, 2021. "Subspace rotations for high-dimensional outlier detection," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
    20. David T. Frazier & Ruben Loaiza-Maya & Gael M. Martin, 2021. "Variational Bayes in State Space Models: Inferential and Predictive Accuracy," Papers 2106.12262, arXiv.org, revised Feb 2022.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:84:y:2022:i:5:p:1640-1665. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.