IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v84y2022i5p1640-1665.html

General Bayesian loss function selection and the use of improper models

Author

Listed:
  • Jack Jewson
  • David Rossell

Abstract

Statisticians often face the choice between using probability models or a paradigm defined by minimising a loss function. Both approaches are useful and, if the loss can be re‐cast into a proper probability model, there are many tools to decide which model or loss is more appropriate for the observed data, in the sense of explaining the data's nature. However, when the loss leads to an improper model, there are no principled ways to guide this choice. We address this task by combining the Hyvärinen score, which naturally targets infinitesimal relative probabilities, and general Bayesian updating, which provides a unifying framework for inference on losses and models. Specifically we propose the ℋ$$ \mathscr{H} $$‐score, a general Bayesian selection criterion and prove that it consistently selects the (possibly improper) model closest to the data‐generating truth in Fisher's divergence. We also prove that an associated ℋ$$ \mathscr{H} $$‐posterior consistently learns optimal hyper‐parameters featuring in loss functions, including a challenging tempering parameter in generalised Bayesian inference. As salient examples, we consider robust regression and non‐parametric density estimation where popular loss functions define improper models for the data and hence cannot be dealt with using standard model selection tools. These examples illustrate advantages in robustness‐efficiency trade‐offs and enable Bayesian inference for kernel density estimation, opening a new avenue for Bayesian non‐parametrics.

Suggested Citation

  • Jack Jewson & David Rossell, 2022. "General Bayesian loss function selection and the use of improper models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(5), pages 1640-1665, November.
  • Handle: RePEc:bla:jorssb:v:84:y:2022:i:5:p:1640-1665
    DOI: 10.1111/rssb.12553
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssb.12553
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssb.12553?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Peter Hall & J. S. Marron & Amnon Neeman, 2005. "Geometric representation of high dimension, low sample size data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(3), pages 427-444, June.
    2. Bradley Efron, 2020. "Prediction, Estimation, and Attribution," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(530), pages 636-655, April.
    3. Lan Wang & Bo Peng & Jelena Bradic & Runze Li & Yunan Wu, 2020. "A Tuning-free Robust and Efficient Approach to High-dimensional Regression," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1700-1714, December.
    4. Filzmoser, Peter & Maronna, Ricardo & Werner, Mark, 2008. "Outlier identification in high dimensions," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1694-1711, January.
    5. Vaart,A. W. van der, 2000. "Asymptotic Statistics," Cambridge Books, Cambridge University Press, number 9780521784504, August.
    6. Stephane Shao & Pierre E. Jacob & Jie Ding & Vahid Tarokh, 2019. "Bayesian Model Comparison with the Hyvärinen Score: Computation and Consistency," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 114(528), pages 1826-1837, October.
    7. P. G. Bissiri & C. C. Holmes & S. G. Walker, 2016. "A general framework for updating belief distributions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(5), pages 1103-1130, November.
    8. Takuo Matsubara & Jeremias Knoblauch & François‐Xavier Briol & Chris J. Oates, 2022. "Robust generalised Bayesian inference for intractable likelihoods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 997-1022, July.
    9. Lan Wang & Bo Peng & Jelena Bradic & Runze Li & Yunan Wu, 2020. "Rejoinder to “A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression”," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1726-1729, December.
    10. Marco Riani & Andrea Cerioli & Francesca Torti, 2014. "On consistency factors and efficiency of robust S-estimators," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 23(2), pages 356-387, June.
    11. Jianqing Fan & Cong Ma & Kaizheng Wang, 2020. "Comment on “A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression”," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1720-1725, December.
    12. Bradley Efron, 2020. "Prediction, Estimation, and Attribution," International Statistical Review, International Statistical Institute, vol. 88(S1), pages 28-59, December.
    13. F. Giummolè & V. Mameli & E. Ruli & L. Ventura, 2019. "Objective Bayesian inference with proper scoring rules," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(3), pages 728-755, September.
    14. A. Philip Dawid & Monica Musio & Laura Ventura, 2016. "Minimum Scoring Rule Inference," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 123-138, March.
    15. Chernozhukov, Victor & Hong, Han, 2003. "An MCMC approach to classical estimation," Journal of Econometrics, Elsevier, vol. 115(2), pages 293-346, August.
    16. David Rossell & Francisco J. Rubio, 2018. "Tractable Bayesian Variable Selection: Beyond Normality," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(524), pages 1742-1758, October.
    17. Xiudi Li & Ali Shojaie, 2020. "Discussion of “A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression”," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(532), pages 1717-1719, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Akifumi Okuno, 2024. "Minimizing robust density power-based divergences for general parametric density models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 76(5), pages 851-875, October.
    2. Saroja Kumar Singh & J. F. R. Aguirre & F. R. B. Cruz, 2025. "Estimating changes in traffic intensity for Markovian finite queueing systems: a Bayesian perspective," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 16(7), pages 2467-2479, July.
    3. Mai, The Tien, 2025. "On properties of fractional posterior in generalized reduced-rank regression," Journal of Multivariate Analysis, Elsevier, vol. 210(C).
    4. Lucas Kock & David T. Frazier & Michael Stanley Smith & David J. Nott, 2026. "Bayesian Modular Inference for Copula Models with Potentially Misspecified Marginals," Papers 2603.11457, arXiv.org, revised Apr 2026.
    5. Christis Katsouris, 2023. "High Dimensional Time Series Regression Models: Applications to Statistical Learning Methods," Papers 2308.16192, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Canhong Wen & Zhenduo Li & Ruipeng Dong & Yijin Ni & Wenliang Pan, 2023. "Simultaneous Dimension Reduction and Variable Selection for Multinomial Logistic Regression," INFORMS Journal on Computing, INFORMS, vol. 35(5), pages 1044-1060, September.
    2. Toshio Honda & Wei-Ying Wu, 2026. "Sparse quantile regression via $$\ell _0$$ ℓ 0 -penalty," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 78(1), pages 141-173, February.
    3. Xinyu Fu & Mian Huang & Weixin Yao, 2024. "Semiparametric efficient estimation in high‐dimensional partial linear regression models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 51(3), pages 1259-1287, September.
    4. Xun-Jian Li & Jiajuan Liang & Guo-Liang Tian & Man-Lai Tang & Jianhua Shi, 2025. "Mean regression model for Type I generalized logistic distribution with a QLB algorithm," Statistical Papers, Springer, vol. 66(5), pages 1-42, August.
    5. Mingyang Ren & Sanguo Zhang & Junhui Wang, 2023. "Consistent estimation of the number of communities via regularized network embedding," Biometrics, The International Biometric Society, vol. 79(3), pages 2404-2416, September.
    6. Yu, Ke & Luo, Shan, 2024. "Rank-based sequential feature selection for high-dimensional accelerated failure time models with main and interaction effects," Computational Statistics & Data Analysis, Elsevier, vol. 197(C).
    7. Yuyang Liu & Pengfei Pi & Shan Luo, 2023. "A semi-parametric approach to feature selection in high-dimensional linear regression models," Computational Statistics, Springer, vol. 38(2), pages 979-1000, June.
    8. Dong, Wei & Xu, Chen & Xie, Jinhan & Tang, Niansheng, 2024. "Tuning-free sparse clustering via alternating hard-thresholding," Journal of Multivariate Analysis, Elsevier, vol. 203(C).
    9. Takuo Matsubara & Jeremias Knoblauch & François‐Xavier Briol & Chris J. Oates, 2022. "Robust generalised Bayesian inference for intractable likelihoods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 997-1022, July.
    10. Manski, Charles F., 2023. "Probabilistic prediction for binary treatment choice: With focus on personalized medicine," Journal of Econometrics, Elsevier, vol. 234(2), pages 647-663.
    11. Yuan Liao & Anna Simoni, 2012. "Semi-parametric Bayesian Partially Identified Models based on Support Function," Papers 1212.3267, arXiv.org, revised Nov 2013.
    12. Weishampel, Anthony & Staicu, Ana-Maria & Rand, William, 2023. "Classification of social media users with generalized functional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    13. Toru Kitagawa & Hugo Lopez & Jeff Rowley, 2022. "Stochastic Treatment Choice with Empirical Welfare Updating," Papers 2211.01537, arXiv.org, revised Feb 2023.
    14. Xiaohong Chen & Timothy M. Christensen & Elie Tamer, 2018. "Monte Carlo Confidence Sets for Identified Sets," Econometrica, Econometric Society, vol. 86(6), pages 1965-2018, November.
    15. Paolo Libenzio Brignoli & Alessandro Varacca & Cornelis Gardebroek & Paolo Sckokai, 2024. "Machine learning to predict grains futures prices," Agricultural Economics, International Association of Agricultural Economists, vol. 55(3), pages 479-497, May.
    16. Mai, The Tien, 2025. "On properties of fractional posterior in generalized reduced-rank regression," Journal of Multivariate Analysis, Elsevier, vol. 210(C).
    17. Ruben Loaiza‐Maya & Gael M. Martin & David T. Frazier, 2021. "Focused Bayesian prediction," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(5), pages 517-543, August.
    18. Chung, Hee Cheol & Ahn, Jeongyoun, 2021. "Subspace rotations for high-dimensional outlier detection," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
    19. David T. Frazier & Ruben Loaiza-Maya & Gael M. Martin, 2021. "Variational Bayes in State Space Models: Inferential and Predictive Accuracy," Papers 2106.12262, arXiv.org, revised Feb 2022.
    20. Mabit, Stefan E., 2026. "Mixed logit applications in travel behaviour research," Transportation Research Part A: Policy and Practice, Elsevier, vol. 203(C).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:84:y:2022:i:5:p:1640-1665. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.