IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v53y2009i12p4028-4045.html
   My bibliography  Save this article

Estimating Bayes factors via thermodynamic integration and population MCMC

Author

Listed:
  • Calderhead, Ben
  • Girolami, Mark

Abstract

A Bayesian approach to model comparison based on the integrated or marginal likelihood is considered, and applications to linear regression models and nonlinear ordinary differential equation (ODE) models are used as the setting in which to elucidate and further develop existing statistical methodology. The focus is on two methods of marginal likelihood estimation. First, a statistical failure of the widely employed Posterior Harmonic Mean estimator is highlighted. It is demonstrated that there is a systematic bias capable of significantly skewing Bayes factor estimates, which has not previously been highlighted in the literature. Second, a detailed study of the recently proposed Thermodynamic Integral estimator is presented, which characterises the error associated with its discrete form. An experimental study using analytically tractable linear regression models highlights substantial differences with recently published results regarding optimal discretisation. Finally, with the insights gained, it is demonstrated how Population MCMC and thermodynamic integration methods may be elegantly combined to estimate Bayes factors accurately enough to discriminate between nonlinear models based on systems of ODEs, which has important application in describing the behaviour of complex processes arising in a wide variety of research areas, such as Systems Biology, Computational Ecology and Chemical Engineering.

Suggested Citation

  • Calderhead, Ben & Girolami, Mark, 2009. "Estimating Bayes factors via thermodynamic integration and population MCMC," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4028-4045, October.
  • Handle: RePEc:eee:csdana:v:53:y:2009:i:12:p:4028-4045
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(09)00272-2
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Pierre Del Moral & Arnaud Doucet & Ajay Jasra, 2006. "Sequential Monte Carlo samplers," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(3), pages 411-436, June.
    2. N. Friel & A. N. Pettitt, 2008. "Marginal likelihood estimation via power posteriors," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(3), pages 589-607, July.
    3. El Adlouni, Salaheddine & Favre, Anne-Catherine & Bobee, Bernard, 2006. "Comparison of methodologies to assess the convergence of Markov chain Monte Carlo methods," Computational Statistics & Data Analysis, Elsevier, vol. 50(10), pages 2685-2701, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Marco Grzegorczyk & Andrej Aderhold & Dirk Husmeier, 2017. "Targeting Bayes factors with direct-path non-equilibrium thermodynamic integration," Computational Statistics, Springer, vol. 32(2), pages 717-761, June.
    2. Will Penny & Biswa Sengupta, 2016. "Annealed Importance Sampling for Neural Mass Models," PLOS Computational Biology, Public Library of Science, vol. 12(3), pages 1-25, March.
    3. Birgir Hrafnkelsson & Helgi Sigurdarson & Sölvi Rögnvaldsson & Axel Örn Jansson & Rafael Daníel Vias & Sigurdur M. Gardarsson, 2022. "Generalization of the power‐law rating curve using hydrodynamic theory and Bayesian hierarchical modeling," Environmetrics, John Wiley & Sons, Ltd., vol. 33(2), March.
    4. Golchi, Shirin & Campbell, David A., 2016. "Sequentially Constrained Monte Carlo," Computational Statistics & Data Analysis, Elsevier, vol. 97(C), pages 98-113.
    5. Zhou, Yan, 2015. "vSMC: Parallel Sequential Monte Carlo in C++," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 62(i09).
    6. Chris J. Oates & Mark Girolami & Nicolas Chopin, 2017. "Control functionals for Monte Carlo integration," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 695-718, June.
    7. Luigi Spezia & Andy Vinten & Roberta Paroli & Marc Stutter, 2021. "An evolutionary Monte Carlo method for the analysis of turbidity high‐frequency time series through Markov switching autoregressive models," Environmetrics, John Wiley & Sons, Ltd., vol. 32(8), December.
    8. Spezia, Luigi, 2020. "Bayesian variable selection in non-homogeneous hidden Markov models through an evolutionary Monte Carlo method," Computational Statistics & Data Analysis, Elsevier, vol. 143(C).
    9. Filippone, Maurizio & Sanguinetti, Guido, 2011. "Approximate inference of the bandwidth in multivariate kernel density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3104-3122, December.
    10. Liu Baisen & Wang Liangliang & Cao Jiguo, 2018. "Bayesian estimation of ordinary differential equation models when the likelihood has multiple local modes," Monte Carlo Methods and Applications, De Gruyter, vol. 24(2), pages 117-127, June.
    11. Rigat, F. & Mira, A., 2012. "Parallel hierarchical sampling: A general-purpose interacting Markov chains Monte Carlo algorithm," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1450-1467.
    12. Loza-Reyes, E. & Hurn, M.A. & Robinson, A., 2014. "Classification of molecular sequence data using Bayesian phylogenetic mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 75(C), pages 81-95.
    13. Eduardo A Aponte & Dario Schöbi & Klaas E Stephan & Jakob Heinzle, 2017. "The Stochastic Early Reaction, Inhibition, and late Action (SERIA) model for antisaccades," PLOS Computational Biology, Public Library of Science, vol. 13(8), pages 1-36, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Luca Martino & Fernando Llorente & Ernesto Curbelo & Javier López-Santiago & Joaquín Míguez, 2021. "Automatic Tempered Posterior Distributions for Bayesian Inversion Problems," Mathematics, MDPI, vol. 9(7), pages 1-17, April.
    2. repec:dau:papers:123456789/5724 is not listed on IDEAS
    3. Drovandi, Christopher C. & McGree, James M. & Pettitt, Anthony N., 2013. "Sequential Monte Carlo for Bayesian sequentially designed experiments for discrete data," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 320-335.
    4. Perrakis, Konstantinos & Ntzoufras, Ioannis & Tsionas, Efthymios G., 2014. "On the use of marginal posteriors in marginal likelihood estimation via importance sampling," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 54-69.
    5. Lefebvre, Geneviève & Steele, Russell & Vandal, Alain C., 2010. "A path sampling identity for computing the Kullback-Leibler and J divergences," Computational Statistics & Data Analysis, Elsevier, vol. 54(7), pages 1719-1731, July.
    6. Mathias Drton & Martyn Plummer, 2017. "A Bayesian information criterion for singular models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 323-380, March.
    7. Xing Ju Lee & Christopher C. Drovandi & Anthony N. Pettitt, 2015. "Model choice problems using approximate Bayesian computation with applications to pathogen transmission data sets," Biometrics, The International Biometric Society, vol. 71(1), pages 198-207, March.
    8. S. G. J. Senarathne & C. C. Drovandi & J. M. McGree, 2020. "Bayesian sequential design for Copula models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(2), pages 454-478, June.
    9. Arnaud Dufays, 2016. "Evolutionary Sequential Monte Carlo Samplers for Change-Point Models," Econometrics, MDPI, vol. 4(1), pages 1-33, March.
    10. Jeong Eun Lee & Christian Robert, 2013. "Imortance Sampling Schemes for Evidence Approximation in Mixture Models," Working Papers 2013-42, Center for Research in Economics and Statistics.
    11. James Martin & Ajay Jasra & Emma McCoy, 2013. "Inference for a class of partially observed point process models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 65(3), pages 413-437, June.
    12. Will Penny & Biswa Sengupta, 2016. "Annealed Importance Sampling for Neural Mass Models," PLOS Computational Biology, Public Library of Science, vol. 12(3), pages 1-25, March.
    13. Spezia, L. & Cooksley, S.L. & Brewer, M.J. & Donnelly, D. & Tree, A., 2014. "Modelling species abundance in a river by Negative Binomial hidden Markov models," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 599-614.
    14. Mark Bognanni & John Zito, 2019. "Sequential Bayesian Inference for Vector Autoregressions with Stochastic Volatility," Working Papers 19-29, Federal Reserve Bank of Cleveland.
    15. Vitoratou, Silia & Ntzoufras, Ioannis & Moustaki, Irini, 2016. "Explaining the behavior of joint and marginal Monte Carlo estimators in latent variable models with independence assumptions," LSE Research Online Documents on Economics 57685, London School of Economics and Political Science, LSE Library.
    16. Saifuddin Syed & Alexandre Bouchard‐Côté & George Deligiannidis & Arnaud Doucet, 2022. "Non‐reversible parallel tempering: A scalable highly parallel MCMC scheme," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 321-350, April.
    17. Brignone, Riccardo & Gonzato, Luca & Lütkebohmert, Eva, 2023. "Efficient Quasi-Bayesian Estimation of Affine Option Pricing Models Using Risk-Neutral Cumulants," Journal of Banking & Finance, Elsevier, vol. 148(C).
    18. Lee Anthony & Caron Francois & Doucet Arnaud & Holmes Chris, 2012. "Bayesian Sparsity-Path-Analysis of Genetic Association Signal using Generalized t Priors," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(2), pages 1-31, January.
    19. Edward Herbst & Frank Schorfheide, 2014. "Sequential Monte Carlo Sampling For Dsge Models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(7), pages 1073-1098, November.
    20. Spezia, Luigi, 2020. "Bayesian variable selection in non-homogeneous hidden Markov models through an evolutionary Monte Carlo method," Computational Statistics & Data Analysis, Elsevier, vol. 143(C).
    21. Beirne, John & Villafuerte, James & Zhang, Bryan (ed.), 2022. "Fintech and COVID-19: Impacts, Challenges, and Policy Priorities for Asia," ADBI Books, Asian Development Bank Institute, number 29, Décembre.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:53:y:2009:i:12:p:4028-4045. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.