IDEAS home Printed from https://ideas.repec.org/a/bla/jorssb/v81y2019i2p235-269.html
   My bibliography  Save this article

Approximate Bayesian computation with the Wasserstein distance

Author

Listed:
  • Espen Bernton
  • Pierre E. Jacob
  • Mathieu Gerber
  • Christian P. Robert

Abstract

A growing number of generative statistical models do not permit the numerical evaluation of their likelihood functions. Approximate Bayesian computation has become a popular approach to overcome this issue, in which one simulates synthetic data sets given parameters and compares summaries of these data sets with the corresponding observed values. We propose to avoid the use of summaries and the ensuing loss of information by instead using the Wasserstein distance between the empirical distributions of the observed and synthetic data. This generalizes the well‐known approach of using order statistics within approximate Bayesian computation to arbitrary dimensions. We describe how recently developed approximations of the Wasserstein distance allow the method to scale to realistic data sizes, and we propose a new distance based on the Hilbert space filling curve. We provide a theoretical study of the method proposed, describing consistency as the threshold goes to 0 while the observations are kept fixed, and concentration properties as the number of observations grows. Various extensions to time series data are discussed. The approach is illustrated on various examples, including univariate and multivariate g‐and‐k distributions, a toggle switch model from systems biology, a queuing model and a Lévy‐driven stochastic volatility model.

Suggested Citation

  • Espen Bernton & Pierre E. Jacob & Mathieu Gerber & Christian P. Robert, 2019. "Approximate Bayesian computation with the Wasserstein distance," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 235-269, April.
  • Handle: RePEc:bla:jorssb:v:81:y:2019:i:2:p:235-269
    DOI: 10.1111/rssb.12312
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssb.12312
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssb.12312?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Paul Fearnhead & Dennis Prangle, 2012. "Constructing summary statistics for approximate Bayesian computation: semi-automatic approximate Bayesian computation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 74(3), pages 419-474, June.
    2. Christophe Andrieu & Arnaud Doucet & Roman Holenstein, 2010. "Particle Markov chain Monte Carlo methods," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 72(3), pages 269-342, June.
    3. Anthony Lee & Krzysztof Łatuszyński, 2014. "Variance bounding and geometric ergodicity of Markov chain Monte Carlo kernels for approximate Bayesian computation," Biometrika, Biometrika Trust, vol. 101(3), pages 655-671.
    4. Ole E. Barndorff‐Nielsen & Neil Shephard, 2002. "Econometric analysis of realized volatility and its use in estimating stochastic volatility models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(2), pages 253-280, May.
    5. Wentao Li & Paul Fearnhead, 2018. "On the asymptotic efficiency of approximate Bayesian computation estimators," Biometrika, Biometrika Trust, vol. 105(2), pages 285-299.
    6. N. Chopin & P. E. Jacob & O. Papaspiliopoulos, 2013. "SMC-super-2: an efficient algorithm for sequential analysis of state space models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 75(3), pages 397-426, June.
    7. Aude Geneway & Gabriel Peyré & Marco Cuturi, 2017. "Learning Generative Models with Sinkhorn Divergences," Working Papers 2017-83, Center for Research in Economics and Statistics.
    8. Drovandi, Christopher C. & Pettitt, Anthony N., 2011. "Likelihood-free Bayesian estimation of multivariate quantile distributions," Computational Statistics & Data Analysis, Elsevier, vol. 55(9), pages 2541-2556, September.
    9. D T Frazier & G M Martin & C P Robert & J Rousseau, 2018. "Asymptotic properties of approximate Bayesian computation," Biometrika, Biometrika Trust, vol. 105(3), pages 593-607.
    10. repec:dau:papers:123456789/7305 is not listed on IDEAS
    11. repec:dau:papers:123456789/5724 is not listed on IDEAS
    12. Max Sommerfeld & Axel Munk, 2018. "Inference for empirical Wasserstein distances on finite spaces," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(1), pages 219-238, January.
    13. Bonassi Fernando V. & You Lingchong & West Mike, 2011. "Bayesian Learning from Marginal Data in Bionetwork Models," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 10(1), pages 1-27, October.
    14. Carsten Gottschlich & Dominic Schuhmacher, 2014. "The Shortlist Method for Fast Computation of the Earth Mover's Distance and Finding Optimal Solutions to Transportation Problems," PLOS ONE, Public Library of Science, vol. 9(10), pages 1-10, October.
    15. Bassetti, Federico & Bodini, Antonella & Regazzini, Eugenio, 2006. "On minimum Kantorovich distance estimators," Statistics & Probability Letters, Elsevier, vol. 76(12), pages 1298-1302, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Arbel, Julyan & Crispino, Marta & Girard, Stéphane, 2019. "Dependence properties and Bayesian inference for asymmetric multivariate copulas," Journal of Multivariate Analysis, Elsevier, vol. 174(C).
    2. Pierre-Olivier Goffard & Patrick Laub, 2021. "Approximate Bayesian Computations to fit and compare insurance loss models," Post-Print hal-02891046, HAL.
    3. Joel Dyer & Patrick Cannon & J. Doyne Farmer & Sebastian Schmon, 2022. "Black-box Bayesian inference for economic agent-based models," Papers 2202.00625, arXiv.org.
    4. Goffard, Pierre-Olivier & Laub, Patrick J., 2021. "Approximate Bayesian Computations to fit and compare insurance loss models," Insurance: Mathematics and Economics, Elsevier, vol. 100(C), pages 350-371.
    5. Anthony Ebert & Ritabrata Dutta & Kerrie Mengersen & Antonietta Mira & Fabrizio Ruggeri & Paul Wu, 2021. "Likelihood‐free parameter estimation for dynamic queueing networks: Case study of passenger flow in an international airport terminal," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 770-792, June.
    6. Lorenzo Pacchiardi & Pierre Künzli & Marcel Schöngens & Bastien Chopard & Ritabrata Dutta, 2021. "Distance-learning For Approximate Bayesian Computation To Model a Volcanic Eruption," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 288-317, May.
    7. Anthony Ebert & Kerrie Mengersen & Fabrizio Ruggeri & Paul Wu, 2021. "Curve Registration of Functional Data for Approximate Bayesian Computation," Stats, MDPI, vol. 4(3), pages 1-14, September.
    8. Mathias Silva, 2023. "Parametric models of income distributions integrating misreporting and non-response mechanisms," AMSE Working Papers 2311, Aix-Marseille School of Economics, France.
    9. Ninna Vihrs & Jesper Møller & Alan E. Gelfand, 2022. "Approximate Bayesian inference for a spatial point process model exhibiting regularity and random aggregation," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(1), pages 185-210, March.
    10. Jonathan U Harrison & Ruth E Baker, 2020. "An automatic adaptive method to combine summary statistics in approximate Bayesian computation," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-21, August.
    11. Perepolkin, Dmytro & Goodrich, Benjamin & Sahlin, Ullrika, 2023. "The tenets of quantile-based inference in Bayesian models," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    12. Gael M. Martin & David T. Frazier & Christian P. Robert, 2021. "Approximating Bayes in the 21st Century," Monash Econometrics and Business Statistics Working Papers 24/21, Monash University, Department of Econometrics and Business Statistics.
    13. David T. Frazier, 2020. "Robust and Efficient Approximate Bayesian Computation: A Minimum Distance Approach," Papers 2006.14126, arXiv.org.
    14. Perepolkin, Dmytro & Goodrich, Benjamin & Sahlin, Ullrika, 2021. "The tenets of indirect inference in Bayesian models," OSF Preprints enzgs, Center for Open Science.
    15. Cecilia Viscardi & Michele Boreale & Fabio Corradi, 2021. "Weighted approximate Bayesian computation via Sanov’s theorem," Computational Statistics, Springer, vol. 36(4), pages 2719-2753, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ajay Jasra, 2015. "Approximate Bayesian Computation for a Class of Time Series Models," International Statistical Review, International Statistical Institute, vol. 83(3), pages 405-435, December.
    2. Gael M. Martin & David T. Frazier & Christian P. Robert, 2020. "Computing Bayes: Bayesian Computation from 1763 to the 21st Century," Monash Econometrics and Business Statistics Working Papers 14/20, Monash University, Department of Econometrics and Business Statistics.
    3. Piotr Szczepocki, 2020. "Application of iterated filtering to stochastic volatility models based on non-Gaussian Ornstein-Uhlenbeck process," Statistics in Transition New Series, Polish Statistical Association, vol. 21(2), pages 173-187, June.
    4. George Karabatsos, 2023. "Approximate Bayesian computation using asymptotically normal point estimates," Computational Statistics, Springer, vol. 38(2), pages 531-568, June.
    5. Virbickaitė, Audronė & Frey, Christoph & Macedo, Demian N., 2020. "Bayesian sequential stock return prediction through copulas," The Journal of Economic Asymmetries, Elsevier, vol. 22(C).
    6. Henri Pesonen & Umberto Simola & Alvaro Köhn‐Luque & Henri Vuollekoski & Xiaoran Lai & Arnoldo Frigessi & Samuel Kaski & David T. Frazier & Worapree Maneesoonthorn & Gael M. Martin & Jukka Corander, 2023. "ABC of the future," International Statistical Review, International Statistical Institute, vol. 91(2), pages 243-268, August.
    7. Gael M. Martin & David T. Frazier & Christian P. Robert, 2021. "Approximating Bayes in the 21st Century," Monash Econometrics and Business Statistics Working Papers 24/21, Monash University, Department of Econometrics and Business Statistics.
    8. Frazier, David T. & Maneesoonthorn, Worapree & Martin, Gael M. & McCabe, Brendan P.M., 2019. "Approximate Bayesian forecasting," International Journal of Forecasting, Elsevier, vol. 35(2), pages 521-539.
    9. Virbickaitė, Audronė & Ausín, M. Concepción & Galeano, Pedro, 2020. "Copula stochastic volatility in oil returns: Approximate Bayesian computation with volatility prediction," Energy Economics, Elsevier, vol. 92(C).
    10. Lee, Xing Ju & Hainy, Markus & McKeone, James P. & Drovandi, Christopher C. & Pettitt, Anthony N., 2018. "ABC model selection for spatial extremes models applied to South Australian maximum temperature data," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 128-144.
    11. Szczepocki Piotr, 2020. "Application of iterated filtering to stochastic volatility models based on non-Gaussian Ornstein-Uhlenbeck process," Statistics in Transition New Series, Polish Statistical Association, vol. 21(2), pages 173-187, June.
    12. McKinley, Trevelyan J. & Ross, Joshua V. & Deardon, Rob & Cook, Alex R., 2014. "Simulation-based Bayesian inference for epidemic models," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 434-447.
    13. Arnaud Dufays, 2016. "Evolutionary Sequential Monte Carlo Samplers for Change-Point Models," Econometrics, MDPI, vol. 4(1), pages 1-33, March.
    14. Luis Alvarez & Cristine Pinto & Vladimir Ponczek, 2022. "Homophily in preferences or meetings? Identifying and estimating an iterative network formation model," Papers 2201.06694, arXiv.org, revised Mar 2024.
    15. Dassios, Angelos & Qu, Yan & Zhao, Hongbiao, 2018. "Exact simulation for a class of tempered stable," LSE Research Online Documents on Economics 86981, London School of Economics and Political Science, LSE Library.
    16. Li, J. & Nott, D.J. & Fan, Y. & Sisson, S.A., 2017. "Extending approximate Bayesian computation methods to high dimensions via a Gaussian copula model," Computational Statistics & Data Analysis, Elsevier, vol. 106(C), pages 77-89.
    17. Brignone, Riccardo & Gonzato, Luca & Lütkebohmert, Eva, 2023. "Efficient Quasi-Bayesian Estimation of Affine Option Pricing Models Using Risk-Neutral Cumulants," Journal of Banking & Finance, Elsevier, vol. 148(C).
    18. Naoki Awaya & Yasuhiro Omori, 2021. "Particle Rolling MCMC with Double-Block Sampling ," CIRJE F-Series CIRJE-F-1175, CIRJE, Faculty of Economics, University of Tokyo.
    19. Hirokuni Iiboshi & Mototsugu Shintani & Kozo Ueda, 2022. "Estimating a Nonlinear New Keynesian Model with the Zero Lower Bound for Japan," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 54(6), pages 1637-1671, September.
    20. D T Frazier & G M Martin & C P Robert & J Rousseau, 2018. "Asymptotic properties of approximate Bayesian computation," Biometrika, Biometrika Trust, vol. 105(3), pages 593-607.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssb:v:81:y:2019:i:2:p:235-269. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.