IDEAS home Printed from https://ideas.repec.org/a/eee/spapps/v129y2019i12p5278-5311.html
   My bibliography  Save this article

User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient

Author

Listed:
  • Dalalyan, Arnak S.
  • Karagulyan, Avetik

Abstract

In this paper, we study the problem of sampling from a given probability density function that is known to be smooth and strongly log-concave. We analyze several methods of approximate sampling based on discretizations of the (highly overdamped) Langevin diffusion and establish guarantees on its error measured in the Wasserstein-2 distance. Our guarantees improve or extend the state-of-the-art results in three directions. First, we provide an upper bound on the error of the first-order Langevin Monte Carlo (LMC) algorithm with optimized varying step-size. This result has the advantage of being horizon free (we do not need to know in advance the target precision) and to improve by a logarithmic factor the corresponding result for the constant step-size. Second, we study the case where accurate evaluations of the gradient of the log-density are unavailable, but one can have access to approximations of the aforementioned gradient. In such a situation, we consider both deterministic and stochastic approximations of the gradient and provide an upper bound on the sampling error of the first-order LMC that quantifies the impact of the gradient evaluation inaccuracies. Third, we establish upper bounds for two versions of the second-order LMC, which leverage the Hessian of the log-density. We provide non asymptotic guarantees on the sampling error of these second-order LMCs. These guarantees reveal that the second-order LMC algorithms improve on the first-order LMC in ill-conditioned settings.

Suggested Citation

  • Dalalyan, Arnak S. & Karagulyan, Avetik, 2019. "User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient," Stochastic Processes and their Applications, Elsevier, vol. 129(12), pages 5278-5311.
  • Handle: RePEc:eee:spapps:v:129:y:2019:i:12:p:5278-5311
    DOI: 10.1016/j.spa.2019.02.016
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304414918304824
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.spa.2019.02.016?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Jarner, Søren Fiig & Hansen, Ernst, 2000. "Geometric ergodicity of Metropolis algorithms," Stochastic Processes and their Applications, Elsevier, vol. 85(2), pages 341-361, February.
    2. G. O. Roberts & O. Stramer, 2002. "Langevin Diffusions and Metropolis-Hastings Algorithms," Methodology and Computing in Applied Probability, Springer, vol. 4(4), pages 337-357, December.
    3. Aurélien Alfonsi & Benjamin Jourdain & Arturo Kohatsu-Higa, 2014. "Pathwise optimal transport bounds between a one-dimensional diffusion and its Euler scheme," Post-Print hal-00727430, HAL.
    4. O. Stramer & R. L. Tweedie, 1999. "Langevin-Type Models I: Diffusions with Given Stationary Distributions and their Discretizations," Methodology and Computing in Applied Probability, Springer, vol. 1(3), pages 283-306, October.
    5. O. Stramer & R. L. Tweedie, 1999. "Langevin-Type Models II: Self-Targeting Candidates for MCMC Algorithms," Methodology and Computing in Applied Probability, Springer, vol. 1(3), pages 307-328, October.
    6. Arnak S. Dalalyan, 2017. "Theoretical guarantees for approximate sampling from smooth and log-concave densities," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 651-676, June.
    7. Arnak Dalalyan, 2017. "Further and stronger analogy between sampling and optimization: Langevin Monte Carlo and gradient descent," Working Papers 2017-21, Center for Research in Economics and Statistics.
    8. Gareth O. Roberts & Jeffrey S. Rosenthal, 1998. "Optimal scaling of discrete approximations to Langevin diffusions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(1), pages 255-268.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Crespo, Marelys & Gadat, Sébastien & Gendre, Xavier, 2023. "Stochastic Langevin Monte Carlo for (weakly) log-concave posterior distributions," TSE Working Papers 23-1398, Toulouse School of Economics (TSE).
    2. Yang, Jun & Roberts, Gareth O. & Rosenthal, Jeffrey S., 2020. "Optimal scaling of random-walk metropolis algorithms on general target distributions," Stochastic Processes and their Applications, Elsevier, vol. 130(10), pages 6094-6132.
    3. Murray Pollock & Paul Fearnhead & Adam M. Johansen & Gareth O. Roberts, 2020. "Quasi‐stationary Monte Carlo and the ScaLE algorithm," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 82(5), pages 1167-1221, December.
    4. Sotirios Sabanis & Ying Zhang, 2020. "A fully data-driven approach to minimizing CVaR for portfolio of assets via SGLD with discontinuous updating," Papers 2007.01672, arXiv.org.
    5. Chau, Huy N. & Rásonyi, Miklós, 2022. "Stochastic Gradient Hamiltonian Monte Carlo for non-convex learning," Stochastic Processes and their Applications, Elsevier, vol. 149(C), pages 341-368.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. O. F. Christensen & J. Møller & R. P. Waagepetersen, 2001. "Geometric Ergodicity of Metropolis-Hastings Algorithms for Conditional Simulation in Generalized Linear Mixed Models," Methodology and Computing in Applied Probability, Springer, vol. 3(3), pages 309-327, September.
    2. G. O. Roberts & O. Stramer, 2002. "Langevin Diffusions and Metropolis-Hastings Algorithms," Methodology and Computing in Applied Probability, Springer, vol. 4(4), pages 337-357, December.
    3. Allassonnière, Stéphanie & Kuhn, Estelle, 2015. "Convergent stochastic Expectation Maximization algorithm with efficient sampling in high dimension. Application to deformable template model estimation," Computational Statistics & Data Analysis, Elsevier, vol. 91(C), pages 4-19.
    4. Samuel Livingstone & Giacomo Zanella, 2022. "The Barker proposal: Combining robustness and efficiency in gradient‐based MCMC," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 496-523, April.
    5. Fort, G. & Moulines, E., 2003. "Polynomial ergodicity of Markov transition kernels," Stochastic Processes and their Applications, Elsevier, vol. 103(1), pages 57-99, January.
    6. Arnak S. Dalalyan, 2017. "Theoretical guarantees for approximate sampling from smooth and log-concave densities," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 651-676, June.
    7. Ruben Loaiza-Maya & Didier Nibbering & Dan Zhu, 2023. "Hybrid unadjusted Langevin methods for high-dimensional latent variable models," Papers 2306.14445, arXiv.org.
    8. Denis Belomestny & Leonid Iosipoi, 2019. "Fourier transform MCMC, heavy tailed distributions and geometric ergodicity," Papers 1909.00698, arXiv.org, revised Dec 2019.
    9. Fort, Gersende & Moulines, Eric, 2000. "V-Subgeometric ergodicity for a Hastings-Metropolis algorithm," Statistics & Probability Letters, Elsevier, vol. 49(4), pages 401-410, October.
    10. Belomestny, Denis & Iosipoi, Leonid, 2021. "Fourier transform MCMC, heavy-tailed distributions, and geometric ergodicity," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 181(C), pages 351-363.
    11. Yang, Jun & Roberts, Gareth O. & Rosenthal, Jeffrey S., 2020. "Optimal scaling of random-walk metropolis algorithms on general target distributions," Stochastic Processes and their Applications, Elsevier, vol. 130(10), pages 6094-6132.
    12. Dang, Khue-Dung & Quiroz, Matias & Kohn, Robert & Tran, Minh-Ngoc & Villani, Mattias, 2019. "Hamiltonian Monte Carlo with Energy Conserving Subsampling," Working Paper Series 372, Sveriges Riksbank (Central Bank of Sweden).
    13. Kamatani, Kengo, 2020. "Random walk Metropolis algorithm in high dimension with non-Gaussian target distributions," Stochastic Processes and their Applications, Elsevier, vol. 130(1), pages 297-327.
    14. Yves F. Atchadé, 2006. "An Adaptive Version for the Metropolis Adjusted Langevin Algorithm with a Truncated Drift," Methodology and Computing in Applied Probability, Springer, vol. 8(2), pages 235-254, June.
    15. Chau, Huy N. & Rásonyi, Miklós, 2022. "Stochastic Gradient Hamiltonian Monte Carlo for non-convex learning," Stochastic Processes and their Applications, Elsevier, vol. 149(C), pages 341-368.
    16. Vincent Lemaire & Gilles Pag`es & Christian Yeo, 2023. "Swing contract pricing: with and without Neural Networks," Papers 2306.03822, arXiv.org, revised Mar 2024.
    17. Bédard, Mylène, 2017. "Hierarchical models: Local proposal variances for RWM-within-Gibbs and MALA-within-Gibbs," Computational Statistics & Data Analysis, Elsevier, vol. 109(C), pages 231-246.
    18. Jorge I. Figueroa-Zúñiga & Cristian L. Bayes & Víctor Leiva & Shuangzhe Liu, 2022. "Robust beta regression modeling with errors-in-variables: a Bayesian approach and numerical applications," Statistical Papers, Springer, vol. 63(3), pages 919-942, June.
    19. Mamatzakis, Emmanuel C. & Tsionas, Mike G., 2021. "Making inference of British household's happiness efficiency: A Bayesian latent model," European Journal of Operational Research, Elsevier, vol. 294(1), pages 312-326.
    20. Masuda, Hiroki, 2007. "Ergodicity and exponential [beta]-mixing bounds for multidimensional diffusions with jumps," Stochastic Processes and their Applications, Elsevier, vol. 117(1), pages 35-56, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:spapps:v:129:y:2019:i:12:p:5278-5311. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/505572/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.