IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-04943092.html

Stochastic gradient langevin dynamics for (weakly) log-concave posterior distributions

Author

Listed:
  • Marelys Crespo

    (TSE-R - Toulouse School of Economics - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement)

  • Sébastien Gadat

    (TSE-R - Toulouse School of Economics - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement)

  • Xavier Gendre

    (IMT - Institut de Mathématiques de Toulouse UMR5219 - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - INSA Toulouse - Institut National des Sciences Appliquées - Toulouse - INSA - Institut National des Sciences Appliquées - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - UT2J - Université Toulouse - Jean Jaurès - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - UT3 - Université Toulouse III - Paul Sabatier - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - CNRS - Centre National de la Recherche Scientifique)

Abstract

In this paper, we investigate a continuous time version of the Stochastic Langevin Monte Carlo method, introduced in [39], that incorporates a stochastic sampling step inside the traditional overdamped Langevin diffusion. This method is popular in machine learning for sampling posterior distribution. We will pay specific attention in our work to the computational cost in terms of n (the number of observations that produces the posterior distribution), and d (the dimension of the ambient space where the parameter of interest is living). We derive our analysis in the weakly convex framework, which is parameterized with the help of the Kurdyka- Lojasiewicz (KL) inequality, that permits to handle a vanishing curvature settings, which is far less restrictive when compared to the simple strongly convex case. We establish that the final horizon of simulation to obtain an ε approximation (in terms of entropy) is of the order (d log(n)²)(1+r)² [log²(ε−1) + n²d²(1+r) log4(1+r)(n)] with a Poissonian subsampling of parameter n(d log²(n))1+r)−1, where the parameter r is involved in the KL inequality and varies between 0 (strongly convex case) and 1 (limiting Laplace situation).

Suggested Citation

  • Marelys Crespo & Sébastien Gadat & Xavier Gendre, 2024. "Stochastic gradient langevin dynamics for (weakly) log-concave posterior distributions," Post-Print hal-04943092, HAL.
  • Handle: RePEc:hal:journl:hal-04943092
    DOI: 10.1214/24-EJP1235
    Note: View the original document on HAL open archive server: https://hal.science/hal-04943092v1
    as

    Download full text from publisher

    File URL: https://hal.science/hal-04943092v1/document
    Download Restriction: no

    File URL: https://libkey.io/10.1214/24-EJP1235?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gadat, Sébastien & Panloup, Fabien & Pellegrini, C., 2020. "On the cost of Bayesian posterior mean strategy for log-concave models," TSE Working Papers 20-1155, Toulouse School of Economics (TSE), revised Feb 2022.
    2. Dalalyan, Arnak S. & Karagulyan, Avetik, 2019. "User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient," Stochastic Processes and their Applications, Elsevier, vol. 129(12), pages 5278-5311.
    3. Mee Young Park & Trevor Hastie, 2007. "L1‐regularization path algorithm for generalized linear models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(4), pages 659-677, September.
    4. Sébastien Gadat & Ioana Gavra & Laurent Risser, 2018. "How to Calculate the Barycenter of a Weighted Graph," Mathematics of Operations Research, INFORMS, vol. 43(4), pages 1085-1118, November.
    5. Arnak S. Dalalyan, 2017. "Theoretical guarantees for approximate sampling from smooth and log-concave densities," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(3), pages 651-676, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Crespo, Marelys & Gadat, Sébastien & Gendre, Xavier, 2023. "Stochastic Langevin Monte Carlo for (weakly) log-concave posterior distributions," TSE Working Papers 23-1398, Toulouse School of Economics (TSE).
    2. Yang, Jun & Roberts, Gareth O. & Rosenthal, Jeffrey S., 2020. "Optimal scaling of random-walk metropolis algorithms on general target distributions," Stochastic Processes and their Applications, Elsevier, vol. 130(10), pages 6094-6132.
    3. Yu, Lu & Dalalyan, Arnak, 2025. "Parallelized midpoint randomization for Langevin Monte Carlo," Stochastic Processes and their Applications, Elsevier, vol. 190(C).
    4. Chau, Huy N. & Rásonyi, Miklós, 2022. "Stochastic Gradient Hamiltonian Monte Carlo for non-convex learning," Stochastic Processes and their Applications, Elsevier, vol. 149(C), pages 341-368.
    5. Niklas Giesa & Stefan Haufe & Mario Menk & Björn Weiß & Claudia D Spies & Sophie K Piper & Felix Balzer & Sebastian D Boie, 2024. "Predicting postoperative delirium assessed by the Nursing Screening Delirium Scale in the recovery room for non-cardiac surgeries without craniotomy: A retrospective study using a machine learning approach," PLOS Digital Health, Public Library of Science, vol. 3(8), pages 1-21, August.
    6. Wai-Kay Seto & Chun-Fan Lee & Ching-Lung Lai & Philip P C Ip & Daniel Yee-Tak Fong & James Fung & Danny Ka-Ho Wong & Man-Fung Yuen, 2011. "A New Model Using Routinely Available Clinical Parameters to Predict Significant Liver Fibrosis in Chronic Hepatitis B," PLOS ONE, Public Library of Science, vol. 6(8), pages 1-7, August.
    7. Bartosz Uniejewski, 2024. "Regularization for electricity price forecasting," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 34(3), pages 267-286.
    8. Tengyuan Liang & Weijie J. Su, 2019. "Statistical inference for the population landscape via moment‐adjusted stochastic gradients," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 81(2), pages 431-456, April.
    9. Dalalyan, Arnak S. & Karagulyan, Avetik, 2019. "User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient," Stochastic Processes and their Applications, Elsevier, vol. 129(12), pages 5278-5311.
    10. Loaiza-Maya, Rubén & Nibbering, Didier & Zhu, Dan, 2024. "Hybrid unadjusted Langevin methods for high-dimensional latent variable models," Journal of Econometrics, Elsevier, vol. 241(2).
    11. Bian, Yuan & Yi, Grace Y. & He, Wenqing, 2024. "A unified framework of analyzing missing data and variable selection using regularized likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 194(C).
    12. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    13. Zhengyong Jiang & Jeyan Thiayagalingam & Jionglong Su & Jinjun Liang, 2023. "CAD: Clustering And Deep Reinforcement Learning Based Multi-Period Portfolio Management Strategy," Papers 2310.01319, arXiv.org.
    14. Menz, Georg & Schlichting, André & Tang, Wenpin & Wu, Tianqi, 2022. "Ergodicity of the infinite swapping algorithm at low temperature," Stochastic Processes and their Applications, Elsevier, vol. 151(C), pages 519-552.
    15. Antonello D’Ambra & Pietro Amenta & Antonio Lucadamo, 2025. "Multivariate approaches to investigate the home and away behavior of football teams playing football matches," Computational Statistics, Springer, vol. 40(4), pages 1779-1799, April.
    16. Bally, Vlad & Qin, Yifeng, 2024. "Approximation for the invariant measure with applications for jump processes (convergence in total variation distance)," Stochastic Processes and their Applications, Elsevier, vol. 176(C).
    17. Yanis Tazi & Juan E. Arango-Ossa & Yangyu Zhou & Elsa Bernard & Ian Thomas & Amanda Gilkes & Sylvie Freeman & Yoann Pradat & Sean J. Johnson & Robert Hills & Richard Dillon & Max F. Levine & Daniel Le, 2022. "Unified classification and risk-stratification in Acute Myeloid Leukemia," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    18. Arnak Dalalyan, 2017. "Further and stronger analogy between sampling and optimization: Langevin Monte Carlo and gradient descent," Working Papers 2017-21, Center for Research in Economics and Statistics.
    19. Hiroki Saishu & Kota Kudo & Yuichi Takano, 2024. "Cutting-plane algorithm for estimation of sparse Cox proportional hazards models," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(1), pages 57-82, April.
    20. Denis Belomestny & Leonid Iosipoi, 2019. "Fourier transform MCMC, heavy tailed distributions and geometric ergodicity," Papers 1909.00698, arXiv.org, revised Dec 2019.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-04943092. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.