IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-05495397.html

Nonsmooth implicit differentiation for machine learning and optimization

Author

Listed:
  • Jérôme Bolte

    (TSE-R - Toulouse School of Economics - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement)

  • Tam Le

    (TSE-R - Toulouse School of Economics - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement)

  • Edouard Pauwels

    (IRIT - Institut de recherche en informatique de Toulouse - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - UT2J - Université Toulouse - Jean Jaurès - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - UT3 - Université Toulouse III - Paul Sabatier - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - CNRS - Centre National de la Recherche Scientifique - Toulouse INP - Institut National Polytechnique (Toulouse) - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - TMBI - Toulouse Mind & Brain Institut - UT2J - Université Toulouse - Jean Jaurès - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - UT3 - Université Toulouse III - Paul Sabatier - Comue de Toulouse - Communauté d'universités et établissements de Toulouse, CNRS - Centre National de la Recherche Scientifique)

  • Antonio Silveti Falls

    (TSE-R - Toulouse School of Economics - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement)

Abstract

In view of training increasingly complex learning architectures, we establish a nonsmooth implicit function theorem with an operational calculus. Our result applies to most practical problems (i.e., definable problems) provided that a nonsmooth form of the classical invertibility condition is fulfilled. This approach allows for formal subdifferentiation: for instance, replacing derivatives by Clarke Jacobians in the usual differentiation formulas is fully justified for a wide class of nonsmooth problems. Moreover this calculus is entirely compatible with algorithmic differentiation (e.g., backpropagation). We provide several applications such as training deep equilibrium networks, training neural nets with conic optimization layers, or hyperparameter-tuning for nonsmooth Lasso-type models. To show the sharpness of our assumptions, we present numerical experiments showcasing the extremely pathological gradient dynamics one can encounter when applying implicit algorithmic differentiation without any hypothesis.

Suggested Citation

  • Jérôme Bolte & Tam Le & Edouard Pauwels & Antonio Silveti Falls, 2021. "Nonsmooth implicit differentiation for machine learning and optimization," Post-Print hal-05495397, HAL.
  • Handle: RePEc:hal:journl:hal-05495397
    DOI: 10.48550/arXiv.2106.04350
    Note: View the original document on HAL open archive server: https://hal.science/hal-05495397v1
    as

    Download full text from publisher

    File URL: https://hal.science/hal-05495397v1/document
    Download Restriction: no

    File URL: https://libkey.io/10.48550/arXiv.2106.04350?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hédy Attouch & Jérôme Bolte & Patrick Redont & Antoine Soubeyran, 2010. "Proximal Alternating Minimization and Projection Methods for Nonconvex Problems: An Approach Based on the Kurdyka-Łojasiewicz Inequality," Mathematics of Operations Research, INFORMS, vol. 35(2), pages 438-457, May.
    2. J. B. Hiriart-Urruty, 1979. "Tangent Cones, Generalized Gradients and Mathematical Programming in Banach Spaces," Mathematics of Operations Research, INFORMS, vol. 4(1), pages 79-97, February.
    3. Michel Benaim & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions II: Applications," Levine's Bibliography 784828000000000098, UCLA Department of Economics.
    4. Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions; Part II: Applications," Working Papers hal-00242974, HAL.
    5. Enzo Busseti & Walaa M. Moursi & Stephen Boyd, 2019. "Solution refinement at regular points of conic problems," Computational Optimization and Applications, Springer, vol. 74(3), pages 627-643, December.
    6. Stephen M. Robinson, 1991. "An Implicit-Function Theorem for a Class of Nonsmooth Functions," Mathematics of Operations Research, INFORMS, vol. 16(2), pages 292-309, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Siriruk, Pavee & Kotekangpoo, Apicha, 2026. "Order quantity optimization model for perishable products under continuous review (Q, r) inventory policy with stochastic demand and positive lead time," Omega, Elsevier, vol. 138(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Bolte, Jérôme & Le, Tam & Pauwels, Edouard & Silveti-Falls, Antonio, 2022. "Nonsmooth Implicit Differentiation for Machine Learning and Optimization," TSE Working Papers 22-1314, Toulouse School of Economics (TSE).
    2. Mathieu Faure & Gregory Roth, 2010. "Stochastic Approximations of Set-Valued Dynamical Systems: Convergence with Positive Probability to an Attractor," Mathematics of Operations Research, INFORMS, vol. 35(3), pages 624-640, August.
    3. Giacomo Lanzani, 2025. "Dynamic Concern for Misspecification," Econometrica, Econometric Society, vol. 93(4), pages 1333-1370, July.
    4. Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed, 2010. "Testing the TASP: An experimental investigation of learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 145(6), pages 2309-2331, November.
    5. Akimoto, Youhei & Auger, Anne & Hansen, Nikolaus, 2022. "An ODE method to prove the geometric convergence of adaptive stochastic algorithms," Stochastic Processes and their Applications, Elsevier, vol. 145(C), pages 269-307.
    6. Eunji Lim, 2011. "On the Convergence Rate for Stochastic Approximation in the Nonsmooth Setting," Mathematics of Operations Research, INFORMS, vol. 36(3), pages 527-537, August.
    7. Borkar, Vivek S., 2025. "Stochastic approximation with two time scales: The general case," Stochastic Processes and their Applications, Elsevier, vol. 190(C).
    8. Prasenjit Karmakar & Shalabh Bhatnagar, 2018. "Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning," Mathematics of Operations Research, INFORMS, vol. 43(1), pages 130-151, February.
    9. Saeed Hadikhanloo & Rida Laraki & Panayotis Mertikopoulos & Sylvain Sorin, 2022. "Learning in nonatomic games, part Ⅰ: Finite action spaces and population games," Post-Print hal-03767995, HAL.
    10. Leslie, David S. & Collins, E.J., 2006. "Generalised weakened fictitious play," Games and Economic Behavior, Elsevier, vol. 56(2), pages 285-298, August.
    11. Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2006. "Stochastic Approximations and Differential Inclusions, Part II: Applications," Mathematics of Operations Research, INFORMS, vol. 31(4), pages 673-695, November.
    12. Pascal Bianchi & Walid Hachem, 2016. "Dynamical Behavior of a Stochastic Forward–Backward Algorithm Using Random Monotone Operators," Journal of Optimization Theory and Applications, Springer, vol. 171(1), pages 90-120, October.
    13. Vinayaka G. Yaji & Shalabh Bhatnagar, 2020. "Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise," Mathematics of Operations Research, INFORMS, vol. 45(4), pages 1405-1444, November.
    14. Swenson, Brian & Murray, Ryan & Kar, Soummya, 2020. "Regular potential games," Games and Economic Behavior, Elsevier, vol. 124(C), pages 432-453.
    15. Andriy Zapechelnyuk, 2009. "Limit Behavior of No-regret Dynamics," Discussion Papers 21, Kyiv School of Economics.
    16. Bervoets, Sebastian & Faure, Mathieu, 2020. "Convergence in games with continua of equilibria," Journal of Mathematical Economics, Elsevier, vol. 90(C), pages 25-30.
    17. Leslie, David S. & Perkins, Steven & Xu, Zibo, 2020. "Best-response dynamics in zero-sum stochastic games," Journal of Economic Theory, Elsevier, vol. 189(C).
    18. Bervoets, Sebastian & Faure, Mathieu, 2019. "Stability in games with continua of equilibria," Journal of Economic Theory, Elsevier, vol. 179(C), pages 131-162.
    19. Sylvain Sorin, 2023. "Continuous Time Learning Algorithms in Optimization and Game Theory," Dynamic Games and Applications, Springer, vol. 13(1), pages 3-24, March.
    20. Bolte, Jérôme & Pauwels, Edouard, 2019. "Conservative set valued fields, automatic differentiation, stochastic gradient methods and deep learning," TSE Working Papers 19-1044, Toulouse School of Economics (TSE).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-05495397. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.