IDEAS home Printed from https://ideas.repec.org/a/inm/ormoor/v50y2025i1p40-67.html
   My bibliography  Save this article

Towards Optimal Problem Dependent Generalization Error Bounds in Statistical Learning Theory

Author

Listed:
  • Yunbei Xu

    (Decision, Risk, and Operations Division, Graduate School of Business, Columbia University, New York, New York 10027)

  • Assaf Zeevi

    (Decision, Risk, and Operations Division, Graduate School of Business, Columbia University, New York, New York 10027)

Abstract

We study problem-dependent rates, that is, generalization errors that scale near-optimally with the variance, effective loss, or gradient norms evaluated at the “best hypothesis.” We introduce a principled framework dubbed “uniform localized convergence” and characterize sharp problem-dependent rates for central statistical learning problems. From a methodological viewpoint, our framework resolves several fundamental limitations of existing uniform convergence and localization analysis approaches. It also provides improvements and some level of unification in the study of localized complexities, one-sided uniform inequalities, and sample-based iterative algorithms. In the so-called “slow rate” regime, we provide the first (moment-penalized) estimator that achieves the optimal variance-dependent rate for general “rich” classes; we also establish an improved loss-dependent rate for standard empirical risk minimization. In the “fast rate” regime, we establish finite-sample, problem-dependent bounds that are comparable to precise asymptotics. In addition, we show that iterative algorithms such as gradient descent and first order expectation maximization can achieve optimal generalization error in several representative problems across the areas of nonconvex learning, stochastic optimization, and learning with missing data.

Suggested Citation

  • Yunbei Xu & Assaf Zeevi, 2025. "Towards Optimal Problem Dependent Generalization Error Bounds in Statistical Learning Theory," Mathematics of Operations Research, INFORMS, vol. 50(1), pages 40-67, February.
  • Handle: RePEc:inm:ormoor:v:50:y:2025:i:1:p:40-67
    DOI: 10.1287/moor.2021.0076
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/moor.2021.0076
    Download Restriction: no

    File URL: https://libkey.io/10.1287/moor.2021.0076?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormoor:v:50:y:2025:i:1:p:40-67. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.