Finite-Time High-Probability Bounds for Polyak–Ruppert Averaged Iterates of Linear Stochastic Approximation
Author
Abstract
Suggested Citation
DOI: 10.1287/moor.2022.0179
Download full text from publisher
References listed on IDEAS
- Jalaj Bhandari & Daniel Russo & Raghav Singal, 2021. "A Finite Time Analysis of Temporal Difference Learning with Linear Function Approximation," Operations Research, INFORMS, vol. 69(3), pages 950-973, May.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Shuze Chen & David Simchi-Levi & Chonghuan Wang, 2024. "Improving the Estimation of Lifetime Effects in A/B Testing via Treatment Locality," Papers 2407.19618, arXiv.org, revised Sep 2025.
- Gen Li & Changxiao Cai & Yuxin Chen & Yuting Wei & Yuejie Chi, 2024. "Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis," Operations Research, INFORMS, vol. 72(1), pages 222-236, January.
- Wenlong Mou & Ashwin Pananjady & Martin J. Wainwright, 2023. "Optimal Oracle Inequalities for Projected Fixed-Point Equations, with Applications to Policy Evaluation," Mathematics of Operations Research, INFORMS, vol. 48(4), pages 2308-2336, November.
- Zhu, Jin & Wan, Runzhe & Qi, Zhengling & Luo, Shikai & Shi, Chengchun, 2024. "Robust offline reinforcement learning with heavy-tailed rewards," LSE Research Online Documents on Economics 122740, London School of Economics and Political Science, LSE Library.
More about this item
Keywords
; ; ; ; ;JEL classification:
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormoor:v:50:y:2025:i:2:p:935-964. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.
Printed from https://ideas.repec.org/a/inm/ormoor/v50y2025i2p935-964.html