IDEAS home Printed from https://ideas.repec.org/p/ucr/wpaper/202512.html
   My bibliography  Save this paper

Robust Reinforcement Learning under Diffusion Models for Data with Jumps

Author

Listed:
  • Chenyang Jiang
  • Donggyu Kim

    (Department of Economics, University of California Riverside)

  • Alejandra Quintos
  • Yazhen Wang

Abstract

Reinforcement Learning (RL) has proven effective in solving complex decision-making tasks across various domains, but challenges remain in continuous-time settings, particularly when state dynamics are governed by stochastic differential equations (SDEs) with jump components. In this paper, we address this challenge by introducing the Mean-Square Bipower Variation Error (MSBVE) algorithm, which enhances robustness and convergence in scenarios involving significant stochastic noise and jumps. We first revisit the Mean-Square TD Error (MSTDE) algorithm, commonly used in continuous-time RL, and highlight its limitations in handling jumps in state dynamics. The proposed MSBVE algorithm minimizes the mean-square quadratic variation error, offering improved performance over MSTDE in environments characterized by SDEs with jumps. Simulations and formal proofs demonstrate that the MSBVE algorithm reliably estimates the value function in complex settings, surpassing MSTDE's performance when faced with jump processes. These findings underscore the importance of alternative error metrics to improve the resilience and effectiveness of RL algorithms in continuous-time frameworks.

Suggested Citation

  • Chenyang Jiang & Donggyu Kim & Alejandra Quintos & Yazhen Wang, 2025. "Robust Reinforcement Learning under Diffusion Models for Data with Jumps," Working Papers 202512, University of California at Riverside, Department of Economics.
  • Handle: RePEc:ucr:wpaper:202512
    as

    Download full text from publisher

    File URL: https://economics.ucr.edu/repec/ucr/wpaper/202512.pdf
    File Function: First version, 2025
    Download Restriction: no
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ucr:wpaper:202512. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Kelvin Mac (email available below). General contact details of provider: https://edirc.repec.org/data/deucrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.