IDEAS home Printed from https://ideas.repec.org/p/bie/wpaper/740.html
   My bibliography  Save this paper

Exploratory Optimal Stopping: A Singular Control Formulation

Author

Listed:
  • Dianetti, Jodi

    (Center for Mathematical Economics, Bielefeld University)

  • Ferrari, Giorgio

    (Center for Mathematical Economics, Bielefeld University)

  • Xu, Renyuan

    (Center for Mathematical Economics, Bielefeld University)

Abstract

This paper explores continuous-time and state-space optimal stopping problems from a reinforcement learning perspective. We begin by formulating the stopping problem using randomized stopping times, where the decision maker’s control is represented by the probability of stopping within a given time—specifically, a bounded, non-decreasing, càdlàg control process. To encourage exploration and facilitate learning, we introduce a regularized version of the problem by penalizing the performance criterion with the cumulative residual entropy of the randomized stopping time. The regularized problem takes the form of an (n + 1)-dimensional degenerate singular stochastic control with finite-fuel. We address this through the dynamic programming principle, which enables us to identify the unique optimal exploratory strategy. For the specific case of a real option problem, we derive a semi-explicit solution to the regularized problem, allowing us to assess the impact of entropy regularization and analyze the vanishing entropy limit. Finally, we propose a reinforcement learning algorithm based on policy iteration. We show both policy improvement and policy convergence results for our proposed algorithm.

Suggested Citation

  • Dianetti, Jodi & Ferrari, Giorgio & Xu, Renyuan, 2025. "Exploratory Optimal Stopping: A Singular Control Formulation," Center for Mathematical Economics Working Papers 740, Center for Mathematical Economics, Bielefeld University.
  • Handle: RePEc:bie:wpaper:740
    as

    Download full text from publisher

    File URL: https://pub.uni-bielefeld.de/download/3006178/3006179
    File Function: First Version, 2024
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bie:wpaper:740. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Bettina Weingarten (email available below). General contact details of provider: https://edirc.repec.org/data/imbiede.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.