Exploratory Optimal Stopping: A Singular Control Formulation

My bibliography Save this paper

Exploratory Optimal Stopping: A Singular Control Formulation

Author

Listed:

Dianetti, Jodi
(Center for Mathematical Economics, Bielefeld University)
Ferrari, Giorgio
(Center for Mathematical Economics, Bielefeld University)
Xu, Renyuan
(Center for Mathematical Economics, Bielefeld University)

Registered:

Abstract

This paper explores continuous-time and state-space optimal stopping problems from a reinforcement learning perspective. We begin by formulating the stopping problem using randomized stopping times, where the decision maker’s control is represented by the probability of stopping within a given time—specifically, a bounded, non-decreasing, càdlàg control process. To encourage exploration and facilitate learning, we introduce a regularized version of the problem by penalizing the performance criterion with the cumulative residual entropy of the randomized stopping time. The regularized problem takes the form of an (n + 1)-dimensional degenerate singular stochastic control with finite-fuel. We address this through the dynamic programming principle, which enables us to identify the unique optimal exploratory strategy. For the specific case of a real option problem, we derive a semi-explicit solution to the regularized problem, allowing us to assess the impact of entropy regularization and analyze the vanishing entropy limit. Finally, we propose a reinforcement learning algorithm based on policy iteration. We show both policy improvement and policy convergence results for our proposed algorithm.

Suggested Citation

Dianetti, Jodi & Ferrari, Giorgio & Xu, Renyuan, 2025. "Exploratory Optimal Stopping: A Singular Control Formulation," Center for Mathematical Economics Working Papers 740, Center for Mathematical Economics, Bielefeld University.

Handle: RePEc:bie:wpaper:740

Download full text from publisher

References listed on IDEAS

Boetius, Frederik & Kohlmann, Michael, 1998. "Connections between optimal stopping and singular stochastic control," Stochastic Processes and their Applications, Elsevier, vol. 77(2), pages 253-281, September.
Giorgio Ferrari, 2012. "On an integral equation for the free-boundary of stochastic, irreversible investment problems," Papers 1211.0412, arXiv.org, revised Jan 2015.
- Ferrari, Giorgio, 2014. "On an integral equation for the free boundary of stochastic, irreversible investment problems," Center for Mathematical Economics Working Papers 471, Center for Mathematical Economics, Bielefeld University.
Yanwei Jia & Xun Yu Zhou, 2021. "Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms," Papers 2111.11232, arXiv.org, revised Jul 2022.
Tiziano De Angelis & Salvatore Federico & Giorgio Ferrari, 2017. "Optimal Boundary Surface for Irreversible Investment with Stochastic Costs," Mathematics of Operations Research, INFORMS, vol. 42(4), pages 1135-1161, November.
Riedel, Frank & Steg, Jan-Henrik, 2017. "Subgame-perfect equilibria in stochastic timing games," Journal of Mathematical Economics, Elsevier, vol. 72(C), pages 36-50.
- Riedel, Frank & Steg, Jan-Henrik, 2014. "Subgame-Perfect Equilibria in Stochastic Timing Games," Center for Mathematical Economics Working Papers 524, Center for Mathematical Economics, Bielefeld University.
Sunil Kumar & Kumar Muthuraman, 2004. "A Numerical Method for Solving Singular Stochastic Control Problems," Operations Research, INFORMS, vol. 52(4), pages 563-582, August.
Tiziano De Angelis & Salvatore Federico & Giorgio Ferrari, 2014. "Optimal Boundary Surface for Irreversible Investment with Stochastic Costs," Papers 1406.4297, arXiv.org, revised Jan 2017.
- Tiziano De Angelis & Salvatore Federico & Giorgio Ferrari, 2015. "Optimal boundary surface for irreversible investment with stochastic costs," Working Papers - Mathematical Economics 2015-03, Universita' degli Studi di Firenze, Dipartimento di Scienze per l'Economia e l'Impresa.
Eyal Neuman & Wolfgang Stockinger & Yufei Zhang, 2023. "An Offline Learning Approach to Propagator Models," Papers 2309.02994, arXiv.org.
Y.M. Kabanov, 1999. "Hedging and liquidation under transaction costs in currency markets," Finance and Stochastics, Springer, vol. 3(2), pages 237-248.
Touzi, N. & Vieille, N., 1999. "Continuous-Time Dynkin Games with Mixed Strategies," Papiers d'Economie MathÃ©matique et Applications 1999.112, UniversitÃ© PanthÃ©on-Sorbonne (Paris 1).
- Nicolas Vieille & Nizar Touzi, 2002. "Continuous-Time Dynkin Games with Mixed Strategies," Post-Print hal-00465013, HAL.
Ben Hambly & Renyuan Xu & Huining Yang, 2020. "Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon," Papers 2011.10300, arXiv.org, revised Jun 2021.
Dianetti, Jodi & Ferrari, Giorgio, 2023. "Multidimensional singular control and related Skorokhod problem: Sufficient conditions for the characterization of optimal controls," Stochastic Processes and their Applications, Elsevier, vol. 162(C), pages 547-592.
Tiziano De Angelis & Giorgio Ferrari & John Moriarty, 2019. "A Solvable Two-Dimensional Degenerate Singular Stochastic Control Problem with Nonconvex Costs," Mathematics of Operations Research, INFORMS, vol. 44(2), pages 512-531, May.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Jodi Dianetti & Giorgio Ferrari & Renyuan Xu, 2024. "Exploratory Optimal Stopping: A Singular Control Formulation," Papers 2408.09335, arXiv.org, revised Oct 2024.
Giorgio Ferrari & Hanwu Li & Frank Riedel, 2020. "A Knightian Irreversible Investment Problem," Papers 2003.14359, arXiv.org, revised Apr 2020.
- Ferrari, Giorgio & Li, Hanwu & Riedel, Frank, 2020. "A Knightian Irreversible Investment Problem," Center for Mathematical Economics Working Papers 634, Center for Mathematical Economics, Bielefeld University.
Andrea Bovo & Tiziano De Angelis & Jan Palczewski, 2023. "Zero-sum stopper vs. singular-controller games with constrained control directions," Papers 2306.05113, arXiv.org, revised Feb 2024.
Andrea Bovo & Tiziano De Angelis & Jan Palczewski, 2023. "Stopper vs. singular-controller games with degenerate diffusions," Papers 2312.00613, arXiv.org, revised Jul 2024.
de Angelis, Tiziano & Ferrari, Giorgio, 2014. "A Stochastic Reversible Investment Problem on a Finite-Time Horizon: Free Boundary Analysis," Center for Mathematical Economics Working Papers 477, Center for Mathematical Economics, Bielefeld University.
Dianetti, Jodi & Ferrari, Giorgio, 2023. "Multidimensional singular control and related Skorokhod problem: Sufficient conditions for the characterization of optimal controls," Stochastic Processes and their Applications, Elsevier, vol. 162(C), pages 547-592.
Junkee Jeon & Geonwoo Kim, 2020. "An Integral Equation Approach to the Irreversible Investment Problem with a Finite Horizon," Mathematics, MDPI, vol. 8(11), pages 1-10, November.
Ferrari, Giorgio, 2025. "On the Optimal Management of Public Debt: a Singular Stochastic Control Problem," Center for Mathematical Economics Working Papers 709, Center for Mathematical Economics, Bielefeld University.
AÃ¯d, RenÃ© & Basei, Matteo & Ferrari, Giorgio, 2023. "A Stationary Mean-Field Equilibrium Model of Irreversible Investment in a Two-Regime Economy," Center for Mathematical Economics Working Papers 679, Center for Mathematical Economics, Bielefeld University.
Ren'e Aid & Matteo Basei & Giorgio Ferrari, 2023. "A Stationary Mean-Field Equilibrium Model of Irreversible Investment in a Two-Regime Economy," Papers 2305.00541, arXiv.org.
Dianetti, Jodi & Ferrari, Giorgio, 2021. "Multidimensional Singular Control and Related Skorokhod Problem: Suficient Conditions for the Characterization of Optimal Controls," Center for Mathematical Economics Working Papers 645, Center for Mathematical Economics, Bielefeld University.
Salvatore Federico & Mauro Rosestolato & Elisa Tacconi, 2018. "Irreversible investment with fixed adjustment costs: a stochastic impulse control approach," Papers 1801.04491, arXiv.org, revised Feb 2019.
Kexin Chen & Kyunghyun Park & Hoi Ying Wong, 2024. "Robust dividend policy: Equivalence of Epstein-Zin and Maenhout preferences," Papers 2406.12305, arXiv.org, revised Oct 2025.
Ferrari, Giorgio & Riedel, Frank & Steg, Jan-Henrik, 2016. "Continuous-Time Public Good Contribution under Uncertainty," Center for Mathematical Economics Working Papers 485, Center for Mathematical Economics, Bielefeld University.
Steg, Jan-Henrik, 2018. "Preemptive investment under uncertainty," Games and Economic Behavior, Elsevier, vol. 110(C), pages 90-119.
- Jan-Henrik Steg, 2015. "Preemptive Investment under Uncertainty," Papers 1511.03863, arXiv.org, revised May 2016.
- Steg, Jan-Henrik, 2015. "Preemptive Investment under Uncertainty," Center for Mathematical Economics Working Papers 549, Center for Mathematical Economics, Bielefeld University.
Dammann, Felix & Ferrari, Giorgio, 2022. "Optimal Execution with Multiplicative Price Impact and Incomplete Information on the Return," Center for Mathematical Economics Working Papers 663, Center for Mathematical Economics, Bielefeld University.
Georgiadis, George & Kim, Youngsoo & Kwon, H. Dharma, 2022. "The absence of attrition in a war of attrition under complete information," Games and Economic Behavior, Elsevier, vol. 131(C), pages 171-185.
Peter Bank & Yan Dolinsky, 2018. "Continuous-time Duality for Super-replication with Transient Price Impact," Papers 1808.09807, arXiv.org, revised May 2019.
Chiarolla, Maria B. & Ferrari, Giorgio & Stabile, Gabriele, 2015. "Optimal dynamic procurement policies for a storable commodity with Lévy prices and convex holding costs," European Journal of Operational Research, Elsevier, vol. 247(3), pages 847-858.
- Maria B. Chiarolla & Giorgio Ferrari & Gabriele Stabile, 2014. "Optimal Dynamic Procurement Policies for a Storable Commodity with L\'evy Prices and Convex Holding Costs," Papers 1409.0665, arXiv.org, revised Jun 2015.
Joffrey Derchu & Philippe Guillot & Thibaut Mastrolia & Mathieu Rosenbaum, 2020. "AHEAD : Ad-Hoc Electronic Auction Design," Papers 2010.02827, arXiv.org.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bie:wpaper:740. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Bettina Weingarten (email available below). General contact details of provider: https://edirc.repec.org/data/imbiede.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Exploratory Optimal Stopping: A Singular Control Formulation

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data