Solving the optimal stopping problem with reinforcement learning: an application in financial option exercise

My bibliography Save this paper

Solving the optimal stopping problem with reinforcement learning: an application in financial option exercise

Author

Listed:

Leonardo Kanashiro Felizardo
Elia Matsumoto
Emilio Del-Moral-Hernandez

Registered:

Abstract

The optimal stopping problem is a category of decision problems with a specific constrained configuration. It is relevant to various real-world applications such as finance and management. To solve the optimal stopping problem, state-of-the-art algorithms in dynamic programming, such as the least-squares Monte Carlo (LSMC), are employed. This type of algorithm relies on path simulations using only the last price of the underlying asset as a state representation. Also, the LSMC was thinking for option valuation where risk-neutral probabilities can be employed to account for uncertainty. However, the general optimal stopping problem goals may not fit the requirements of the LSMC showing auto-correlated prices. We employ a data-driven method that uses Monte Carlo simulation to train and test artificial neural networks (ANN) to solve the optimal stopping problem. Using ANN to solve decision problems is not entirely new. We propose a different architecture that uses convolutional neural networks (CNN) to deal with the dimensionality problem that arises when we transform the whole history of prices into a Markovian state. We present experiments that indicate that our proposed architecture improves results over the previous implementations under specific simulated time series function sets. Lastly, we employ our proposed method to compare the optimal exercise of the financial options problem with the LSMC algorithm. Our experiments show that our method can capture more accurate exercise opportunities when compared to the LSMC. We have outstandingly higher (above 974\% improvement) expected payoff from these exercise policies under the many Monte Carlo simulations that used the real-world return database on the out-of-sample (test) data.

Suggested Citation

Leonardo Kanashiro Felizardo & Elia Matsumoto & Emilio Del-Moral-Hernandez, 2022. "Solving the optimal stopping problem with reinforcement learning: an application in financial option exercise," Papers 2208.00765, arXiv.org.

Handle: RePEc:arx:papers:2208.00765

Download full text from publisher

References listed on IDEAS

Longstaff, Francis A & Schwartz, Eduardo S, 2001. "Valuing American Options by Simulation: A Simple Least-Squares Approach," The Review of Financial Studies, Society for Financial Studies, vol. 14(1), pages 113-147.
Bulan, Laarni & Mayer, Christopher & Somerville, C. Tsuriel, 2009. "Irreversible investment, real options, and competition: Evidence from real estate development," Journal of Urban Economics, Elsevier, vol. 65(3), pages 237-251, May.
- Laarni Bulan & Christopher Mayer & C. Tsuriel Somerville, "undated". "Irreversible Investment, Real Options, and Competition: Evidence from Real Estate Development," Zell/Lurie Center Working Papers 391, Wharton School Samuel Zell and Robert Lurie Real Estate Center, University of Pennsylvania.
- Laarni Bulan & Christopher J. Mayer & C. Tsuriel Somerville, 2006. "Irreversible Investment, Real Options, and Competition: Evidence from Real Estate Development," NBER Working Papers 12486, National Bureau of Economic Research, Inc.
Tze Leung Lai & Tiong Wee Lim, 2004. "Exercise Regions And Efficient Valuation Of American Lookback Options," Mathematical Finance, Wiley Blackwell, vol. 14(2), pages 249-269, April.
Dirk Hackbarth & Erwan Morellec, 2008. "Stock Returns in Mergers and Acquisitions," Journal of Finance, American Finance Association, vol. 63(3), pages 1213-1252, June.
- Dirk Hackbarth & Erwan Morellec, 2006. "Stock Returns in Mergers and Acquisitions," Swiss Finance Institute Research Paper Series 06-01, Swiss Finance Institute.
Carriere, Jacques F., 1996. "Valuation of the early-exercise price for options using simulations and nonparametric regression," Insurance: Mathematics and Economics, Elsevier, vol. 19(1), pages 19-30, December.
Gonzalo Cortazar & Eduardo S. Schwartz & Marcelo Salinas, 1998. "Evaluating Environmental Investments: A Real Options Approach," Management Science, INFORMS, vol. 44(8), pages 1059-1070, August.
Stephane Villeneuve, 1999. "Exercise regions of American options on several assets," Finance and Stochastics, Springer, vol. 3(3), pages 295-322.
Lander, Diane M. & Pinches, George E., 1998. "Challenges to the Practical Implementation of Modeling and Valuing Real Options," The Quarterly Review of Economics and Finance, Elsevier, vol. 38(3, Part 2), pages 537-567.
Li, Yong, 2008. "Duration analysis of venture capital staging: A real options perspective," Journal of Business Venturing, Elsevier, vol. 23(5), pages 497-512, September.
Kamrad, Bardia & Lele, Shreevardhan S. & Siddique, Akhtar & Thomas, Robert J., 2005. "Innovation diffusion uncertainty, advertising and pricing policies," European Journal of Operational Research, Elsevier, vol. 164(3), pages 829-850, August.
Ludovic Goudenège & Andrea Molent & Antonino Zanette, 2020. "Machine learning for pricing American options in high-dimensional Markovian and non-Markovian models," Quantitative Finance, Taylor & Francis Journals, vol. 20(4), pages 573-591, April.
Pennings, Enrico & Lint, Onno, 1997. "The option value of advanced R & D," European Journal of Operational Research, Elsevier, vol. 103(1), pages 83-94, November.
Nadarajah, Selvaprabu & Margot, François & Secomandi, Nicola, 2017. "Comparison of least squares Monte Carlo methods with applications to energy real options," European Journal of Operational Research, Elsevier, vol. 256(1), pages 196-204.
Broadie, Mark & Glasserman, Paul, 1997. "Pricing American-style securities using simulation," Journal of Economic Dynamics and Control, Elsevier, vol. 21(8-9), pages 1323-1352, June.
Longstaff, Francis A & Schwartz, Eduardo S, 2001. "Valuing American Options by Simulation: A Simple Least-Squares Approach," University of California at Los Angeles, Anderson Graduate School of Management qt43n1k4jb, Anderson Graduate School of Management, UCLA.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Bradley Sturt, 2021. "A nonparametric algorithm for optimal stopping based on robust optimization," Papers 2103.03300, arXiv.org, revised Mar 2023.
Juri Hinz & Tanya Tarnopolskaya & Jeremy Yee, 2020. "Efficient algorithms of pathwise dynamic programming for decision optimization in mining operations," Annals of Operations Research, Springer, vol. 286(1), pages 583-615, March.
Mark Broadie & Jerome B. Detemple, 2004. "ANNIVERSARY ARTICLE: Option Pricing: Valuation Models and Applications," Management Science, INFORMS, vol. 50(9), pages 1145-1177, September.
repec:hum:wpaper:sfb649dp2006-051 is not listed on IDEAS
Denis Belomestny & Grigori Milstein & Vladimir Spokoiny, 2009. "Regression methods in pricing American and Bermudan options using consumption processes," Quantitative Finance, Taylor & Francis Journals, vol. 9(3), pages 315-327.
- Belomestny, Denis & Milstein, Grigori N. & Spokoiny, Vladimir, 2006. "Regression methods in pricing American and Bermudan options using consumption processes," SFB 649 Discussion Papers 2006-051, Humboldt University Berlin, Collaborative Research Center 649: Economic Risk.
Nicholas Davey & Nicolas Langrené & Wen Chen & Jonathan R. Rhodes & Simon Dunstall & Saman Halgamuge, 2023. "Designing higher value roads to preserve species at risk by optimally controlling traffic flow," Annals of Operations Research, Springer, vol. 320(2), pages 663-693, January.
Lars Stentoft, 2004. "Convergence of the Least Squares Monte Carlo Approach to American Option Valuation," Management Science, INFORMS, vol. 50(9), pages 1193-1203, September.
Garcia, Diego, 2003. "Convergence and Biases of Monte Carlo estimates of American option prices using a parametric exercise rule," Journal of Economic Dynamics and Control, Elsevier, vol. 27(10), pages 1855-1879, August.
A. -S. Chen & P. -F. Shen, 2003. "Computational complexity analysis of least-squares Monte Carlo (LSM) for pricing US derivatives," Applied Economics Letters, Taylor & Francis Journals, vol. 10(4), pages 223-229.
Ammann, Manuel & Kind, Axel & Wilde, Christian, 2008. "Simulation-based pricing of convertible bonds," Journal of Empirical Finance, Elsevier, vol. 15(2), pages 310-331, March.
- Manuel Ammann & Axel Kind & Christian Wilde, 2005. "Simulation-Based Pricing of Convertible Bonds," Finance 0507015, University Library of Munich, Germany.
Ascione, Giacomo & Mehrdoust, Farshid & Orlando, Giuseppe & Samimi, Oldouz, 2023. "Foreign Exchange Options on Heston-CIR Model Under Lévy Process Framework," Applied Mathematics and Computation, Elsevier, vol. 446(C).
Calypso Herrera & Louis Paulot, 2014. "Parallel American Monte Carlo," Papers 1404.1180, arXiv.org.
Pascal Létourneau & Lars Stentoft, 2019. "Bootstrapping the Early Exercise Boundary in the Least-Squares Monte Carlo Method," JRFM, MDPI, vol. 12(4), pages 1-21, December.
M. Martin Boyer & Lars Stentoft, 2017. "Yes We Can (Price Derivatives on Survivor Indices)," Risk Management and Insurance Review, American Risk and Insurance Association, vol. 20(1), pages 37-62, March.
Manuel Moreno & Javier Navas, 2003. "On the Robustness of Least-Squares Monte Carlo (LSM) for Pricing American Derivatives," Review of Derivatives Research, Springer, vol. 6(2), pages 107-128, May.
- Manuel Moreno & Javier R. Navas, 2001. "On the robustness of least-squares Monte Carlo (LSM) for pricing American derivatives," Economics Working Papers 543, Department of Economics and Business, Universitat Pompeu Fabra.
Lee, Sangmin & Boomsma, Trine Krogh, 2022. "An approximate dynamic programming algorithm for short-term electric vehicle fleet operation under uncertainty," Applied Energy, Elsevier, vol. 325(C).
Martin B. Haugh & Leonid Kogan, 2004. "Pricing American Options: A Duality Approach," Operations Research, INFORMS, vol. 52(2), pages 258-270, April.
Lars Stentoft, 2013. "American option pricing using simulation with an application to the GARCH model," Chapters, in: Adrian R. Bell & Chris Brooks & Marcel Prokopczuk (ed.), Handbook of Research Methods and Applications in Empirical Finance, chapter 5, pages 114-147, Edward Elgar Publishing.
Mark Broadie & Menghui Cao, 2008. "Improved lower and upper bound algorithms for pricing American options by simulation," Quantitative Finance, Taylor & Francis Journals, vol. 8(8), pages 845-861.
Zbigniew Palmowski & Tomasz Serafin, 2020. "A Note on Simulation Pricing of π -Options," Risks, MDPI, vol. 8(3), pages 1-19, August.
Gagliardini, Patrick & Ronchetti, Diego, 2013. "Semi-parametric estimation of American option prices," Journal of Econometrics, Elsevier, vol. 173(1), pages 57-82.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2022-09-05 (Big Data)
NEP-CMP-2022-09-05 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2208.00765. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Solving the optimal stopping problem with reinforcement learning: an application in financial option exercise

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data