Commodities Trading through Deep Policy Gradient Methods

My bibliography Save this paper

Commodities Trading through Deep Policy Gradient Methods

Author

Listed:

Jonas Hanetho

Registered:

Abstract

Algorithmic trading has gained attention due to its potential for generating superior returns. This paper investigates the effectiveness of deep reinforcement learning (DRL) methods in algorithmic commodities trading. It formulates the commodities trading problem as a continuous, discrete-time stochastic dynamical system. The proposed system employs a novel time-discretization scheme that adapts to market volatility, enhancing the statistical properties of subsampled financial time series. To optimize transaction-cost- and risk-sensitive trading agents, two policy gradient algorithms, namely actor-based and actor-critic-based approaches, are introduced. These agents utilize CNNs and LSTMs as parametric function approximators to map historical price observations to market positions.Backtesting on front-month natural gas futures demonstrates that DRL models increase the Sharpe ratio by $83\%$ compared to the buy-and-hold baseline. Additionally, the risk profile of the agents can be customized through a hyperparameter that regulates risk sensitivity in the reward function during the optimization process. The actor-based models outperform the actor-critic-based models, while the CNN-based models show a slight performance advantage over the LSTM-based models.

Suggested Citation

Jonas Hanetho, 2023. "Commodities Trading through Deep Policy Gradient Methods," Papers 2309.00630, arXiv.org.

Handle: RePEc:arx:papers:2309.00630

Download full text from publisher

References listed on IDEAS

Benoit Mandelbrot & Howard M. Taylor, 1967. "On the Distribution of Stock Price Differences," Operations Research, INFORMS, vol. 15(6), pages 1057-1062, December.
Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
Clark, Peter K, 1973. "A Subordinated Stochastic Process Model with Finite Variance for Speculative Prices," Econometrica, Econometric Society, vol. 41(1), pages 135-155, January.
David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
Jonas Hanetho, 2023. "Deep Policy Gradient Methods in Commodity Markets," Papers 2308.01910, arXiv.org.
Fischer, Thomas G., 2018. "Reinforcement learning in financial markets - a survey," FAU Discussion Papers in Economics 12/2018, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Jonas Hanetho, 2023. "Deep Policy Gradient Methods in Commodity Markets," Papers 2308.01910, arXiv.org.
Scalas, Enrico & Kaizoji, Taisei & Kirchler, Michael & Huber, Jürgen & Tedeschi, Alessandra, 2006. "Waiting times between orders and trades in double-auction markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 366(C), pages 463-471.
- Enrico Scalas & Taisei Kaizoji & Michael Kirchler & Juergen Huber & Alessandra Tedeschi, 2006. "Waiting times between orders and trades in double-auction markets," Papers physics/0608273, arXiv.org.
Charl Maree & Christian W. Omlin, 2022. "Balancing Profit, Risk, and Sustainability for Portfolio Management," Papers 2207.02134, arXiv.org.
J. Doyne Farmer & Laszlo Gillemot & Fabrizio Lillo & Szabolcs Mike & Anindya Sen, 2004. "What really causes large price changes?," Quantitative Finance, Taylor & Francis Journals, vol. 4(4), pages 383-397.
- J. Doyne Farmer & Laszlo Gillemot & Fabrizio Lillo & Szabolcs Mike & Anindya Sen, 2003. "What really causes large price changes?," Papers cond-mat/0312703, arXiv.org, revised Apr 2004.
Saswat Patra & Malay Bhattacharyya, 2021. "Does volume really matter? A risk management perspective using cross‐country evidence," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 26(1), pages 118-135, January.
Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2016. "Rock around the clock: An agent-based model of low- and high-frequency trading," Journal of Evolutionary Economics, Springer, vol. 26(1), pages 49-76, March.
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2014. "Rock around the Clock: An Agent-Based Model of Low- and High-Frequency Trading," GREDEG Working Papers 2014-21, Groupe de REcherche en Droit, Economie, Gestion (GREDEG CNRS), Université Côte d'Azur, France.
- Giorgio Fagiolo & Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini, 2015. "Rock around the Clock: An Agent-Based Model of Low- and High-Frequency Trading," SciencePo Working papers Main hal-03411703, HAL.
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2014. "Rock around the clock: an agent-based model of low- and high-frequency trading," SciencePo Working papers Main hal-01070542, HAL.
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2016. "Rock around the Clock : An agent-based model of low- and high-frequency trading," Post-Print hal-01512863, HAL.
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgo Fagiolo, 2014. "Rock around the clock :An agent-based model of low-and high frequency trading," Documents de Travail de l'OFCE 2014-03, Observatoire Francais des Conjonctures Economiques (OFCE).
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2014. "Rock around the clock: An agent-based model of low- and high-frequency trading," Post-Print hal-01515227, HAL.
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2014. "Rock around the Clock: An Agent-Based Model of Low- and High-Frequency Trading," LEM Papers Series 2014/03, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2014. "Rock around the Clock: An Agent-Based Model of Low- and High-Frequency Trading," Papers 1402.2046, arXiv.org.
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2014. "Rock around the Clock: An Agent-Based Model of Low- and High-Frequency Trading," Working Papers 02/2014, University of Verona, Department of Economics.
- Giorgio Fagiolo & Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini, 2015. "Rock around the Clock: An Agent-Based Model of Low- and High-Frequency Trading," Post-Print hal-03411703, HAL.
- Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2014. "Rock around the clock: an agent-based model of low- and high-frequency trading," Working Papers hal-01070542, HAL.
Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
Laura Eslava & Fernando Baltazar-Larios & Bor Reynoso, 2022. "Maximum Likelihood Estimation for a Markov-Modulated Jump-Diffusion Model," Papers 2211.17220, arXiv.org.
Kashyap, Ravi, 2019. "The perfect marriage and much more: Combining dimension reduction, distance measures and covariance," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
Meenakshi Venkateswaran & B. Wade Brorsen & Joyce A. Hall, 1993. "The distribution of standardized futures price changes," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 13(3), pages 279-298, May.
- Venkateswaran, Meenakshi & Brorsen, B. Wade & Hall, Joyce A., "undated". "The Distribution Of Standardized Futures Price Changes," 1988 Annual Meeting, August 1-3, Knoxville, Tennessee 270288, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
Xin Ling, 2017. "Normality of stock returns with event time clocks," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 57, pages 277-298, April.
Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
Michele Caraglio & Fulvio Baldovin & Attilio L. Stella, 2021. "How Fast Does the Clock of Finance Run?—A Time-Definition Enforcing Stationarity and Quantifying Overnight Duration," JRFM, MDPI, vol. 14(8), pages 1-15, August.
Robert J. Elliott & Carlton-James U. Osakwe, 2006. "Option Pricing for Pure Jump Processes with Markov Switching Compensators," Finance and Stochastics, Springer, vol. 10(2), pages 250-275, April.
Adrian Millea, 2021. "Deep Reinforcement Learning for Trading—A Critical Survey," Data, MDPI, vol. 6(11), pages 1-25, November.
Kyoung-hun Bae & Albert S. Kyle & Eun Jung Lee & Anna Obizhaeva, 2016. "Invariance of buy-sell switching points," Working Papers w0232, New Economic School (NES).
Scalas, Enrico, 2006. "The application of continuous-time random walks in finance and economics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 362(2), pages 225-239.
Iwao Maeda & David deGraw & Michiharu Kitano & Hiroyasu Matsushima & Hiroki Sakaji & Kiyoshi Izumi & Atsuo Kato, 2020. "Deep Reinforcement Learning in Agent Based Financial Market Simulation," JRFM, MDPI, vol. 13(4), pages 1-17, April.
Sandrine Jacob Leal & Mauro Napoletano, 2017. "Market Stability vs. Market Resilience: Regulatory Policies Experiments in an Agent-Based Model with Low- and High-Frequency Trading," Post-Print hal-01768876, HAL.
Ma, Cong & Nan, Shijing, 2024. "Dynamic graph reinforcement learning algorithm for portfolio management: A novel time–frequency correlated model," Finance Research Letters, Elsevier, vol. 63(C).

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2023-10-16 (Big Data)
NEP-CMP-2023-10-16 (Computational Economics)
NEP-GER-2023-10-16 (German Papers)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2309.00630. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Commodities Trading through Deep Policy Gradient Methods

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data