IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2309.00630.html
   My bibliography  Save this paper

Commodities Trading through Deep Policy Gradient Methods

Author

Listed:
  • Jonas Hanetho

Abstract

Algorithmic trading has gained attention due to its potential for generating superior returns. This paper investigates the effectiveness of deep reinforcement learning (DRL) methods in algorithmic commodities trading. It formulates the commodities trading problem as a continuous, discrete-time stochastic dynamical system. The proposed system employs a novel time-discretization scheme that adapts to market volatility, enhancing the statistical properties of subsampled financial time series. To optimize transaction-cost- and risk-sensitive trading agents, two policy gradient algorithms, namely actor-based and actor-critic-based approaches, are introduced. These agents utilize CNNs and LSTMs as parametric function approximators to map historical price observations to market positions.Backtesting on front-month natural gas futures demonstrates that DRL models increase the Sharpe ratio by $83\%$ compared to the buy-and-hold baseline. Additionally, the risk profile of the agents can be customized through a hyperparameter that regulates risk sensitivity in the reward function during the optimization process. The actor-based models outperform the actor-critic-based models, while the CNN-based models show a slight performance advantage over the LSTM-based models.

Suggested Citation

  • Jonas Hanetho, 2023. "Commodities Trading through Deep Policy Gradient Methods," Papers 2309.00630, arXiv.org.
  • Handle: RePEc:arx:papers:2309.00630
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2309.00630
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Benoit Mandelbrot & Howard M. Taylor, 1967. "On the Distribution of Stock Price Differences," Operations Research, INFORMS, vol. 15(6), pages 1057-1062, December.
    2. Zhengyao Jiang & Dixing Xu & Jinjun Liang, 2017. "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem," Papers 1706.10059, arXiv.org, revised Jul 2017.
    3. Clark, Peter K, 1973. "A Subordinated Stochastic Process Model with Finite Variance for Speculative Prices," Econometrica, Econometric Society, vol. 41(1), pages 135-155, January.
    4. David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
    5. Jonas Hanetho, 2023. "Deep Policy Gradient Methods in Commodity Markets," Papers 2308.01910, arXiv.org.
    6. Fischer, Thomas G., 2018. "Reinforcement learning in financial markets - a survey," FAU Discussion Papers in Economics 12/2018, Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jonas Hanetho, 2023. "Deep Policy Gradient Methods in Commodity Markets," Papers 2308.01910, arXiv.org.
    2. Scalas, Enrico & Kaizoji, Taisei & Kirchler, Michael & Huber, Jürgen & Tedeschi, Alessandra, 2006. "Waiting times between orders and trades in double-auction markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 366(C), pages 463-471.
    3. Panov, Vladimir, 2019. "Some properties of the one-dimensional subordinated stable model," Statistics & Probability Letters, Elsevier, vol. 146(C), pages 80-84.
    4. Abootaleb Shirvani & Yuan Hu & Svetlozar T. Rachev & Frank J. Fabozzi, 2019. "Option Pricing with Mixed Levy Subordinated Price Process and Implied Probability Weighting Function," Papers 1910.05902, arXiv.org, revised Apr 2020.
    5. Sandrine Jacob Leal & Mauro Napoletano & Andrea Roventini & Giorgio Fagiolo, 2016. "Rock around the clock: An agent-based model of low- and high-frequency trading," Journal of Evolutionary Economics, Springer, vol. 26(1), pages 49-76, March.
    6. Ghysels, E. & Harvey, A. & Renault, E., 1995. "Stochastic Volatility," Papers 95.400, Toulouse - GREMAQ.
    7. Charl Maree & Christian W. Omlin, 2022. "Balancing Profit, Risk, and Sustainability for Portfolio Management," Papers 2207.02134, arXiv.org.
    8. J. Doyne Farmer & Laszlo Gillemot & Fabrizio Lillo & Szabolcs Mike & Anindya Sen, 2004. "What really causes large price changes?," Quantitative Finance, Taylor & Francis Journals, vol. 4(4), pages 383-397.
    9. Gabriele La Spada & J. Doyne Farmer & Fabrizio Lillo, 2010. "Tick size and price diffusion," Papers 1009.2329, arXiv.org, revised Oct 2010.
    10. Saswat Patra & Malay Bhattacharyya, 2021. "Does volume really matter? A risk management perspective using cross‐country evidence," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 26(1), pages 118-135, January.
    11. Ben Hambly & Renyuan Xu & Huining Yang, 2021. "Recent Advances in Reinforcement Learning in Finance," Papers 2112.04553, arXiv.org, revised Feb 2023.
    12. Joann Jasiak, 2003. "First‐Order Autoregressive Processes with Heterogeneous Persistence," Journal of Time Series Analysis, Wiley Blackwell, vol. 24(3), pages 283-309, May.
    13. Parker, Edgar, 2016. "Flash Crashes: The Role of Information Processing Based Subordination and the Cauchy Distribution in Market Instability," MPRA Paper 80039, University Library of Munich, Germany.
    14. Laura Eslava & Fernando Baltazar-Larios & Bor Reynoso, 2022. "Maximum Likelihood Estimation for a Markov-Modulated Jump-Diffusion Model," Papers 2211.17220, arXiv.org.
    15. Kashyap, Ravi, 2019. "The perfect marriage and much more: Combining dimension reduction, distance measures and covariance," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
    16. Aldrich, Eric M. & Heckenbach, Indra & Laughlin, Gregory, 2016. "A compound duration model for high-frequency asset returns," Journal of Empirical Finance, Elsevier, vol. 39(PA), pages 105-128.
    17. Ales Kresta & Tomas Tichy, 2012. "International Equity Portfolio Risk Modeling: The Case of the NIG Model and Ordinary Copula Functions," Czech Journal of Economics and Finance (Finance a uver), Charles University Prague, Faculty of Social Sciences, vol. 62(2), pages 141-161, May.
    18. Chamil W SENARATHNE & Wei JIANGUO, 2020. "Testing for Heteroskedastic Mixture of Ordinary Least Squares Errors," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(2), pages 73-91, July.
    19. Kyoung-hun Bae & Albert S. Kyle & Eun Jung Lee & Anna Obizhaeva, 2016. "Invariance of buy-sell switching points," Working Papers w0232, Center for Economic and Financial Research (CEFIR).
    20. Meenakshi Venkateswaran & B. Wade Brorsen & Joyce A. Hall, 1993. "The distribution of standardized futures price changes," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 13(3), pages 279-298, May.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2309.00630. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.