A Multi-Scaling Reinforcement Learning Trading System Based on Multi-Scaling Convolutional Neural Networks

My bibliography Save this article

A Multi-Scaling Reinforcement Learning Trading System Based on Multi-Scaling Convolutional Neural Networks

Author

Listed:

Yuling Huang
(School of Computer Science and Engineering, Macau University of Science and Technology, Macao, China)
Kai Cui
(School of Computer Science and Engineering, Macau University of Science and Technology, Macao, China)
Yunlin Song
(Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology, Macao, China)
Zongren Chen
(School of Computer Science and Engineering, Macau University of Science and Technology, Macao, China)

Registered:

Abstract

Advancements in machine learning have led to an increased interest in applying deep reinforcement learning techniques to investment decision-making problems. Despite this, existing approaches often rely solely on single-scaling daily data, neglecting the importance of multi-scaling information, such as weekly or monthly data, in decision-making processes. To address this limitation, a multi-scaling convolutional neural network for reinforcement learning-based stock trading, termed multi-scaling convolutional neural network SARSA (state, action, reward, state, action), is proposed. Our method utilizes a multi-scaling convolutional neural network to obtain multi-scaling features of daily and weekly financial data automatically. This involves using a convolutional neural network with several filter sizes to perform a multi-scaling extraction of temporal features. Multiple-scaling feature mining allows agents to operate over longer time scaling, identifying low stock positions on the weekly line and avoiding daily fluctuations during continuous declines. This mimics the human approach of considering information at varying temporal and spatial scaling during stock trading. We further enhance the network’s robustness by adding an average pooling layer to the backbone convolutional neural network, reducing overfitting. State, action, reward, state, action, as an on-policy reinforcement learning method, generates dynamic trading strategies that combine multi-scaling information across different time scaling, while avoiding dangerous strategies. We evaluate the effectiveness of our proposed method on four real-world datasets (Dow Jones, NASDAQ, General Electric, and AAPLE) spanning from 1 January 2007 to 31 December 2020, and demonstrate its superior profits compared to several baseline methods. In addition, we perform various comparative and ablation tests in order to demonstrate the superiority of the proposed network architecture. Through these experiments, our proposed multi-scaling module yields better results compared to the single-scaling module.

Suggested Citation

Yuling Huang & Kai Cui & Yunlin Song & Zongren Chen, 2023. "A Multi-Scaling Reinforcement Learning Trading System Based on Multi-Scaling Convolutional Neural Networks," Mathematics, MDPI, vol. 11(11), pages 1-19, May.

Handle: RePEc:gam:jmathe:v:11:y:2023:i:11:p:2467-:d:1157319

Download full text from publisher

References listed on IDEAS

Mehran Taghian & Ahmad Asadi & Reza Safabakhsh, 2021. "A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules," Papers 2101.03867, arXiv.org.
Marco Corazza & Giovanni Fasano & Riccardo Gusso & Raffaele Pesenti, 2019. "A comparison among Reinforcement Learning algorithms in financial trading systems," Working Papers 2019:33, Department of Economics, University of Venice "Ca' Foscari".
Caiyu Jiang & Jianhua Wang, 2022. "A Portfolio Model with Risk Control Policy Based on Deep Reinforcement Learning," Mathematics, MDPI, vol. 11(1), pages 1-16, December.
Marco Corazza & Andrea Sangalli, 2015. "Q-Learning and SARSA: a comparison between two intelligent stochastic control approaches for financial trading," Working Papers 2015:15, Department of Economics, University of Venice "Ca' Foscari", revised 2015.
Souradeep Chakraborty, 2019. "Capturing Financial markets to apply Deep Reinforcement Learning," Papers 1907.04373, arXiv.org, revised Dec 2019.
Poterba, James M. & Summers, Lawrence H., 1988. "Mean reversion in stock prices : Evidence and Implications," Journal of Financial Economics, Elsevier, vol. 22(1), pages 27-59, October.
- James M. Poterba & Lawrence H. Summers, 1987. "Mean Reversion in Stock Prices: Evidence and Implications," NBER Working Papers 2343, National Bureau of Economic Research, Inc.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Yuling Huang & Chujin Zhou & Lin Zhang & Xiaoping Lu, 2024. "A Self-Rewarding Mechanism in Deep Reinforcement Learning for Trading Strategy Optimization," Mathematics, MDPI, vol. 12(24), pages 1-25, December.
Geoffrey Ngene & Kenneth A. Tah & Ali F. Darrat, 2017. "Long memory or structural breaks: Some evidence for African stock markets," Review of Financial Economics, John Wiley & Sons, vol. 34(1), pages 61-73, September.
- Ngene, Geoffrey & Tah, Kenneth A. & Darrat, Ali F., 2017. "Long memory or structural breaks: Some evidence for African stock markets," Review of Financial Economics, Elsevier, vol. 34(C), pages 61-73.
Yoon, Byung-Sam & Brorsen, B. Wade, 2005. "Can Multiyear Rollover Hedging Increase Mean Returns?," Journal of Agricultural and Applied Economics, Cambridge University Press, vol. 37(1), pages 65-78, April.
- Yoon, Byung-Sam & Brorsen, B. Wade, 2005. "Can Multiyear Rollover Hedging Increase Mean Returns?," Journal of Agricultural and Applied Economics, Southern Agricultural Economics Association, vol. 37(01), pages 1-14, April.
Shyh-Wei Chen, 2008. "Non-stationarity and Non-linearity in Stock Prices: Evidence from the OECD Countries," Economics Bulletin, AccessEcon, vol. 3(11), pages 1-11.
Nam, Kiseok & Pyun, Chong Soo & Kim, Sei-Wan, 2003. "Is asymmetric mean-reverting pattern in stock returns systematic? Evidence from Pacific-basin markets in the short-horizon," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 13(5), pages 481-502, December.
Neely, Christopher J. & Weller, Paul, 2000. "Predictability in International Asset Returns: A Reexamination," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 35(4), pages 601-620, December.
- Christopher J. Neely & Paul A. Weller, 1999. "Predictability in international asset returns: a reexamination," Working Papers 1997-010, Federal Reserve Bank of St. Louis.
Mohamed Es-Sanoun & Jude Gohou & Mounir Benboubker, 2023. "Testing of Herd Behavior In african Stock Markets During COVID-19 Pandemic [Essai de vérification du comportement mimétique dans les marchés boursiers africains au cours de la crise de covid-19]," Post-Print hal-04144289, HAL.
Nadiezhda de la Uz, 2002. "La hipótesis de martingala en el mercado bursátil mexicano," Estudios Económicos, El Colegio de México, Centro de Estudios Económicos, vol. 17(1), pages 91-127.
John Sabelhaus, 2005. "Alternative Methods for Projecting Equity Returns: Implications for Evaluating Social Security Reform Proposals," Risk Management and Insurance Review, American Risk and Insurance Association, vol. 8(1), pages 43-63, March.
Ajit Singh, 1998. "Pension Reform, the Stock Market, Capital Formation and Economic Growth: A Critical Commentary on the World Bank’s Proposals," Istanbul Stock Exchange Review, Research and Business Development Department, Borsa Istanbul, vol. 2(8-7), pages 51-78.
- Ajit Singh, 1996. "Pension reform, the stock market, capital formation and economic growth: A critical commentary on the World Bank's proposals," International Social Security Review, John Wiley & Sons, vol. 49(3), pages 21-43, July.
- Singh, Ajit, 1996. "Pension reform, the stock market, capital formation and economic growth: a critical commentary on the World Bank's proposals," MPRA Paper 54924, University Library of Munich, Germany.
- Ajit Singh, 1996. "Pension Reform, The Stock Market, Capital Formation and Economic Growth: A Critical Commentary on the World Bank's Proposals," SCEPA working paper series. 1996-03, Schwartz Center for Economic Policy Analysis (SCEPA), The New School.
De Long, J Bradford & Andrei Shleifer & Lawrence H. Summers & Robert J. Waldmann, 1990. "Noise Trader Risk in Financial Markets," Journal of Political Economy, University of Chicago Press, vol. 98(4), pages 703-738, August.
- J. Bradford De Long & Andrei Shleifer & Lawrence H. Summers & Robert J. Waldmann, "undated". "Noise Trader Risk in Financial Markets," J. Bradford De Long's Working Papers _124, University of California at Berkeley, Economics Department.
- De Long, J. Bradford & Shleifer, Andrei & Summers, Lawrence H. & Waldmann, Robert J., 1990. "Noise Trader Risk in Financial Markets," Scholarly Articles 3725552, Harvard University Department of Economics.
Shu-Ling Chen & Hyeongwoo Kim, 2011. "Nonlinear Mean Reversion across National Stock Markets: Evidence from Emerging Asian Markets," International Economic Journal, Taylor & Francis Journals, vol. 25(2), pages 239-250.
- Chen, Shu-Ling & Kim, Hyeongwoo, 2008. "Nonlinear Mean Reversion across National Stock Markets: Evidence from Emerging Asian Markets," MPRA Paper 18680, University Library of Munich, Germany, revised Nov 2009.
Semenov, Andrei, 2021. "Measuring the stock's factor beta and identifying risk factors under market inefficiency," The Quarterly Review of Economics and Finance, Elsevier, vol. 80(C), pages 635-649.
Bonomo, Marco & Garcia, Rene, 1996. "Consumption and equilibrium asset pricing: An empirical assessment," Journal of Empirical Finance, Elsevier, vol. 3(3), pages 239-265, September.
- Bonomo, M. & Garcia, R., 1991. "Consumption and Equilibrium Asset Pricing: an Empirical Assessment," Cahiers de recherche 9126, Universite de Montreal, Departement de sciences economiques.
- Marco antonio Bonomo & Rene Garcia, 1992. "Consumption and equilibrium asset pricing: An empirical assessment," Textos para discussão 284, Department of Economics PUC-Rio (Brazil).
- Bonomo, M. & Garcia, R., 1991. "Consumption and Equilibrium Asset Pricing: an Empirical Assessment," Cahiers de recherche 9126, Centre interuniversitaire de recherche en Ã©conomie quantitative, CIREQ.
Gil-Alana, L.A., 2006. "Fractional integration in daily stock market indexes," Review of Financial Economics, Elsevier, vol. 15(1), pages 28-48.
- L.A. Gil‐Alana, 2006. "Fractional integration in daily stock market indexes," Review of Financial Economics, John Wiley & Sons, vol. 15(1), pages 28-48.
Peng, Lin & Xiong, Wei, 2006. "Investor attention, overconfidence and category learning," Journal of Financial Economics, Elsevier, vol. 80(3), pages 563-602, June.
- Lin Peng & Wei Xiong, 2005. "Investor Attention: Overconfidence and Category Learning," NBER Working Papers 11400, National Bureau of Economic Research, Inc.
Dmitry Kulikov, 2012. "Testing for Rational Speculative Bubbles on the Estonian Stock Market," Research in Economics and Business: Central and Eastern Europe, Tallinn School of Economics and Business Administration, Tallinn University of Technology, vol. 4(1).
John Y. Campbell & Yeung Lewis Chanb & M. Viceira, 2013. "A multivariate model of strategic asset allocation," World Scientific Book Chapters, in: Leonard C MacLean & William T Ziemba (ed.), HANDBOOK OF THE FUNDAMENTALS OF FINANCIAL DECISION MAKING Part II, chapter 39, pages 809-848, World Scientific Publishing Co. Pte. Ltd..
- Campbell, John Y. & Chan, Yeung Lewis & Viceira, Luis M., 2003. "A multivariate model of strategic asset allocation," Journal of Financial Economics, Elsevier, vol. 67(1), pages 41-80, January.
- John Y. Campbell & Yeung Lewis Chan & Luis M. Viceira, 2001. "A Multivariate Model of Strategic Asset Allocation," NBER Working Papers 8566, National Bureau of Economic Research, Inc.
- Chan, Yeung Lewis & Viceira, Luis & Campbell, John, 2003. "A Multivariate Model of Strategic Asset Allocation," Scholarly Articles 3163263, Harvard University Department of Economics.
- Campbell, John Y & Viceira, Luis & Chan, Yeung Lewis, 2001. "A Multivariate Model of Strategic Asset Allocation," CEPR Discussion Papers 3070, C.E.P.R. Discussion Papers.
Dai, R., 2010. "Essays on pension finance and dynamic asset allocation," Other publications TiSEM 72fdbf1a-5a77-410d-bb4e-9, Tilburg University, School of Economics and Management.
Dichtl, Hubert & Drobetz, Wolfgang, 2011. "Portfolio insurance and prospect theory investors: Popularity and optimal design of capital protected financial products," Journal of Banking & Finance, Elsevier, vol. 35(7), pages 1683-1697, July.

More about this item

Keywords

; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:11:p:2467-:d:1157319. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Multi-Scaling Reinforcement Learning Trading System Based on Multi-Scaling Convolutional Neural Networks

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data