IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1904.12887.html
   My bibliography  Save this paper

Curriculum Learning in Deep Neural Networks for Financial Forecasting

Author

Listed:
  • Allison Koenecke
  • Amita Gajewar

Abstract

For any financial organization, computing accurate quarterly forecasts for various products is one of the most critical operations. As the granularity at which forecasts are needed increases, traditional statistical time series models may not scale well. We apply deep neural networks in the forecasting domain by experimenting with techniques from Natural Language Processing (Encoder-Decoder LSTMs) and Computer Vision (Dilated CNNs), as well as incorporating transfer learning. A novel contribution of this paper is the application of curriculum learning to neural network models built for time series forecasting. We illustrate the performance of our models using Microsoft's revenue data corresponding to Enterprise, and Small, Medium & Corporate products, spanning approximately 60 regions across the globe for 8 different business segments, and totaling in the order of tens of billions of USD. We compare our models' performance to the ensemble model of traditional statistics and machine learning techniques currently used by Microsoft Finance. With this in-production model as a baseline, our experiments yield an approximately 30% improvement in overall accuracy on test data. We find that our curriculum learning LSTM-based model performs best, showing that it is reasonable to implement our proposed methods without overfitting on medium-sized data.

Suggested Citation

  • Allison Koenecke & Amita Gajewar, 2019. "Curriculum Learning in Deep Neural Networks for Financial Forecasting," Papers 1904.12887, arXiv.org, revised Jul 2019.
  • Handle: RePEc:arx:papers:1904.12887
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1904.12887
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Justin Sirignano & Rama Cont, 2018. "Universal features of price formation in financial markets: perspectives from Deep Learning," Papers 1803.06917, arXiv.org.
    2. Thiyanga S Talagala & Rob J Hyndman & George Athanasopoulos, 2018. "Meta-learning how to forecast time series," Monash Econometrics and Business Statistics Working Papers 6/18, Monash University, Department of Econometrics and Business Statistics.
    3. Justin Sirignano & Rama Cont, 2018. "Universal features of price formation in financial markets: perspectives from Deep Learning," Working Papers hal-01754054, HAL.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ivan Peñaloza & Pablo Padilla, 2022. "A Pricing Method in a Constrained Market with Differential Informational Frameworks," Computational Economics, Springer;Society for Computational Economics, vol. 60(3), pages 1055-1100, October.
    2. Ymir Makinen & Juho Kanniainen & Moncef Gabbouj & Alexandros Iosifidis, 2018. "Forecasting of Jump Arrivals in Stock Prices: New Attention-based Network Architecture using Limit Order Book Data," Papers 1810.10845, arXiv.org.
    3. Zihao Zhang & Stefan Zohren & Stephen Roberts, 2018. "DeepLOB: Deep Convolutional Neural Networks for Limit Order Books," Papers 1808.03668, arXiv.org, revised Jan 2020.
    4. Eduard Silantyev, 2019. "Order flow analysis of cryptocurrency markets," Digital Finance, Springer, vol. 1(1), pages 191-218, November.
    5. Christa Cuchiero & Wahid Khosrawi & Josef Teichmann, 2020. "A Generative Adversarial Network Approach to Calibration of Local Stochastic Volatility Models," Risks, MDPI, vol. 8(4), pages 1-31, September.
    6. Omer Berat Sezer & Mehmet Ugur Gudelek & Ahmet Murat Ozbayoglu, 2019. "Financial Time Series Forecasting with Deep Learning : A Systematic Literature Review: 2005-2019," Papers 1911.13288, arXiv.org.
    7. Takuya Shintate & Lukáš Pichl, 2019. "Trend Prediction Classification for High Frequency Bitcoin Time Series with Deep Learning," JRFM, MDPI, vol. 12(1), pages 1-15, January.
    8. Qi Zhao, 2020. "A Deep Learning Framework for Predicting Digital Asset Price Movement from Trade-by-trade Data," Papers 2010.07404, arXiv.org.
    9. Baptiste Barreau & Laurent Carlier & Damien Challet, 2019. "Deep Prediction of Investor Interest: a Supervised Clustering Approach," Papers 1909.05289, arXiv.org, revised Feb 2021.
    10. Bryan Lim & Stefan Zohren & Stephen Roberts, 2019. "Enhancing Time Series Momentum Strategies Using Deep Neural Networks," Papers 1904.04912, arXiv.org, revised Sep 2020.
    11. Paul Bilokon & Yitao Qiu, 2023. "Transformers versus LSTMs for electronic trading," Papers 2309.11400, arXiv.org.
    12. Kieran Wood & Samuel Kessler & Stephen J. Roberts & Stefan Zohren, 2023. "Few-Shot Learning Patterns in Financial Time-Series for Trend-Following Strategies," Papers 2310.10500, arXiv.org, revised Mar 2024.
    13. Dimitrios Nikolaidis & Michalis Doumpos, 2022. "Credit Scoring with Drift Adaptation Using Local Regions of Competence," SN Operations Research Forum, Springer, vol. 3(4), pages 1-28, December.
    14. Arthur le Calvez & Dave Cliff, 2018. "Deep Learning can Replicate Adaptive Traders in a Limit-Order-Book Financial Market," Papers 1811.02880, arXiv.org.
    15. Hyungjun Park & Min Kyu Sim & Dong Gu Choi, 2019. "An intelligent financial portfolio trading strategy using deep Q-learning," Papers 1907.03665, arXiv.org, revised Nov 2019.
    16. Ymir Mäkinen & Juho Kanniainen & Moncef Gabbouj & Alexandros Iosifidis, 2019. "Forecasting jump arrivals in stock prices: new attention-based network architecture using limit order book data," Quantitative Finance, Taylor & Francis Journals, vol. 19(12), pages 2033-2050, December.
    17. Parley Ruogu Yang, 2021. "Forecasting high-frequency financial time series: an adaptive learning approach with the order book data," Papers 2103.00264, arXiv.org.
    18. Baptiste Barreau & Laurent Carlier & Damien Challet, 2019. "Deep Prediction of Investor Interest: a Supervised Clustering Approach," Papers 1909.05289, arXiv.org, revised Feb 2021.
    19. Fan Fang & Waichung Chung & Carmine Ventre & Michail Basios & Leslie Kanthan & Lingbo Li & Fan Wu, 2020. "Ascertaining price formation in cryptocurrency markets with DeepLearning," Papers 2003.00803, arXiv.org.
    20. Ahmet Murat Ozbayoglu & Mehmet Ugur Gudelek & Omer Berat Sezer, 2020. "Deep Learning for Financial Applications : A Survey," Papers 2002.05786, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1904.12887. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.