IDEAS home Printed from https://ideas.repec.org/a/wly/jforec/v37y2018i8p852-866.html
   My bibliography  Save this article

Benchmark dataset for mid‐price forecasting of limit order book data with machine learning methods

Author

Listed:
  • Adamantios Ntakaris
  • Martin Magris
  • Juho Kanniainen
  • Moncef Gabbouj
  • Alexandros Iosifidis

Abstract

Managing the prediction of metrics in high‐frequency financial markets is a challenging task. An efficient way is by monitoring the dynamics of a limit order book to identify the information edge. This paper describes the first publicly available benchmark dataset of high‐frequency limit order markets for mid‐price prediction. We extracted normalized data representations of time series data for five stocks from the Nasdaq Nordic stock market for a time period of 10 consecutive days, leading to a dataset of ∼4,000,000 time series samples in total. A day‐based anchored cross‐validation experimental protocol is also provided that can be used as a benchmark for comparing the performance of state‐of‐the‐art methodologies. Performance of baseline approaches are also provided to facilitate experimental comparisons. We expect that such a large‐scale dataset can serve as a testbed for devising novel solutions of expert systems for high‐frequency limit order book data analysis.

Suggested Citation

  • Adamantios Ntakaris & Martin Magris & Juho Kanniainen & Moncef Gabbouj & Alexandros Iosifidis, 2018. "Benchmark dataset for mid‐price forecasting of limit order book data with machine learning methods," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 37(8), pages 852-866, December.
  • Handle: RePEc:wly:jforec:v:37:y:2018:i:8:p:852-866
    DOI: 10.1002/for.2543
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/for.2543
    Download Restriction: no

    File URL: https://libkey.io/10.1002/for.2543?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yufei Wu & Mahmoud Mahfouz & Daniele Magazzeni & Manuela Veloso, 2021. "Towards Robust Representation of Limit Orders Books for Deep Learning Models," Papers 2110.05479, arXiv.org, revised Dec 2022.
    2. Erdinc Akyildirim & Oguzhan Cepni & Shaen Corbet & Gazi Salah Uddin, 2023. "Forecasting mid-price movement of Bitcoin futures using machine learning," Annals of Operations Research, Springer, vol. 330(1), pages 553-584, November.
    3. Bangzhu Zhu & Shunxin Ye & Ping Wang & Julien Chevallier & Yi‐Ming Wei, 2022. "Forecasting carbon price using a multi‐objective least squares support vector machine with mixture kernels," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(1), pages 100-117, January.
    4. Zijian Shi & Yu Chen & John Cartlidge, 2021. "The LOB Recreation Model: Predicting the Limit Order Book from TAQ History Using an Ordinary Differential Equation Recurrent Neural Network," Papers 2103.01670, arXiv.org.
    5. Adamantios Ntakaris & Giorgio Mirone & Juho Kanniainen & Moncef Gabbouj & Alexandros Iosifidis, 2019. "Feature Engineering for Mid-Price Prediction with Deep Learning," Papers 1904.05384, arXiv.org, revised Jun 2019.
    6. Matteo Prata & Giuseppe Masi & Leonardo Berti & Viviana Arrigoni & Andrea Coletta & Irene Cannistraci & Svitlana Vyetrenko & Paola Velardi & Novella Bartolini, 2023. "LOB-Based Deep Learning Models for Stock Price Trend Prediction: A Benchmark Study," Papers 2308.01915, arXiv.org, revised Sep 2023.
    7. Zihao Zhang & Bryan Lim & Stefan Zohren, 2021. "Deep Learning for Market by Order Data," Papers 2102.08811, arXiv.org, revised Jul 2021.
    8. Rakshit Jha & Mattijs De Paepe & Samuel Holt & James West & Shaun Ng, 2020. "Deep Learning for Digital Asset Limit Order Books," Papers 2010.01241, arXiv.org.
    9. Zijian Shi & John Cartlidge, 2021. "The Limit Order Book Recreation Model (LOBRM): An Extended Analysis," Papers 2107.00534, arXiv.org.
    10. Myles Sjogren & Timothy DeLise, 2021. "General Compound Hawkes Processes for Mid-Price Prediction," Papers 2110.07075, arXiv.org.
    11. Mostafa Shabani & Martin Magris & George Tzagkarakis & Juho Kanniainen & Alexandros Iosifidis, 2022. "Predicting the State of Synchronization of Financial Time Series using Cross Recurrence Plots," Papers 2210.14605, arXiv.org, revised Nov 2022.
    12. Lorenzo Lucchese & Mikko Pakkanen & Almut Veraart, 2022. "The Short-Term Predictability of Returns in Order Book Markets: a Deep Learning Perspective," Papers 2211.13777, arXiv.org, revised Oct 2023.
    13. Martin Magris & Mostafa Shabani & Alexandros Iosifidis, 2022. "Bayesian Bilinear Neural Network for Predicting the Mid-price Dynamics in Limit-Order Book Markets," Papers 2203.03613, arXiv.org, revised Jan 2023.
    14. James Wallbridge, 2020. "Transformers for Limit Order Books," Papers 2003.00130, arXiv.org.
    15. Hong Guo & Jianwu Lin & Fanlin Huang, 2023. "Market Making with Deep Reinforcement Learning from Limit Order Books," Papers 2305.15821, arXiv.org.
    16. Zihao Zhang & Stefan Zohren, 2021. "Multi-Horizon Forecasting for Limit Order Books: Novel Deep Learning Approaches and Hardware Acceleration using Intelligent Processing Units," Papers 2105.10430, arXiv.org, revised Aug 2021.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:jforec:v:37:y:2018:i:8:p:852-866. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www3.interscience.wiley.com/cgi-bin/jhome/2966 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.