IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1904.05384.html
   My bibliography  Save this paper

Feature Engineering for Mid-Price Prediction with Deep Learning

Author

Listed:
  • Adamantios Ntakaris
  • Giorgio Mirone
  • Juho Kanniainen
  • Moncef Gabbouj
  • Alexandros Iosifidis

Abstract

Mid-price movement prediction based on limit order book (LOB) data is a challenging task due to the complexity and dynamics of the LOB. So far, there have been very limited attempts for extracting relevant features based on LOB data. In this paper, we address this problem by designing a new set of handcrafted features and performing an extensive experimental evaluation on both liquid and illiquid stocks. More specifically, we implement a new set of econometrical features that capture statistical properties of the underlying securities for the task of mid-price prediction. Moreover, we develop a new experimental protocol for online learning that treats the task as a multi-objective optimization problem and predicts i) the direction of the next price movement and ii) the number of order book events that occur until the change takes place. In order to predict the mid-price movement, the features are fed into nine different deep learning models based on multi-layer perceptrons (MLP), convolutional neural networks (CNN) and long short-term memory (LSTM) neural networks. The performance of the proposed method is then evaluated on liquid and illiquid stocks, which are based on TotalView-ITCH US and Nordic stocks, respectively. For some stocks, results suggest that the correct choice of a feature set and a model can lead to the successful prediction of how long it takes to have a stock price movement.

Suggested Citation

  • Adamantios Ntakaris & Giorgio Mirone & Juho Kanniainen & Moncef Gabbouj & Alexandros Iosifidis, 2019. "Feature Engineering for Mid-Price Prediction with Deep Learning," Papers 1904.05384, arXiv.org, revised Jun 2019.
  • Handle: RePEc:arx:papers:1904.05384
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1904.05384
    File Function: Latest version
    Download Restriction: no

    References listed on IDEAS

    as
    1. O. E. Barndorff-Nielsen & P. Reinhard Hansen & A. Lunde & N. Shephard, 2009. "Realized kernels in practice: trades and quotes," Econometrics Journal, Royal Economic Society, vol. 12(3), pages 1-32, November.
    2. Christensen, Kim & Oomen, Roel C.A. & Podolskij, Mark, 2014. "Fact or friction: Jumps at ultra high frequency," Journal of Financial Economics, Elsevier, vol. 114(3), pages 576-599.
    3. Ole E. Barndorff-Nielsen & Neil Shephard, 2006. "Econometrics of Testing for Jumps in Financial Economics Using Bipower Variation," Journal of Financial Econometrics, Society for Financial Econometrics, vol. 4(1), pages 1-30.
    4. Ole E. Barndorff-Nielsen, 2004. "Power and Bipower Variation with Stochastic Volatility and Jumps," Journal of Financial Econometrics, Society for Financial Econometrics, vol. 2(1), pages 1-37.
    5. Avraam Tsantekidis & Nikolaos Passalis & Anastasios Tefas & Juho Kanniainen & Moncef Gabbouj & Alexandros Iosifidis, 2018. "Using Deep Learning for price prediction by exploiting stationary limit order book features," Papers 1810.09965, arXiv.org.
    6. Justin Sirignano, 2016. "Deep Learning for Limit Order Books," Papers 1601.01987, arXiv.org, revised Jul 2016.
    7. Ole E. Barndorff‐Nielsen & Neil Shephard, 2002. "Econometric analysis of realized volatility and its use in estimating stochastic volatility models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(2), pages 253-280, May.
    8. Justin Sirignano & Rama Cont, 2018. "Universal features of price formation in financial markets: perspectives from Deep Learning," Papers 1803.06917, arXiv.org.
    9. Martin Lettau, 2001. "Consumption, Aggregate Wealth, and Expected Stock Returns," Journal of Finance, American Finance Association, vol. 56(3), pages 815-849, June.
    10. Guo, Hui, 2004. "Limited Stock Market Participation and Asset Prices in a Dynamic Economy," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 39(3), pages 495-516, September.
    11. Andersen, Torben G & Bollerslev, Tim, 1998. "Answering the Skeptics: Yes, Standard Volatility Models Do Provide Accurate Forecasts," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 39(4), pages 885-905, November.
    12. Brownlees, C.T. & Gallo, G.M., 2006. "Financial econometric analysis at ultra-high frequency: Data handling concerns," Computational Statistics & Data Analysis, Elsevier, vol. 51(4), pages 2232-2245, December.
    13. Gençay, Ramazan & Dacorogna, Michel & Muller, Ulrich A. & Pictet, Olivier & Olsen, Richard, 2001. "An Introduction to High-Frequency Finance," Elsevier Monographs, Elsevier, edition 1, number 9780122796715.
    14. Ole E. Barndorff-Nielsen & Peter Reinhard Hansen & Asger Lunde & Neil Shephard, 2008. "Designing Realized Kernels to Measure the ex post Variation of Equity Prices in the Presence of Noise," Econometrica, Econometric Society, vol. 76(6), pages 1481-1536, November.
    15. Oomen, Roel C.A., 2006. "Properties of Realized Variance Under Alternative Sampling Schemes," Journal of Business & Economic Statistics, American Statistical Association, vol. 24, pages 219-237, April.
    16. Zhang, Lan & Mykland, Per A. & Ait-Sahalia, Yacine, 2005. "A Tale of Two Time Scales: Determining Integrated Volatility With Noisy High-Frequency Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 1394-1411, December.
    17. Ban Zheng & Eric Moulines & Fr'ed'eric Abergel, 2012. "Price Jump Prediction in Limit Order Book," Papers 1204.1381, arXiv.org.
    18. repec:wly:jforec:v:37:y:2018:i:8:p:852-866 is not listed on IDEAS
    19. Sima Siami-Namini & Akbar Siami Namin, 2018. "Forecasting Economics and Financial Time Series: ARIMA vs. LSTM," Papers 1803.06386, arXiv.org.
    20. repec:eee:finmar:v:37:y:2018:i:c:p:17-34 is not listed on IDEAS
    21. Adamantios Ntakaris & Martin Magris & Juho Kanniainen & Moncef Gabbouj & Alexandros Iosifidis, 2017. "Benchmark Dataset for Mid-Price Forecasting of Limit Order Book Data with Machine Learning Methods," Papers 1705.03233, arXiv.org, revised Aug 2018.
    Full references (including those not matched with items on IDEAS)

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1904.05384. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (arXiv administrators). General contact details of provider: http://arxiv.org/ .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.