IDEAS home Printed from https://ideas.repec.org/a/spr/fininn/v5y2019i1d10.1186_s40854-019-0138-0.html
   My bibliography  Save this article

Predicting the daily return direction of the stock market using hybrid machine learning algorithms

Author

Listed:
  • Xiao Zhong

    (Graduate School of Management, Clark University)

  • David Enke

    (Missouri University of Science and Technology)

Abstract

Big data analytic techniques associated with machine learning algorithms are playing an increasingly important role in various application fields, including stock market investment. However, few studies have focused on forecasting daily stock market returns, especially when using powerful machine learning techniques, such as deep neural networks (DNNs), to perform the analyses. DNNs employ various deep learning algorithms based on the combination of network structure, activation function, and model parameters, with their performance depending on the format of the data representation. This paper presents a comprehensive big data analytics process to predict the daily return direction of the SPDR S&P 500 ETF (ticker symbol: SPY) based on 60 financial and economic features. DNNs and traditional artificial neural networks (ANNs) are then deployed over the entire preprocessed but untransformed dataset, along with two datasets transformed via principal component analysis (PCA), to predict the daily direction of future stock market index returns. While controlling for overfitting, a pattern for the classification accuracy of the DNNs is detected and demonstrated as the number of the hidden layers increases gradually from 12 to 1000. Moreover, a set of hypothesis testing procedures are implemented on the classification, and the simulation results show that the DNNs using two PCA-represented datasets give significantly higher classification accuracy than those using the entire untransformed dataset, as well as several other hybrid machine learning algorithms. In addition, the trading strategies guided by the DNN classification process based on PCA-represented data perform slightly better than the others tested, including in a comparison against two standard benchmarks.

Suggested Citation

  • Xiao Zhong & David Enke, 2019. "Predicting the daily return direction of the stock market using hybrid machine learning algorithms," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 5(1), pages 1-20, December.
  • Handle: RePEc:spr:fininn:v:5:y:2019:i:1:d:10.1186_s40854-019-0138-0
    DOI: 10.1186/s40854-019-0138-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1186/s40854-019-0138-0
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1186/s40854-019-0138-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Huang, Yan & Kou, Gang & Peng, Yi, 2017. "Nonlinear manifold learning for early warnings in financial markets," European Journal of Operational Research, Elsevier, vol. 258(2), pages 692-702.
    2. Zhang, Guoqiang & Eddy Patuwo, B. & Y. Hu, Michael, 1998. "Forecasting with artificial neural networks:: The state of the art," International Journal of Forecasting, Elsevier, vol. 14(1), pages 35-62, March.
    3. Sarat Chandra Nayak & Bijan Bihari Misra, 2018. "Estimating stock closing indices using a GA-weighted condensed polynomial neural network," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 4(1), pages 1-22, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jichang Dong & Wei Dai & Ying Liu & Lean Yu & Jie Wang, 2019. "Forecasting Chinese Stock Market Prices using Baidu Search Index with a Learning-Based Data Collection Method," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 18(05), pages 1605-1629, September.
    2. Contessi, Silvio & De Pace, Pierangelo & Guidolin, Massimo, 2020. "Mildly explosive dynamics in U.S. fixed income markets," European Journal of Operational Research, Elsevier, vol. 287(2), pages 712-724.
    3. Balkin, Sandy, 2001. "On Forecasting Exchange Rates Using Neural Networks: P.H. Franses and P.V. Homelen, 1998, Applied Financial Economics, 8, 589-596," International Journal of Forecasting, Elsevier, vol. 17(1), pages 139-140.
    4. Barrow, Devon & Kourentzes, Nikolaos, 2018. "The impact of special days in call arrivals forecasting: A neural network approach to modelling special days," European Journal of Operational Research, Elsevier, vol. 264(3), pages 967-977.
    5. Daniel Buncic, 2012. "Understanding forecast failure of ESTAR models of real exchange rates," Empirical Economics, Springer, vol. 43(1), pages 399-426, August.
    6. Apostolos Ampountolas & Titus Nyarko Nde & Paresh Date & Corina Constantinescu, 2021. "A Machine Learning Approach for Micro-Credit Scoring," Risks, MDPI, vol. 9(3), pages 1-20, March.
    7. Ebrahimpour, Reza & Nikoo, Hossein & Masoudnia, Saeed & Yousefi, Mohammad Reza & Ghaemi, Mohammad Sajjad, 2011. "Mixture of MLP-experts for trend forecasting of time series: A case study of the Tehran stock exchange," International Journal of Forecasting, Elsevier, vol. 27(3), pages 804-816, July.
    8. Hewamalage, Hansika & Bergmeir, Christoph & Bandara, Kasun, 2021. "Recurrent Neural Networks for Time Series Forecasting: Current status and future directions," International Journal of Forecasting, Elsevier, vol. 37(1), pages 388-427.
    9. Leung, Philip C.M. & Lee, Eric W.M., 2013. "Estimation of electrical power consumption in subway station design by intelligent approach," Applied Energy, Elsevier, vol. 101(C), pages 634-643.
    10. Donya Rahmani & Saeed Heravi & Hossein Hassani & Mansi Ghodsi, 2016. "Forecasting time series with structural breaks with Singular Spectrum Analysis, using a general form of recurrent formula," Papers 1605.02188, arXiv.org.
    11. Wei Sun & Yujun He & Hong Chang, 2015. "Forecasting Fossil Fuel Energy Consumption for Power Generation Using QHSA-Based LSSVM Model," Energies, MDPI, vol. 8(2), pages 1-21, January.
    12. Saman, Corina, 2011. "Scenarios of the Romanian GDP Evolution With Neural Models," Journal for Economic Forecasting, Institute for Economic Forecasting, vol. 0(4), pages 129-140, December.
    13. Ghiassi, M. & Saidane, H. & Zimbra, D.K., 2005. "A dynamic artificial neural network model for forecasting time series events," International Journal of Forecasting, Elsevier, vol. 21(2), pages 341-362.
    14. Barrow, Devon K., 2016. "Forecasting intraday call arrivals using the seasonal moving average method," Journal of Business Research, Elsevier, vol. 69(12), pages 6088-6096.
    15. Changshi Liu & Gang Kou & Yi Peng & Fawaz E. Alsaadi, 2019. "Location-Routing Problem for Relief Distribution in the Early Post-Earthquake Stage from the Perspective of Fairness," Sustainability, MDPI, vol. 11(12), pages 1-16, June.
    16. Jani, D.B. & Mishra, Manish & Sahoo, P.K., 2017. "Application of artificial neural network for predicting performance of solid desiccant cooling systems – A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 80(C), pages 352-366.
    17. Oscar Claveria & Salvador Torra, 2013. "“Forecasting Business surveys indicators: neural networks vs. time series models”," AQR Working Papers 201312, University of Barcelona, Regional Quantitative Analysis Group, revised Nov 2013.
    18. Bento, P.M.R. & Pombo, J.A.N. & Calado, M.R.A. & Mariano, S.J.P.S., 2018. "A bat optimized neural network and wavelet transform approach for short-term price forecasting," Applied Energy, Elsevier, vol. 210(C), pages 88-97.
    19. Nataša Glišović & Miloš Milenković & Nebojša Bojović & Libor Švadlenka & Zoran Avramović, 2016. "A hybrid model for forecasting the volume of passenger flows on Serbian railways," Operational Research, Springer, vol. 16(2), pages 271-285, July.
    20. Timothy Praditia & Thilo Walser & Sergey Oladyshkin & Wolfgang Nowak, 2020. "Improving Thermochemical Energy Storage Dynamics Forecast with Physics-Inspired Neural Network Architecture," Energies, MDPI, vol. 13(15), pages 1-26, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:fininn:v:5:y:2019:i:1:d:10.1186_s40854-019-0138-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.