IDEAS home Printed from https://ideas.repec.org/a/spr/fininn/v9y2023i1d10.1186_s40854-022-00441-7.html
   My bibliography  Save this article

Survey of feature selection and extraction techniques for stock market prediction

Author

Listed:
  • Htet Htet Htun

    (University of Groningen)

  • Michael Biehl

    (University of Groningen)

  • Nicolai Petkov

    (University of Groningen)

Abstract

In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based methods used in stock market forecasting. However, no survey study has explored feature selection and extraction techniques for stock market forecasting. This survey presents a detailed analysis of 32 research works that use a combination of feature study and ML approaches in various stock market applications. We conduct a systematic search for articles in the Scopus and Web of Science databases for the years 2011–2022. We review a variety of feature selection and feature extraction approaches that have been successfully applied in the stock market analyses presented in the articles. We also describe the combination of feature analysis techniques and ML methods and evaluate their performance. Moreover, we present other survey articles, stock market input and output data, and analyses based on various factors. We find that correlation criteria, random forest, principal component analysis, and autoencoder are the most widely used feature selection and extraction techniques with the best prediction accuracy for various stock market applications.

Suggested Citation

  • Htet Htet Htun & Michael Biehl & Nicolai Petkov, 2023. "Survey of feature selection and extraction techniques for stock market prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-25, December.
  • Handle: RePEc:spr:fininn:v:9:y:2023:i:1:d:10.1186_s40854-022-00441-7
    DOI: 10.1186/s40854-022-00441-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1186/s40854-022-00441-7
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1186/s40854-022-00441-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dharmaraja Selvamuthu & Vineet Kumar & Abhishek Mishra, 2019. "Indian stock market prediction using artificial neural networks on tick data," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 5(1), pages 1-12, December.
    2. Jaideep Singh & Matloob Khushi, 2021. "Feature Learning for Stock Price Prediction Shows a Significant Role of Analyst Rating," Papers 2103.09106, arXiv.org.
    3. Basak, Suryoday & Kar, Saibal & Saha, Snehanshu & Khaidem, Luckyson & Dey, Sudeepa Roy, 2019. "Predicting the direction of stock market prices using tree-based classifiers," The North American Journal of Economics and Finance, Elsevier, vol. 47(C), pages 552-567.
    4. Ozgur Ican & Taha Bugra Celik, 2017. "Stock Market Prediction Performance of Neural Networks: A Literature Review," International Journal of Economics and Finance, Canadian Center of Science and Education, vol. 9(11), pages 100-108, November.
    5. Burton G. Malkiel, 2003. "The Efficient Market Hypothesis and Its Critics," Working Papers 111, Princeton University, Department of Economics, Center for Economic Policy Studies..
    6. Burton G. Malkiel, 2003. "The Efficient Market Hypothesis and Its Critics," Working Papers 111, Princeton University, Department of Economics, Center for Economic Policy Studies..
    7. repec:pri:cepsud:91malkiel is not listed on IDEAS
    8. Lin, Qi, 2018. "Technical analysis and stock return predictability: An aligned approach," Journal of Financial Markets, Elsevier, vol. 38(C), pages 103-123.
    9. Jeffrey E. Jarrett & Janne Schilling, 2008. "Daily variation and predicting stock market returns for the frankfurter börse (stock market)," Journal of Business Economics and Management, Taylor & Francis Journals, vol. 9(3), pages 189-198, March.
    10. Burton G. Malkiel, 2003. "The Efficient Market Hypothesis and Its Critics," Journal of Economic Perspectives, American Economic Association, vol. 17(1), pages 59-82, Winter.
    11. Perry Sadorsky, 2021. "A Random Forests Approach to Predicting Clean Energy Stock Prices," JRFM, MDPI, vol. 14(2), pages 1-20, January.
    12. Hakan Gunduz, 2021. "An efficient stock market prediction model using hybrid feature reduction method based on variational autoencoders and recursive feature elimination," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-24, December.
    13. Dev Shah & Haruna Isah & Farhana Zulkernine, 2019. "Stock Market Analysis: A Review and Taxonomy of Prediction Techniques," IJFS, MDPI, vol. 7(2), pages 1-22, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Petr Sokerin & Kristian Kuznetsov & Elizaveta Makhneva & Alexey Zaytsev, 2023. "Portfolio Selection via Topological Data Analysis," Papers 2308.07944, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Perry Sadorsky, 2021. "A Random Forests Approach to Predicting Clean Energy Stock Prices," JRFM, MDPI, vol. 14(2), pages 1-20, January.
    2. Dushmanta Kumar Padhi & Neelamadhab Padhy & Akash Kumar Bhoi & Jana Shafi & Muhammad Fazal Ijaz, 2021. "A Fusion Framework for Forecasting Financial Market Direction Using Enhanced Ensemble Models and Technical Indicators," Mathematics, MDPI, vol. 9(21), pages 1-31, October.
    3. Saqib Farid & Rubeena Tashfeen & Tahseen Mohsan & Arsal Burhan, 2023. "Forecasting stock prices using a data mining method: Evidence from emerging market," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 28(2), pages 1911-1917, April.
    4. Yang, Yanlin & Hu, Xuemei & Jiang, Huifeng, 2022. "Group penalized logistic regressions predict up and down trends for stock prices," The North American Journal of Economics and Finance, Elsevier, vol. 59(C).
    5. L.J. Basson & Sune Ferreira-Schenk & Zandri Dickason-Koekemoer, 2022. "Fractal Dimension Option Hedging Strategy Implementation During Turbulent Market Conditions in Developing and Developed Countries," International Journal of Economics and Financial Issues, Econjournals, vol. 12(2), pages 84-95, March.
    6. Dimingo, Roselyn & Muteba Mwamba, John W. & Bonga-Bonga, Lumengo, 2021. "Prediction of Stock Market Direction: Application of Machine Learning Models," Economia Internazionale / International Economics, Camera di Commercio Industria Artigianato Agricoltura di Genova, vol. 74(4), pages 499-536.
    7. Ignacio Escanuela Romana & Clara Escanuela Nieves, 2023. "A spectral approach to stock market performance," Papers 2305.05762, arXiv.org.
    8. David M. Ritzwoller & Joseph P. Romano, 2019. "Uncertainty in the Hot Hand Fallacy: Detecting Streaky Alternatives to Random Bernoulli Sequences," Papers 1908.01406, arXiv.org, revised Apr 2021.
    9. Jitka Veselá & Alžběta Zíková, 2022. "Are the Czech, Polish, German and Dutch markets taking a random walk? [Konají český, polský, německý a nizozemský trh náhodnou procházku?]," Český finanční a účetní časopis, Prague University of Economics and Business, vol. 2022(2), pages 19-38.
    10. Muchnik, Lev & Bunde, Armin & Havlin, Shlomo, 2009. "Long term memory in extreme returns of financial time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(19), pages 4145-4150.
    11. John Sabelhaus, 2005. "Alternative Methods for Projecting Equity Returns: Implications for Evaluating Social Security Reform Proposals," Risk Management and Insurance Review, American Risk and Insurance Association, vol. 8(1), pages 43-63, March.
    12. Cristi Spulbar & Ramona Birau & Lucian Florin Spulbar, 2021. "A Critical Survey on Efficient Market Hypothesis (EMH), Adaptive Market Hypothesis (AMH) and Fractal Markets Hypothesis (FMH) Considering Their Implication on Stock Markets Behavior," Ovidius University Annals, Economic Sciences Series, Ovidius University of Constantza, Faculty of Economic Sciences, vol. 0(2), pages 1161-1165, December.
    13. Stephen Bell & John Quiggin, 2006. "Asset Price Instability and Policy Responses: The Legacy of Liberalization," Journal of Economic Issues, Taylor & Francis Journals, vol. 40(3), pages 629-649, September.
    14. Mahata, Ajit & Rai, Anish & Nurujjaman, Md. & Prakash, Om, 2021. "Modeling and analysis of the effect of COVID-19 on the stock price: V and L-shape recovery," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 574(C).
    15. Abootaleb Shirvani & Svetlozar T. Rachev & Frank J. Fabozzi, 2019. "A Rational Finance Explanation of the Stock Predictability Puzzle," Papers 1911.02194, arXiv.org.
    16. Stöckl, Thomas & Huber, Jürgen & Kirchler, Michael & Lindner, Florian, 2015. "Hot hand and gambler's fallacy in teams: Evidence from investment experiments," Journal of Economic Behavior & Organization, Elsevier, vol. 117(C), pages 327-339.
    17. Rešovský, Marcel & Gróf, Marek & Horváth, Denis & Gazda, Vladimír, 2014. "Analysis of the Lead-Lag Relationship on South Africa capital market," MPRA Paper 57309, University Library of Munich, Germany.
    18. Diniz-Maganini, Natalia & Diniz, Eduardo H. & Rasheed, Abdul A., 2021. "Bitcoin’s price efficiency and safe haven properties during the COVID-19 pandemic: A comparison," Research in International Business and Finance, Elsevier, vol. 58(C).
    19. Bradly Alicea, 2014. "Contextual and Structural Representations of Market-mediated Economic Value," Papers 1403.7021, arXiv.org.
    20. Svitlana Galeshchuk, 2017. "Technological bias at the exchange rate market," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 24(2-3), pages 80-86, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:fininn:v:9:y:2023:i:1:d:10.1186_s40854-022-00441-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.