IDEAS home Printed from https://ideas.repec.org/a/spr/fininn/v9y2023i1d10.1186_s40854-022-00441-7.html
   My bibliography  Save this article

Survey of feature selection and extraction techniques for stock market prediction

Author

Listed:
  • Htet Htet Htun

    (University of Groningen)

  • Michael Biehl

    (University of Groningen)

  • Nicolai Petkov

    (University of Groningen)

Abstract

In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based methods used in stock market forecasting. However, no survey study has explored feature selection and extraction techniques for stock market forecasting. This survey presents a detailed analysis of 32 research works that use a combination of feature study and ML approaches in various stock market applications. We conduct a systematic search for articles in the Scopus and Web of Science databases for the years 2011–2022. We review a variety of feature selection and feature extraction approaches that have been successfully applied in the stock market analyses presented in the articles. We also describe the combination of feature analysis techniques and ML methods and evaluate their performance. Moreover, we present other survey articles, stock market input and output data, and analyses based on various factors. We find that correlation criteria, random forest, principal component analysis, and autoencoder are the most widely used feature selection and extraction techniques with the best prediction accuracy for various stock market applications.

Suggested Citation

  • Htet Htet Htun & Michael Biehl & Nicolai Petkov, 2023. "Survey of feature selection and extraction techniques for stock market prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-25, December.
  • Handle: RePEc:spr:fininn:v:9:y:2023:i:1:d:10.1186_s40854-022-00441-7
    DOI: 10.1186/s40854-022-00441-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1186/s40854-022-00441-7
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1186/s40854-022-00441-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dharmaraja Selvamuthu & Vineet Kumar & Abhishek Mishra, 2019. "Indian stock market prediction using artificial neural networks on tick data," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 5(1), pages 1-12, December.
    2. Jaideep Singh & Matloob Khushi, 2021. "Feature Learning for Stock Price Prediction Shows a Significant Role of Analyst Rating," Papers 2103.09106, arXiv.org.
    3. Basak, Suryoday & Kar, Saibal & Saha, Snehanshu & Khaidem, Luckyson & Dey, Sudeepa Roy, 2019. "Predicting the direction of stock market prices using tree-based classifiers," The North American Journal of Economics and Finance, Elsevier, vol. 47(C), pages 552-567.
    4. Ozgur Ican & Taha Bugra Celik, 2017. "Stock Market Prediction Performance of Neural Networks: A Literature Review," International Journal of Economics and Finance, Canadian Center of Science and Education, vol. 9(11), pages 100-108, November.
    5. Burton G. Malkiel, 2003. "The Efficient Market Hypothesis and Its Critics," Working Papers 111, Princeton University, Department of Economics, Center for Economic Policy Studies..
    6. Burton G. Malkiel, 2003. "The Efficient Market Hypothesis and Its Critics," Working Papers 111, Princeton University, Department of Economics, Center for Economic Policy Studies..
    7. repec:pri:cepsud:91malkiel is not listed on IDEAS
    8. Lin, Qi, 2018. "Technical analysis and stock return predictability: An aligned approach," Journal of Financial Markets, Elsevier, vol. 38(C), pages 103-123.
    9. Jeffrey E. Jarrett & Janne Schilling, 2008. "Daily variation and predicting stock market returns for the frankfurter börse (stock market)," Journal of Business Economics and Management, Taylor & Francis Journals, vol. 9(3), pages 189-198, March.
    10. Burton G. Malkiel, 2003. "The Efficient Market Hypothesis and Its Critics," Journal of Economic Perspectives, American Economic Association, vol. 17(1), pages 59-82, Winter.
    11. Perry Sadorsky, 2021. "A Random Forests Approach to Predicting Clean Energy Stock Prices," JRFM, MDPI, vol. 14(2), pages 1-20, January.
    12. Hakan Gunduz, 2021. "An efficient stock market prediction model using hybrid feature reduction method based on variational autoencoders and recursive feature elimination," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 7(1), pages 1-24, December.
    13. Dev Shah & Haruna Isah & Farhana Zulkernine, 2019. "Stock Market Analysis: A Review and Taxonomy of Prediction Techniques," IJFS, MDPI, vol. 7(2), pages 1-22, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Petr Sokerin & Kristian Kuznetsov & Elizaveta Makhneva & Alexey Zaytsev, 2023. "Portfolio Selection via Topological Data Analysis," Papers 2308.07944, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Saqib Farid & Rubeena Tashfeen & Tahseen Mohsan & Arsal Burhan, 2023. "Forecasting stock prices using a data mining method: Evidence from emerging market," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 28(2), pages 1911-1917, April.
    2. Perry Sadorsky, 2021. "A Random Forests Approach to Predicting Clean Energy Stock Prices," JRFM, MDPI, vol. 14(2), pages 1-20, January.
    3. Dushmanta Kumar Padhi & Neelamadhab Padhy & Akash Kumar Bhoi & Jana Shafi & Muhammad Fazal Ijaz, 2021. "A Fusion Framework for Forecasting Financial Market Direction Using Enhanced Ensemble Models and Technical Indicators," Mathematics, MDPI, vol. 9(21), pages 1-31, October.
    4. Yang, Yanlin & Hu, Xuemei & Jiang, Huifeng, 2022. "Group penalized logistic regressions predict up and down trends for stock prices," The North American Journal of Economics and Finance, Elsevier, vol. 59(C).
    5. Ignacio Escanuela Romana & Clara Escanuela Nieves, 2023. "A spectral approach to stock market performance," Papers 2305.05762, arXiv.org.
    6. L.J. Basson & Sune Ferreira-Schenk & Zandri Dickason-Koekemoer, 2022. "Fractal Dimension Option Hedging Strategy Implementation During Turbulent Market Conditions in Developing and Developed Countries," International Journal of Economics and Financial Issues, Econjournals, vol. 12(2), pages 84-95, March.
    7. Dimingo, Roselyn & Muteba Mwamba, John W. & Bonga-Bonga, Lumengo, 2021. "Prediction of Stock Market Direction: Application of Machine Learning Models," Economia Internazionale / International Economics, Camera di Commercio Industria Artigianato Agricoltura di Genova, vol. 74(4), pages 499-536.
    8. David M. Ritzwoller & Joseph P. Romano, 2019. "Uncertainty in the Hot Hand Fallacy: Detecting Streaky Alternatives to Random Bernoulli Sequences," Papers 1908.01406, arXiv.org, revised Apr 2021.
    9. Chia-Lin Chang & Jukka Ilomäki & Hannu Laurila & Michael McAleer, 2018. "Long Run Returns Predictability and Volatility with Moving Averages," Risks, MDPI, vol. 6(4), pages 1-18, September.
    10. Bell, Peter N, 2013. "New Testing Procedures to Assess Market Efficiency with Trading Rules," MPRA Paper 46701, University Library of Munich, Germany.
    11. Jitka Veselá & Alžběta Zíková, 2022. "Are the Czech, Polish, German and Dutch markets taking a random walk? [Konají český, polský, německý a nizozemský trh náhodnou procházku?]," Český finanční a účetní časopis, Prague University of Economics and Business, vol. 2022(2), pages 19-38.
    12. Muchnik, Lev & Bunde, Armin & Havlin, Shlomo, 2009. "Long term memory in extreme returns of financial time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(19), pages 4145-4150.
    13. Nathan Jensen, 2007. "International institutions and market expectations: Stock price responses to the WTO ruling on the 2002 U.S. steel tariffs," The Review of International Organizations, Springer, vol. 2(3), pages 261-280, September.
    14. Ishani Chaudhuri & Parthajit Kayal, 2022. "Predicting Power of Ticker Search Volume in Indian Stock Market," Working Papers 2022-214, Madras School of Economics,Chennai,India.
    15. Ghada A. Altarawneh & Ahmad B. Hassanat & Ahmad S. Tarawneh & Ahmad Abadleh & Malek Alrashidi & Mansoor Alghamdi, 2022. "Stock Price Forecasting for Jordan Insurance Companies Amid the COVID-19 Pandemic Utilizing Off-the-Shelf Technical Analysis Methods," Economies, MDPI, vol. 10(2), pages 1-18, February.
    16. John Sabelhaus, 2005. "Alternative Methods for Projecting Equity Returns: Implications for Evaluating Social Security Reform Proposals," Risk Management and Insurance Review, American Risk and Insurance Association, vol. 8(1), pages 43-63, March.
    17. Cristi Spulbar & Ramona Birau & Lucian Florin Spulbar, 2021. "A Critical Survey on Efficient Market Hypothesis (EMH), Adaptive Market Hypothesis (AMH) and Fractal Markets Hypothesis (FMH) Considering Their Implication on Stock Markets Behavior," Ovidius University Annals, Economic Sciences Series, Ovidius University of Constantza, Faculty of Economic Sciences, vol. 0(2), pages 1161-1165, December.
    18. Stephen Bell & John Quiggin, 2006. "Asset Price Instability and Policy Responses: The Legacy of Liberalization," Journal of Economic Issues, Taylor & Francis Journals, vol. 40(3), pages 629-649, September.
    19. Paolo Cremonesi & Chiara Francalanci & Alessandro Poli & Roberto Pagano & Luca Mazzoni & Alberto Maggioni & Mehdi Elahi, 2018. "Social Network based Short-Term Stock Trading System," Papers 1801.05295, arXiv.org.
    20. Park, Cheol-Ho & Irwin, Scott H., 2004. "The Profitability Of Technical Trading Rules In Us Futures Markets: A Data Snooping Free Test," 2004 Conference, April 19-20, 2004, St. Louis, Missouri 19011, NCR-134 Conference on Applied Commodity Price Analysis, Forecasting, and Market Risk Management.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:fininn:v:9:y:2023:i:1:d:10.1186_s40854-022-00441-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.