IDEAS home Printed from https://ideas.repec.org/a/wly/isacfm/v29y2022i4p219-241.html
   My bibliography  Save this article

Application and performance of data mining techniques in stock market: A review

Author

Listed:
  • Jasleen Kaur
  • Khushdeep Dharni

Abstract

Prediction and the stock market go hand in hand. Due to the inherent limitations of traditional forecasting methods and the pursuit to uncover the hidden patterns in stock market data, stock market prediction using data mining techniques has caught the fancy of academicians, researchers, and investors. Based on a systematic review of more than 143 research studies spanning 25 years, the present paper brings to light the major issues concerning forecasting of stock markets based on data mining techniques, such as usage of data mining techniques in the stock market, input data types, single versus hybrid techniques, instruments and stock markets researched, types of software and algorithms used, measures of forecast accuracy, and performance of various data mining techniques. Emerging patterns related to various dimensions have been critically analyzed by highlighting the existing limitations and suggesting future research paradigms. This analysis can be useful for academicians, researchers and investors looking for futuristic directions in a given research domain.

Suggested Citation

  • Jasleen Kaur & Khushdeep Dharni, 2022. "Application and performance of data mining techniques in stock market: A review," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 29(4), pages 219-241, October.
  • Handle: RePEc:wly:isacfm:v:29:y:2022:i:4:p:219-241
    DOI: 10.1002/isaf.1518
    as

    Download full text from publisher

    File URL: https://doi.org/10.1002/isaf.1518
    Download Restriction: no

    File URL: https://libkey.io/10.1002/isaf.1518?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ezgi Erturk & Ebru Akcapinar Sezer, 2016. "Software fault prediction using Mamdani type fuzzy inference system," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 8(1), pages 14-28.
    2. Fernando Perez-cruz & Julio Afonso-rodriguez & Javier Giner, 2003. "Estimating GARCH models using support vector machines," Quantitative Finance, Taylor & Francis Journals, vol. 3(3), pages 163-172.
    3. Hong, Harrison & Torous, Walter & Valkanov, Rossen, 2007. "Do industries lead stock markets?," Journal of Financial Economics, Elsevier, vol. 83(2), pages 367-396, February.
    4. Geet Lahoti & Dilip Kumar Pratihar, 2017. "Recurrent neural networks to model input-output relationships of metal inert gas (MIG) welding process," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 9(3), pages 248-282.
    5. Mojtaba Sedighi & Hossein Jahangirnia & Mohsen Gharakhani & Saeed Farahani Fard, 2019. "A Novel Hybrid Model for Stock Price Forecasting Based on Metaheuristics and Support Vector Machine," Data, MDPI, vol. 4(2), pages 1-28, May.
    6. Jana Tuckova & Martin Sramka, 2012. "ANN application in emotional speech analysis," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 4(3), pages 256-276.
    7. Christopher J. Neely & Paul A. Weller, 2011. "Technical analysis in the foreign exchange market," Working Papers 2011-001, Federal Reserve Bank of St. Louis.
    8. Kunta Ramu & V. Ravi, 2009. "Privacy preservation in data mining using hybrid perturbation methods: an application to bankruptcy prediction in banks," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 1(4), pages 313-331.
    9. Singhal, Shelly & Choudhary, Sangita & Biswal, Pratap Chandra, 2019. "Return and volatility linkages among International crude oil price, gold price, exchange rate and stock markets: Evidence from Mexico," Resources Policy, Elsevier, vol. 60(C), pages 255-261.
    10. Deepak Gupta & Mahardhika Pratama & Zhenyuan Ma & Jun Li & Mukesh Prasad, 2019. "Financial time series forecasting using twin support vector regression," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-27, March.
    11. Jenni L. Bettman & Stephen J. Sault & Emma L. Schultz, 2009. "Fundamental and technical analysis: substitutes or complements?," Accounting and Finance, Accounting and Finance Association of Australia and New Zealand, vol. 49(1), pages 21-36, March.
    12. Barin N. Nag & Chaodong Han & Dong-qing Yao, 2015. "Information enhancement in data mining: a study in data reduction," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 7(1), pages 3-20.
    13. Jingtao Yao & Chew Lim Tan & Hean-Lee Poh, 1999. "Neural Networks For Technical Analysis: A Study On Klci," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 2(02), pages 221-241.
    14. Gallegati, Marco, 2008. "Wavelet analysis of stock returns and aggregate economic activity," Computational Statistics & Data Analysis, Elsevier, vol. 52(6), pages 3061-3074, February.
    15. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," The Review of Financial Studies, Society for Financial Studies, vol. 33(5), pages 2223-2273.
    16. S.M. Monzurur Rahman & F.A. Siddiky & Uma Shrestha, 2011. "A statistical data mining approach in bacteriology for bacterial identification," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 3(2), pages 117-142.
    17. ?enol Emir & Hasan Din?er & Mehpare Timor, 2012. "A Stock Selection Model Based on Fundamental and Technical Analysis Variables by Using Artificial Neural Networks and Support Vector Machines," Review of Economics & Finance, Better Advances Press, Canada, vol. 2, pages 106-122, August.
    18. Bicchetti, David & Maystre, Nicolas Maystre, 2013. "The synchronized and long-lasting structural change on commodity markets: Evidence from high frequency data," Algorithmic Finance, IOS Press, vol. 2(3-4), pages 233-239.
    19. Mohammed Alweshah & Hasan Rashaideh & Abdelaziz I. Hammouri & Hanadi Tayyeb & Mohammed Ababneh, 2017. "Solving time series classification problems using support vector machine and neural network," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 9(3), pages 237-247.
    20. Taylor, Mark P. & Allen, Helen, 1992. "The use of technical analysis in the foreign exchange market," Journal of International Money and Finance, Elsevier, vol. 11(3), pages 304-314, June.
    21. Phichhang Ou & Hengshan Wang, 2009. "Prediction of Stock Market Index Movement by Ten Data Mining Techniques," Modern Applied Science, Canadian Center of Science and Education, vol. 3(12), pages 1-28, December.
    22. Shihao Gu & Bryan Kelly & Dacheng Xiu, 2020. "Empirical Asset Pricing via Machine Learning," Review of Finance, European Finance Association, vol. 33(5), pages 2223-2273.
    23. Weiwen Yang & Linchi Kwok, 2017. "Improving blog spam filters via machine learning," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 9(2), pages 99-121.
    24. Dwiti Krishna Bebarta & Birendra Biswal & P.K. Dash, 2012. "Comparative study of stock market forecasting using different functional link artificial neural networks," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 4(4), pages 398-427.
    25. Giorgos Tigas & Panagiotis Lefakis & Konstantinos Ioannou & Athanasios Hasekioglou, 2013. "Evaluation of artificial neural networks as a model for forecasting consumption of wood products," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 5(1), pages 38-48.
    26. Nafis Ahmad & Tomohisha Tanaka & Yoshio Saito, 2014. "Machining parameter optimisation by genetic algorithm and artificial neural network," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 6(3), pages 261-274.
    27. Pai, Ping-Feng & Lin, Chih-Sheng, 2005. "A hybrid ARIMA and support vector machines model in stock price forecasting," Omega, Elsevier, vol. 33(6), pages 497-505, December.
    28. Zoran Vojinovic & Vojislav Kecman & Rainer Seidel, 2001. "A data mining approach to financial time series modelling and forecasting," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 10(4), pages 225-239, December.
    29. Quinn Taylor & Christophe Giraud-Carrier & Charles D. Knutson, 2010. "Applications of data mining in software engineering," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 2(3), pages 243-257.
    30. Wun-Hua Chen & Jen-Ying Shih & Soushan Wu, 2006. "Comparison of support-vector machines and back propagation neural networks in forecasting the six major Asian stock markets," International Journal of Electronic Finance, Inderscience Enterprises Ltd, vol. 1(1), pages 49-67.
    31. M. Saimurugan & K.I. Ramachandran, 2014. "A comparative study of sound and vibration signals in detection of rotating machine faults using support vector machine and independent component analysis," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 6(2), pages 188-204.
    32. Mohamed Essaied Hamrita & Abdelkader Trifi, 2011. "The Relationship between Interest Rate, Exchange Rate and Stock Price: A Wavelet Analysis," International Journal of Economics and Financial Issues, Econjournals, vol. 1(4), pages 220-228.
    33. Tay, Francis E. H. & Cao, Lijuan, 2001. "Application of support vector machines in financial time series forecasting," Omega, Elsevier, vol. 29(4), pages 309-317, August.
    34. Armstrong, J. Scott & Collopy, Fred, 1992. "Error measures for generalizing about forecasting methods: Empirical comparisons," International Journal of Forecasting, Elsevier, vol. 8(1), pages 69-80, June.
    35. Dudyala Anil Kumar & V. Ravi, 2008. "Predicting credit card customer churn in banks using data mining," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 1(1), pages 4-28.
    36. Paramjeet & V. Ravi, 2011. "Bacterial foraging trained wavelet neural networks: application to bankruptcy prediction in banks," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 3(3), pages 261-280.
    37. Y.R. Ramesh Kumar & A. Govardhan & R.B.V. Subramanyam, 2013. "Predicting intraday prices in stock market transactions using similarity profiled temporal associations," International Journal of Data Analysis Techniques and Strategies, Inderscience Enterprises Ltd, vol. 5(3), pages 303-322.
    38. Dev Shah & Haruna Isah & Farhana Zulkernine, 2019. "Stock Market Analysis: A Review and Taxonomy of Prediction Techniques," IJFS, MDPI, vol. 7(2), pages 1-22, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gaeithry Manoharam & Mohd Shareduwan Mohd Kasihmuddin & Siti Noor Farwina Mohamad Anwar Antony & Nurul Atiqah Romli & Nur ‘Afifah Rusdi & Suad Abdeen & Mohd. Asyraf Mansor, 2023. "Log-Linear-Based Logic Mining with Multi-Discrete Hopfield Neural Network," Mathematics, MDPI, vol. 11(9), pages 1-30, April.
    2. Margrét Vilborg Bjarnadóttir & Louiqa Raschid, 2023. "Modeling Financial Products and Their Supply Chains," INFORMS Joural on Data Science, INFORMS, vol. 2(2), pages 138-160, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nawaf Almaskati, 2022. "Machine learning in finance: Major applications, issues, metrics, and future trends," International Journal of Financial Engineering (IJFE), World Scientific Publishing Co. Pte. Ltd., vol. 9(03), pages 1-32, September.
    2. Manolis Maragoudakis & Dimitrios Serpanos, 2016. "Exploiting Financial News and Social Media Opinions for Stock Market Analysis using MCMC Bayesian Inference," Computational Economics, Springer;Society for Computational Economics, vol. 47(4), pages 589-622, April.
    3. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    4. Smith, Simon C., 2021. "International stock return predictability," International Review of Financial Analysis, Elsevier, vol. 78(C).
    5. Thierry Warin & Aleksandar Stojkov, 2021. "Machine Learning in Finance: A Metadata-Based Systematic Review of the Literature," JRFM, MDPI, vol. 14(7), pages 1-31, July.
    6. Jasleen Kaur & Khushdeep Dharni, 2022. "Assessing efficacy of association rules for predicting global stock indices," DECISION: Official Journal of the Indian Institute of Management Calcutta, Springer;Indian Institute of Management Calcutta, vol. 49(3), pages 329-339, September.
    7. ?enol Emir & Hasan Din?er & Mehpare Timor, 2012. "A Stock Selection Model Based on Fundamental and Technical Analysis Variables by Using Artificial Neural Networks and Support Vector Machines," Review of Economics & Finance, Better Advances Press, Canada, vol. 2, pages 106-122, August.
    8. I. Marta Miranda García & María‐Jesús Segovia‐Vargas & Usue Mori & José A. Lozano, 2023. "Early prediction of Ibex 35 movements," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(5), pages 1150-1166, August.
    9. Lukas Ryll & Sebastian Seidens, 2019. "Evaluating the Performance of Machine Learning Algorithms in Financial Market Forecasting: A Comprehensive Survey," Papers 1906.07786, arXiv.org, revised Jul 2019.
    10. Gradojevic, Nikola & Kukolj, Dragan & Adcock, Robert & Djakovic, Vladimir, 2023. "Forecasting Bitcoin with technical analysis: A not-so-random forest?," International Journal of Forecasting, Elsevier, vol. 39(1), pages 1-17.
    11. Duan, Wen-Qi & Stanley, H. Eugene, 2011. "Cross-correlation and the predictability of financial return series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(2), pages 290-296.
    12. Smith, Simon C. & Timmermann, Allan, 2022. "Have risk premia vanished?," Journal of Financial Economics, Elsevier, vol. 145(2), pages 553-576.
    13. Dhanya Jothimani & Ravi Shankar & Surendra S. Yadav, 2016. "Discrete Wavelet Transform-Based Prediction of Stock Index: A Study on National Stock Exchange Fifty Index," Papers 1605.07278, arXiv.org.
    14. Bakalli, Gaetan & Guerrier, Stéphane & Scaillet, Olivier, 2023. "A penalized two-pass regression to predict stock returns with time-varying risk premia," Journal of Econometrics, Elsevier, vol. 237(2).
    15. Philippe Goulet Coulombe & Maxime Leroux & Dalibor Stevanovic & Stéphane Surprenant, 2022. "How is machine learning useful for macroeconomic forecasting?," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(5), pages 920-964, August.
    16. Wang, Yudong & Hao, Xianfeng, 2022. "Forecasting the real prices of crude oil: A robust weighted least squares approach," Energy Economics, Elsevier, vol. 116(C).
    17. Tobias Götze & Marc Gürtler & Eileen Witowski, 2020. "Improving CAT bond pricing models via machine learning," Journal of Asset Management, Palgrave Macmillan, vol. 21(5), pages 428-446, September.
    18. Wen, Danyan & Liu, Li & Wang, Yudong & Zhang, Yaojie, 2022. "Forecasting crude oil market returns: Enhanced moving average technical indicators," Resources Policy, Elsevier, vol. 76(C).
    19. Zhu, Haibin & Bai, Lu & He, Lidan & Liu, Zhi, 2023. "Forecasting realized volatility with machine learning: Panel data perspective," Journal of Empirical Finance, Elsevier, vol. 73(C), pages 251-271.
    20. Eghbal Rahimikia & Stefan Zohren & Ser-Huang Poon, 2021. "Realised Volatility Forecasting: Machine Learning via Financial Word Embedding," Papers 2108.00480, arXiv.org, revised Mar 2023.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wly:isacfm:v:29:y:2022:i:4:p:219-241. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.interscience.wiley.com/jpages/1099-1174/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.