IDEAS home Printed from https://ideas.repec.org/a/ibn/masjnl/v3y2009i12p28.html

Prediction of Stock Market Index Movement by Ten Data Mining Techniques

Author

Listed:
  • Phichhang Ou
  • Hengshan Wang

Abstract

Ability to predict direction of stock/index price accurately is crucial for market dealers or investors to maximize their profits. Data mining techniques have been successfully shown to generate high forecasting accuracy of stock price movement. Nowadays, in stead of a single method, traders need to use various forecasting techniques to gain multiple signals and more information about the future of the markets. In this paper, ten different techniques of data mining are discussed and applied to predict price movement of Hang Seng index of Hong Kong stock market. The approaches include Linear discriminant analysis (LDA), Quadratic discriminant analysis (QDA), K-nearest neighbor classification, Naïve Bayes based on kernel estimation, Logit model, Tree based classification, neural network, Bayesian classification with Gaussian process, Support vector machine (SVM) and Least squares support vector machine (LS-SVM). Experimental results show that the SVM and LS-SVM generate superior predictive performances among the other models. Specifically, SVM is better than LS-SVM for in-sample prediction but LS-SVM is, in turn, better than the SVM for the out-of-sample forecasts in term of hit rate and error rate criteria.

Suggested Citation

  • Phichhang Ou & Hengshan Wang, 2009. "Prediction of Stock Market Index Movement by Ten Data Mining Techniques," Modern Applied Science, Canadian Center of Science and Education, vol. 3(12), pages 1-28, December.
  • Handle: RePEc:ibn:masjnl:v:3:y:2009:i:12:p:28
    as

    Download full text from publisher

    File URL: https://ccsenet.org/journal/index.php/mas/article/download/4586/3925
    Download Restriction: no

    File URL: https://ccsenet.org/journal/index.php/mas/article/view/4586
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Yangru Wu & Hua Zhang, 1997. "Forward premiums as unbiased predictors of future currency depreciation: a non-parametric analysis," Journal of International Money and Finance, Elsevier, vol. 16(4), pages 609-623, August.
    2. Tay, Francis E. H. & Cao, Lijuan, 2001. "Application of support vector machines in financial time series forecasting," Omega, Elsevier, vol. 29(4), pages 309-317, August.
    3. Karatzoglou, Alexandros & Meyer, David & Hornik, Kurt, 2006. "Support Vector Machines in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 15(i09).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jasleen Kaur & Khushdeep Dharni, 2022. "Application and performance of data mining techniques in stock market: A review," Intelligent Systems in Accounting, Finance and Management, John Wiley & Sons, Ltd., vol. 29(4), pages 219-241, October.
    2. Görkem Ataman & Serpil Kahraman, 2022. "Comparing Decision Trees and Association Rules for Stock Market Expectations in BIST100 and BIST30," Scientific Annals of Economics and Business (continues Analele Stiintifice), Alexandru Ioan Cuza University, Faculty of Economics and Business Administration, vol. 69(3), pages 459-475, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Andrey Zahariev & Mikhail Zveryаkov & Stoyan Prodanov & Galina Zaharieva & Petko Angelov & Silvia Zarkova & Mariana Petrova, 2020. "Debt management evaluation through Support Vector Machines: on the example of Italy and Greece," Entrepreneurship and Sustainability Issues, VsI Entrepreneurship and Sustainability Center, vol. 7(3), pages 2382-2393, March.
    2. Ślepaczuk Robert & Zenkova Maryna, 2018. "Robustness of Support Vector Machines in Algorithmic Trading on Cryptocurrency Market," Central European Economic Journal, Sciendo, vol. 5(52), pages 186-205, January.
    3. Wang, Chao & Lim, Ming K & Zhao, Longfeng & Tseng, Ming-Lang & Chien, Chen-Fu & Lev, Benjamin, 2020. "The evolution of Omega-The International Journal of Management Science over the past 40 years: A bibliometric overview," Omega, Elsevier, vol. 93(C).
    4. Fethi, Meryem Duygun & Pasiouras, Fotios, 2010. "Assessing bank efficiency and performance with operational research and artificial intelligence techniques: A survey," European Journal of Operational Research, Elsevier, vol. 204(2), pages 189-198, July.
    5. Deng, S. & Yeh, Tsung-Han, 2011. "Using least squares support vector machines for the airframe structures manufacturing cost estimation," International Journal of Production Economics, Elsevier, vol. 131(2), pages 701-708, June.
    6. Samia Zaoui & Clovis Foguem & Dieudonné Tchuente & Samuel Fosso-Wamba & Bernard Kamsu-Foguem, 2023. "The Viability of Supply Chains with Interpretable Learning Systems: The Case of COVID-19 Vaccine Deliveries," Global Journal of Flexible Systems Management, Springer;Global Institute of Flexible Systems Management, vol. 24(4), pages 633-657, December.
    7. Grossmann, Axel & Lee, Allissa A. & Simpson, Marc W., 2014. "Forward premium anomaly of the British pound and the euro," International Review of Financial Analysis, Elsevier, vol. 34(C), pages 140-156.
    8. Yanqin Bai & Xin Yan, 2016. "Conic Relaxations for Semi-supervised Support Vector Machines," Journal of Optimization Theory and Applications, Springer, vol. 169(1), pages 299-313, April.
    9. Akın, Melda, 2015. "A novel approach to model selection in tourism demand modeling," Tourism Management, Elsevier, vol. 48(C), pages 64-72.
    10. M. Tanveer & T. Rajani & R. Rastogi & Y. H. Shao & M. A. Ganaie, 2024. "Comprehensive review on twin support vector machines," Annals of Operations Research, Springer, vol. 339(3), pages 1223-1268, August.
    11. ?enol Emir & Hasan Din?er & Mehpare Timor, 2012. "A Stock Selection Model Based on Fundamental and Technical Analysis Variables by Using Artificial Neural Networks and Support Vector Machines," Review of Economics & Finance, Better Advances Press, Canada, vol. 2, pages 106-122, August.
    12. I. Marta Miranda García & María‐Jesús Segovia‐Vargas & Usue Mori & José A. Lozano, 2023. "Early prediction of Ibex 35 movements," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(5), pages 1150-1166, August.
    13. Kim, Hong Sik & Sohn, So Young, 2010. "Support vector machines for default prediction of SMEs based on technology credit," European Journal of Operational Research, Elsevier, vol. 201(3), pages 838-846, March.
    14. Hakob GRIGORYAN, 2016. "A Stock Market Prediction Method Based on Support Vector Machines (SVM) and Independent Component Analysis (ICA)," Database Systems Journal, Academy of Economic Studies - Bucharest, Romania, vol. 7(1), pages 12-21, August.
    15. Bundzel, Marek & Kasanický, Tomáš & Pinčák, Richard, 2016. "Using string invariants for prediction searching for optimal parameters," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 444(C), pages 680-688.
    16. Kyoung-jae Kim & Kichun Lee & Hyunchul Ahn, 2018. "Predicting Corporate Financial Sustainability Using Novel Business Analytics," Sustainability, MDPI, vol. 11(1), pages 1-17, December.
    17. Paolo Sorino & Maria Gabriella Caruso & Giovanni Misciagna & Caterina Bonfiglio & Angelo Campanella & Antonella Mirizzi & Isabella Franco & Antonella Bianco & Claudia Buongiorno & Rosalba Liuzzi & Ann, 2020. "Selecting the best machine learning algorithm to support the diagnosis of Non-Alcoholic Fatty Liver Disease: A meta learner study," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-15, October.
    18. Adam Korniejczuk & Robert 'Slepaczuk, 2024. "Statistical arbitrage in multi-pair trading strategy based on graph clustering algorithms in US equities market," Papers 2406.10695, arXiv.org.
    19. Mingyue Qiu & Yu Song, 2016. "Predicting the Direction of Stock Market Index Movement Using an Optimized Artificial Neural Network Model," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-11, May.
    20. Mojtaba Sedighi & Majid Mohammadi & Saeed Farahani Fard & Mehdi Sedighi, 2019. "The Nexus between Stock Returns of Oil Companies and Oil Price Fluctuations after Heavy Oil Upgrading: Toward Theoretical Progress," Economies, MDPI, vol. 7(3), pages 1-17, July.

    More about this item

    JEL classification:

    • R00 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General - - - General
    • Z0 - Other Special Topics - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ibn:masjnl:v:3:y:2009:i:12:p:28. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Canadian Center of Science and Education (email available below). General contact details of provider: https://edirc.repec.org/data/cepflch.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.