IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0287754.html
   My bibliography  Save this article

Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process

Author

Listed:
  • Xiaohua Zeng
  • Jieping Cai
  • Changzhou Liang
  • Chiping Yuan

Abstract

Prediction of stock price has been a hot topic in artificial intelligence field. Computational intelligent methods such as machine learning or deep learning are explored in the prediction system in recent years. However, making accurate predictions of stock price direction is still a big challenge because stock prices are affected by nonlinear, nonstationary, and high dimensional features. In previous works, feature engineering was overlooked. How to select the optimal feature sets that affect stock price is a prominent solution. Hence, our motivation for this article is to propose an improved many-objective optimization algorithm integrating random forest (I-NSGA-II-RF) algorithm with a three-stage feature engineering process in order to decrease the computational complexity and improve the accuracy of prediction system. Maximizing accuracy and minimizing the optimal solution set are the optimization directions of the model in this study. The integrated information initialization population of two filtered feature selection methods is used to optimize the I-NSGA-II algorithm, using multiple chromosome hybrid coding to synchronously select features and optimize model parameters. Finally, the selected feature subset and parameters are input to the RF for training, prediction, and iterative optimization. Experimental results show that the I-NSGA-II-RF algorithm has the highest average accuracy, the smallest optimal solution set, and the shortest running time compared to the unmodified multi-objective feature selection algorithm and the single target feature selection algorithm. Compared to the deep learning model, this model has interpretability, higher accuracy, and less running time.

Suggested Citation

  • Xiaohua Zeng & Jieping Cai & Changzhou Liang & Chiping Yuan, 2023. "Prediction of stock price movement using an improved NSGA-II-RF algorithm with a three-stage feature engineering process," PLOS ONE, Public Library of Science, vol. 18(6), pages 1-30, June.
  • Handle: RePEc:plo:pone00:0287754
    DOI: 10.1371/journal.pone.0287754
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0287754
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0287754&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0287754?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Basak, Suryoday & Kar, Saibal & Saha, Snehanshu & Khaidem, Luckyson & Dey, Sudeepa Roy, 2019. "Predicting the direction of stock market prices using tree-based classifiers," The North American Journal of Economics and Finance, Elsevier, vol. 47(C), pages 552-567.
    2. Zuo, Wei & Wang, Zijie & E, Jiaqiang & Li, Qingqing & Cheng, Qianju & Wu, Yinkun & Zhou, Kun, 2023. "Numerical investigations on the performance of a hydrogen-fueled micro planar combustor with tube outlet for thermophotovoltaic applications," Energy, Elsevier, vol. 263(PC).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Baoqiang Zhan & Shu Zhang & Helen S. Du & Xiaoguang Yang, 2022. "Exploring Statistical Arbitrage Opportunities Using Machine Learning Strategy," Computational Economics, Springer;Society for Computational Economics, vol. 60(3), pages 861-882, October.
    2. Zhao, He & Zhao, Dan & Sun, Dakun & Semlitsch, Bernhard, 2024. "Electrical power, energy efficiency, NO and CO emissions investigations of an ammonia/methane-fueled micro-thermal photovoltaic system with a reduced chemical reaction mechanism," Energy, Elsevier, vol. 305(C).
    3. Ahmad Kianrad & Mohadeseh Najafi Arani & Karim Hasani & Masoumeh Zargar & Eila Erfani & Amir Razmjou, 2024. "Investigating the impact of company announcements on stock prices: an application of machine learning on Australian lithium market," Mineral Economics, Springer;Raw Materials Group (RMG);Luleå University of Technology, vol. 37(1), pages 163-172, March.
    4. Henriques, Irene & Sadorsky, Perry, 2023. "Forecasting rare earth stock prices with machine learning," Resources Policy, Elsevier, vol. 86(PA).
    5. Dai, Churong & Zuo, Wei & Li, Qingqing & Zhou, Kun & Huang, Yuhan & Zhang, Guangde & E, Jiaqiang, 2024. "Energy conversion efficiency improvement studies on the hydrogen-fueled micro planar combustor with multi-baffles for thermophotovoltaic applications," Energy, Elsevier, vol. 313(C).
    6. Wang, Jianzhou & Lv, Mengzheng & Wang, Shuai & Gao, Jialu & Zhao, Yang & Wang, Qiangqiang, 2024. "Can multi-period auto-portfolio systems improve returns? Evidence from Chinese and U.S. stock markets," International Review of Financial Analysis, Elsevier, vol. 95(PB).
    7. Saqib Farid & Rubeena Tashfeen & Tahseen Mohsan & Arsal Burhan, 2023. "Forecasting stock prices using a data mining method: Evidence from emerging market," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 28(2), pages 1911-1917, April.
    8. Zhou, Zhongbao & Gao, Meng & Liu, Qing & Xiao, Helu, 2020. "Forecasting stock price movements with multiple data sources: Evidence from stock market in China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 542(C).
    9. Barboza, Flavio & Altman, Edward, 2024. "Predicting financial distress in Latin American companies: A comparative analysis of logistic regression and random forest models," The North American Journal of Economics and Finance, Elsevier, vol. 72(C).
    10. Htet Htet Htun & Michael Biehl & Nicolai Petkov, 2024. "Forecasting relative returns for S&P 500 stocks using machine learning," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 10(1), pages 1-16, December.
    11. Şirin Özlem & Omer Faruk Tan, 2022. "Predicting cash holdings using supervised machine learning algorithms," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 8(1), pages 1-19, December.
    12. Vitor Azevedo & Christopher Hoegner, 2023. "Enhancing stock market anomalies with machine learning," Review of Quantitative Finance and Accounting, Springer, vol. 60(1), pages 195-230, January.
    13. Htet Htet Htun & Michael Biehl & Nicolai Petkov, 2023. "Survey of feature selection and extraction techniques for stock market prediction," Financial Innovation, Springer;Southwestern University of Finance and Economics, vol. 9(1), pages 1-25, December.
    14. Han Gui, 2024. "Machine learning in weekly movement prediction," Papers 2407.09831, arXiv.org.
    15. Yizhe Xu & Tom H. Greene & Adam P. Bress & Brandon K. Bellows & Yue Zhang & Zugui Zhang & Paul Kolm & William S. Weintraub & Andrew S. Moran & Jincheng Shen, 2022. "An Efficient Approach for Optimizing the Cost-effective Individualized Treatment Rule Using Conditional Random Forest," Papers 2204.10971, arXiv.org.
    16. Xu, Yingying & Dai, Yifan & Guo, Lingling & Chen, Jingjing, 2024. "Leveraging machine learning to forecast carbon returns: Factors from energy markets," Applied Energy, Elsevier, vol. 357(C).
    17. He, Ziqiang & You, Jingxiang & Kang, Dugang & Zou, Qunfeng & Zhang, Wenxiang & Zhang, Zhien, 2024. "Overall numerical simulation of chemical-thermal-electric conversion for an all-in-one thermoelectric generator based on micro scale combustion," Energy, Elsevier, vol. 292(C).
    18. Zhao, He & Zhao, Dan & Becker, Sid & Rong, Hui & Zhao, Xiaohuan, 2023. "Entropy generation and improved thermal performance investigation on a hydrogen-fuelled double-channel microcombustor with Y-shaped internal fins," Energy, Elsevier, vol. 283(C).
    19. Yang, Yanlin & Hu, Xuemei & Jiang, Huifeng, 2022. "Group penalized logistic regressions predict up and down trends for stock prices," The North American Journal of Economics and Finance, Elsevier, vol. 59(C).
    20. Mercadier, Mathieu & Lardy, Jean-Pierre, 2019. "Credit spread approximation and improvement using random forest regression," European Journal of Operational Research, Elsevier, vol. 277(1), pages 351-365.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0287754. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.