IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2510.16636.html
   My bibliography  Save this paper

A three-step machine learning approach to predict market bubbles with financial news

Author

Listed:
  • Abraham Atsiwo

Abstract

This study presents a three-step machine learning framework to predict bubbles in the S&P 500 stock market by combining financial news sentiment with macroeconomic indicators. Building on traditional econometric approaches, the proposed approach predicts bubble formation by integrating textual and quantitative data sources. In the first step, bubble periods in the S&P 500 index are identified using a right-tailed unit root test, a widely recognized real-time bubble detection method. The second step extracts sentiment features from large-scale financial news articles using natural language processing (NLP) techniques, which capture investors' expectations and behavioral patterns. In the final step, ensemble learning methods are applied to predict bubble occurrences based on high sentiment-based and macroeconomic predictors. Model performance is evaluated through k-fold cross-validation and compared against benchmark machine learning algorithms. Empirical results indicate that the proposed three-step ensemble approach significantly improves predictive accuracy and robustness, providing valuable early warning insights for investors, regulators, and policymakers in mitigating systemic financial risks.

Suggested Citation

  • Abraham Atsiwo, 2025. "A three-step machine learning approach to predict market bubbles with financial news," Papers 2510.16636, arXiv.org.
  • Handle: RePEc:arx:papers:2510.16636
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2510.16636
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Peter C. B. Phillips & Shuping Shi & Jun Yu, 2015. "Testing For Multiple Bubbles: Historical Episodes Of Exuberance And Collapse In The S&P 500," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 56(4), pages 1043-1078, November.
    2. Peter C. B. Phillips & Yangru Wu & Jun Yu, 2011. "EXPLOSIVE BEHAVIOR IN THE 1990s NASDAQ: WHEN DID EXUBERANCE ESCALATE ASSET VALUES?," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 52(1), pages 201-226, February.
    3. Peter C. B. Phillips & Shuping Shi & Jun Yu, 2015. "Testing For Multiple Bubbles: Limit Theory Of Real‐Time Detectors," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 56(4), pages 1079-1134, November.
    4. Anders Johansen & Didier Sornette, 2010. "Shocks, Crashes and Bubbles in Financial Markets," Brussels Economic Review, ULB -- Universite Libre de Bruxelles, vol. 53(2), pages 201-253.
    5. George A. Akerlof & Olivier Blanchard & David Romer & Joseph E. Stiglitz (ed.), 2014. "What Have We Learned? Macroeconomic Policy After the Crisis," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262027348, December.
    6. Peter C. B. Phillips & Shuping Shi & Jun Yu, 2015. "Testing For Multiple Bubbles: Limit Theory Of Real‐Time Detectors," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 56, pages 1079-1134, November.
    7. Peter C. B. Phillips & Shuping Shi & Jun Yu, 2015. "Testing For Multiple Bubbles: Historical Episodes Of Exuberance And Collapse In The S&P 500," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 56, pages 1043-1078, November.
    8. Galbraith, JohnW. & Zinde-Walsh, Victoria, 1999. "On the distributions of Augmented Dickey-Fuller statistics in processes with moving average components," Journal of Econometrics, Elsevier, vol. 93(1), pages 25-47, November.
    9. Anders Johansen & Didier Sornette, 1999. "Log-periodic power law bubbles in Latin-American and Asian markets and correlated anti-bubbles in Western stock markets: An empirical study," Finance 9907004, University Library of Munich, Germany.
    10. Bates, David S., 1996. "Dollar jump fears, 1984-1992: distributional abnormalities implicit in currency futures options," Journal of International Money and Finance, Elsevier, vol. 15(1), pages 65-93, February.
    11. Evans, George W, 1991. "Pitfalls in Testing for Explosive Bubbles in Asset Prices," American Economic Review, American Economic Association, vol. 81(4), pages 922-930, September.
    12. Yoosoon Chang & Joon Park, 2002. "On The Asymptotics Of Adf Tests For Unit Roots," Econometric Reviews, Taylor & Francis Journals, vol. 21(4), pages 431-447.
    13. Kamaladdin Fataliyev & Aneesh Chivukula & Mukesh Prasad & Wei Liu, 2021. "Stock Market Analysis with Text Data: A Review," Papers 2106.12985, arXiv.org, revised Jul 2021.
    14. Abraham Atsiwo & Andrey Sarantsev, 2024. "Capital Asset Pricing Model with Size Factor and Normalizing by Volatility Index," Papers 2411.19444, arXiv.org, revised Dec 2024.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Adrian Fernández-Pérez & Marta Gómez-Puig & Simón Sosvilla-Rivero, 2025. "El Clasico of Housing: Bubbles in Madrid and Barcelona’s Real Estate Markets," Documentos de Trabajo del ICAE 2025-03, Universidad Complutense de Madrid, Facultad de Ciencias Económicas y Empresariales, Instituto Complutense de Análisis Económico.
    2. Wang, Xichen & Yan, Ji (Karena) & Yan, Cheng & Gozgor, Giray, 2021. "Emerging stock market exuberance and international short-term flows," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 75(C).
    3. Janusz Sobieraj & Dominik Metelski, 2021. "Testing Housing Markets for Episodes of Exuberance: Evidence from Different Polish Cities," JRFM, MDPI, vol. 14(9), pages 1-29, September.
    4. Caravello, Tomas E. & Psaradakis, Zacharias & Sola, Martin, 2023. "Rational bubbles: Too many to be true?," Journal of Economic Dynamics and Control, Elsevier, vol. 151(C).
    5. Cincinelli, Peter & Pellini, Elisabetta, 2025. "The role of geopolitical and climate risk in driving uncertainty in European electricity markets," Energy Economics, Elsevier, vol. 144(C).
    6. Coppola, Anna & Urga, Giovanni & Varaldo, Alessandro, 2025. "Asset class liquidity risk indicators. Timing the risk in the European and US equity and bond markets," Journal of Financial Stability, Elsevier, vol. 76(C).
    7. Podhorsky, Andrea, 2024. "Bursting the bitcoin bubble: Do market prices reflect fundamental bitcoin value?," International Review of Financial Analysis, Elsevier, vol. 93(C).
    8. Fan, John Hua & Fernandez-Perez, Adrian & Indriawan, Ivan & Todorova, Neda, 2024. "When Chinese mania meets global frenzy: Commodity price bubbles," Journal of Commodity Markets, Elsevier, vol. 36(C).
    9. Ariza, Juan & Ferrer, Román, 2025. "Explosiveness in the renewable energy equity sector: International evidence," The North American Journal of Economics and Finance, Elsevier, vol. 76(C).
    10. Li, Yi & Zhang, Wei & Urquhart, Andrew & Wang, Pengfei, 2022. "The role of media coverage in the bubble formation: Evidence from the Bitcoin market," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 80(C).
    11. Cervera, Ignacio & Figuerola-Ferretti, Isabel, 2024. "Credit risk and bubble behavior of credit default swaps in the corporate energy sector," International Review of Economics & Finance, Elsevier, vol. 89(PA), pages 702-731.
    12. Lui, Yiu Lim & Phillips, Peter C.B. & Yu, Jun, 2024. "Robust testing for explosive behavior with strongly dependent errors," Journal of Econometrics, Elsevier, vol. 238(2).
    13. Kong, Xiaolin & Ma, Chaoqun & Ren, Yi-Shuai & Baltas, Konstantinos & Narayan, Seema, 2024. "A comparative analysis of the price explosiveness in Bitcoin and forked coins," Finance Research Letters, Elsevier, vol. 61(C).
    14. Koulmas, Pavlos & Konstantakis, Konstantinos N. & Michaelides, Panayotis G. & Karadimitropoulou, Aikaterini & Karkalakos, Sotiris, 2024. "Energy firms in China towards resilience: A dynamic quantile connectedness approach," Energy Economics, Elsevier, vol. 139(C).
    15. Esteve, Vicente & Prats, María A., 2023. "Testing explosive bubbles with time-varying volatility: The case of Spanish public debt," Finance Research Letters, Elsevier, vol. 51(C).
    16. Aktham Maghyereh & Hussein Abdoh, 2022. "Bubble contagion effect between the main precious metals," Studies in Economics and Finance, Emerald Group Publishing Limited, vol. 40(1), pages 43-63, March.
    17. Hurn, Stan & Shi, Shuping & Wang, Ben, 2022. "Housing networks and driving forces," Journal of Banking & Finance, Elsevier, vol. 134(C).
    18. Michaelides, Panayotis G. & Tsionas, Efthymios G. & Konstantakis, Konstantinos N., 2016. "Non-linearities in financial bubbles: Theory and Bayesian evidence from S&P500," Journal of Financial Stability, Elsevier, vol. 24(C), pages 61-70.
    19. Yang, Hui & Ferrer, Román, 2023. "Explosive behavior in the Chinese stock market: A sectoral analysis," Pacific-Basin Finance Journal, Elsevier, vol. 81(C).
    20. Shuping Shi & Arafat Rahman & Ben Zhe Wang, 2020. "Australian Housing Market Booms: Fundamentals or Speculation?☆," The Economic Record, The Economic Society of Australia, vol. 96(315), pages 381-401, December.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2510.16636. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.