IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2502.16023.html

Contrastive Similarity Learning for Market Forecasting: The ContraSim Framework

Author

Listed:
  • Nicholas Vinden
  • Raeid Saqur
  • Zining Zhu
  • Frank Rudzicz

Abstract

We introduce the Contrastive Similarity Space Embedding Algorithm (ContraSim), a novel framework for uncovering the global semantic relationships between daily financial headlines and market movements. ContraSim operates in two key stages: (I) Weighted Headline Augmentation, which generates augmented financial headlines along with a semantic fine-grained similarity score, and (II) Weighted Self-Supervised Contrastive Learning (WSSCL), an extended version of classical self-supervised contrastive learning that uses the similarity metric to create a refined weighted embedding space. This embedding space clusters semantically similar headlines together, facilitating deeper market insights. Empirical results demonstrate that integrating ContraSim features into financial forecasting tasks improves classification accuracy from WSJ headlines by 7%. Moreover, leveraging an information density analysis, we find that the similarity spaces constructed by ContraSim intrinsically cluster days with homogeneous market movement directions, indicating that ContraSim captures market dynamics independent of ground truth labels. Additionally, ContraSim enables the identification of historical news days that closely resemble the headlines of the current day, providing analysts with actionable insights to predict market trends by referencing analogous past events.

Suggested Citation

  • Nicholas Vinden & Raeid Saqur & Zining Zhu & Frank Rudzicz, 2025. "Contrastive Similarity Learning for Market Forecasting: The ContraSim Framework," Papers 2502.16023, arXiv.org.
  • Handle: RePEc:arx:papers:2502.16023
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2502.16023
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sims, Christopher A, 1980. "Macroeconomics and Reality," Econometrica, Econometric Society, vol. 48(1), pages 1-48, January.
    2. Robert Engle & Clive Granger, 2015. "Co-integration and error correction: Representation, estimation, and testing," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 39(3), pages 106-135.
    3. Raeid Saqur, 2024. "What Teaches Robots to Walk, Teaches Them to Trade too -- Regime Adaptive Execution using Informed Data and LLMs," Papers 2406.15508, arXiv.org.
    4. Raeid Saqur & Ken Kato & Nicholas Vinden & Frank Rudzicz, 2024. "NIFTY Financial News Headlines Dataset," Papers 2405.09747, arXiv.org.
    5. Bollerslev, Tim, 1986. "Generalized autoregressive conditional heteroskedasticity," Journal of Econometrics, Elsevier, vol. 31(3), pages 307-327, April.
    6. Hamilton, James D, 1989. "A New Approach to the Economic Analysis of Nonstationary Time Series and the Business Cycle," Econometrica, Econometric Society, vol. 57(2), pages 357-384, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Diebold, F.X. & Kilian, L. & Nerlove, Marc, 2006. "Time Series Analysis," Working Papers 28556, University of Maryland, Department of Agricultural and Resource Economics.
    2. Claudio Morana, 2014. "Factor Vector Autoregressive Estimation of Heteroskedastic Persistent and Non Persistent Processes Subject to Structural Breaks," Working Papers 273, University of Milano-Bicocca, Department of Economics, revised May 2014.
    3. Taufiq Supriadi & Kurniawan Tjakrawala & Nyoman Adhi Suryadnyana & Juska Meidy Enyke Sjam & Rochman Marota, 2025. "Fraud Prevention in the Public Sector: The Role of Internal Audit," Economic Studies journal, Bulgarian Academy of Sciences - Economic Research Institute, issue 3, pages 170-183.
    4. Pami Dua & Nishita Raje & Satyananda Sahoo, 2004. "Interest Rate Modeling and Forecasting in India," Occasional papers 3, Centre for Development Economics, Delhi School of Economics.
    5. David Greasley & Les Oxley, 2010. "Cliometrics And Time Series Econometrics: Some Theory And Applications," Journal of Economic Surveys, Wiley Blackwell, vol. 24(5), pages 970-1042, December.
    6. Bolós, V.J. & Benítez, R. & Ferrer, R. & Jammazi, R., 2017. "The windowed scalogram difference: A novel wavelet tool for comparing time series," Applied Mathematics and Computation, Elsevier, vol. 312(C), pages 49-65.
    7. Adrian C. Darnell, 1994. "A Dictionary Of Econometrics," Books, Edward Elgar Publishing, number 118, June.
    8. Pinar Unal, 2025. "The Interrelation Between the Carbon Trading Systems and Energy Markets and Economic Outlook: A Comparative Analysis Using VECM and ARDL," Economic Studies journal, Bulgarian Academy of Sciences - Economic Research Institute, issue 3, pages 145-169.
    9. Diebold, Francis X & Rudebusch, Glenn D, 1996. "Measuring Business Cycles: A Modern Perspective," The Review of Economics and Statistics, MIT Press, vol. 78(1), pages 67-77, February.
    10. Dominique Guegan & Patrick Rakotomarolahy, 2010. "Alternative methods for forecasting GDP," Post-Print halshs-00511979, HAL.
    11. Zhou, Kaile & Li, Yiwen, 2019. "Influencing factors and fluctuation characteristics of China’s carbon emission trading price," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 524(C), pages 459-474.
    12. Stefan Sauer & Klaus Wohlrabe, 2020. "ifo Handbuch der Konjunkturumfragen," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 88.
    13. Apostolos Serletis & Libo Xu, 2024. "Inflation uncertainty," Empirical Economics, Springer, vol. 66(5), pages 1903-1920, May.
    14. Christopher L. Gilbert & Duo Qin, 2007. "Representation in Econometrics: A Historical Perspective," Working Papers 583, Queen Mary University of London, School of Economics and Finance.
    15. Kerry Patterson & Michael A. Thornton, 2013. "A review of econometric concepts and methods for empirical macroeconomics," Chapters, in: Nigar Hashimzade & Michael A. Thornton (ed.), Handbook of Research Methods and Applications in Empirical Macroeconomics, chapter 2, pages 4-42, Edward Elgar Publishing.
    16. Kocenda, Evzen, 1998. "Exchange rate in transition," MPRA Paper 32030, University Library of Munich, Germany.
    17. Nobel Prize Committee, 2003. "Time-series Econometrics: Cointegration and Autoregressive Conditional Heteroskedasticity," Nobel Prize in Economics documents 2003-1, Nobel Prize Committee.
    18. John D. Levendis, 2018. "Time Series Econometrics," Springer Texts in Business and Economics, Springer, number 978-3-319-98282-3, December.
    19. Francis X. Diebold, 1998. "The Past, Present, and Future of Macroeconomic Forecasting," Journal of Economic Perspectives, American Economic Association, vol. 12(2), pages 175-192, Spring.
    20. Alain Monfort, 1992. "Quelques développements récents des méthodes macroéconométriques," L'Actualité Economique, Société Canadienne de Science Economique, vol. 68(1), pages 305-324.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2502.16023. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.