IDEAS home Printed from https://ideas.repec.org/h/wsi/wschap/9789811281747_0003.html
   My bibliography  Save this book chapter

Mathematics of Embeddings: Spillover of Polarities over Financial Texts

In: REVIEWS IN MODERN QUANTITATIVE FINANCE

Author

Listed:
  • Mengda Li
  • Charles-Albert Lehalle

Abstract

In this chapter, we perform a mathematical analysis of the word2vec model. This sheds light on how the decision to use such a model makes implicit assumptions on the structure of the language. Beside, under Markovian assumptions that we discuss, we provide a very clear theoretical understanding of the formation of embeddings and, in particular, the way it captures what we call frequentist synonyms. These assumptions allow to conduct an explicit analysis of the loss function commonly used by these NLP techniques that asymptotically reaches a cross-entropy between the language model and the underlying true generative model.Moreover, we produce synthetic corpora with different levels of structures and show empirically how the word2vec algorithm succeed, or not, to learn them. It leads us to empirically assess the capability of such models to capture structures on a corpus of around 42 millions of financial news covering 12 years. And, we rely on the Loughran–McDonald Sentiment Polarity Word Lists and we show that embeddings are exposed to mixing terms with opposite polarity because of the way they treat antonyms as frequentist synonyms. Besides, we study the non-stationarity of such a financial corpus that has surprisingly not be documented in the literature.

Suggested Citation

  • Mengda Li & Charles-Albert Lehalle, 2024. "Mathematics of Embeddings: Spillover of Polarities over Financial Texts," World Scientific Book Chapters, in: Andrey Itkin (ed.), REVIEWS IN MODERN QUANTITATIVE FINANCE, chapter 3, pages 151-188, World Scientific Publishing Co. Pte. Ltd..
  • Handle: RePEc:wsi:wschap:9789811281747_0003
    as

    Download full text from publisher

    File URL: https://www.worldscientific.com/doi/pdf/10.1142/9789811281747_0003
    Download Restriction: Ebook Access is available upon purchase.

    File URL: https://www.worldscientific.com/doi/abs/10.1142/9789811281747_0003
    Download Restriction: Ebook Access is available upon purchase.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Keywords

    Quantitative Finance; Financial Engineering; Mathematical Finance; Computational Finance; Computational Methods; Computational Problems; Pricing of Securities; Trading; Market Microstructures; Risk Theory; Queuing Theory; Asset Management Technique; Liability Management Technique; Risk Measures; Solvency; Financial Instability; Fintech; Cryptocurrencies; Financial Machine Learning; Artificial Intelligence; Fintech; Quantum Computing; Distributed Ledgers; Econophysics;
    All these keywords.

    JEL classification:

    • C02 - Mathematical and Quantitative Methods - - General - - - Mathematical Economics
    • C - Mathematical and Quantitative Methods
    • C6 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling
    • C61 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Optimization Techniques; Programming Models; Dynamic Analysis

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:wschap:9789811281747_0003. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscientific.com/page/worldscibooks .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.