IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2111.13519.html
   My bibliography  Save this paper

Graph Auto-Encoders for Financial Clustering

Author

Listed:
  • Edward Turner

Abstract

Deep learning has shown remarkable results on Euclidean data (e.g. audio, images, text) however this type of data is limited in the amount of relational information it can hold. In mathematics we can model more general relational data in a graph structure while retaining Euclidean data as associated node or edge features. Due to the ubiquity of graph data, and its ability to hold multiple dimensions of information, graph deep learning has become a fast emerging field. We look at applying and optimising graph deep learning on a finance graph to produce more informed clusters of companies. Having clusters produced from multiple streams of data can be highly useful in quantitative finance; not only does it allow clusters to be tailored to the specific task but the culmination of multiple streams allows for cross source pattern recognition that would have otherwise gone unnoticed. This can provide financial institutions with an edge over competitors which is crucial in the heavily optimised world of trading. In this paper we use news co-occurrence and stock price for our data combination. We optimise our model to achieve an average testing precision of 78% and find a clear improvement in clustering capabilities when dual data sources are used; cluster purity rises from 32% for just vertex data and 42% for just edge data to 64% when both are used in comparisons to ground-truth Bloomberg clusters. The framework we provide utilises unsupervised learning which we view as key for future work due to the volume of unlabelled data in financial markets.

Suggested Citation

  • Edward Turner, 2021. "Graph Auto-Encoders for Financial Clustering," Papers 2111.13519, arXiv.org, revised Dec 2021.
  • Handle: RePEc:arx:papers:2111.13519
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2111.13519
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Daiki Matsunaga & Toyotaro Suzumura & Toshihiro Takahashi, 2019. "Exploring Graph Neural Networks for Stock Market Predictions with Rolling Window Analysis," Papers 1909.10660, arXiv.org, revised Nov 2019.
    2. Xingchen Wan & Jie Yang & Slavi Marinov & Jan-Peter Calliess & Stefan Zohren & Xiaowen Dong, 2020. "Sentiment Correlation in Financial News Networks and Associated Market Movements," Papers 2011.06430, arXiv.org, revised Feb 2021.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dragos Gorduza & Xiaowen Dong & Stefan Zohren, 2022. "Understanding stock market instability via graph auto-encoders," Papers 2212.04974, arXiv.org.
    2. Gianluca Anese & Marco Corazza & Michele Costola & Loriana Pelizzon, 2023. "Impact of public news sentiment on stock market index return and volatility," Computational Management Science, Springer, vol. 20(1), pages 1-36, December.
    3. Rian Dolphin & Barry Smyth & Ruihai Dong, 2022. "A Multimodal Embedding-Based Approach to Industry Classification in Financial Markets," Papers 2211.06378, arXiv.org.
    4. Qinkai Chen, 2021. "Stock Movement Prediction with Financial News using Contextualized Embedding from BERT," Papers 2107.08721, arXiv.org.
    5. Costola, Michele & Hinz, Oliver & Nofer, Michael & Pelizzon, Loriana, 2023. "Machine learning sentiment analysis, COVID-19 news and stock market reactions," Research in International Business and Finance, Elsevier, vol. 64(C).
    6. Pei-Fen Tsai & Cheng-Han Gao & Shyan-Ming Yuan, 2023. "Stock Selection Using Machine Learning Based on Financial Ratios," Mathematics, MDPI, vol. 11(23), pages 1-18, November.
    7. Mittal, Varun & Schaposnik, Laura, 2022. "Housing market forecasts via stock market indicators," MPRA Paper 115009, University Library of Munich, Germany.
    8. Semen Budennyy & Alexey Kazakov & Elizaveta Kovtun & Leonid Zhukov, 2022. "New drugs and stock market: how to predict pharma market reaction to clinical trial announcements," Papers 2208.07248, arXiv.org, revised Aug 2022.
    9. Andrieş, Alin Marius & Ongena, Steven & Sprincean, Nicu & Tunaru, Radu, 2022. "Risk spillovers and interconnectedness between systemically important institutions," Journal of Financial Stability, Elsevier, vol. 58(C).
    10. Aurthur Vimalachandran Thomas Jayachandran, 2022. "The financial crash of 2020 and the retail trader’s boon: a correlation between sentiment and technical analysis," SN Business & Economics, Springer, vol. 2(6), pages 1-8, June.
    11. Wentao Xu & Weiqing Liu & Lewen Wang & Yingce Xia & Jiang Bian & Jian Yin & Tie-Yan Liu, 2021. "HIST: A Graph-based Framework for Stock Trend Forecasting via Mining Concept-Oriented Shared Information," Papers 2110.13716, arXiv.org, revised Jan 2022.
    12. Rohitash Chandra & Yixuan He, 2021. "Bayesian neural networks for stock price forecasting before and during COVID-19 pandemic," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-32, July.
    13. Alireza Jafari & Saman Haratizadeh, 2022. "NETpred: Network-based modeling and prediction of multiple connected market indices," Papers 2212.05916, arXiv.org.
    14. Xianchao Wu, 2020. "Event-Driven Learning of Systematic Behaviours in Stock Markets," Papers 2010.15586, arXiv.org.
    15. Jianian Wang & Sheng Zhang & Yanghua Xiao & Rui Song, 2021. "A Review on Graph Neural Network Methods in Financial Applications," Papers 2111.15367, arXiv.org, revised Apr 2022.
    16. Shawn McCarthy & Gita Alaghband, 2023. "Enhancing Financial Market Analysis and Prediction with Emotion Corpora and News Co-Occurrence Network," JRFM, MDPI, vol. 16(4), pages 1-19, April.
    17. Yuanrong Wang & Tomaso Aste, 2022. "Sparsification and Filtering for Spatial-temporal GNN in Multivariate Time-series," Papers 2203.03991, arXiv.org.
    18. Xiaohong Shen & Gaoshan Wang & Yue Wang & Alfred Peris, 2021. "The Influence of Research Reports on Stock Returns: The Mediating Effect of Machine-Learning-Based Investor Sentiment," Discrete Dynamics in Nature and Society, Hindawi, vol. 2021, pages 1-14, December.
    19. Loureiro, Maria L. & Alló, Maria & Coello, Pablo, 2022. "Hot in Twitter: Assessing the emotional impacts of wildfires with sentiment analysis," Ecological Economics, Elsevier, vol. 200(C).
    20. Wentao Xu & Weiqing Liu & Chang Xu & Jiang Bian & Jian Yin & Tie-Yan Liu, 2021. "REST: Relational Event-driven Stock Trend Forecasting," Papers 2102.07372, arXiv.org, revised Feb 2021.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2111.13519. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.