IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2106.12987.html
   My bibliography  Save this paper

Fund2Vec: Mutual Funds Similarity using Graph Learning

Author

Listed:
  • Vipul Satone
  • Dhruv Desai
  • Dhagash Mehta

Abstract

Identifying similar mutual funds with respect to the underlying portfolios has found many applications in financial services ranging from fund recommender systems, competitors analysis, portfolio analytics, marketing and sales, etc. The traditional methods are either qualitative, and hence prone to biases and often not reproducible, or, are known not to capture all the nuances (non-linearities) among the portfolios from the raw data. We propose a radically new approach to identify similar funds based on the weighted bipartite network representation of funds and their underlying assets data using a sophisticated machine learning method called Node2Vec which learns an embedded low-dimensional representation of the network. We call the embedding \emph{Fund2Vec}. Ours is the first ever study of the weighted bipartite network representation of the funds-assets network in its original form that identifies structural similarity among portfolios as opposed to merely portfolio overlaps.

Suggested Citation

  • Vipul Satone & Dhruv Desai & Dhagash Mehta, 2021. "Fund2Vec: Mutual Funds Similarity using Graph Learning," Papers 2106.12987, arXiv.org.
  • Handle: RePEc:arx:papers:2106.12987
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2106.12987
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Athanasios Orphanides, "undated". "Compensation Incentives and Risk Taking Behavior: Evidence from Mutual Funds," Finance and Economics Discussion Series 1996-21, Board of Governors of the Federal Reserve System (U.S.), revised 10 Dec 2019.
    2. Fan Cai & Nhien-An Le-Khac & Tahar Kechadi, 2016. "Clustering Approaches for Financial Data Analysis: a Survey," Papers 1609.08520, arXiv.org.
    3. Lin, Li & Guo, Xin-Yu, 2019. "Identifying fragility for the stock market: Perspective from the portfolio overlaps network," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 62(C), pages 132-151.
    4. Luis M. Varela & Giulia Rotundo, 2016. "Complex Network Analysis and Nonlinear Dynamics," Lecture Notes in Economics and Mathematical Systems, in: Pasquale Commendatore & Mariano Matilla-García & Luis M. Varela & Jose S. Cánovas (ed.), Complex Networks and Dynamics, pages 3-25, Springer.
    5. Moreno, David & Marco, Paulina & Olmeda, Ignacio, 2006. "Self-organizing maps could improve the classification of Spanish mutual funds," European Journal of Operational Research, Elsevier, vol. 174(2), pages 1039-1054, October.
    6. Edwin J. Elton & Martin J. Gruber & Christopher R. Blake, 2003. "Incentive Fees and Mutual Funds," Journal of Finance, American Finance Association, vol. 58(2), pages 779-804, April.
    7. Dhagash Mehta & Dhruv Desai & Jithin Pradeep, 2020. "Machine Learning Fund Categorizations," Papers 2006.00123, arXiv.org.
    8. Jaime F. Lavin & Mauricio A. Valle & Nicolás S. Magner, 2019. "Modeling Overlapped Mutual Funds’ Portfolios: A Bipartite Network Approach," Complexity, Hindawi, vol. 2019, pages 1-20, July.
    9. Anna, Petrenko, 2016. "Мaркування готової продукції як складова частина інформаційного забезпечення маркетингової діяльності підприємств овочепродуктового підкомплексу," Agricultural and Resource Economics: International Scientific E-Journal, Agricultural and Resource Economics: International Scientific E-Journal, vol. 2(1), March.
    10. Brown, Stephen J. & Goetzmann, William N., 1997. "Mutual fund styles," Journal of Financial Economics, Elsevier, vol. 43(3), pages 373-399, March.
    11. Kim, Moon & Shukla, Ravi & Tomas, Michael, 2000. "Mutual fund objective misclassification," Journal of Economics and Business, Elsevier, vol. 52(4), pages 309-323.
    12. Danilo Delpini & Stefano Battiston & Guido Caldarelli & Massimo Riccaboni, 2019. "Systemic risk from investment similarities," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-15, May.
    13. Anna Maria D’Arcangelis & Giulia Rotundo, 2016. "Complex Networks in Finance," Lecture Notes in Economics and Mathematical Systems, in: Pasquale Commendatore & Mariano Matilla-García & Luis M. Varela & Jose S. Cánovas (ed.), Complex Networks and Dynamics, pages 209-235, Springer.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dhruv Desai & Ashmita Dhiman & Tushar Sharma & Deepika Sharma & Dhagash Mehta & Stefano Pasquali, 2023. "Quantifying Outlierness of Funds from their Categories using Supervised Similarity," Papers 2308.06882, arXiv.org.
    2. Rian Dolphin & Barry Smyth & Ruihai Dong, 2022. "A Multimodal Embedding-Based Approach to Industry Classification in Financial Markets," Papers 2211.06378, arXiv.org.
    3. Bhaskarjit Sarmah & Nayana Nair & Dhagash Mehta & Stefano Pasquali, 2022. "Learning Embedded Representation of the Stock Correlation Matrix using Graph Machine Learning," Papers 2207.07183, arXiv.org.
    4. Vadim Zlotnikov & Jiayu Liu & Igor Halperin & Fei He & Lisa Huang, 2023. "Model-Free Market Risk Hedging Using Crowding Networks," Papers 2306.08105, arXiv.org.
    5. Alejandro Rodriguez Dominguez, 2022. "Portfolio Optimization based on Neural Networks Sensitivities from Assets Dynamics respect Common Drivers," Papers 2202.08921, arXiv.org, revised Dec 2022.
    6. Jerinsh Jeyapaulraj & Dhruv Desai & Peter Chu & Dhagash Mehta & Stefano Pasquali & Philip Sommer, 2022. "Supervised similarity learning for corporate bonds using Random Forest proximities," Papers 2207.04368, arXiv.org, revised Oct 2022.
    7. Dimitrios Vamvourellis & Mate Attila Toth & Dhruv Desai & Dhagash Mehta & Stefano Pasquali, 2022. "Learning Mutual Fund Categorization using Natural Language Processing," Papers 2207.04959, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dhagash Mehta & Dhruv Desai & Jithin Pradeep, 2020. "Machine Learning Fund Categorizations," Papers 2006.00123, arXiv.org.
    2. Dhruv Desai & Ashmita Dhiman & Tushar Sharma & Deepika Sharma & Dhagash Mehta & Stefano Pasquali, 2023. "Quantifying Outlierness of Funds from their Categories using Supervised Similarity," Papers 2308.06882, arXiv.org.
    3. Sensoy, Berk A., 2009. "Performance evaluation and self-designated benchmark indexes in the mutual fund industry," Journal of Financial Economics, Elsevier, vol. 92(1), pages 25-39, April.
    4. repec:onb:oenbwp:y:2005:i:9:b:1 is not listed on IDEAS
    5. Bhaskarjit Sarmah & Nayana Nair & Dhagash Mehta & Stefano Pasquali, 2022. "Learning Embedded Representation of the Stock Correlation Matrix using Graph Machine Learning," Papers 2207.07183, arXiv.org.
    6. Victor DeMiguel & Javier Gil-Bazo & Francisco J. Nogales & André A. P. Santos, 2021. "Can Machine Learning Help to Select Portfolios of Mutual Funds?," Working Papers 1245, Barcelona School of Economics.
    7. D’Arcangelis, Anna Maria & Levantesi, Susanna & Rotundo, Giulia, 2021. "A complex networks approach to pension funds," Journal of Business Research, Elsevier, vol. 129(C), pages 687-702.
    8. Alexander Kempf & Peer Osthoff, 2008. "SRI Funds: Nomen est Omen," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 35(9-10), pages 1276-1294.
    9. Irina Bezhentseva Mateus & Cesario Mateus & Natasa Todorovic, 2019. "Benchmark-adjusted performance of US equity mutual funds and the issue of prospectus benchmarks," Journal of Asset Management, Palgrave Macmillan, vol. 20(1), pages 15-30, February.
    10. Moreno, David & Marco, Paulina & Olmeda, Ignacio, 2006. "Self-organizing maps could improve the classification of Spanish mutual funds," European Journal of Operational Research, Elsevier, vol. 174(2), pages 1039-1054, October.
    11. Peng, Hongfeng & Zhang, Zhenqi & Goodell, John W. & Li, Mingsheng, 2023. "Socially responsible investing: Is it for real or just for show?," International Review of Financial Analysis, Elsevier, vol. 86(C).
    12. Alexander Kempf & Peer Osthoff, 2008. "SRI Funds: Nomen est Omen," Journal of Business Finance & Accounting, Wiley Blackwell, vol. 35(9‐10), pages 1276-1294, November.
    13. Kurniawan, Meinanda & How, Janice & Verhoeven, Peter, 2016. "Fund governance and style drift," Pacific-Basin Finance Journal, Elsevier, vol. 40(PA), pages 59-72.
    14. Dimitrios Vamvourellis & Mate Attila Toth & Dhruv Desai & Dhagash Mehta & Stefano Pasquali, 2022. "Learning Mutual Fund Categorization using Natural Language Processing," Papers 2207.04959, arXiv.org.
    15. George J. Jiang & Bing Liang & Huacheng Zhang, 2022. "Hedge Fund Manager Skill and Style-Shifting," Management Science, INFORMS, vol. 68(3), pages 2284-2307, March.
    16. Mateus, Irina B. & Mateus, Cesario & Todorovic, Natasa, 2019. "Review of new trends in the literature on factor models and mutual fund performance," International Review of Financial Analysis, Elsevier, vol. 63(C), pages 344-354.
    17. Chua, Angeline Kim Pei & Tam, On Kit, 2020. "The shrouded business of style drift in active mutual funds," Journal of Corporate Finance, Elsevier, vol. 64(C).
    18. Deetz, Marcus & Poddig, Thorsten & Varmaz, Armin, 2009. "Klassifizierung von Hedge-Fonds durch das k-means Clustering von Self-Organizing Maps: eine renditebasierte Analyse zur Selbsteinstufungsgüte und Stiländerungsproblematik [Classifying Hedge Funds u," MPRA Paper 16939, University Library of Munich, Germany.
    19. Stephen Brown & William Goetzmann & Bing Liang & Christopher Schwarz, 2008. "Mandatory Disclosure and Operational Risk: Evidence from Hedge Fund Registration," Journal of Finance, American Finance Association, vol. 63(6), pages 2785-2815, December.
    20. Vivian Welch & Christine M. Mathew & Panteha Babelmorad & Yanfei Li & Elizabeth T. Ghogomu & Johan Borg & Monserrat Conde & Elizabeth Kristjansson & Anne Lyddiatt & Sue Marcus & Jason W. Nickerson & K, 2021. "Health, social care and technological interventions to improve functional ability of older adults living at home: An evidence and gap map," Campbell Systematic Reviews, John Wiley & Sons, vol. 17(3), September.
    21. Persson, Petra & Qiu, Xinyao & Rossin-Slater, Maya, 2021. "Family Spillover Effects of Marginal Diagnoses: The Case of ADHD," IZA Discussion Papers 14020, Institute of Labor Economics (IZA).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2106.12987. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.