IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2309.15552.html
   My bibliography  Save this paper

Startup success prediction and VC portfolio simulation using CrunchBase data

Author

Listed:
  • Mark Potanin
  • Andrey Chertok
  • Konstantin Zorin
  • Cyril Shtabtsovsky

Abstract

Predicting startup success presents a formidable challenge due to the inherently volatile landscape of the entrepreneurial ecosystem. The advent of extensive databases like Crunchbase jointly with available open data enables the application of machine learning and artificial intelligence for more accurate predictive analytics. This paper focuses on startups at their Series B and Series C investment stages, aiming to predict key success milestones such as achieving an Initial Public Offering (IPO), attaining unicorn status, or executing a successful Merger and Acquisition (M\&A). We introduce novel deep learning model for predicting startup success, integrating a variety of factors such as funding metrics, founder features, industry category. A distinctive feature of our research is the use of a comprehensive backtesting algorithm designed to simulate the venture capital investment process. This simulation allows for a robust evaluation of our model's performance against historical data, providing actionable insights into its practical utility in real-world investment contexts. Evaluating our model on Crunchbase's, we achieved a 14 times capital growth and successfully identified on B round high-potential startups including Revolut, DigitalOcean, Klarna, Github and others. Our empirical findings illuminate the importance of incorporating diverse feature sets in enhancing the model's predictive accuracy. In summary, our work demonstrates the considerable promise of deep learning models and alternative unstructured data in predicting startup success and sets the stage for future advancements in this research area.

Suggested Citation

  • Mark Potanin & Andrey Chertok & Konstantin Zorin & Cyril Shtabtsovsky, 2023. "Startup success prediction and VC portfolio simulation using CrunchBase data," Papers 2309.15552, arXiv.org.
  • Handle: RePEc:arx:papers:2309.15552
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2309.15552
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Tumasjan, Andranik & Braun, Reiner & Stolz, Barbara, 2021. "Twitter sentiment as a weak signal in venture capital financing," Journal of Business Venturing, Elsevier, vol. 36(2).
    2. Xin Wang & Kai Zong & Cuicui Luo, 2022. "Credit risk detection based on machine learning algorithms," International Journal of Financial Services Management, Inderscience Enterprises Ltd, vol. 11(3), pages 183-189.
    3. Antretter, Torben & Blohm, Ivo & Grichnik, Dietmar & Wincent, Joakim, 2019. "Predicting new venture survival: A Twitter-based machine learning approach to measuring online legitimacy," Journal of Business Venturing Insights, Elsevier, vol. 11(C), pages 1-1.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seigner, Benedikt David Christian & Milanov, Hana & Lundmark, Erik & Shepherd, Dean A., 2023. "Tweeting like Elon? Provocative language, new-venture status, and audience engagement on social media," Journal of Business Venturing, Elsevier, vol. 38(2).
    2. Rogojan Luana Cristina & Croicu Andreea Elena & Iancu Laura Andreea, 2023. "Modern Approaches in Credit Risk Modeling: A Literature Review," Proceedings of the International Conference on Business Excellence, Sciendo, vol. 17(1), pages 1617-1627, July.
    3. Massimo G. Colombo & Benedetta Montanaro & Silvio Vismara, 2023. "What drives the valuation of entrepreneurial ventures? A map to navigate the literature and research directions," Small Business Economics, Springer, vol. 61(1), pages 59-84, June.
    4. Colak, Gonul & Fu, Mengchuan & Hasan, Iftekhar, 2022. "On modeling IPO failure risk," Economic Modelling, Elsevier, vol. 109(C).
    5. Tanja Verster & Erika Fourie, 2023. "The Changing Landscape of Financial Credit Risk Models," IJFS, MDPI, vol. 11(3), pages 1-15, August.
    6. Lucas, David S. & Park, U. David, 2023. "The nature and origins of social venture mission: An exploratory study of political ideology and moral foundations," Journal of Business Venturing, Elsevier, vol. 38(2).
    7. Khanindra Ch. Das, 2023. "What Affects Startup Acquisition in Emerging Economy? Evidence from India," Journal of Emerging Market Finance, Institute for Financial Management and Research, vol. 22(2), pages 111-134, June.
    8. Onur Bayar & Emre Kesici, 2024. "The impact of social media on venture capital financing: evidence from Twitter interactions," Review of Quantitative Finance and Accounting, Springer, vol. 62(1), pages 195-224, January.
    9. Sahab Zandi & Kamesh Korangi & Mar'ia 'Oskarsd'ottir & Christophe Mues & Cristi'an Bravo, 2024. "Attention-based Dynamic Multilayer Graph Neural Networks for Loan Default Prediction," Papers 2402.00299, arXiv.org.
    10. Johnson, Nicholas E. & Short, Jeremy C. & Chandler, Jeffrey A. & Jordan, Samantha L., 2022. "Introducing the contentpreneur: Making the case for research on content creation-based online platforms," Journal of Business Venturing Insights, Elsevier, vol. 18(C).
    11. Daniel Blaseg & Lars Hornuf, 2024. "Playing the Business Angel: The Impact of Well-Known Business Angels on Venture Performance," Entrepreneurship Theory and Practice, , vol. 48(1), pages 171-204, January.
    12. Ashish Vazirani & Titas Bhattacharjee, 2021. "Entrepreneurial Finance in the Twenty-first Century, a Review of Factors Influencing Venture Capitalist’s Decision," Journal of Entrepreneurship and Innovation in Emerging Economies, Entrepreneurship Development Institute of India, vol. 30(2), pages 306-335, September.
    13. Ruey‐Jer “Bryan” Jean & Daekwan Kim, 2021. "Signalling Strategies of Exporters on Internet Business‐to‐Business Platforms," Journal of Management Studies, Wiley Blackwell, vol. 58(7), pages 1869-1898, November.
    14. Ruling Zhang & Zengrui Tian & Killian J. McCarthy & Xiao Wang & Kun Zhang, 2023. "Application of machine learning techniques to predict entrepreneurial firm valuation," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 42(2), pages 402-417, March.
    15. Jung, Sang Hoon & Jeong, Yong Jin, 2020. "Twitter data analytical methodology development for prediction of start-up firms’ social media marketing level," Technology in Society, Elsevier, vol. 63(C).
    16. Ungerer, Christina & Reuther, Kevin & Baltes, Guido, 2021. "The lingering living dead phenomenon: Distorting venture survival studies?," Journal of Business Venturing Insights, Elsevier, vol. 16(C).
    17. Malyy, Maksim & Tekic, Zeljko & Podladchikova, Tatiana, 2021. "The value of big data for analyzing growth dynamics of technology-based new ventures," Technological Forecasting and Social Change, Elsevier, vol. 169(C).
    18. Tumasjan, Andranik & Braun, Reiner & Stolz, Barbara, 2021. "Twitter sentiment as a weak signal in venture capital financing," Journal of Business Venturing, Elsevier, vol. 36(2).
    19. Anton van Dyk & Gary van Vuuren, 2023. "Measurement and Calibration of Regulatory Credit Risk Asset Correlations," JRFM, MDPI, vol. 16(9), pages 1-19, September.
    20. Andrea Ancona & Matteo Cinelli & Giovanna Ferraro & Antonio Iovanella, 2023. "Network-based principles of entrepreneurial ecosystems: a case study of a start-up network," Small Business Economics, Springer, vol. 61(4), pages 1497-1514, December.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2309.15552. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.