IDEAS home Printed from https://ideas.repec.org/a/kap/sbusec/v60y2023i2d10.1007_s11187-022-00609-6.html
   My bibliography  Save this article

Topic-based classification and identification of global trends for startup companies

Author

Listed:
  • Ivan Savin

    (Universitat Autònoma de Barcelona
    Ural Federal University)

  • Kristina Chukavina

    (Ural Federal University)

  • Andrey Pushkarev

    (Ural Federal University)

Abstract

To foresee global economic trends, one needs to understand the present startup companies that soon may become new market leaders. In this paper, we explore textual descriptions of more than 250 thousand startups in the Crunchbase database. We analyze the 2009–2019 period by using topic modeling. We propose a novel classification of startup companies free from expert bias that contains 38 topics and quantifies the weight of each of these topics for all the startups. Taking the year of establishment and geographical location of the startups into account, we measure which topics were increasing or decreasing their share over time, and which of them were predominantly present in Europe, North America, or other regions. We find that the share of startups focused on data analytics, social platforms, and financial transfers, and time management has risen, while an opposite trend is observed for mobile gaming, online news, and online social networks as well as legal and professional services. We also identify strong regional differences in topic distribution, suggesting certain concentration of the startups. For example, sustainable agriculture is presented stronger in South America and Africa, while pharmaceutics, in North America and Europe. Furthermore, we explore which pairs of topics tend to co-occur more often together, quantify how multisectoral the startups are, and which startup classes attract more investments. Finally, we compare our classification to the one existing in the Crunchbase database, demonstrating how we improve it.

Suggested Citation

  • Ivan Savin & Kristina Chukavina & Andrey Pushkarev, 2023. "Topic-based classification and identification of global trends for startup companies," Small Business Economics, Springer, vol. 60(2), pages 659-689, February.
  • Handle: RePEc:kap:sbusec:v:60:y:2023:i:2:d:10.1007_s11187-022-00609-6
    DOI: 10.1007/s11187-022-00609-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11187-022-00609-6
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s11187-022-00609-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lüdering Jochen & Winker Peter, 2016. "Forward or Backward Looking? The Economic Discourse and the Observed Reality," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 236(4), pages 483-515, August.
    2. Reinartz, Werner & Wiegand, Nico & Imschloss, Monika, 2019. "The impact of digital transformation on the retailing value chain," International Journal of Research in Marketing, Elsevier, vol. 36(3), pages 350-366.
    3. Donghyun Choi & Bomi Song, 2018. "Exploring Technological Trends in Logistics: Topic Modeling-Based Patent Analysis," Sustainability, MDPI, vol. 10(8), pages 1-26, August.
    4. Helen Bollaert & Gaël Leboeuf & Armin Schwienbacher, 2020. "The narcissism of crowdfunding entrepreneurs," Small Business Economics, Springer, vol. 55(1), pages 57-76, June.
    5. Suominen, Arho & Toivanen, Hannes & Seppänen, Marko, 2017. "Firms' knowledge profiles: Mapping patent data with unsupervised learning," Technological Forecasting and Social Change, Elsevier, vol. 115(C), pages 131-142.
    6. Bongsug (Kevin) Chae & Eunhye (Olivia) Park, 2018. "Corporate Social Responsibility (CSR): A Survey of Topics and Trends Using Twitter Data and Topic Modeling," Sustainability, MDPI, vol. 10(7), pages 1-20, June.
    7. Savin, Ivan & Ott, Ingrid & Konop, Chris, 2022. "Tracing the evolution of service robotics: Insights from a topic modeling approach," Technological Forecasting and Social Change, Elsevier, vol. 174(C).
    8. Daniel Ratzinger & Kevin Amess & Andrew Greenman & Simon Mosey, 2018. "The impact of digital start-up founders’ higher education on reaching equity investment milestones," The Journal of Technology Transfer, Springer, vol. 43(3), pages 760-778, June.
    9. Sarah Kaplan & Keyvan Vakili, 2015. "The double-edged sword of recombination in breakthrough innovation," Strategic Management Journal, Wiley Blackwell, vol. 36(10), pages 1435-1457, October.
    10. Jermain C. Kaminski & Christian Hopp, 2020. "Predicting outcomes in crowdfunding campaigns with textual, visual, and linguistic signals," Small Business Economics, Springer, vol. 55(3), pages 627-649, October.
    11. Endre Tvinnereim & Kjersti Fløttum, 2015. "Explaining topic prevalence in answers to open-ended survey questions about climate change," Nature Climate Change, Nature, vol. 5(8), pages 744-747, August.
    12. Joern H. Block & Massimo G. Colombo & Douglas J. Cumming & Silvio Vismara, 2018. "New players in entrepreneurial finance and why they are there," Small Business Economics, Springer, vol. 50(2), pages 239-250, February.
    13. David Scott Hunter & Ajay Saini & Tauhid Zaman, 2017. "Picking Winners: A Data Driven Approach to Evaluating the Quality of Startup Companies," Papers 1706.04229, arXiv.org, revised Jul 2018.
    14. Uwe Cantner & Ivan Savin & Simone Vannuccini, 2019. "Replicator dynamics in value chains: explaining some puzzles of market selection," Industrial and Corporate Change, Oxford University Press and the Associazione ICC, vol. 28(3), pages 589-611.
    15. Jian, Sisi & Liu, Wei & Wang, Xiaolei & Yang, Hai & Waller, S. Travis, 2020. "On integrating carsharing and parking sharing services," Transportation Research Part B: Methodological, Elsevier, vol. 142(C), pages 19-44.
    16. Angela Ambrosino & Mario Cedrini & John B. Davis & Stefano Fiori & Marco Guerzoni & Massimiliano Nuccio, 2018. "What topic modeling could reveal about the evolution of economics," Journal of Economic Methodology, Taylor & Francis Journals, vol. 25(4), pages 329-348, October.
    17. Ivan Savin & Oleg Mariev & Andrey Pushkarev, 2019. "Survival of the Fittest? Measuring the Strength of Market Selection on the Example of the Urals Federal District," HSE Economic Journal, National Research University Higher School of Economics, vol. 23(1), pages 90-117.
    18. de Bellis, Emanuel & Venkataramani Johar, Gita, 2020. "Autonomous Shopping Systems: Identifying and Overcoming Barriers to Consumer Adoption," Journal of Retailing, Elsevier, vol. 96(1), pages 74-87.
    19. Ivan Savin & Stefan Drews & Sara Maestre-Andrés & Jeroen Bergh, 2020. "Public views on carbon taxation and its fairness: a computational-linguistics analysis," Climatic Change, Springer, vol. 162(4), pages 2107-2138, October.
    20. Jean-Michel Dalle & Matthijs den Besten & Carlo Menon, 2017. "Using Crunchbase for economic and managerial research," OECD Science, Technology and Industry Working Papers 2017/08, OECD Publishing.
    21. Ed Saiedi & Anders Broström & Felipe Ruiz, 2021. "Global drivers of cryptocurrency infrastructure adoption," Small Business Economics, Springer, vol. 57(1), pages 353-406, June.
    22. Endre Tvinnereim & Xiaozi Liu & Eric M. Jamelske, 2017. "Public perceptions of air pollution and climate change: different manifestations, similar causes, and concerns," Climatic Change, Springer, vol. 140(3), pages 399-412, February.
    23. Venugopalan, Subhashini & Rai, Varun, 2015. "Topic based classification and pattern identification in patents," Technological Forecasting and Social Change, Elsevier, vol. 94(C), pages 236-250.
    24. Francesca De Battisti & Alfio Ferrara & Silvia Salini, 2015. "A decade of research in statistics: a topic model approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(2), pages 413-433, May.
    25. Yan, Yingchen & Zhao, Ruiqing & Liu, Zhibing, 2018. "Strategic introduction of the marketplace channel under spillovers from online to offline sales," European Journal of Operational Research, Elsevier, vol. 267(1), pages 65-77.
    26. Max W. Callaghan & Jan C. Minx & Piers M. Forster, 2020. "A topography of climate change research," Nature Climate Change, Nature, vol. 10(2), pages 118-123, February.
    27. Christian Haddad & Lars Hornuf, 2019. "The emergence of the global fintech market: economic and technological determinants," Small Business Economics, Springer, vol. 53(1), pages 81-105, June.
    28. Oliver Alexy & Joern Block & Philipp Sandner & Anne Ter Wal, 2012. "Social capital of venture capitalists and start-up funding," Small Business Economics, Springer, vol. 39(4), pages 835-851, November.
    29. Savin, Ivan & Drews, Stefan & van den Bergh, Jeroen, 2021. "Free associations of citizens and scientists with economic and green growth: A computational-linguistics analysis," Ecological Economics, Elsevier, vol. 180(C).
    30. Mauri Laukkanen, 2000. "Exploring alternative approaches in high-level entrepreneurship education: creating micromechanisms for endogenous regional growth," Entrepreneurship & Regional Development, Taylor & Francis Journals, vol. 12(1), pages 25-47, January.
    31. Theodor Florian Cojoianu & Gordon L. Clark & Andreas G. F. Hoepner & Vladimir Pažitka & Dariusz Wójcik, 2021. "Fin vs. tech: are trust and knowledge creation key ingredients in fintech start-up emergence and financing?," Small Business Economics, Springer, vol. 57(4), pages 1715-1731, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Savin, Ivan & Ott, Ingrid & Konop, Chris, 2022. "Tracing the evolution of service robotics: Insights from a topic modeling approach," Technological Forecasting and Social Change, Elsevier, vol. 174(C).
    2. Bongini, Paola & Osborne, Francesco & Pedrazzoli, Alessia & Rossolini, Monica, 2022. "A topic modelling analysis of white papers in security token offerings: Which topic matters for funding?," Technological Forecasting and Social Change, Elsevier, vol. 184(C).
    3. Mohamed M. Mostafa, 2023. "A one-hundred-year structural topic modeling analysis of the knowledge structure of international management research," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3905-3935, August.
    4. Levy, Daniel & Mayer, Tamir & Raviv, Alon, 2022. "Economists in the 2008 financial crisis: Slow to see, fast to act," Journal of Financial Stability, Elsevier, vol. 60(C).
    5. Carolin Bock & Christian Hackober, 2020. "Unicorns—what drives multibillion-dollar valuations?," Business Research, Springer;German Academic Association for Business Research, vol. 13(3), pages 949-984, November.
    6. Montobbio, Fabio & Staccioli, Jacopo & Virgillito, Maria Enrica & Vivarelli, Marco, 2022. "Robots and the origin of their labour-saving impact," Technological Forecasting and Social Change, Elsevier, vol. 174(C).
    7. Tali Hadasa Blank & Abraham Carmeli, 2021. "Does founding team composition influence external investment? The role of founding team prior experience and founder CEO," The Journal of Technology Transfer, Springer, vol. 46(6), pages 1869-1888, December.
    8. Savin, Ivan & Drews, Stefan & van den Bergh, Jeroen, 2021. "Free associations of citizens and scientists with economic and green growth: A computational-linguistics analysis," Ecological Economics, Elsevier, vol. 180(C).
    9. Camilla Salvatore & Silvia Biffignandi & Annamaria Bianchi, 2022. "Corporate Social Responsibility Activities Through Twitter: From Topic Model Analysis to Indexes Measuring Communication Characteristics," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 164(3), pages 1217-1248, December.
    10. Eliza Nichifor & Adrian Trifan & Elena Mihaela Nechifor, 2021. "Artificial Intelligence in Electronic Commerce: Basic Chatbots and Consumer Journey," The AMFITEATRU ECONOMIC journal, Academy of Economic Studies - Bucharest, Romania, vol. 23(56), pages 1-87, February.
    11. Fabrice Hervé & Armin Schwienbacher, 2018. "Crowdfunding And Innovation," Journal of Economic Surveys, Wiley Blackwell, vol. 32(5), pages 1514-1530, December.
    12. Pacelli, Vincenzo & Miglietta, Federica & Foglia, Matteo, 2022. "The extreme risk connectedness of the new financial system: European evidence," International Review of Financial Analysis, Elsevier, vol. 84(C).
    13. Alsagr, Naif & Cumming, Douglas J. & Davis, Justin G. & Sewaid, Ahmed, 2023. "Geopolitical risk and crowdfunding performance," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 85(C).
    14. Ghaffari, Mohsen & Aliahmadi, Alireza & Khalkhali, Abolfazl & Zakery, Amir & Daim, Tugrul U. & Yalcin, Haydar, 2023. "Topic-based technology mapping using patent data analysis: A case study of vehicle tires," Technological Forecasting and Social Change, Elsevier, vol. 193(C).
    15. Ivan Savin & Maria Novitskaya, 2023. "Data-driven definitions of gazelle companies that rule out chance: application for Russia and Spain," Eurasian Business Review, Springer;Eurasia Business and Economics Society, vol. 13(3), pages 507-542, September.
    16. Ghlamallah, Ezzedine & Alexakis, Christos & Dowling, Michael & Piepenbrink, Anke, 2021. "The topics of Islamic economics and finance research," International Review of Economics & Finance, Elsevier, vol. 75(C), pages 145-160.
    17. Savin, I., 2020. "Studying market selection in Russia and abroad: Measurement problems, national specificity and stimulating methods," Journal of the New Economic Association, New Economic Association, vol. 48(4), pages 197-204.
    18. Ed Saiedi & Anders Broström & Felipe Ruiz, 2021. "Global drivers of cryptocurrency infrastructure adoption," Small Business Economics, Springer, vol. 57(1), pages 353-406, June.
    19. Alaassar, Ahmad & Mention, Anne-Laure & Aas, Tor Helge, 2020. "Exploring how social interactions influence regulators and innovators: The case of regulatory sandboxes," Technological Forecasting and Social Change, Elsevier, vol. 160(C).
    20. Jessica Birkholz & Jutta Günther & Mariia Shkolnykova, 2021. "Using Topic Modeling in Innovation Studies: The Case of a Small Innovation System under Conditions of Pandemic Related Change," Bremen Papers on Economics & Innovation 2101, University of Bremen, Faculty of Business Studies and Economics.

    More about this item

    Keywords

    Crunchbase; Machine learning; Natural language processing; Investments; Entrepreneurship;
    All these keywords.

    JEL classification:

    • M13 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Business Administration - - - New Firms; Startups
    • C6 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling
    • F23 - International Economics - - International Factor Movements and International Business - - - Multinational Firms; International Business
    • L26 - Industrial Organization - - Firm Objectives, Organization, and Behavior - - - Entrepreneurship

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:sbusec:v:60:y:2023:i:2:d:10.1007_s11187-022-00609-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.