IDEAS home Printed from https://ideas.repec.org/p/ehl/lserod/115565.html
   My bibliography  Save this paper

Modeling clusters from the ground up: a web data approach

Author

Listed:
  • Stich, Christoph
  • Tranos, Emmanouil
  • Nathan, Max

Abstract

This paper proposes a new methodological framework to identify economic clusters over space and time. We employ a unique open source dataset of geolocated and archived business webpages and interrogate them using Natural Language Processing to build bottom-up classifications of economic activities. We validate our method on an iconic UK tech cluster – Shoreditch, East London. We benchmark our results against existing case studies and administrative data, replicating the main features of the cluster and providing fresh insights. As well as overcoming limitations in conventional industrial classification, our method addresses some of the spatial and temporal limitations of the clustering literature.

Suggested Citation

  • Stich, Christoph & Tranos, Emmanouil & Nathan, Max, 2023. "Modeling clusters from the ground up: a web data approach," LSE Research Online Documents on Economics 115565, London School of Economics and Political Science, LSE Library.
  • Handle: RePEc:ehl:lserod:115565
    as

    Download full text from publisher

    File URL: http://eprints.lse.ac.uk/115565/
    File Function: Open access version.
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. William R. Kerr & Scott Duke Kominers, 2015. "Agglomerative Forces and Cluster Shapes," The Review of Economics and Statistics, MIT Press, vol. 97(4), pages 877-899, October.
    2. Jaehyuk Park & Ian Wood & Elise Jing & Azadeh Nematzadeh & Souvik Ghosh & Michael Conover & Yong-Yeol Ahn, 2019. "Global labor flow network reveals the hierarchical organization and dynamics of geo-industrial clusters in the world economy," Papers 1902.04613, arXiv.org, revised Mar 2019.
    3. Frank Neffke & Martin Henning & Ron Boschma, 2011. "How Do Regions Diversify over Time? Industry Relatedness and the Development of New Growth Paths in Regions," Economic Geography, Taylor & Francis Journals, vol. 87(3), pages 237-265, July.
    4. Ron Boschma & Koen Frenken, 2011. "The emerging empirics of evolutionary economic geography," Journal of Economic Geography, Oxford University Press, vol. 11(2), pages 295-307, March.
    5. Gernot Grabher & Oliver Ibert, 2014. "Distance as asset? Knowledge collaboration in hybrid virtual communities," Journal of Economic Geography, Oxford University Press, vol. 14(1), pages 97-123, January.
    6. John R. Baldwin & W. Mark Brown & David L. Rigby, 2010. "Agglomeration Economies: Microdata Panel Estimates From Canadian Manufacturing," Journal of Regional Science, Wiley Blackwell, vol. 50(5), pages 915-934, December.
    7. Gilles Duranton & William R. Kerr, 2015. "The Logic of Agglomeration," Harvard Business School Working Papers 16-037, Harvard Business School.
    8. Koen Frenken & Elena Cefis & Erik Stam, 2020. "Industrial Dynamics and Clusters: A Survey," Regional Studies, Taylor & Francis Journals, vol. 49(1), pages 10-27, July.
    9. Andrea Caragliu & Laura de Dominicis & Henri L.F. de Groot, 2016. "Both Marshall and Jacobs were Right!," Economic Geography, Taylor & Francis Journals, vol. 92(1), pages 87-111, January.
    10. Pierre-Alexandre Balland & Ron Boschma & Koen Frenken, 2015. "Proximity and Innovation: From Statics to Dynamics," Regional Studies, Taylor & Francis Journals, vol. 49(6), pages 907-920, June.
    11. Gilles Duranton & Henry G. Overman, 2005. "Testing for Localization Using Micro-Geographic Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 72(4), pages 1077-1106.
    12. Anne Ter Wal & Ron Boschma, 2011. "Co-evolution of Firms, Industries and Networks in Space," Regional Studies, Taylor & Francis Journals, vol. 45(7), pages 919-933.
    13. Ron Boschma & Simona Iammarino, 2009. "Related Variety, Trade Linkages, and Regional Growth in Italy," Economic Geography, Clark University, vol. 85(3), pages 289-311, July.
    14. Allen J. Scott, 1997. "The Cultural Economy of Cities," International Journal of Urban and Regional Research, Wiley Blackwell, vol. 21(2), pages 323-339, June.
    15. Max Nathan & Emma Vandore & Georgina Voss, 2019. "Spatial Imaginaries and Tech Cities: Place-branding East London’s digital economy," Journal of Economic Geography, Oxford University Press, vol. 19(2), pages 409-432.
    16. Hernández, Blanca & Jiménez, Julio & Martín, M. José, 2009. "Key website factors in e-business strategy," International Journal of Information Management, Elsevier, vol. 29(5), pages 362-371.
    17. Jan Kinne & Janna Axenbeck, 2020. "Web mining for innovation ecosystem mapping: a framework and a large-scale pilot study," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2011-2041, December.
    18. Ron Martin & Peter Sunley, 2006. "Path dependence and regional economic evolution," Journal of Economic Geography, Oxford University Press, vol. 6(4), pages 395-437, August.
    19. J. Vernon Henderson, 2001. "Marshall's Scale Economies," Working Papers 2001-46, Brown University, Department of Economics.
    20. Peter J. Taylor & Ben Derudder & James Faulconbridge & Michael Hoyler & Pengfei Ni, 2014. "Advanced Producer Service Firms as Strategic Networks, Global Cities as Strategic Places," Economic Geography, Clark University, vol. 90(3), pages 267-291, July.
    21. Michael E. Martin & Nadine Schuurman, 2020. "Social Media Big Data Acquisition and Analysis for Qualitative GIScience: Challenges and Opportunities," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 110(5), pages 1335-1352, September.
    22. Michael E. Martin & Nadine Schuurman, 2017. "Area-Based Topic Modeling and Visualization of Social Media for Qualitative GIS," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 107(5), pages 1028-1039, September.
    23. Sanjay K. Arora & Jan Youtie & Philip Shapira & Lidan Gao & TingTing Ma, 2013. "Entry strategies in an emerging technology: a pilot web-based study of graphene firms," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(3), pages 1189-1207, June.
    24. Oecd, 2013. "Measuring the Internet Economy: A Contribution to the Research Agenda," OECD Digital Economy Papers 226, OECD Publishing.
    25. Ellison, Glenn & Glaeser, Edward L, 1997. "Geographic Concentration in U.S. Manufacturing Industries: A Dartboard Approach," Journal of Political Economy, University of Chicago Press, vol. 105(5), pages 889-927, October.
    26. Aaron Chatterji & Edward Glaeser & William Kerr, 2014. "Clusters of Entrepreneurship and Innovation," Innovation Policy and the Economy, University of Chicago Press, vol. 14(1), pages 129-166.
    27. Ron Boschma & Dirk Fornahl, 2011. "Cluster Evolution and a Roadmap for Future Research," Regional Studies, Taylor & Francis Journals, vol. 45(10), pages 1295-1298, November.
    28. Savvas Papagiannidis & Eric W.K. See-To & Dimitris Assimakopoulos & Yang Yang, 2018. "Identifying industrial clusters with a novel big-data methodology : Are SIC codes (not) fit for purpose in the Internet age?," Post-Print hal-02312006, HAL.
    29. Timothy Sturgeon & Johannes Van Biesebroeck & Gary Gereffi, 2008. "Value chains, networks and clusters: reframing the global automotive industry," Journal of Economic Geography, Oxford University Press, vol. 8(3), pages 297-321, May.
    30. Matthew A Zook, 2000. "The Web of Production: The Economic Geography of Commercial Internet Content Production in the United States," Environment and Planning A, , vol. 32(3), pages 411-426, March.
    31. Wagner, Alfred, 1891. "Marshall's Principles of Economics," History of Economic Thought Articles, McMaster University Archive for the History of Economic Thought, vol. 5, pages 319-338.
    32. Peter J. Taylor & Ben Derudder & James Faulconbridge & Michael Hoyler & Pengfei Ni, 2014. "Advanced Producer Service Firms as Strategic Networks, Global Cities as Strategic Places," Economic Geography, Taylor & Francis Journals, vol. 90(3), pages 267-291, July.
    33. Allen John Scott, 2014. "Beyond the Creative City: Cognitive--Cultural Capitalism and the New Urbanism," Regional Studies, Taylor & Francis Journals, vol. 48(4), pages 565-578, April.
    34. Henry Wai-chung Yeung & Neil Coe, 2015. "Toward a Dynamic Theory of Global Production Networks," Economic Geography, Taylor & Francis Journals, vol. 91(1), pages 29-58, January.
    35. Matthew Gentzkow & Bryan Kelly & Matt Taddy, 2019. "Text as Data," Journal of Economic Literature, American Economic Association, vol. 57(3), pages 535-574, September.
    36. Abdullah Gök & Alec Waterworth & Philip Shapira, 2015. "Use of web mining in studying innovation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 653-671, January.
    37. Ron Martin & Peter Sunley, 2011. "Conceptualizing Cluster Evolution: Beyond the Life Cycle Model?," Regional Studies, Taylor & Francis Journals, vol. 45(10), pages 1299-1318, November.
    38. Gilles Duranton, 2011. "California Dreamin': The Feeble Case for Cluster Policies," Review of Economic Analysis, Digital Initiatives at the University of Waterloo Library, vol. 3(1), pages 3-45, July.
    39. Juliana Martins, 2015. "The Extended Workplace in a Creative Cluster: Exploring Space(s) of Digital Work in Silicon Roundabout," Journal of Urban Design, Taylor & Francis Journals, vol. 20(1), pages 125-145, February.
    40. Yingjie Hu & Chengbin Deng & Zhou Zhou, 2019. "A Semantic and Sentiment Analysis on Online Neighborhood Reviews for Understanding the Perceptions of People toward Their Living Environments," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 109(4), pages 1052-1073, July.
    41. Glenn Ellison & Edward L. Glaeser & William R. Kerr, 2010. "What Causes Industry Agglomeration? Evidence from Coagglomeration Patterns," American Economic Review, American Economic Association, vol. 100(3), pages 1195-1213, June.
    42. Li, Yin & Arora, Sanjay & Youtie, Jan & Shapira, Philip, 2018. "Using web mining to explore Triple Helix influences on growth in small and mid-size firms," Technovation, Elsevier, vol. 76, pages 3-14.
    43. Chris Hamnett, 2003. "Gentrification and the Middle-class Remaking of Inner London, 1961-2001," Urban Studies, Urban Studies Journal Limited, vol. 40(12), pages 2401-2426, November.
    44. Henderson, J. Vernon, 2003. "Marshall's scale economies," Journal of Urban Economics, Elsevier, vol. 53(1), pages 1-28, January.
    45. Ron Martin & Peter Sunley, 2003. "Deconstructing clusters: chaotic concept or policy panacea?," Journal of Economic Geography, Oxford University Press, vol. 3(1), pages 5-35, January.
    46. Nathan, Max & Rosso, Anna, 2015. "Mapping digital businesses with big data: Some early findings from the UK," Research Policy, Elsevier, vol. 44(9), pages 1714-1733.
    47. Catini, Roberto & Karamshuk, Dmytro & Penner, Orion & Riccaboni, Massimo, 2015. "Identifying geographic clusters: A network analytic approach," Research Policy, Elsevier, vol. 44(9), pages 1749-1762.
    48. Henry Wai-chung Yeung & Neil M. Coe, 2015. "Toward a Dynamic Theory of Global Production Networks," Economic Geography, Clark University, vol. 91(1), pages 29-58, January.
    49. Vernon Henderson, J., 2007. "Understanding knowledge spillovers," Regional Science and Urban Economics, Elsevier, vol. 37(4), pages 497-508, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Carlino, Gerald & Kerr, William R., 2015. "Agglomeration and Innovation," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 349-404, Elsevier.
    2. Margarida Madaleno & Max Nathan & Henry Overman & Sevrin Waights, 2018. "Incubators, accelerators and regional economic development," CEP Discussion Papers dp1575, Centre for Economic Performance, LSE.
    3. Combes, Pierre-Philippe & Gobillon, Laurent, 2015. "The Empirics of Agglomeration Economies," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 247-348, Elsevier.
    4. Kristian Behrens, 2016. "Agglomeration and clusters: Tools and insights from coagglomeration patterns," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 49(4), pages 1293-1339, November.
    5. repec:hal:spmain:info:hdl:2441/1kv8mtgl748r0ahh12air9erdc is not listed on IDEAS
    6. Wang, Liang & Tan, Justin & Li, Wan, 2018. "The impacts of spatial positioning on regional new venture creation and firm mortality over the industry life cycle," Journal of Business Research, Elsevier, vol. 86(C), pages 41-52.
    7. Nathan, Max, 2022. "Does light touch cluster policy work? Evaluating the tech city programme," Research Policy, Elsevier, vol. 51(9).
    8. Xiwei Zhu & Ye Liu & Ming He & Deming Luo & Yiyun Wu, 2019. "Entrepreneurship and industrial clusters: evidence from China industrial census," Small Business Economics, Springer, vol. 52(3), pages 595-616, March.
    9. Delgado, Mercedes & Porter, Michael E. & Stern, Scott, 2014. "Clusters, convergence, and economic performance," Research Policy, Elsevier, vol. 43(10), pages 1785-1799.
    10. Lu, Ren & Ruan, Min & Reve, Torger, 2016. "Cluster and co-located cluster effects: An empirical study of six Chinese city regions," Research Policy, Elsevier, vol. 45(10), pages 1984-1995.
    11. Bartelme, Dominick & Ziv, Oren, 2023. "JUE Insight: Firms and industry agglomeration," Journal of Urban Economics, Elsevier, vol. 133(C).
    12. William R. Kerr & Frederic Robert-Nicoud, 2020. "Tech Clusters," Journal of Economic Perspectives, American Economic Association, vol. 34(3), pages 50-76, Summer.
    13. Margarida Madaleno & Max Nathan & Henry Overman & Sevrin Waights, 2022. "Incubators, accelerators and urban economic development," Urban Studies, Urban Studies Journal Limited, vol. 59(2), pages 281-300, February.
    14. Emma Howard & Carol Newman & Finn Tarp, 2016. "Measuring industry coagglomeration and identifying the driving forces," Journal of Economic Geography, Oxford University Press, vol. 16(5), pages 1055-1078.
    15. John Rand & Finn Tarp & Neda Trifković & Helge Zille, 2019. "Industrial agglomeration in Myanmar," WIDER Working Paper Series wp-2019-3, World Institute for Development Economic Research (UNU-WIDER).
    16. Ehrl, Philipp, 2013. "Agglomeration economies with consistent productivity estimates," Regional Science and Urban Economics, Elsevier, vol. 43(5), pages 751-763.
    17. Carlino, Gerald & Kerr, William R., 2015. "Agglomeration and Innovation," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 349-404, Elsevier.
    18. repec:zbw:bofrdp:2015_027 is not listed on IDEAS
    19. Matthias Firgo & Peter Mayerhofer, 2015. "Wissens-Spillovers und regionale Entwicklung - welche strukturpolitische Ausrichtung optimiert des Wachstum?," Working Paper Reihe der AK Wien - Materialien zu Wirtschaft und Gesellschaft 144, Kammer für Arbeiter und Angestellte für Wien, Abteilung Wirtschaftswissenschaft und Statistik.
    20. Giulia Faggio & Olmo Silva & William C Strange, 2020. "Tales of the city: what do agglomeration cases tell us about agglomeration in general? [The anchor tenant hypothesis: exploring the role of large, local, R&D-intensive firms in regional innovation ," Journal of Economic Geography, Oxford University Press, vol. 20(5), pages 1117-1143.
    21. Martin, Philippe & Mayer, Thierry & Mayneris, Florian, 2011. "Spatial concentration and plant-level productivity in France," Journal of Urban Economics, Elsevier, vol. 69(2), pages 182-195, March.
    22. Mercedes Delgado & Michael E. Porter & Scott Stern, 2016. "Defining clusters of related industries," Journal of Economic Geography, Oxford University Press, vol. 16(1), pages 1-38.

    More about this item

    Keywords

    cities; clusters; machine learning; technology industry; onsumer Data Research Centre (CDRC) and Engineering and Physical Sciences Research Council (ESRC;
    All these keywords.

    JEL classification:

    • L86 - Industrial Organization - - Industry Studies: Services - - - Information and Internet Services; Computer Software
    • C50 - Mathematical and Quantitative Methods - - Econometric Modeling - - - General
    • O31 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Innovation and Invention: Processes and Incentives
    • R12 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General Regional Economics - - - Size and Spatial Distributions of Regional Economic Activity; Interregional Trade (economic geography)

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:115565. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.