IDEAS home Printed from https://ideas.repec.org/a/kap/jgeosy/v17y2015i4p333-351.html
   My bibliography  Save this article

Optimizing distance-based methods for large data sets

Author

Listed:
  • Tobias Scholl
  • Thomas Brenner

Abstract

Distance-based methods for measuring spatial concentration of industries have received an increasing popularity in the spatial econometrics community. However, a limiting factor for using these methods is their computational complexity since both their memory requirements and running times are in $${\mathcal {O}}(n^2)$$ O ( n 2 ) . In this paper, we present an algorithm with constant memory requirements and shorter running time, enabling distance-based methods to deal with large data sets. We discuss three recent distance-based methods in spatial econometrics: the D&O-Index by Duranton and Overman (Rev Econ Stud 72(4):1077–1106, 2005 ), the M-function by Marcon and Puech (J Econ Geogr 10(5):745–762, 2010 ) and the Cluster-Index by Scholl and Brenner (Reg Stud (ahead-of-print):1–15, 2014 ). Finally, we present an alternative calculation for the latter index that allows the use of data sets with millions of firms. Copyright Springer-Verlag Berlin Heidelberg 2015

Suggested Citation

  • Tobias Scholl & Thomas Brenner, 2015. "Optimizing distance-based methods for large data sets," Journal of Geographical Systems, Springer, vol. 17(4), pages 333-351, October.
  • Handle: RePEc:kap:jgeosy:v:17:y:2015:i:4:p:333-351
    DOI: 10.1007/s10109-015-0219-1
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10109-015-0219-1
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10109-015-0219-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stefania Vitali & Mauro Napoletano & Giorgio Fagiolo, 2013. "Spatial Localization in Manufacturing: A Cross-Country Analysis," Regional Studies, Taylor & Francis Journals, vol. 47(9), pages 1534-1554, October.
    2. Gilles Duranton & Henry G. Overman, 2005. "Testing for Localization Using Micro-Geographic Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 72(4), pages 1077-1106.
    3. A. J. Baddeley & J. Møller & R. Waagepetersen, 2000. "Non‐ and semi‐parametric estimation of interaction in inhomogeneous point patterns," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 54(3), pages 329-350, November.
    4. Glenn Ellison & Edward L. Glaeser & William R. Kerr, 2010. "What Causes Industry Agglomeration? Evidence from Coagglomeration Patterns," American Economic Review, American Economic Association, vol. 100(3), pages 1195-1213, June.
    5. Hyun-Ju Koh & Nadine Riedel, 2014. "Assessing the Localization Pattern of German Manufacturing and Service Industries: A Distance-based Approach," Regional Studies, Taylor & Francis Journals, vol. 48(5), pages 823-843, May.
    6. Eric Marcon & Florence Puech, 2003. "Evaluating the Geographic Concentration of Industries Using Distance-Based Methods," Post-Print halshs-00372646, HAL.
    7. Eric Marcon & Florence Puech, 2010. "Measures of the geographic concentration of industries: improving distance-based methods," Journal of Economic Geography, Oxford University Press, vol. 10(5), pages 745-762, September.
    8. Eric Marcon & Florence Puech, 2003. "Evaluating the Geographic Concentration of Industries Using Distance-Based Methods," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-00372646, HAL.
    9. Harvey J. Miller, 2010. "The Data Avalanche Is Here. Shouldn’T We Be Digging?," Journal of Regional Science, Wiley Blackwell, vol. 50(1), pages 181-201, February.
    10. Eric Marcon & Florence Puech, 2012. "A typology of distance-based measures of spatial concentration," Working Papers halshs-00679993, HAL.
    11. Barlet, M. & Briant, A. & Crusson, L., 2013. "Location patterns of service industries in France: A distance-based approach," Regional Science and Urban Economics, Elsevier, vol. 43(2), pages 338-351.
    12. Giuseppe Espa & Giuseppe Arbia & Diego Giuliani, 2010. "Measuring industrial agglomeration with inhomogeneous K-function: the case of ICT firms in Milan (Italy)," Department of Economics Working Papers 1014, Department of Economics, University of Trento, Italia.
    13. Reinhold Kosfeld & Hans-Friedrich Eckey & Jørgen Lauridsen, 2011. "Spatial point pattern analysis and industry concentration," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 47(2), pages 311-328, October.
    14. Eric Marcon & Florence Puech, 2003. "Evaluating the geographic concentration of industries using distance-based methods," Journal of Economic Geography, Oxford University Press, vol. 3(4), pages 409-428, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Marcon, Eric & Puech, Florence, 2017. "A typology of distance-based measures of spatial concentration," Regional Science and Urban Economics, Elsevier, vol. 62(C), pages 56-67.
    2. Katarina Weßling & Nora Bechler, 2019. "Where do regional influences matter? The impact of socio-spatial indicators on transitions from secondary school to university [Wo hat Region einen Einfluss? Effekte sozialräumlicher Indikatoren au," Review of Regional Research: Jahrbuch für Regionalwissenschaft, Springer;Gesellschaft für Regionalforschung (GfR), vol. 39(2), pages 163-188, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marcon, Eric & Puech, Florence, 2017. "A typology of distance-based measures of spatial concentration," Regional Science and Urban Economics, Elsevier, vol. 62(C), pages 56-67.
    2. Gabriel Lang & Eric Marcon & Florence Puech, 2020. "Distance-based measures of spatial concentration: introducing a relative density function," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 64(2), pages 243-265, April.
    3. Marta R. Casanova & Vicente Orts & José M. Albert, 2017. "Sectoral scope and colocalisation of Spanish manufacturing industries," Journal of Geographical Systems, Springer, vol. 19(1), pages 65-92, January.
    4. Behrens, Kristian & Bougna, Théophile, 2015. "An anatomy of the geographical concentration of Canadian manufacturing industries," Regional Science and Urban Economics, Elsevier, vol. 51(C), pages 47-69.
    5. Tobias Scholl & Thomas Brenner, 2013. "Optimizing Distance-Based Methods for Big Data Analysis," Working Papers on Innovation and Space 2013-09, Philipps University Marburg, Department of Geography.
    6. Eric Marcon & Florence Puech, 2012. "A typology of distance-based measures of spatial concentration," Working Papers halshs-00679993, HAL.
    7. Gibbons, Steve & Overman, Henry G. & Patacchini, Eleonora, 2015. "Spatial Methods," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 115-168, Elsevier.
    8. Jean Dubé & Cédric Brunelle, 2014. "Dots to dots: a general methodology to build local indicators using spatial micro-data," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 53(1), pages 245-272, August.
    9. Hiroyasu Inoue & Kentaro Nakajima & Yukiko Umeno Saito, 2019. "Localization of collaborations in knowledge creation," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 62(1), pages 119-140, February.
    10. Ana Isabel Moreno-Monroy & Gustavo Adolfo García Cruz, 2016. "Intra-Metropolitan Agglomeration of Formal and Informal Manufacturing Activity: Evidence from Cali, Colombia," Tijdschrift voor Economische en Sociale Geografie, Royal Dutch Geographical Society KNAG, vol. 107(4), pages 389-406, September.
    11. Nakajima, Kentaro & Saito, Yukiko Umeno & Uesugi, Iichiro, 2012. "Measuring economic localization: Evidence from Japanese firm-level data," Journal of the Japanese and International Economies, Elsevier, vol. 26(2), pages 201-220.
    12. Wolfgang Dauth & Michaela Fuchs & Anne Otto, 2018. "Long‐run processes of geographical concentration and dispersion: Evidence from Germany," Papers in Regional Science, Wiley Blackwell, vol. 97(3), pages 569-593, August.
    13. Brad Humphreys & Josh Matti, 2018. "The Spatial Distribution of Urban Consumer Service Firms: Evidence from Yelp Reviews," Working Papers 18-03, Department of Economics, West Virginia University.
    14. Giuseppe Espa & Giuseppe Arbia & Diego Giuliani, 2013. "Conditional versus unconditional industrial agglomeration: disentangling spatial dependence and spatial heterogeneity in the analysis of ICT firms’ distribution in Milan," Journal of Geographical Systems, Springer, vol. 15(1), pages 31-50, January.
    15. Robert Schweizog & Alan Collins, 2015. "A Simple Location Index Plus Some Maps and no Apologies: Back to Basics on the Development of Links Between Economic Integration and Spatial Concentration of Industries," Tijdschrift voor Economische en Sociale Geografie, Royal Dutch Geographical Society KNAG, vol. 106(1), pages 17-35, February.
    16. Florent Bonneu & Christine Thomas-Agnan, 2015. "Measuring and Testing Spatial Mass Concentration with Micro-geographic Data," Spatial Economic Analysis, Taylor & Francis Journals, vol. 10(3), pages 289-316, September.
    17. Giulio Cainelli & Roberto Ganau & Yuting Jiang, 2020. "Detecting space–time agglomeration processes over the Great Recession using firm-level micro-geographic data," Journal of Geographical Systems, Springer, vol. 22(4), pages 419-445, October.
    18. William R. Kerr & Scott Duke Kominers, 2015. "Agglomerative Forces and Cluster Shapes," The Review of Economics and Statistics, MIT Press, vol. 97(4), pages 877-899, October.
    19. Di Giacinto, Valter & Pagnini, Marcello, 2011. "Local and global agglomeration patterns: Two econometrics-based indicators," Regional Science and Urban Economics, Elsevier, vol. 41(3), pages 266-280, May.
    20. Bartelme, Dominick & Ziv, Oren, 2023. "JUE Insight: Firms and industry agglomeration," Journal of Urban Economics, Elsevier, vol. 133(C).

    More about this item

    Keywords

    Spatial concentration; Duranton–Overman index; Big data analysis; MAUP; Distance-based measures; C21; C60; R12;
    All these keywords.

    JEL classification:

    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
    • C60 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - General
    • R12 - Urban, Rural, Regional, Real Estate, and Transportation Economics - - General Regional Economics - - - Size and Spatial Distributions of Regional Economic Activity; Interregional Trade (economic geography)

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:jgeosy:v:17:y:2015:i:4:p:333-351. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.