IDEAS home Printed from https://ideas.repec.org/a/sae/envirb/v47y2020i3p489-507.html
   My bibliography  Save this article

Distance metric choice can both reduce and induce collinearity in geographically weighted regression

Author

Listed:
  • Alexis Comber

    (University of Leeds, UK)

  • Khanh Chi

    (GeoViet Consulting Co. Ltd, Vietnam)

  • Man Q Huy

    (Vietnam National University, Vietnam)

  • Quan Nguyen

    (National University of Civil Engineering, Vietnam)

  • Binbin Lu

    (Wuhan University, China)

  • Hoang H Phe

    (Vinaconex R&D, Vietnam)

  • Paul Harris

Abstract

This paper explores the impact of different distance metrics on collinearity in local regression models such as geographically weighted regression. Using a case study of house price data collected in Hà Nội, Vietnam, and by fully varying both power and rotation parameters to create different Minkowski distances, the analysis shows that local collinearity can be both negatively and positively affected by distance metric choice. The Minkowski distance that maximised collinearity in a geographically weighted regression was approximate to a Manhattan distance with (power =  0.70 ) with a rotation of 30°, and that which minimised collinearity was parameterised with power  = 0.05 and a rotation of 70 °. The results indicate that distance metric choice can provide a useful extra tuning component to address local collinearity issues in spatially varying coefficient modelling and that understanding the interaction of distance metric and collinearity can provide insight into the nature and structure of the data relationships. The discussion considers first, the exploration and selection of different distance metrics to minimise collinearity as an alternative to localised ridge regression, lasso and elastic net approaches. Second, it discusses the how distance metric choice could extend the methods that additionally optimise local model fit (lasso and elastic net) by selecting a distance metric that further helped minimise local collinearity. Third, it identifies the need to investigate the relationship between kernel bandwidth, distance metrics and collinearity as an area of further work.

Suggested Citation

  • Alexis Comber & Khanh Chi & Man Q Huy & Quan Nguyen & Binbin Lu & Hoang H Phe & Paul Harris, 2020. "Distance metric choice can both reduce and induce collinearity in geographically weighted regression," Environment and Planning B, , vol. 47(3), pages 489-507, March.
  • Handle: RePEc:sae:envirb:v:47:y:2020:i:3:p:489-507
    DOI: 10.1177/2399808318784017
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/2399808318784017
    Download Restriction: no

    File URL: https://libkey.io/10.1177/2399808318784017?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. David Wheeler & Michael Tiefelsdorf, 2005. "Multicollinearity and correlation among local regression coefficients in geographically weighted regression," Journal of Geographical Systems, Springer, vol. 7(2), pages 161-187, June.
    2. ,, 2002. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 18(4), pages 1007-1017, August.
    3. Antonio Páez & Takashi Uchida & Kazuaki Miyamoto, 2002. "A General Framework for Estimation and Inference of Geographically Weighted Regression Models: 2. Spatial Association and Model Specification Tests," Environment and Planning A, , vol. 34(5), pages 883-904, May.
    4. ,, 2002. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 18(6), pages 1461-1465, December.
    5. ,, 2002. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 18(1), pages 193-194, February.
    6. A. Stewart Fotheringham & Wenbai Yang & Wei Kang, 2017. "Multiscale Geographically Weighted Regression (MGWR)," Annals of the American Association of Geographers, Taylor & Francis Journals, vol. 107(6), pages 1247-1265, November.
    7. ,, 2002. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 18(5), pages 1273-1289, October.
    8. Antonio Páez & Takashi Uchida & Kazuaki Miyamoto, 2002. "A General Framework for Estimation and Inference of Geographically Weighted Regression Models: 1. Location-Specific Kernel Bandwidths and a Test for Locational Heterogeneity," Environment and Planning A, , vol. 34(4), pages 733-754, April.
    9. Hui Zou & Trevor Hastie, 2005. "Addendum: Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(5), pages 768-768, November.
    10. Steven Farber & Antonio Páez, 2007. "A systematic investigation of cross-validation in GWR model estimation: empirical analysis and Monte Carlo simulations," Journal of Geographical Systems, Springer, vol. 9(4), pages 371-396, December.
    11. Gollini, Isabella & Lu, Binbin & Charlton, Martin & Brunsdon, Christopher & Harris, Paul, 2015. "GWmodel: An R Package for Exploring Spatial Heterogeneity Using Geographically Weighted Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 63(i17).
    12. Gelfand A.E. & Kim H-J. & Sirmans C.F. & Banerjee S., 2003. "Spatial Modeling With Spatially Varying Coefficient Processes," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 387-396, January.
    13. ,, 2002. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 18(3), pages 819-821, June.
    14. ,, 2002. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 18(2), pages 541-545, April.
    15. Hui Zou & Trevor Hastie, 2005. "Regularization and variable selection via the elastic net," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 67(2), pages 301-320, April.
    16. M. Bárcena & P. Menéndez & M. Palacios & F. Tusell, 2014. "Alleviating the effect of collinearity in geographically weighted regression," Journal of Geographical Systems, Springer, vol. 16(4), pages 441-466, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alexis Comber & Paul Harris, 2018. "Geographically weighted elastic net logistic regression," Journal of Geographical Systems, Springer, vol. 20(4), pages 317-341, October.
    2. Claudia García-García & Catalina B. García-García & Román Salmerón, 2021. "Confronting collinearity in environmental regression models: evidence from world data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(3), pages 895-926, September.
    3. Cambier, Adrien & Chardy, Matthieu & Figueiredo, Rosa & Ouorou, Adam & Poss, Michael, 2022. "Optimizing subscriber migrations for a telecommunication operator in uncertain context," European Journal of Operational Research, Elsevier, vol. 298(1), pages 308-321.
    4. Libura, Marek, 2007. "On the adjustment problem for linear programs," European Journal of Operational Research, Elsevier, vol. 183(1), pages 125-134, November.
    5. Christophe Loussouarn & Carine Franc & Yann Videau & Julien Mousquès, 2021. "Can General Practitioners Be More Productive? The Impact of Teamwork and Cooperation with Nurses on GP Activities," Health Economics, John Wiley & Sons, Ltd., vol. 30(3), pages 680-698, March.
    6. Tschakert, Petra, 2016. "Shifting Discourses of Vilification and the Taming of Unruly Mining Landscapes in Ghana," World Development, Elsevier, vol. 86(C), pages 123-132.
    7. María-Consuelo Casabán & Rafael Company & Lucas Jódar, 2020. "Non-Gaussian Quadrature Integral Transform Solution of Parabolic Models with a Finite Degree of Randomness," Mathematics, MDPI, vol. 8(7), pages 1-16, July.
    8. Isabelle Boutron & Peter John & David J. Torgerson, 2010. "Reporting Methodological Items in Randomized Experiments in Political Science," The ANNALS of the American Academy of Political and Social Science, , vol. 628(1), pages 112-131, March.
    9. Ben Slimane, Faten & Padilla Angulo, Laura, 2019. "Strategic change and corporate governance: Evidence from the stock exchange industry," Journal of Business Research, Elsevier, vol. 103(C), pages 206-218.
    10. Bossert, Walter & Derks, Jean & Peters, Hans, 2005. "Efficiency in uncertain cooperative games," Mathematical Social Sciences, Elsevier, vol. 50(1), pages 12-23, July.
    11. Weijun Xie & Yanfeng Ouyang & Sze Chun Wong, 2016. "Reliable Location-Routing Design Under Probabilistic Facility Disruptions," Transportation Science, INFORMS, vol. 50(3), pages 1128-1138, August.
    12. Sin-Yu Ho & N.M. Odhiambo, 2018. "Analysing the macroeconomic drivers of stock market development in the Philippines," Cogent Economics & Finance, Taylor & Francis Journals, vol. 6(1), pages 1451265-145, January.
    13. Natalia Nikolaevna Natocheeva* & Yuri Alexandrovich Rovensky & Yuri Yuryevich Rusanov & Tatiana Viktorovna Belyanchikova & Anna Anatolevna Staurskaya, 2018. "Optimizing Variability of Approaches to Regulatory Financing of Higher Education Services," The Journal of Social Sciences Research, Academic Research Publishing Group, pages 221-227:3.
    14. Philip Arestis & Howard Stein, 2005. "An Institutional Perspective to Finance and Development as an Alternative to Financial Liberalisation," International Review of Applied Economics, Taylor & Francis Journals, vol. 19(4), pages 381-398.
    15. Sahar Validi & Arijit Bhattacharya & P. J. Byrne, 2020. "Sustainable distribution system design: a two-phase DoE-guided meta-heuristic solution approach for a three-echelon bi-objective AHP-integrated location-routing model," Annals of Operations Research, Springer, vol. 290(1), pages 191-222, July.
    16. Cabada, Alberto & Fernández-Gómez, Carlos, 2015. "Constant sign solutions of two-point fourth order problems," Applied Mathematics and Computation, Elsevier, vol. 263(C), pages 122-133.
    17. Andy Hall, 2005. "Capacity development for agricultural biotechnology in developing countries: an innovation systems view of what it is and how to develop it," Journal of International Development, John Wiley & Sons, Ltd., vol. 17(5), pages 611-630.
    18. Athinoula A. Kosti & Simon Colreavy-Donnelly & Fabio Caraffini & Zacharias A. Anastassi, 2020. "Efficient Computation of the Nonlinear Schrödinger Equation with Time-Dependent Coefficients," Mathematics, MDPI, vol. 8(3), pages 1-12, March.
    19. Bruno Frey, 2005. "Problems with Publishing: Existing State and Solutions," European Journal of Law and Economics, Springer, vol. 19(2), pages 173-190, April.
    20. Lan, Heng-you, 2021. "Approximation-solvability of population biology systems based on p-Laplacian elliptic inequalities with demicontinuous strongly pseudo-contractive operators," Chaos, Solitons & Fractals, Elsevier, vol. 150(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:envirb:v:47:y:2020:i:3:p:489-507. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.