IDEAS home Printed from https://ideas.repec.org/a/pal/palcom/v10y2023i1d10.1057_s41599-023-01548-7.html
   My bibliography  Save this article

What dictates income in New York City? SHAP analysis of income estimation based on Socio-economic and Spatial Information Gaussian Processes (SSIG)

Author

Listed:
  • Ruiqiao Bai

    (The University of Hong Kong)

  • Jacqueline C. K. Lam

    (The University of Hong Kong)

  • Victor O. K. Li

    (The University of Hong Kong)

Abstract

Income inequality presents a key challenge to urban sustainability across the developed economies. Traditionally, accurate high granularity income data are generally obtained from field surveys. However, due to privacy considerations, field subjects are hesitant to provide accurate personal income data. A Socio-economic & Spatial-Information-GP (SSIG) model is thereby developed to estimate district-based high granularity income for New York City (NYC). As compared to the state-of-the-art Gaussian Processes (GP) income estimation model based entirely on spatial information, SSIG incorporates socio-economic domain-specific knowledge into a GP model. For SSIG to be explainable, SHapley Additive exPlanations (SHAP) analysis is undertaken to evaluate the relative contribution of various key individual socio-economic variables to district-based per-capita and median household income in NYC. Differentiating from traditional income inequality studies based predominantly on linear or log-linear regression model, SSIG presents a novel income-based model architecture, capable of modelling complex non-linear relationships. In parallel, SHAP analysis serves an effective analytical tool for identifying the key attributes to income inequality. Results have shown that SSIG surpasses other state-of-the-art baselines in estimation accuracy, as far as per-capita and median household income estimation at the Tract-level and the ZIP-level in NYC are concerned. SHAP results have indicated that having a bachelor or a postgraduate degree can accurately predict income in NYC, despite that between-district income inequality due to Sex/Race remains prevalent. SHAP has further confirmed that between-district income gap is more associated with Race than Sex. Furthermore, ablation study shows that socio-economic information is more predictive of income at the ZIP-level, relative to the spatial information. This study carries significant implications for policy-making in a developed context. To promote urban economic sustainability in NYC, policymakers can attend to the growing income disparity (income inequality) contributed by Sex and Race, while giving more higher education opportunities to residents in the lower-income districts, as the estimated per-capita income is more sensitive to the proportion of adults ≥25 holding a bachelor’s degree. Finally, interpretative SHAP analysis is useful for investigating the relative contribution of socio-economic inputs to any predicted outputs in future machine-learning-driven socio-economic analyses.

Suggested Citation

  • Ruiqiao Bai & Jacqueline C. K. Lam & Victor O. K. Li, 2023. "What dictates income in New York City? SHAP analysis of income estimation based on Socio-economic and Spatial Information Gaussian Processes (SSIG)," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-14, December.
  • Handle: RePEc:pal:palcom:v:10:y:2023:i:1:d:10.1057_s41599-023-01548-7
    DOI: 10.1057/s41599-023-01548-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/s41599-023-01548-7
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1057/s41599-023-01548-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2018. "Big Data And Big Cities: The Promises And Limitations Of Improved Measures Of Urban Life," Economic Inquiry, Western Economic Association International, vol. 56(1), pages 114-137, January.
    2. Jones, Patricia, 2001. "Are educated workers really more productive?," Journal of Development Economics, Elsevier, vol. 64(1), pages 57-79, February.
    3. George Psacharopoulos & Harry Anthony Patrinos, 2004. "Returns to investment in education: a further update," Education Economics, Taylor & Francis Journals, vol. 12(2), pages 111-134.
    4. Stan Lipovetsky & Michael Conklin, 2001. "Analysis of regression in game theory approach," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 17(4), pages 319-330, October.
    5. Lasse Eika & Magne Mogstad & Basit Zafar, 2019. "Educational Assortative Mating and Household Income Inequality," Journal of Political Economy, University of Chicago Press, vol. 127(6), pages 2795-2835.
    6. R. Ebrahimi & S. Choobchian & H. Farhadian & I. Goli & E. Farmandeh & H. Azadi, 2022. "Investigating the effect of vocational education and training on rural women’s empowerment," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-11, December.
    7. Welch, F, 1970. "Education in Production," Journal of Political Economy, University of Chicago Press, vol. 78(1), pages 35-59, Jan.-Feb..
    8. Thomas FULLERTON & Enedina LICERIO & Phuntsho WANGMO, 2010. "Education, Infrastructure, and Regional Income Performance in Arkansas," Regional and Sectoral Economic Studies, Euro-American Association of Economic Development, vol. 10(1).
    9. Randall Akee & Maggie R. Jones & Sonya R. Porter, 2019. "Race Matters: Income Shares, Income Inequality, and Income Mobility for All U.S. Races," Demography, Springer;Population Association of America (PAA), vol. 56(3), pages 999-1021, June.
    10. Rauch James E., 1993. "Productivity Gains from Geographic Concentration of Human Capital: Evidence from the Cities," Journal of Urban Economics, Elsevier, vol. 34(3), pages 380-400, November.
    11. Albert Esteve & Joan García-Román & Iñaki Permanyer, 2012. "The Gender-Gap Reversal in Education and Its Effect on Union Formation: The End of Hypergamy?," Population and Development Review, The Population Council, Inc., vol. 38(3), pages 535-546, September.
    12. Thomas M. FULLERTON & Carlos R. MORALES & Adam G. WALKE, 2014. "The Effects Of Education, Infrastructure, And Demographics On Regional Income Performance In Missouri," Regional and Sectoral Economic Studies, Euro-American Association of Economic Development, vol. 14(1), pages 5-22.
    13. Määttänen, Niku & Terviö, Marko, 2014. "Income distribution and housing prices: An assignment model approach," Journal of Economic Theory, Elsevier, vol. 151(C), pages 381-410.
    14. Lucas Chancel & Alex Hough & Tancrède Voituriez, 2018. "Reducing Inequalities within Countries: Assessing the Potential of the Sustainable Development Goals," Global Policy, London School of Economics and Political Science, vol. 9(1), pages 5-16, February.
    15. Frederico Cantante, 2020. "Four profiles of inequality and tax redistribution in Europe," Palgrave Communications, Palgrave Macmillan, vol. 7(1), pages 1-7, December.
    16. Thomas M. Fullerton, 2001. "Educational attainment and border income performance," Economic and Financial Policy Review, Federal Reserve Bank of Dallas, issue Q III, pages 2-10.
    17. Pratap Kumar Mahakur & Narayan Chandra Nayak, 2019. "An investigation of intrastate income disparities and regional convergence in Odisha," Journal of Social and Economic Development, Springer;Institute for Social and Economic Change, vol. 21(2), pages 288-308, December.
    18. Yu Sang Chang & Sung Jun Jo & Yoo-Taek Lee & Yoonji Lee, 2021. "Population Density or Populations Size. Which Factor Determines Urban Traffic Congestion?," Sustainability, MDPI, vol. 13(8), pages 1-21, April.
    19. Rosenzweig, Mark R, 1995. "Why Are There Returns to Schooling?," American Economic Review, American Economic Association, vol. 85(2), pages 153-158, May.
    20. Charlotta Mellander & José Lobo & Kevin Stolarick & Zara Matheson, 2015. "Night-Time Light Data: A Good Proxy Measure for Economic Activity?," PLOS ONE, Public Library of Science, vol. 10(10), pages 1-18, October.
    21. Timo-Kolja Pfoertner & Hans-Juergen Andress & Christian Janssen, 2011. "Income or living standard and health in Germany: different ways of measurement of relative poverty with regard to self-rated health," International Journal of Public Health, Springer;Swiss School of Public Health (SSPH+), vol. 56(4), pages 373-384, August.
    22. Borrell, Luisa N. & Kodali, Hanish & Rodriguez-Alvarez, Elena, 2021. "Interracial/ethnic marriage and adverse birth outcomes: The effect of neighborhood racial/ethnic composition," Social Science & Medicine, Elsevier, vol. 270(C).
    23. Simon, Curtis J., 1998. "Human Capital and Metropolitan Employment Growth," Journal of Urban Economics, Elsevier, vol. 43(2), pages 223-243, March.
    24. Harry Anthony Patrinos, 2016. "Estimating the return to schooling using the Mincer equation," IZA World of Labor, Institute of Labor Economics (IZA), pages 278-278, July.
    25. Dang, Hai-Anh H. & Viet Nguyen, Cuong, 2021. "Gender inequality during the COVID-19 pandemic: Income, expenditure, savings, and job loss," World Development, Elsevier, vol. 140(C).
    26. Mary Townsend Hamilton, 1973. "Sex and Income Inequality among the Employed," The ANNALS of the American Academy of Political and Social Science, , vol. 409(1), pages 42-52, September.
    27. Stanley R. Bailey & Aliya Saperstein & Andrew Penner, 2014. "Race, color, and income inequality across the Americas," Demographic Research, Max Planck Institute for Demographic Research, Rostock, Germany, vol. 31(24), pages 735-756.
    28. Joel R. Malin & Chris Brown & Georgeta Ion & Isabell Ackeren & Nina Bremm & Ruth Luzmore & Jane Flood & Gul Muhammad Rind, 2020. "World-wide barriers and enablers to achieving evidence-informed practice in education: what can be learnt from Spain, England, the United States, and Germany?," Palgrave Communications, Palgrave Macmillan, vol. 7(1), pages 1-14, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Almada, Christa & Blanco-Gonzalez, Lorenzo & Eason, Patricia & Fullerton, Thomas, 2006. "Econometric Evidence Regarding Education and Border Income Performance," MPRA Paper 451, University Library of Munich, Germany, revised 2006.
    2. Thomas FULLERTON & Enedina LICERIO & Phuntsho WANGMO, 2010. "Education, Infrastructure, and Regional Income Performance in Arkansas," Regional and Sectoral Economic Studies, Euro-American Association of Economic Development, vol. 10(1).
    3. Marconi, G. & de Grip, A., 2014. "Education and growth with learning by doing," ROA Research Memorandum 010, Maastricht University, Research Centre for Education and the Labour Market (ROA).
    4. Hannum, Emily & Buchmann, Claudia, 2005. "Global Educational Expansion and Socio-Economic Development: An Assessment of Findings from the Social Sciences," World Development, Elsevier, vol. 33(3), pages 333-354, March.
    5. John V. Winters, 2013. "Human capital externalities and employment differences across metropolitan areas of the USA," Journal of Economic Geography, Oxford University Press, vol. 13(5), pages 799-822, September.
    6. David L. Sjoquist & John V. Winters, 2015. "State Merit-Based Financial Aid Programs And College Attainment," Journal of Regional Science, Wiley Blackwell, vol. 55(3), pages 364-390, June.
    7. Behrman, Jere R., 2010. "Investment in Education Inputs and Incentives," Handbook of Development Economics, in: Dani Rodrik & Mark Rosenzweig (ed.), Handbook of Development Economics, edition 1, volume 5, chapter 0, pages 4883-4975, Elsevier.
    8. Phanhpakit ONPHANHDALA, 2009. "Farmer Education and Agricultural Efficiency: Evidence from Lao PDR," GSICS Working Paper Series 20, Graduate School of International Cooperation Studies, Kobe University.
    9. Mohammad Amin & Asif M. Islam, 2022. "Does manager education play a role in the productivity of informal firms in developing economies? Evidence from firm‐level surveys," Review of Development Economics, Wiley Blackwell, vol. 26(2), pages 962-984, May.
    10. Backman, Mikaela, 2013. "Regional Variation of Returns to Education," Working Paper Series in Economics and Institutions of Innovation 300, Royal Institute of Technology, CESIS - Centre of Excellence for Science and Innovation Studies.
    11. John Winters, 2012. "Differences in employment outcomes for college town stayers and leavers," IZA Journal of Migration and Development, Springer;Forschungsinstitut zur Zukunft der Arbeit GmbH (IZA), vol. 1(1), pages 1-17, December.
    12. Bourguignon, Francois, 2005. "The Effect of Economic Growth on Social Structures," Handbook of Economic Growth, in: Philippe Aghion & Steven Durlauf (ed.), Handbook of Economic Growth, edition 1, volume 1, chapter 27, pages 1701-1747, Elsevier.
    13. Duranton, Gilles & Puga, Diego, 2014. "The Growth of Cities," Handbook of Economic Growth, in: Philippe Aghion & Steven Durlauf (ed.), Handbook of Economic Growth, edition 1, volume 2, chapter 5, pages 781-853, Elsevier.
    14. Mateos-Planas, Xavier, 2000. "Schooling and distortions in a vintage capital model," Discussion Paper Series In Economics And Econometrics 30, Economics Division, School of Social Sciences, University of Southampton.
    15. Aaron Chatterji & Edward Glaeser & William Kerr, 2014. "Clusters of Entrepreneurship and Innovation," Innovation Policy and the Economy, University of Chicago Press, vol. 14(1), pages 129-166.
    16. Iñaki Permanyer & Diederik Boertien, 2019. "A century of change in global education variability and gender differences in education," PLOS ONE, Public Library of Science, vol. 14(2), pages 1-22, February.
    17. Möller Joachim & Tubadji Annie, 2009. "The Creative Class, Bohemians and Local Labor Market Performance: A Micro-data Panel Study for Germany 1975–2004," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 229(2-3), pages 270-291, April.
    18. Liliana Sousa, 2013. "Community Determinants Of Immigrant Self-Employment: Human Capital Spillovers And Ethnic Enclaves," Working Papers 13-21, Center for Economic Studies, U.S. Census Bureau.
    19. repec:kap:iaecre:v:11:y:2005:i:2:p:231-242 is not listed on IDEAS
    20. Richard Florida & Charlotta Mellander & Kevin Stolarick, 2016. "Human capital in cities and suburbs," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 57(1), pages 91-123, July.
    21. Belal Fallah & Mark D. Partridge & Dan S. Rickman, 2014. "Geography and High-Tech Employment Growth in US Counties," Journal of Economic Geography, Oxford University Press, vol. 14(4), pages 683-720.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:palcom:v:10:y:2023:i:1:d:10.1057_s41599-023-01548-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: https://www.nature.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.