IDEAS home Printed from https://ideas.repec.org/p/fip/fedkrw/97176.html
   My bibliography  Save this paper

Understanding Models and Model Bias with Gaussian Processes

Author

Abstract

Despite growing interest in the use of complex models, such as machine learning (ML) models, for credit underwriting, ML models are difficult to interpret, and it is possible for them to learn relationships that yield de facto discrimination. How can we understand the behavior and potential biases of these models, especially if our access to the underlying model is limited? We argue that counterfactual reasoning is ideal for interpreting model behavior, and that Gaussian processes (GP) can provide approximate counterfactual reasoning while also incorporating uncertainty in the underlying model’s functional form. We illustrate with an exercise in which a simulated lender uses a biased machine model to decide credit terms. Comparing aggregate outcomes does not clearly reveal bias, but with a GP model we can estimate individual counterfactual outcomes. This approach can detect the bias in the lending model even when only a relatively small sample is available. To demonstrate the value of this approach for the more general task of model interpretability, we also show how the GP model’s estimates can be aggregated to recreate the partial density functions for the lending model.

Suggested Citation

  • Thomas R. Cook & Nathan M. Palmer, 2023. "Understanding Models and Model Bias with Gaussian Processes," Research Working Paper RWP 23-07, Federal Reserve Bank of Kansas City.
  • Handle: RePEc:fip:fedkrw:97176
    DOI: 10.18651/RWP2023-07
    as

    Download full text from publisher

    File URL: https://www.kansascityfed.org/Research%20Working%20Papers/documents/9592/rwp23-07cookpalmer.pdf
    File Function: Full Text
    Download Restriction: no

    File URL: https://libkey.io/10.18651/RWP2023-07?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Michael Funke & Marc Gronwald, 2009. "A Convex Hull Approach to Counterfactual Analysis of Trade Openness and Growth," Quantitative Macroeconomics Working Papers 20906, Hamburg University, Department of Economics.
    2. Marianne Bertrand & Sendhil Mullainathan, 2004. "Are Emily and Greg More Employable Than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination," American Economic Review, American Economic Association, vol. 94(4), pages 991-1013, September.
    3. Thomas R. Cook & Greg Gupton & Zach Modig & Nathan M. Palmer, 2021. "Explaining Machine Learning by Bootstrapping Partial Dependence Functions and Shapley Values," Research Working Paper RWP 21-12, Federal Reserve Bank of Kansas City.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chowdhury, Shyamal & Ooi, Evarn & Slonim, Robert, 2017. "Racial discrimination and white first name adoption: a field experiment in the Australian labour market," Working Papers 2017-15, University of Sydney, School of Economics.
    2. Mujcic, Redzo & Frijters, Paul, 2013. "Still Not Allowed on the Bus: It Matters If You're Black or White!," IZA Discussion Papers 7300, Institute of Labor Economics (IZA).
    3. Anthony Edo & Nicolas Jacquemet & Constantine Yannelis, 2019. "Language skills and homophilous hiring discrimination: Evidence from gender and racially differentiated applications," Review of Economics of the Household, Springer, vol. 17(1), pages 349-376, March.
    4. Kevin Lang & Ariella Kahn-Lang Spitzer, 2020. "Race Discrimination: An Economic Perspective," Journal of Economic Perspectives, American Economic Association, vol. 34(2), pages 68-89, Spring.
    5. Keith Head & Thierry Mayer, 2008. "Detection Of Local Interactions From The Spatial Pattern Of Names In France," Journal of Regional Science, Wiley Blackwell, vol. 48(1), pages 67-95, February.
    6. David H Chae & Sean Clouston & Mark L Hatzenbuehler & Michael R Kramer & Hannah L F Cooper & Sacoby M Wilson & Seth I Stephens-Davidowitz & Robert S Gold & Bruce G Link, 2015. "Association between an Internet-Based Measure of Area Racism and Black Mortality," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-12, April.
    7. Elizabeth Hirsh & Hazel Hollingdale & Natasha Stecy-Hildebrandt, 2013. "Gender inequality in the workplace," Chapters, in: Deborah M. Figart & Tonia L. Warnecke (ed.), Handbook of Research on Gender and Economic Life, chapter 12, pages 183-199, Edward Elgar Publishing.
    8. Saugato Datta & Vikram Pathania, 2016. "For whom does the phone (not) ring? Discrimination in the rental housing market in Delhi, India," WIDER Working Paper Series 055, World Institute for Development Economic Research (UNU-WIDER).
    9. Kelley Sarussi & Thomas Walstrum, 2019. "Education and the Evolution of Earnings Across Population Groups Since 2000," Profitwise, Federal Reserve Bank of Chicago, issue 5, pages 1-13.
    10. Ritwik Banerjee & Nabanita Datta Gupta, 2015. "Awareness Programs and Change in Taste-Based Caste Prejudice," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-17, April.
    11. Bertrand, Jérémie & Burietz, Aurore, 2023. "(Loan) price and (loan officer) prejudice," Journal of Economic Behavior & Organization, Elsevier, vol. 210(C), pages 26-42.
    12. Jan Hanousek & Štěpán Jurajda, 2018. "Názvy společností a jejich vliv na výkonnost firem [Corporate Names and Performance]," Politická ekonomie, Prague University of Economics and Business, vol. 2018(6), pages 671-688.
    13. Bryson, Alex & Chevalier, Arnaud, 2015. "Is there a taste for racial discrimination amongst employers?," Labour Economics, Elsevier, vol. 34(C), pages 51-63.
    14. Bethany Everett & David Rehkopf & Richard Rogers, 2013. "The Nonlinear Relationship Between Education and Mortality: An Examination of Cohort, Race/Ethnic, and Gender Differences," Population Research and Policy Review, Springer;Southern Demographic Association (SDA), vol. 32(6), pages 893-917, December.
    15. Ola Bengtsson & John R. M. Hand, 2013. "Employee Compensation in Entrepreneurial Companies," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 22(2), pages 312-340, June.
    16. John M. Nunley & Adam Pugh & Nicholas Romero & Richard Alan Seals, Jr., 2014. "Unemployment, Underemployment, and Employment Opportunities: Results from a Correspondence Audit of the Labor Market for College Graduates," Auburn Economics Working Paper Series auwp2014-04, Department of Economics, Auburn University.
    17. Li, Xilin & Hsee, Christopher K., 2021. "Free-riding and cost-bearing in discrimination," Organizational Behavior and Human Decision Processes, Elsevier, vol. 163(C), pages 80-90.
    18. Button, Patrick & Walker, Brigham, 2020. "Employment discrimination against Indigenous Peoples in the United States: Evidence from a field experiment," Labour Economics, Elsevier, vol. 65(C).
    19. Bart Capéau & Lieve Eeman & Steven Groenez & Miet Lamberts, 2012. "Standardised Scores as a Way to measure and Compare Discrimination Across Dimensions," Working Papers ECARES ECARES 2012-022, ULB -- Universite Libre de Bruxelles.
    20. Letisha Engracia Cardoso Brown, 2020. "Eat to Live, Don’t Live to Eat: Black Men, Masculinity, Faith and Food," IJERPH, MDPI, vol. 17(12), pages 1-13, June.

    More about this item

    Keywords

    models; Gaussian process; model bias;
    All these keywords.

    JEL classification:

    • C10 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C18 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Methodolical Issues: General
    • C45 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Neural Networks and Related Topics

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:fip:fedkrw:97176. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Zach Kastens (email available below). General contact details of provider: https://edirc.repec.org/data/frbkcus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.