IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2105.07168.html
   My bibliography  Save this paper

Cohort Shapley value for algorithmic fairness

Author

Listed:
  • Masayoshi Mase
  • Art B. Owen
  • Benjamin B. Seiler

Abstract

Cohort Shapley value is a model-free method of variable importance grounded in game theory that does not use any unobserved and potentially impossible feature combinations. We use it to evaluate algorithmic fairness, using the well known COMPAS recidivism data as our example. This approach allows one to identify for each individual in a data set the extent to which they were adversely or beneficially affected by their value of a protected attribute such as their race. The method can do this even if race was not one of the original predictors and even if it does not have access to a proprietary algorithm that has made the predictions. The grounding in game theory lets us define aggregate variable importance for a data set consistently with its per subject definitions. We can investigate variable importance for multiple quantities of interest in the fairness literature including false positive predictions.

Suggested Citation

  • Masayoshi Mase & Art B. Owen & Benjamin B. Seiler, 2021. "Cohort Shapley value for algorithmic fairness," Papers 2105.07168, arXiv.org.
  • Handle: RePEc:arx:papers:2105.07168
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2105.07168
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mukund Sundararajan & Amir Najmi, 2019. "The many Shapley values for model explanation," Papers 1908.08474, arXiv.org, revised Feb 2020.
    2. Wei, Pengfei & Lu, Zhenzhou & Song, Jingwen, 2015. "Variable importance analysis: A comprehensive review," Reliability Engineering and System Safety, Elsevier, vol. 142(C), pages 399-432.
    3. Jiang, Tao & Owen, Art B., 2003. "Quasi-regression with shrinkage," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 62(3), pages 231-241.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Masayoshi Mase & Art B. Owen & Benjamin B. Seiler, 2022. "Variable importance without impossible data," Papers 2205.15750, arXiv.org, revised Apr 2023.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Masayoshi Mase & Art B. Owen & Benjamin B. Seiler, 2022. "Variable importance without impossible data," Papers 2205.15750, arXiv.org, revised Apr 2023.
    2. Asma Shaheen & Javed Iqbal, 2018. "Spatial Distribution and Mobility Assessment of Carcinogenic Heavy Metals in Soil Profiles Using Geostatistics and Random Forest, Boruta Algorithm," Sustainability, MDPI, vol. 10(3), pages 1-20, March.
    3. Shang, Xiaobing & Su, Li & Fang, Hai & Zeng, Bowen & Zhang, Zhi, 2023. "An efficient multi-fidelity Kriging surrogate model-based method for global sensitivity analysis," Reliability Engineering and System Safety, Elsevier, vol. 229(C).
    4. Alireza Rezazadeh & Yasamin Jafarian & Ali Kord, 2022. "Explainable Ensemble Machine Learning for Breast Cancer Diagnosis Based on Ultrasound Image Texture Features," Forecasting, MDPI, vol. 4(1), pages 1-13, February.
    5. Matteo Rucco & Giovanna Viticchi & Lorenzo Falsetti, 2020. "Towards Personalized Diagnosis of Glioblastoma in Fluid-Attenuated Inversion Recovery (FLAIR) by Topological Interpretable Machine Learning," Mathematics, MDPI, vol. 8(5), pages 1-27, May.
    6. Yun, Wanying & Lu, Zhenzhou & Feng, Kaixuan & Li, Luyi, 2019. "An elaborate algorithm for analyzing the Borgonovo moment-independent sensitivity by replacing the probability density function estimation with the probability estimation," Reliability Engineering and System Safety, Elsevier, vol. 189(C), pages 99-108.
    7. Hu'e Sullivan & Hurlin Christophe & P'erignon Christophe & Saurin S'ebastien, 2022. "Measuring the Driving Forces of Predictive Performance: Application to Credit Scoring," Papers 2212.05866, arXiv.org, revised Jun 2023.
    8. Hugh Chen & Scott M. Lundberg & Su-In Lee, 2022. "Explaining a series of models by propagating Shapley values," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    9. Mohamed Zine & Fouzi Harrou & Mohammed Terbeche & Mohammed Bellahcene & Abdelkader Dairi & Ying Sun, 2023. "E-Learning Readiness Assessment Using Machine Learning Methods," Sustainability, MDPI, vol. 15(11), pages 1-22, June.
    10. Cheng, Kai & Lu, Zhenzhou, 2018. "Sparse polynomial chaos expansion based on D-MORPH regression," Applied Mathematics and Computation, Elsevier, vol. 323(C), pages 17-30.
    11. Wenguang Zhang & Ting Lei & Yu Gong & Jun Zhang & Yirong Wu, 2022. "Using Explainable Artificial Intelligence to Identify Key Characteristics of Deep Poverty for Each Household," Sustainability, MDPI, vol. 14(16), pages 1-21, August.
    12. Wenxuan Wang & Hangshan Gao & Pengfei Wei & Changcong Zhou, 2017. "Extending first-passage method to reliability sensitivity analysis of motion mechanisms," Journal of Risk and Reliability, , vol. 231(5), pages 573-586, October.
    13. Lambert, Romain S.C. & Lemke, Frank & Kucherenko, Sergei S. & Song, Shufang & Shah, Nilay, 2016. "Global sensitivity analysis using sparse high dimensional model representations generated by the group method of data handling," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 128(C), pages 42-54.
    14. Hou, Tianfeng & Nuyens, Dirk & Roels, Staf & Janssen, Hans, 2019. "Quasi-Monte Carlo based uncertainty analysis: Sampling efficiency and error estimation in engineering applications," Reliability Engineering and System Safety, Elsevier, vol. 191(C).
    15. Yun, Wanying & Lu, Zhenzhou & Jiang, Xian, 2019. "An efficient method for moment-independent global sensitivity analysis by dimensional reduction technique and principle of maximum entropy," Reliability Engineering and System Safety, Elsevier, vol. 187(C), pages 174-182.
    16. Kleijnen, Jack P.C., 2017. "Regression and Kriging metamodels with their experimental designs in simulation: A review," European Journal of Operational Research, Elsevier, vol. 256(1), pages 1-16.
    17. Aras, Serkan & Hanifi Van, M., 2022. "An interpretable forecasting framework for energy consumption and CO2 emissions," Applied Energy, Elsevier, vol. 328(C).
    18. McFarland, John & DeCarlo, Erin, 2020. "A Monte Carlo framework for probabilistic analysis and variance decomposition with distribution parameter uncertainty," Reliability Engineering and System Safety, Elsevier, vol. 197(C).
    19. Mi, Jinhua & Beer, Michael & Li, Yan-Feng & Broggi, Matteo & Cheng, Yuhua, 2020. "Reliability and importance analysis of uncertain system with common cause failures based on survival signature," Reliability Engineering and System Safety, Elsevier, vol. 201(C).
    20. Ronald Richman & Mario V. Wuthrich, 2021. "LocalGLMnet: interpretable deep learning for tabular data," Papers 2107.11059, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2105.07168. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.