IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1908.08474.html
   My bibliography  Save this paper

The many Shapley values for model explanation

Author

Listed:
  • Mukund Sundararajan
  • Amir Najmi

Abstract

The Shapley value has become a popular method to attribute the prediction of a machine-learning model on an input to its base features. The use of the Shapley value is justified by citing [16] showing that it is the \emph{unique} method that satisfies certain good properties (\emph{axioms}). There are, however, a multiplicity of ways in which the Shapley value is operationalized in the attribution problem. These differ in how they reference the model, the training data, and the explanation context. These give very different results, rendering the uniqueness result meaningless. Furthermore, we find that previously proposed approaches can produce counterintuitive attributions in theory and in practice---for instance, they can assign non-zero attributions to features that are not even referenced by the model. In this paper, we use the axiomatic approach to study the differences between some of the many operationalizations of the Shapley value for attribution, and propose a technique called Baseline Shapley (BShap) that is backed by a proper uniqueness result. We also contrast BShap with Integrated Gradients, another extension of Shapley value to the continuous setting.

Suggested Citation

  • Mukund Sundararajan & Amir Najmi, 2019. "The many Shapley values for model explanation," Papers 1908.08474, arXiv.org, revised Feb 2020.
  • Handle: RePEc:arx:papers:1908.08474
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1908.08474
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Friedman, Eric & Moulin, Herve, 1999. "Three Methods to Share Joint Costs or Surplus," Journal of Economic Theory, Elsevier, vol. 87(2), pages 275-312, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hugh Chen & Scott M. Lundberg & Su-In Lee, 2022. "Explaining a series of models by propagating Shapley values," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    2. Joohyun Jang & Woonyoung Jeong & Sangmin Kim & Byeongcheon Lee & Miyoung Lee & Jihoon Moon, 2023. "RAID: Robust and Interpretable Daily Peak Load Forecasting via Multiple Deep Neural Networks and Shapley Values," Sustainability, MDPI, vol. 15(8), pages 1-27, April.
    3. Babaei, Golnoosh & Giudici, Paolo & Raffinetti, Emanuela, 2023. "Explainable FinTech lending," Journal of Economics and Business, Elsevier, vol. 125.
    4. Masayoshi Mase & Art B. Owen & Benjamin B. Seiler, 2021. "Cohort Shapley value for algorithmic fairness," Papers 2105.07168, arXiv.org.
    5. Ronald Richman & Mario V. Wuthrich, 2021. "LocalGLMnet: interpretable deep learning for tabular data," Papers 2107.11059, arXiv.org.
    6. Lee Changro, 2022. "Training and Interpreting Machine Learning Models: Application in Property Tax Assessment," Real Estate Management and Valuation, Sciendo, vol. 30(1), pages 13-22, March.
    7. Li, Wei & Becker, Denis Mike, 2021. "Day-ahead electricity price prediction applying hybrid models of LSTM-based deep learning methods and feature selection algorithms under consideration of market coupling," Energy, Elsevier, vol. 237(C).
    8. Kristof Lommers & Ouns El Harzli & Jack Kim, 2021. "Confronting Machine Learning With Financial Research," Papers 2103.00366, arXiv.org, revised Mar 2021.
    9. Kumar, Rishabh & Koshiyama, Adriano & da Costa, Kleyton & Kingsman, Nigel & Tewarrie, Marvin & Kazim, Emre & Roy, Arunita & Treleaven, Philip & Lovell, Zac, 2023. "Deep learning model fragility and implications for financial stability and regulation," Bank of England working papers 1038, Bank of England.
    10. Wenguang Zhang & Ting Lei & Yu Gong & Jun Zhang & Yirong Wu, 2022. "Using Explainable Artificial Intelligence to Identify Key Characteristics of Deep Poverty for Each Household," Sustainability, MDPI, vol. 14(16), pages 1-21, August.
    11. Alireza Rezazadeh & Yasamin Jafarian & Ali Kord, 2022. "Explainable Ensemble Machine Learning for Breast Cancer Diagnosis Based on Ultrasound Image Texture Features," Forecasting, MDPI, vol. 4(1), pages 1-13, February.
    12. Chen, Yujia & Calabrese, Raffaella & Martin-Barragan, Belen, 2024. "Interpretable machine learning for imbalanced credit scoring datasets," European Journal of Operational Research, Elsevier, vol. 312(1), pages 357-372.
    13. Wei Li & Denis Mike Becker, 2021. "Day-ahead electricity price prediction applying hybrid models of LSTM-based deep learning methods and feature selection algorithms under consideration of market coupling," Papers 2101.05249, arXiv.org, revised Jul 2021.
    14. Mehmet Güney Celbiş & Pui‐hang Wong & Karima Kourtit & Peter Nijkamp, 2023. "Impacts of the COVID‐19 outbreak on older‐age cohorts in European Labor Markets: A machine learning exploration of vulnerable groups," Regional Science Policy & Practice, Wiley Blackwell, vol. 15(3), pages 559-584, April.
    15. Julliana Gonçalves Marques & Luiz Affonso Guedes & Márjory Cristiany da Costa Abreu, 2022. "Evaluating Time Influence over Performance of Machine-Learning-Based Diagnosis: A Case Study of COVID-19 Pandemic in Brazil," IJERPH, MDPI, vol. 20(1), pages 1-14, December.
    16. Aras, Serkan & Hanifi Van, M., 2022. "An interpretable forecasting framework for energy consumption and CO2 emissions," Applied Energy, Elsevier, vol. 328(C).
    17. Buckmann, Marcus & Joseph, Andreas, 2022. "An interpretable machine learning workflow with an application to economic forecasting," Bank of England working papers 984, Bank of England.
    18. Disha Bhattacharyya & Sudeep Pradhan & Shabbiruddin, 2023. "Barriers in Replacement of Conventional Vehicles by Electric Vehicles in India: A Decision-Making Approach," International Journal of Decision Support System Technology (IJDSST), IGI Global, vol. 15(1), pages 1-20, January.
    19. Masayoshi Mase & Art B. Owen & Benjamin B. Seiler, 2022. "Variable importance without impossible data," Papers 2205.15750, arXiv.org, revised Apr 2023.
    20. Hu'e Sullivan & Hurlin Christophe & P'erignon Christophe & Saurin S'ebastien, 2022. "Measuring the Driving Forces of Predictive Performance: Application to Credit Scoring," Papers 2212.05866, arXiv.org, revised Jun 2023.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dangxing Chen, 2023. "Can I Trust the Explanations? Investigating Explainable Machine Learning Methods for Monotonic Models," Papers 2309.13246, arXiv.org.
    2. Yves Sprumont, 2010. "An Axiomatization of the Serial Cost-Sharing Method," Econometrica, Econometric Society, vol. 78(5), pages 1711-1748, September.
    3. Juarez, Ruben & Ko, Chiu Yu & Xue, Jingyi, 2018. "Sharing sequential values in a network," Journal of Economic Theory, Elsevier, vol. 177(C), pages 734-779.
    4. Hervé Moulin & Yves Sprumont, 2007. "Fair allocation of production externalities : recent results," Revue d'économie politique, Dalloz, vol. 117(1), pages 7-36.
    5. Hougaard, Jens Leth & Tind, Jørgen, 2009. "Cost allocation and convex data envelopment," European Journal of Operational Research, Elsevier, vol. 194(3), pages 939-947, May.
    6. Koster, M.A.L., 1998. "Multi-Service Serial Cost Sharing : A Characterization of the Moulin-Shenker Rule," Discussion Paper 1998-06, Tilburg University, Center for Economic Research.
    7. Friedman, Eric J., 2012. "Asymmetric Cost Sharing mechanisms," Games and Economic Behavior, Elsevier, vol. 75(1), pages 139-151.
    8. Robert John Kolesar & Peter Bogetoft & Vanara Chea & Guido Erreygers & Sambo Pheakdey, 2022. "Advancing universal health coverage in the COVID-19 era: an assessment of public health services technical efficiency and applied cost allocation in Cambodia," Health Economics Review, Springer, vol. 12(1), pages 1-20, December.
    9. Hougaard, Jens Leth & Moreno-Ternero, Juan D. & Tvede, Mich & Østerdal, Lars Peter, 2017. "Sharing the proceeds from a hierarchical venture," Games and Economic Behavior, Elsevier, vol. 102(C), pages 98-110.
    10. Eric Bahel & Christian Trudeau, 2018. "Consistency requirements and pattern methods in cost sharing problems with technological cooperation," International Journal of Game Theory, Springer;Game Theory Society, vol. 47(3), pages 737-765, September.
    11. Friedman, Eric J., 2002. "Strategic properties of heterogeneous serial cost sharing," Mathematical Social Sciences, Elsevier, vol. 44(2), pages 145-154, November.
    12. Chen, Yan, 2003. "An experimental study of serial and average cost pricing mechanisms," Journal of Public Economics, Elsevier, vol. 87(9-10), pages 2305-2335, September.
    13. Watts, Alison, 2002. "Uniqueness of equilibrium in cost sharing games," Journal of Mathematical Economics, Elsevier, vol. 37(1), pages 47-70, February.
    14. Wang, Yun-Tong & Zhu, Daxin, 2002. "Ordinal proportional cost sharing," Journal of Mathematical Economics, Elsevier, vol. 37(3), pages 215-230, May.
    15. Mariusz Kaleta & Eugeniusz Toczyłowski, 2009. "A cost allocation framework for LP and GLP games," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 19(4), pages 27-46.
    16. Koster, M.A.L., 1998. "Multi-Service Serial Cost Sharing : A Characterization of the Moulin-Shenker Rule," Other publications TiSEM 4d029e40-e4e7-4f90-b963-d, Tilburg University, School of Economics and Management.
    17. Kumar, Rajnish, 2013. "Secure implementation in production economies," Mathematical Social Sciences, Elsevier, vol. 66(3), pages 372-378.
    18. Moulin, Herve, 2002. "Axiomatic cost and surplus sharing," Handbook of Social Choice and Welfare, in: K. J. Arrow & A. K. Sen & K. Suzumura (ed.), Handbook of Social Choice and Welfare, edition 1, volume 1, chapter 6, pages 289-357, Elsevier.
    19. Moulin, Herve & Sprumont, Yves, 2006. "Responsibility and cross-subsidization in cost sharing," Games and Economic Behavior, Elsevier, vol. 55(1), pages 152-188, April.
    20. Eric Friedman, 1997. "Weak and Strong Consistency in Additive Cost Sharing," Departmental Working Papers 199707, Rutgers University, Department of Economics.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1908.08474. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.