IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i10p1581-d1653461.html
   My bibliography  Save this article

The Shapley Value in Data Science: Advances in Computation, Extensions, and Applications

Author

Listed:
  • Lei Qin

    (School of Statistics, University of International Business and Economics, Beijing 100029, China
    Dong Fureng Institute of Economic and Social Development, Wuhan University, Wuhan 430072, China)

  • Yingqiu Zhu

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Shaonan Liu

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Xingjian Zhang

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Yining Zhao

    (Sunwah International Business School, Liaoning University, Shenyang 110036, China)

Abstract

The Shapley value is a fundamental concept in data science, providing a principled framework for fair resource allocation, feature importance quantification, and improved interpretability of complex models. Its fundamental theory is based on four axiomatic proper ties, which underpin its widespread application. To address the inherent computational challenges of exact calculation, we discuss model-agnostic approximation techniques, such as Random Order Value, Least Squares Value, and Multilinear Extension Sampling, as well as specialized fast algorithms for linear, tree-based, and deep learning models. Recent extensions, such as Distributional Shapley and Weighted Shapley, have broadened the applications to data valuation, reinforcement learning, feature interaction analysis, and multi-party cooperation. Practical effectiveness has been demonstrated in health care, finance, industry, and the digital economy, with promising future directions for incorporating these techniques into emerging fields, such as data asset pricing and trading.

Suggested Citation

  • Lei Qin & Yingqiu Zhu & Shaonan Liu & Xingjian Zhang & Yining Zhao, 2025. "The Shapley Value in Data Science: Advances in Computation, Extensions, and Applications," Mathematics, MDPI, vol. 13(10), pages 1-21, May.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:10:p:1581-:d:1653461
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/10/1581/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/10/1581/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Luigi Zingales, 1995. "What Determines the Value of Corporate Votes?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 110(4), pages 1047-1073.
    2. Jackson, Matthew O., 2005. "Allocation rules for network games," Games and Economic Behavior, Elsevier, vol. 51(1), pages 128-154, April.
    3. Hashem Omrani & Mohaddeseh Amini & Mahdieh Babaei & Khatereh Shafaat, 2020. "Use Shapley value for increasing power distinguish of data envelopment analysis model: An application for estimating environmental efficiency of industrial producers in Iran," Energy & Environment, , vol. 31(4), pages 656-675, June.
    4. Roth, Alvin E, 1977. "The Shapley Value as a von Neumann-Morgenstern Utility," Econometrica, Econometric Society, vol. 45(3), pages 657-664, April.
    5. Riccardo Colini-Baldeschi & Marco Scarsini & Stefano Vaccari, 2018. "Variance Allocation and Shapley Value," Methodology and Computing in Applied Probability, Springer, vol. 20(3), pages 919-933, September.
    6. Haim Shalit, 2020. "Using the Shapley value of stocks as systematic risk," Journal of Risk Finance, Emerald Group Publishing Limited, vol. 21(4), pages 459-468, October.
    7. Gately, Dermot, 1974. "Sharing the Gains from Regional Cooperation: A Game Theoretic Application to Planning Investment in Electric Power," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 15(1), pages 195-208, February.
    8. Karl Michael Ortmann, 2016. "The link between the Shapley value and the beta factor," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 39(2), pages 311-325, November.
    9. Lemaire, Jean, 1984. "An Application of Game Theory: Cost Allocation," ASTIN Bulletin, Cambridge University Press, vol. 14(1), pages 61-81, April.
    10. Shapley, L. S. & Shubik, Martin, 1954. "A Method for Evaluating the Distribution of Power in a Committee System," American Political Science Review, Cambridge University Press, vol. 48(3), pages 787-792, September.
    11. Sanjith Gopalakrishnan & Daniel Granot & Frieda Granot & Greys Sošić & Hailong Cui, 2021. "Incentives and Emission Responsibility Allocation in Supply Chains," Management Science, INFORMS, vol. 67(7), pages 4172-4190, July.
    12. Guillermo Owen, 1972. "Multilinear Extensions of Games," Management Science, INFORMS, vol. 18(5-Part-2), pages 64-79, January.
    13. Jennifer K. Ryan & Lusheng Shao & Daewon Sun, 2022. "Contracting Mechanisms for Stable Sourcing Networks," Manufacturing & Service Operations Management, INFORMS, vol. 24(5), pages 2558-2576, September.
    14. Ruiz, Luis M. & Valenciano, Federico & Zarzuelo, Jose M., 1998. "The Family of Least Square Values for Transferable Utility Games," Games and Economic Behavior, Elsevier, vol. 24(1-2), pages 109-130, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Haim Shalit, 2021. "The Shapley value decomposition of optimal portfolios," Annals of Finance, Springer, vol. 17(1), pages 1-25, March.
    2. Benjamin R. Auer & Tobias Hiller, 2021. "Cost gap, Shapley, or nucleolus allocation: Which is the best game‐theoretic remedy for the low‐risk anomaly?," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 42(4), pages 876-884, June.
    3. Taylan Mavruk & Conny Overland & Stefan Sjögren, 2020. "Keeping it real or keeping it simple? Ownership concentration measures compared," European Financial Management, European Financial Management Association, vol. 26(4), pages 958-1005, September.
    4. Flores Díaz, Ramón Jesús & Molina, Elisenda & Tejada, Juan, 2013. "The Shapley group value," DES - Working Papers. Statistics and Econometrics. WS ws133430, Universidad Carlos III de Madrid. Departamento de Estadística.
    5. Carreras, Francesc & Giménez, José Miguel, 2011. "Power and potential maps induced by any semivalue: Some algebraic properties and computation by multilinear extensions," European Journal of Operational Research, Elsevier, vol. 211(1), pages 148-159, May.
    6. Elli Kraizberg, 2016. "Portfolio Management and Appropriation of Private Benefits of Control," Journal of Business, LAR Center Press, vol. 1(1), pages 60-72, March.
    7. Mika Widgrén, 2008. "The Impact of Council's Internal Decision-Making Rules on the Future EU," Discussion Papers 26, Aboa Centre for Economics.
    8. Takayuki Mizuno & Shohei Doi & Shuhei Kurizaki, 2020. "The power of corporate control in the global ownership network," PLOS ONE, Public Library of Science, vol. 15(8), pages 1-19, August.
    9. Gary Gorton & Frank Schmid, 2000. "Class Struggle Inside the Firm: A Study of German Codetermination," NBER Working Papers 7945, National Bureau of Economic Research, Inc.
    10. Yuto Ushioda & Masato Tanaka & Tomomi Matsui, 2022. "Monte Carlo Methods for the Shapley–Shubik Power Index," Games, MDPI, vol. 13(3), pages 1-14, June.
    11. Le Breton, Michel & Montero, Maria & Zaporozhets, Vera, 2012. "Voting power in the EU council of ministers and fair decision making in distributive politics," Mathematical Social Sciences, Elsevier, vol. 63(2), pages 159-173.
    12. D. Kilgour & Terrence Levesque, 1984. "The Canadian constitutional amending formula: Bargaining in the past and the future," Public Choice, Springer, vol. 44(3), pages 457-480, January.
    13. Serguei Kaniovski, 2008. "The exact bias of the Banzhaf measure of power when votes are neither equiprobable nor independent," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 31(2), pages 281-300, August.
    14. Stefan Napel & Mika Widgrén, 2011. "Strategic versus non-strategic voting power in the EU Council of Ministers: the consultation procedure," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 37(3), pages 511-541, September.
    15. Vito Fragnelli & Gianfranco Gambarelli, 2014. "Further open problems in cooperative games," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 24(4), pages 51-62.
    16. Fabrice Barthelemy & Mathieu Martin & Bertrand Tchantcho, 2011. "Some conjectures on the two main power indices," THEMA Working Papers 2011-14, THEMA (THéorie Economique, Modélisation et Applications), Université de Cergy-Pontoise.
    17. Hubert, Franz & Orlova, Ekaterina, 2018. "Network access and market power," Energy Economics, Elsevier, vol. 76(C), pages 170-185.
    18. M. J. Albizuri & A. Goikoetxea, 2021. "The Owen–Shapley Spatial Power Index in Three-Dimensional Space," Group Decision and Negotiation, Springer, vol. 30(5), pages 1027-1055, October.
    19. Crespi, R. & Renneboog, L.D.R., 2000. "United we stand : Corporate Monitoring by Shareholder Coalitions in the UK," Other publications TiSEM 226b4a58-7d8a-436c-8376-c, Tilburg University, School of Economics and Management.
    20. Josep Freixas & Montserrat Pons, 2017. "Using the Multilinear Extension to Study Some Probabilistic Power Indices," Group Decision and Negotiation, Springer, vol. 26(3), pages 437-452, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:10:p:1581-:d:1653461. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.