IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i10p1581-d1653461.html
   My bibliography  Save this article

The Shapley Value in Data Science: Advances in Computation, Extensions, and Applications

Author

Listed:
  • Lei Qin

    (School of Statistics, University of International Business and Economics, Beijing 100029, China
    Dong Fureng Institute of Economic and Social Development, Wuhan University, Wuhan 430072, China)

  • Yingqiu Zhu

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Shaonan Liu

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Xingjian Zhang

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Yining Zhao

    (Sunwah International Business School, Liaoning University, Shenyang 110036, China)

Abstract

The Shapley value is a fundamental concept in data science, providing a principled framework for fair resource allocation, feature importance quantification, and improved interpretability of complex models. Its fundamental theory is based on four axiomatic proper ties, which underpin its widespread application. To address the inherent computational challenges of exact calculation, we discuss model-agnostic approximation techniques, such as Random Order Value, Least Squares Value, and Multilinear Extension Sampling, as well as specialized fast algorithms for linear, tree-based, and deep learning models. Recent extensions, such as Distributional Shapley and Weighted Shapley, have broadened the applications to data valuation, reinforcement learning, feature interaction analysis, and multi-party cooperation. Practical effectiveness has been demonstrated in health care, finance, industry, and the digital economy, with promising future directions for incorporating these techniques into emerging fields, such as data asset pricing and trading.

Suggested Citation

  • Lei Qin & Yingqiu Zhu & Shaonan Liu & Xingjian Zhang & Yining Zhao, 2025. "The Shapley Value in Data Science: Advances in Computation, Extensions, and Applications," Mathematics, MDPI, vol. 13(10), pages 1-21, May.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:10:p:1581-:d:1653461
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/10/1581/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/10/1581/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Luigi Zingales, 1995. "What Determines the Value of Corporate Votes?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 110(4), pages 1047-1073.
    2. Jackson, Matthew O., 2005. "Allocation rules for network games," Games and Economic Behavior, Elsevier, vol. 51(1), pages 128-154, April.
    3. Hashem Omrani & Mohaddeseh Amini & Mahdieh Babaei & Khatereh Shafaat, 2020. "Use Shapley value for increasing power distinguish of data envelopment analysis model: An application for estimating environmental efficiency of industrial producers in Iran," Energy & Environment, , vol. 31(4), pages 656-675, June.
    4. Roth, Alvin E, 1977. "The Shapley Value as a von Neumann-Morgenstern Utility," Econometrica, Econometric Society, vol. 45(3), pages 657-664, April.
    5. Riccardo Colini-Baldeschi & Marco Scarsini & Stefano Vaccari, 2018. "Variance Allocation and Shapley Value," Methodology and Computing in Applied Probability, Springer, vol. 20(3), pages 919-933, September.
    6. Shapley, L. S. & Shubik, Martin, 1954. "A Method for Evaluating the Distribution of Power in a Committee System," American Political Science Review, Cambridge University Press, vol. 48(3), pages 787-792, September.
    7. Sanjith Gopalakrishnan & Daniel Granot & Frieda Granot & Greys Sošić & Hailong Cui, 2021. "Incentives and Emission Responsibility Allocation in Supply Chains," Management Science, INFORMS, vol. 67(7), pages 4172-4190, July.
    8. Haim Shalit, 2020. "Using the Shapley value of stocks as systematic risk," Journal of Risk Finance, Emerald Group Publishing Limited, vol. 21(4), pages 459-468, October.
    9. Gately, Dermot, 1974. "Sharing the Gains from Regional Cooperation: A Game Theoretic Application to Planning Investment in Electric Power," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 15(1), pages 195-208, February.
    10. Karl Michael Ortmann, 2016. "The link between the Shapley value and the beta factor," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 39(2), pages 311-325, November.
    11. Guillermo Owen, 1972. "Multilinear Extensions of Games," Management Science, INFORMS, vol. 18(5-Part-2), pages 64-79, January.
    12. Lemaire, Jean, 1984. "An Application of Game Theory: Cost Allocation," ASTIN Bulletin, Cambridge University Press, vol. 14(1), pages 61-81, April.
    13. Jennifer K. Ryan & Lusheng Shao & Daewon Sun, 2022. "Contracting Mechanisms for Stable Sourcing Networks," Manufacturing & Service Operations Management, INFORMS, vol. 24(5), pages 2558-2576, September.
    14. Ruiz, Luis M. & Valenciano, Federico & Zarzuelo, Jose M., 1998. "The Family of Least Square Values for Transferable Utility Games," Games and Economic Behavior, Elsevier, vol. 24(1-2), pages 109-130, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benjamin R. Auer & Tobias Hiller, 2021. "Cost gap, Shapley, or nucleolus allocation: Which is the best game‐theoretic remedy for the low‐risk anomaly?," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 42(4), pages 876-884, June.
    2. Haim Shalit, 2021. "The Shapley value decomposition of optimal portfolios," Annals of Finance, Springer, vol. 17(1), pages 1-25, March.
    3. Taylan Mavruk & Conny Overland & Stefan Sjögren, 2020. "Keeping it real or keeping it simple? Ownership concentration measures compared," European Financial Management, European Financial Management Association, vol. 26(4), pages 958-1005, September.
    4. Elli Kraizberg, 2016. "Portfolio Management and Appropriation of Private Benefits of Control," Journal of Business, LAR Center Press, vol. 1(1), pages 60-72, March.
    5. Flores Díaz, Ramón Jesús & Molina, Elisenda & Tejada, Juan, 2013. "The Shapley group value," DES - Working Papers. Statistics and Econometrics. WS ws133430, Universidad Carlos III de Madrid. Departamento de Estadística.
    6. Carreras, Francesc & Giménez, José Miguel, 2011. "Power and potential maps induced by any semivalue: Some algebraic properties and computation by multilinear extensions," European Journal of Operational Research, Elsevier, vol. 211(1), pages 148-159, May.
    7. Le Breton, Michel & Montero, Maria & Zaporozhets, Vera, 2012. "Voting power in the EU council of ministers and fair decision making in distributive politics," Mathematical Social Sciences, Elsevier, vol. 63(2), pages 159-173.
    8. D. Kilgour & Terrence Levesque, 1984. "The Canadian constitutional amending formula: Bargaining in the past and the future," Public Choice, Springer, vol. 44(3), pages 457-480, January.
    9. Serguei Kaniovski, 2008. "The exact bias of the Banzhaf measure of power when votes are neither equiprobable nor independent," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 31(2), pages 281-300, August.
    10. Vito Fragnelli & Gianfranco Gambarelli, 2014. "Further open problems in cooperative games," Operations Research and Decisions, Wroclaw University of Science and Technology, Faculty of Management, vol. 24(4), pages 51-62.
    11. Crespi, R. & Renneboog, L.D.R., 2000. "United we stand : Corporate Monitoring by Shareholder Coalitions in the UK," Other publications TiSEM 226b4a58-7d8a-436c-8376-c, Tilburg University, School of Economics and Management.
    12. Josep Freixas & Montserrat Pons, 2017. "Using the Multilinear Extension to Study Some Probabilistic Power Indices," Group Decision and Negotiation, Springer, vol. 26(3), pages 437-452, May.
    13. Widgrén, Mika, 2008. "The Impact of Council Voting Rules on EU Decision-Making," Discussion Papers 1162, The Research Institute of the Finnish Economy.
    14. Maria Montero & Martin Sefton & Ping Zhang, 2008. "Enlargement and the balance of power: an experimental study," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 30(1), pages 69-87, January.
    15. Alonso-Meijide, J.M. & Casas-Mendez, B. & Holler, M.J. & Lorenzo-Freire, S., 2008. "Computing power indices: Multilinear extensions and new characterizations," European Journal of Operational Research, Elsevier, vol. 188(2), pages 540-554, July.
    16. Roberto Roson & Franz Hubert, 2015. "Bargaining Power and Value Sharing in Distribution Networks: A Cooperative Game Theory Approach," Networks and Spatial Economics, Springer, vol. 15(1), pages 71-87, March.
    17. Surajit Borkotokey & Sujata Goala & Niharika Kakoty & Parishmita Boruah, 2022. "The component-wise egalitarian Myerson value for Network Games," Papers 2201.02793, arXiv.org.
    18. Calvo, Emilio & Lasaga, Javier & van den Nouweland, Anne, 1999. "Values of games with probabilistic graphs," Mathematical Social Sciences, Elsevier, vol. 37(1), pages 79-95, January.
    19. Nobel Prize Committee, 2012. "Alvin E. Roth and Lloyd S. Shapley: Stable allocations and the practice of market design," Nobel Prize in Economics documents 2012-1, Nobel Prize Committee.
    20. Borkotokey, Surajit & Chakrabarti, Subhadip & Gilles, Robert P. & Gogoi, Loyimee & Kumar, Rajnish, 2021. "Probabilistic network values," Mathematical Social Sciences, Elsevier, vol. 113(C), pages 169-180.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:10:p:1581-:d:1653461. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.