IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2025i10p1581-d1653461.html
   My bibliography  Save this article

The Shapley Value in Data Science: Advances in Computation, Extensions, and Applications

Author

Listed:
  • Lei Qin

    (School of Statistics, University of International Business and Economics, Beijing 100029, China
    Dong Fureng Institute of Economic and Social Development, Wuhan University, Wuhan 430072, China)

  • Yingqiu Zhu

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Shaonan Liu

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Xingjian Zhang

    (School of Statistics, University of International Business and Economics, Beijing 100029, China)

  • Yining Zhao

    (Sunwah International Business School, Liaoning University, Shenyang 110036, China)

Abstract

The Shapley value is a fundamental concept in data science, providing a principled framework for fair resource allocation, feature importance quantification, and improved interpretability of complex models. Its fundamental theory is based on four axiomatic proper ties, which underpin its widespread application. To address the inherent computational challenges of exact calculation, we discuss model-agnostic approximation techniques, such as Random Order Value, Least Squares Value, and Multilinear Extension Sampling, as well as specialized fast algorithms for linear, tree-based, and deep learning models. Recent extensions, such as Distributional Shapley and Weighted Shapley, have broadened the applications to data valuation, reinforcement learning, feature interaction analysis, and multi-party cooperation. Practical effectiveness has been demonstrated in health care, finance, industry, and the digital economy, with promising future directions for incorporating these techniques into emerging fields, such as data asset pricing and trading.

Suggested Citation

  • Lei Qin & Yingqiu Zhu & Shaonan Liu & Xingjian Zhang & Yining Zhao, 2025. "The Shapley Value in Data Science: Advances in Computation, Extensions, and Applications," Mathematics, MDPI, vol. 13(10), pages 1-21, May.
  • Handle: RePEc:gam:jmathe:v:13:y:2025:i:10:p:1581-:d:1653461
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/10/1581/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/10/1581/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2025:i:10:p:1581-:d:1653461. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.