IDEAS home Printed from https://ideas.repec.org/a/wsi/igtrxx/v24y2022i04ns0219198922500153.html
   My bibliography  Save this article

A Shapley Value Index for Market Basket Analysis: Efficient Computation Using an Harsanyi Dividend Representation

Author

Listed:
  • Jayden Fitzsimon

    (School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Queensland, Australia)

  • Shrikant Agrawal

    (School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Queensland, Australia)

  • Kirti Khade

    (School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Queensland, Australia)

  • Evan Shellshear

    (��Biarri, Brisbane, Queensland, Australia)

  • Jonathon Allport

    (��Biarri, Brisbane, Queensland, Australia)

  • Archie C. Chapman

    (School of Information Technology and Electrical Engineering, The University of Queensland, Brisbane, Queensland, Australia)

Abstract

Market basket analysis (MBA) aims to discover purchasing patterns and item associations from customer transaction data. A major drawback of current techniques for MBA is a lack of quantitative metrics to measure the real value associated with basket items. This paper addresses this gap by deriving a practical game-theoretic measure for MBA based on the Shapley value of cooperative games, which we call Shapley value index for MBA (SIMBA). The SIMBA of an item represents the average revenue it earns, including its influence on the revenue earned from sales of other items. A significant challenge when applying Shapley value-inspired approaches in practical domains is the exponential complexity of Shapley value computation. However, for the MBA domain, we show that SIMBA admits a scalable exact computation method that does not require sampling or other approximations. Specifically, a characteristic function for the MBA game is constructed so that the transaction dataset input corresponds to the game’s Harsanyi dividends. The relationship between Harsanyi dividends and the Shapley value is then exploited to efficiently compute SIMBA. This approach scales linearly in the number of transactions, making SIMBA a feasible approach for quantitative MBA. SIMBA can be used to screen conventional MBA techniques, such as association rules, to identify significant rules based on the items’ cross-selling capacity. This combination of existing MBA methods and SIMBA will generate rules based not only on frequency of co-occurrence, but also on the significance of the items. We demonstrate the working of the algorithm by analyzing openly available transaction data from an online retail store. To the best of our knowledge, this is the first time Shapley value is used in this way to solve market basket analyses of a practical size.

Suggested Citation

  • Jayden Fitzsimon & Shrikant Agrawal & Kirti Khade & Evan Shellshear & Jonathon Allport & Archie C. Chapman, 2022. "A Shapley Value Index for Market Basket Analysis: Efficient Computation Using an Harsanyi Dividend Representation," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 24(04), pages 1-29, December.
  • Handle: RePEc:wsi:igtrxx:v:24:y:2022:i:04:n:s0219198922500153
    DOI: 10.1142/S0219198922500153
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219198922500153
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219198922500153?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:igtrxx:v:24:y:2022:i:04:n:s0219198922500153. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/igtr/igtr.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.