IDEAS home Printed from https://ideas.repec.org/p/bay/rdwiwi/34994.html
   My bibliography  Save this paper

Hidden Variable Models for Market Basket Data. Statistical Performance and Managerial Implications

Author

Listed:
  • Hruschka, Harald

Abstract

We compare the performance of several hidden variable models, namely binary factor analysis, topic models (latent Dirichlet allocation, correlated topic model), the restricted Boltzmann machine and the deep belief net. We shortly present these models and outline their estimation. Performance is measured by log likelihood values of these models for a holdout data set of market baskets. For each model we estimate and evaluate variants with increasing numbers of hidden variables. Binary factor analysis vastly outperforms topic models. The restricted Boltzmann machine and the deep belief net on the other hand attain a similar performance advantage over binary factor analysis. For each model we interpret the relationships between the most important hidden variables and observed category purchases. To demonstrate managerial implications we compute relative basket size increase due to promoting each category for the better performing models. Recommendations based on the restricted Boltzmann machine and the deep belief net not only have lower uncertainty due to their statistical performance, they also have more managerial appeal than those derived for binary factor analysis. The impressive performances of the restricted Boltzmann machine and the deep belief net suggest to continue research by extending these models, e.g., by including marketing variables as predictors.

Suggested Citation

  • Hruschka, Harald, 2016. "Hidden Variable Models for Market Basket Data. Statistical Performance and Managerial Implications," University of Regensburg Working Papers in Business, Economics and Management Information Systems 489, University of Regensburg, Department of Economics.
  • Handle: RePEc:bay:rdwiwi:34994
    as

    Download full text from publisher

    File URL: https://epub.uni-regensburg.de/34994/1/dbn_baksets_diskp_all.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Bruno J.D. Jacobs & Bas Donkers & Dennis Fok, 2016. "Model-Based Purchase Predictions for Large Assortments," Marketing Science, INFORMS, vol. 35(3), pages 389-404, May.
    2. P. Seetharaman & Siddhartha Chib & Andrew Ainslie & Peter Boatwright & Tat Chan & Sachin Gupta & Nitin Mehta & Vithala Rao & Andrei Strijnev, 2005. "Models of Multi-Category Choice Behavior," Marketing Letters, Springer, vol. 16(3), pages 239-254, December.
    3. Chalmers, R. Philip, 2012. "mirt: A Multidimensional Item Response Theory Package for the R Environment," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i06).
    4. Grün, Bettina & Hornik, Kurt, 2011. "topicmodels: An R Package for Fitting Topic Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i13).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mariflor Vega Carrasco & Ioanna Manolopoulou & Jason O'Sullivan & Rosie Prior & Mirco Musolesi, 2022. "Posterior summaries of grocery retail topic models: Evaluation, interpretability and credibility," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(3), pages 562-588, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Harald Hruschka, 2021. "Comparing unsupervised probabilistic machine learning methods for market basket analysis," Review of Managerial Science, Springer, vol. 15(2), pages 497-527, February.
    2. Andreas Falke & Harald Hruschka, 2022. "Analyzing browsing across websites by machine learning methods," Journal of Business Economics, Springer, vol. 92(5), pages 829-852, July.
    3. Justyna Klejdysz & Robin L. Lumsdaine, 2023. "Shifts in ECB Communication: A Textual Analysis of the Press Conference," International Journal of Central Banking, International Journal of Central Banking, vol. 19(2), pages 473-542, June.
    4. Schröder, Nadine & Falke, Andreas & Hruschka, Harald & Reutterer, Thomas, 2019. "Analyzing the Browsing Basket: A Latent Interests-Based Segmentation Tool," Journal of Interactive Marketing, Elsevier, vol. 47(C), pages 181-197.
    5. Martin Reisenbichler & Thomas Reutterer, 2019. "Topic modeling in marketing: recent advances and research opportunities," Journal of Business Economics, Springer, vol. 89(3), pages 327-356, April.
    6. Izolda Pristojkovic Suko & Magdalena Holter & Erwin Stolz & Elfriede Renate Greimel & Wolfgang Freidl, 2022. "Acculturation, Adaptation, and Health among Croatian Migrants in Austria and Ireland: A Cross-Sectional Study," IJERPH, MDPI, vol. 19(24), pages 1-15, December.
    7. Sandra Wankmüller, 2023. "A comparison of approaches for imbalanced classification problems in the context of retrieving relevant documents for an analysis," Journal of Computational Social Science, Springer, vol. 6(1), pages 91-163, April.
    8. Nana Kim & Daniel M. Bolt & James Wollack, 2022. "Noncompensatory MIRT For Passage-Based Tests," Psychometrika, Springer;The Psychometric Society, vol. 87(3), pages 992-1009, September.
    9. Arsenyan, Jbid & Mirowska, Agata & Piepenbrink, Anke, 2023. "Close encounters with the virtual kind: Defining a human-virtual agent coexistence framework," Technological Forecasting and Social Change, Elsevier, vol. 193(C).
    10. Sutthipong Meeyai, 2015. "Modeling Store Patronage: A Systematic Review," International Conference on Marketing and Business Development Journal, The Bucharest University of Economic Studies, vol. 1(1), pages 40-48, July.
    11. Mi Jung Lee & Daejin Kim & Sergio Romero & Ickpyo Hong & Nikolay Bliznyuk & Craig Velozo, 2022. "Examining Older Adults’ Home Functioning Using the American Housing Survey," IJERPH, MDPI, vol. 19(8), pages 1-13, April.
    12. Hong Joo Lee & Hoyeon Oh, 2020. "A Study on the Deduction and Diffusion of Promising Artificial Intelligence Technology for Sustainable Industrial Development," Sustainability, MDPI, vol. 12(14), pages 1-15, July.
    13. Maksym Polyakov & Morteza Chalak & Md. Sayed Iftekhar & Ram Pandit & Sorada Tapsuwan & Fan Zhang & Chunbo Ma, 2018. "Authorship, Collaboration, Topics, and Research Gaps in Environmental and Resource Economics 1991–2015," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 71(1), pages 217-239, September.
    14. Stefano Sbalchiero & Maciej Eder, 2020. "Topic modeling, long texts and the best number of topics. Some Problems and solutions," Quality & Quantity: International Journal of Methodology, Springer, vol. 54(4), pages 1095-1108, August.
    15. Martin Baumgaertner & Johannes Zahner, 2021. "Whatever it takes to understand a central banker - Embedding their words using neural networks," MAGKS Papers on Economics 202130, Philipps-Universität Marburg, Faculty of Business Administration and Economics, Department of Economics (Volkswirtschaftliche Abteilung).
    16. Daoud, Adel & Kohl, Sebastian, 2016. "How much do sociologists write about economic topics? Using big data to test some conventional views in economic sociology, 1890 to 2014," MPIfG Discussion Paper 16/7, Max Planck Institute for the Study of Societies.
    17. Shr-Wei Kao & Pin Luarn, 2020. "Topic Modeling Analysis of Social Enterprises: Twitter Evidence," Sustainability, MDPI, vol. 12(8), pages 1-20, April.
    18. Qian Wu & Monique Vanerum & Anouk Agten & Andrés Christiansen & Frank Vandenabeele & Jean-Michel Rigo & Rianne Janssen, 2021. "Certainty-Based Marking on Multiple-Choice Items: Psychometrics Meets Decision Theory," Psychometrika, Springer;The Psychometric Society, vol. 86(2), pages 518-543, June.
    19. Renan P. Monteiro & Gabriel Lins de Holanda Coelho & Paul H. P. Hanel & Emerson Diógenes Medeiros & Phillip Dyamond Gomes Silva, 2022. "The Efficient Assessment of Self-Esteem: Proposing the Brief Rosenberg Self-Esteem Scale," Applied Research in Quality of Life, Springer;International Society for Quality-of-Life Studies, vol. 17(2), pages 931-947, April.
    20. Melissa Gladstone & Gillian Lancaster & Gareth McCray & Vanessa Cavallera & Claudia R. L. Alves & Limbika Maliwichi & Muneera A. Rasheed & Tarun Dua & Magdalena Janus & Patricia Kariger, 2021. "Validation of the Infant and Young Child Development (IYCD) Indicators in Three Countries: Brazil, Malawi and Pakistan," IJERPH, MDPI, vol. 18(11), pages 1-19, June.

    More about this item

    Keywords

    Marketing; Market Basket Analysis; Factor Analysis; Topic Models; Restricted Boltzmann Machine; Deep Belief Net;
    All these keywords.

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bay:rdwiwi:34994. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Gernot Deinzer (email available below). General contact details of provider: https://edirc.repec.org/data/wfregde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.