IDEAS home Printed from https://ideas.repec.org/p/msh/ebswps/2023-19.html
   My bibliography  Save this paper

A High-dimensional Multinomial Logit Model

Author

Listed:
  • Didier Nibbering

Abstract

The number of parameters in a standard multinomial logit model increases linearly with the number of choice alternatives and number of explanatory variables. Since many modern applications involve large choice sets with categorical explanatory variables, which enter the model as large sets of binary dummies, the number of parameters in a multinomial logit model is often large. This paper proposes a new method for data-driven two-way parameter clustering over outcome categories and explanatory dummy categories in a multinomial logit model. A Bayesian Dirichlet process mixture model encourages parameters to cluster over the categories, which reduces the number of unique model parameters and provides interpretable clusters of categories. In an empirical application, we estimate the holiday preferences of 11 household types over 49 holiday destinations, and identify a small number of household segments with different preferences across clusters of holiday destinations.

Suggested Citation

  • Didier Nibbering, 2023. "A High-dimensional Multinomial Logit Model," Monash Econometrics and Business Statistics Working Papers 19/23, Monash University, Department of Econometrics and Business Statistics.
  • Handle: RePEc:msh:ebswps:2023-19
    as

    Download full text from publisher

    File URL: https://www.monash.edu/business/ebs/research/publications/ebs/2023/wp19-2023.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Luc Bauwens & Jean-François Carpantier & Arnaud Dufays, 2017. "Autoregressive Moving Average Infinite Hidden Markov-Switching Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 35(2), pages 162-182, April.
    2. Vincent, Martin & Hansen, Niels Richard, 2014. "Sparse group lasso and high dimensional multinomial classification," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 771-786.
    3. Richard F. MacLehose & David B. Dunson, 2010. "Bayesian Semiparametric Multiple Shrinkage," Biometrics, The International Biometric Society, vol. 66(2), pages 455-462, June.
    4. Geweke, John & Keane, Michael P & Runkle, David, 1994. "Alternative Computational Approaches to Inference in the Multinomial Probit Model," The Review of Economics and Statistics, MIT Press, vol. 76(4), pages 609-632, November.
    5. Cramer, J. S. & Ridder, G., 1991. "Pooling states in the multinomial logit model," Journal of Econometrics, Elsevier, vol. 47(2-3), pages 267-272, February.
    6. Carson, Richard T. & Louviere, Jordan J., 2014. "Statistical properties of consideration sets," Journal of choice modelling, Elsevier, vol. 13(C), pages 37-48.
    7. Conley, Timothy G. & Hansen, Christian B. & McCulloch, Robert E. & Rossi, Peter E., 2008. "A semi-parametric Bayesian approach to the instrumental variable problem," Journal of Econometrics, Elsevier, vol. 144(1), pages 276-305, May.
    8. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521747387.
    9. Geweke, John, 2007. "Interpretation and inference in mixture models: Simple MCMC works," Computational Statistics & Data Analysis, Elsevier, vol. 51(7), pages 3529-3550, April.
    10. Denzil G. Fiebig & Michael P. Keane & Jordan Louviere & Nada Wasi, 2010. "The Generalized Multinomial Logit Model: Accounting for Scale and Coefficient Heterogeneity," Marketing Science, INFORMS, vol. 29(3), pages 393-421, 05-06.
    11. Bruno J.D. Jacobs & Bas Donkers & Dennis Fok, 2016. "Model-Based Purchase Predictions for Large Assortments," Marketing Science, INFORMS, vol. 35(3), pages 389-404, May.
    12. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    13. Newey, Whitney & West, Kenneth, 2014. "A simple, positive semi-definite, heteroscedasticity and autocorrelation consistent covariance matrix," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 33(1), pages 125-132.
    14. Howard D. Bondell & Brian J. Reich, 2009. "Simultaneous Factor Selection and Collapsing Levels in ANOVA," Biometrics, The International Biometric Society, vol. 65(1), pages 169-177, March.
    15. John Geweke & Gautam Gowrisankaran & Robert J. Town, 2003. "Bayesian Inference for Hospital Quality in a Selection Model," Econometrica, Econometric Society, vol. 71(4), pages 1215-1238, July.
    16. William Greene & David Hensher, 2010. "Does scale heterogeneity across individuals matter? An empirical assessment of alternative logit models," Transportation, Springer, vol. 37(3), pages 413-428, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Didier Nibbering, 2019. "A High-dimensional Multinomial Choice Model," Monash Econometrics and Business Statistics Working Papers 19/19, Monash University, Department of Econometrics and Business Statistics.
    2. David Hensher & John Rose & Zheng Li, 2012. "Does the choice model method and/or the data matter?," Transportation, Springer, vol. 39(2), pages 351-385, March.
    3. Line Bjørnskov Pedersen & Julie Riise & Arne Risa Hole & Dorte Gyrd-Hansen, 2014. "GPs' shifting agencies in choice of treatment," Applied Economics, Taylor & Francis Journals, vol. 46(7), pages 750-761, March.
    4. Hensher, David A., 2012. "Accounting for scale heterogeneity within and between pooled data sources," Transportation Research Part A: Policy and Practice, Elsevier, vol. 46(3), pages 480-486.
    5. Arne Hole & Julie Kolstad, 2012. "Mixed logit estimation of willingness to pay distributions: a comparison of models in preference and WTP space using data from a health-related choice experiment," Empirical Economics, Springer, vol. 42(2), pages 445-469, April.
    6. Chen, Tiantian & Fu, Xiaowen & Hensher, David A. & Li, Zhi-Chun & Sze, N.N., 2022. "Air travel choice, online meeting and passenger heterogeneity – An international study on travellers’ preference during a pandemic," Transportation Research Part A: Policy and Practice, Elsevier, vol. 165(C), pages 439-453.
    7. Holte, Jon Helgheim & Kjaer, Trine & Abelsen, Birgit & Olsen, Jan Abel, 2015. "The impact of pecuniary and non-pecuniary incentives for attracting young doctors to rural general practice," Social Science & Medicine, Elsevier, vol. 128(C), pages 1-9.
    8. Haile, Kaleab K. & Tirivayi, Nyasha & Tesfaye, Wondimagegn, 2019. "Farmers’ willingness to accept payments for ecosystem services on agricultural land: The case of climate-smart agroforestry in Ethiopia," Ecosystem Services, Elsevier, vol. 39(C).
    9. Mariel, Petr & Ayala, Amaya de & Hoyos, David & Abdullah, Sabah, 2013. "Selecting random parameters in discrete choice experiment for environmental valuation: A simulation experiment," Journal of choice modelling, Elsevier, vol. 7(C), pages 44-57.
    10. Alves, Maria Odete & Valente, Airton Saboya Jr, 2006. "Comunicação Rural Entre Três Atores Nas Áreas De Concentração De Fruteiras No Nordeste Brasileiro:," 44th Congress, July 23-27, 2006, Fortaleza, Ceará, Brazil 148515, Sociedade Brasileira de Economia, Administracao e Sociologia Rural (SOBER).
    11. Kaambwa, Billingsley & Lancsar, Emily & McCaffrey, Nicola & Chen, Gang & Gill, Liz & Cameron, Ian D. & Crotty, Maria & Ratcliffe, Julie, 2015. "Investigating consumers' and informal carers' views and preferences for consumer directed care: A discrete choice experiment," Social Science & Medicine, Elsevier, vol. 140(C), pages 81-94.
    12. Balogh, Péter & Békési, Dániel & Gorton, Matthew & Popp, József & Lengyel, Péter, 2016. "Consumer willingness to pay for traditional food products," Food Policy, Elsevier, vol. 61(C), pages 176-184.
    13. I. G. Ukpong & K. G. Balcombe & I. M. Fraser & F. J. Areal, 2019. "Preferences for Mitigation of the Negative Impacts of the Oil and Gas Industry in the Niger Delta Region of Nigeria," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 74(2), pages 811-843, October.
    14. Yuanyuan Gu & Arne Risa Hole & Stephanie Knox, 2013. "Fitting the generalized multinomial logit model in Stata," Stata Journal, StataCorp LP, vol. 13(2), pages 382-397, June.
    15. Shr, Yau-Huo & Ready, Richard C. & Orland, Brian & Echols, Stuart, 2017. "Do Visual Representations Influence Survey Responses? Evidence from a Choice Experiment on Landscape Attributes of Green Infrastructure," 2017 Annual Meeting, July 30-August 1, Chicago, Illinois 258397, Agricultural and Applied Economics Association.
    16. Sarrias, Mauricio & Daziano, Ricardo, 2017. "Multinomial Logit Models with Continuous and Discrete Individual Heterogeneity in R: The gmnl Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 79(i02).
    17. Nguyen, Manh-Hung & Nguyen, Thi Lan Anh & Nguyen, Tuan & Reynaud, Arnaud & Simioni, Michel & Hoang, Viet-Ngu, 2021. "Economic analysis of choices among differing measures to manage coastal erosion in Hoi An (a UNESCO World Heritage Site)," Economic Analysis and Policy, Elsevier, vol. 70(C), pages 529-543.
    18. Rocha, Luiz Eduardo Vasconcelos & Santos, Gilnei Costa & Bastos, Patricia de Melo Abrita, 2006. "Evolução Da Distribuição Da Renda E Da Pobreza Das Famílias Ocupadas E Residentes No Meio Rural Do Estado De Minas Gerais, De 1981 A 2003," 44th Congress, July 23-27, 2006, Fortaleza, Ceará, Brazil 148649, Sociedade Brasileira de Economia, Administracao e Sociologia Rural (SOBER).
    19. John C. Whitehead & Daniel K. Lew, 2020. "Estimating recreation benefits through joint estimation of revealed and stated preference discrete choice data," Empirical Economics, Springer, vol. 58(4), pages 2009-2029, April.
    20. Yuanyuan Gu & Richard Norman & Rosalie Viney, 2014. "Estimating Health State Utility Values From Discrete Choice Experiments—A Qaly Space Model Approach," Health Economics, John Wiley & Sons, Ltd., vol. 23(9), pages 1098-1114, September.

    More about this item

    Keywords

    large choice sets; Dirichlet process prior; multinomial logit model; highdimensional models;
    All these keywords.

    JEL classification:

    • C11 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Bayesian Analysis: General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C25 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions; Probabilities
    • C35 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Discrete Regression and Qualitative Choice Models; Discrete Regressors; Proportions
    • C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:msh:ebswps:2023-19. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Professor Xibin Zhang (email available below). General contact details of provider: https://edirc.repec.org/data/dxmonau.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.