IDEAS home Printed from https://ideas.repec.org/a/eee/eejocm/v54y2025ics1755534525000016.html
   My bibliography  Save this article

Analysis of attribute importance in multinomial logit models using Shapley values-based methods

Author

Listed:
  • Salas, Patricio
  • De la Fuente, Rodrigo
  • Astroza, Sebastian
  • Carrasco, Juan Antonio

Abstract

This paper investigates the use of Shapley values-based methods to determine the importance of attributes in discrete choice models, specifically within a Multinomial Logit (MNL) framework. We extend the Shapley decomposition Shorrocks (2013) method from linear models. Additionally, the SHAP method Lundberg and Lee (2017) idea is applied to assess the impact of attributes on individual-level choice probability predictions. A simulation study demonstrates the effectiveness of these approaches under various experimental conditions, including attributes in several ranges and interaction terms. Finally, an empirical application is conducted using well-known travel mode choice datasets. The simulation results show that Shapley values accurately capture the global importance of attributes on goodness-of-fit. The SHAP method provides transparency in MNL model predictions, clarifying how changes in attribute values influence choice probabilities for each decision-maker. These methods offer a complementary perspective to traditional metrics like elasticities and traditional relative importance analysis Orme(2006). In the empirical application, Shapley decomposition highlights the most relevant attributes, while SHAP values uncover individual-level impacts that might not be apparent through elasticities alone. Global and individual-level analysis offers a more comprehensive understanding of attribute importance. In summary, integrating Shapley values with traditional metrics ensures a robust analysis, aiding practitioners and policymakers in making informed decisions based on broad trends and specific impacts.

Suggested Citation

  • Salas, Patricio & De la Fuente, Rodrigo & Astroza, Sebastian & Carrasco, Juan Antonio, 2025. "Analysis of attribute importance in multinomial logit models using Shapley values-based methods," Journal of choice modelling, Elsevier, vol. 54(C).
  • Handle: RePEc:eee:eejocm:v:54:y:2025:i:c:s1755534525000016
    DOI: 10.1016/j.jocm.2025.100538
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1755534525000016
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jocm.2025.100538?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Coldren, Gregory M. & Koppelman, Frank S. & Kasturirangan, Krishnan & Mukherjee, Amit, 2003. "Modeling aggregate air-travel itinerary shares: logit model development at a major US airline," Journal of Air Transport Management, Elsevier, vol. 9(6), pages 361-369.
    2. Alsaleh, Nael & Farooq, Bilal, 2021. "Interpretable data-driven demand modelling for on-demand transit services," Transportation Research Part A: Policy and Practice, Elsevier, vol. 154(C), pages 1-22.
    3. Gaudry, Marc J. I. & Jara-Diaz, Sergio R. & Ortuzar, Juan de Dios, 1989. "Value of time sensitivity to model specification," Transportation Research Part B: Methodological, Elsevier, vol. 23(2), pages 151-158, April.
    4. Stan Lipovetsky, 2015. "Analytical closed-form solution for binary logit regression by categorical predictors," Journal of Applied Statistics, Taylor & Francis Journals, vol. 42(1), pages 37-49, January.
    5. Irfan Ullah & Kai Liu & Toshiyuki Yamamoto & Rabia Emhamed Al Mamlook & Arshad Jamal, 2022. "A comparative performance of machine learning algorithm to predict electric vehicles energy consumption: A path towards sustainability," Energy & Environment, , vol. 33(8), pages 1583-1612, December.
    6. Coldren, Gregory M. & Koppelman, Frank S., 2005. "Modeling the competition among air-travel itinerary shares: GEV model development," Transportation Research Part A: Policy and Practice, Elsevier, vol. 39(4), pages 345-365, May.
    7. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, June.
    8. Maddala,G. S., 1986. "Limited-Dependent and Qualitative Variables in Econometrics," Cambridge Books, Cambridge University Press, number 9780521338257, Enero-Abr.
    9. Guevara, C. Angelo, 2015. "Critical assessment of five methods to correct for endogeneity in discrete-choice models," Transportation Research Part A: Policy and Practice, Elsevier, vol. 82(C), pages 240-254.
    10. Koppelman, Frank S. & Wen, Chieh-Hua, 2000. "The paired combinatorial logit model: properties, estimation and application," Transportation Research Part B: Methodological, Elsevier, vol. 34(2), pages 75-89, February.
    11. Hadas, Yuval & Gnecco, Giorgio & Sanguineti, Marcello, 2017. "An approach to transportation network analysis via transferable utility games," Transportation Research Part B: Methodological, Elsevier, vol. 105(C), pages 120-143.
    12. Koppelman, Frank S. & Wen, Chieh-Hua, 1998. "Alternative nested logit models: structure, properties and estimation," Transportation Research Part B: Methodological, Elsevier, vol. 32(5), pages 289-298, June.
    13. Stephane Hess & Denis Bolduc & John Polak, 2010. "Random covariance heterogeneity in discrete choice models," Transportation, Springer, vol. 37(3), pages 391-411, May.
    14. Joseph Deutsch & Jacques Silber, 2006. "Ethnic origin and multidimensional relative poverty in Israel: a study based on the 1995 Israeli census," Research in Labor Economics, in: The Economics of Immigration and Social Diversity, pages 235-264, Emerald Group Publishing Limited.
    15. Hensher, David A., 2008. "Assessing systematic sources of variation in public transport elasticities: Some comparative warnings," Transportation Research Part A: Policy and Practice, Elsevier, vol. 42(7), pages 1031-1042, August.
    16. Jara-Díaz, Sergio R. & Videla, Jorge, 1989. "Detection of income effect in mode choice: Theory and application," Transportation Research Part B: Methodological, Elsevier, vol. 23(6), pages 393-400, December.
    17. Marisol Castro & Francisco Martínez & Marcela Munizaga, 2013. "Estimation of a constrained multinomial logit model," Transportation, Springer, vol. 40(3), pages 563-581, May.
    18. Sun, Hongye & Kim, Giseung, 2021. "The composite impact of ICT industry on lowering carbon intensity: From the perspective of regional heterogeneity," Technology in Society, Elsevier, vol. 66(C).
    19. Chandra R. Bhat, 2000. "Incorporating Observed and Unobserved Heterogeneity in Urban Work Travel Mode Choice Modeling," Transportation Science, INFORMS, vol. 34(2), pages 228-238, May.
    20. Stan Lipovetsky & Michael Conklin, 2001. "Analysis of regression in game theory approach," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 17(4), pages 319-330, October.
    21. Jui-Chen Yang & Shelby D. Reed & Steve Hass & Mark B. Skeen & F. Reed Johnson, 2021. "Is Easier Better Than Harder? An Experiment on Choice Experiments for Benefit-Risk Tradeoff Preferences," Medical Decision Making, , vol. 41(2), pages 222-232, February.
    22. Sifringer, Brian & Lurkin, Virginie & Alahi, Alexandre, 2020. "Enhancing discrete choice models with representation learning," Transportation Research Part B: Methodological, Elsevier, vol. 140(C), pages 236-261.
    23. repec:eme:rlec11:s0147-9121(05)24008-6 is not listed on IDEAS
    24. Bhat, Chandra R., 1998. "Analysis of travel mode and departure time choice for urban shopping trips," Transportation Research Part B: Methodological, Elsevier, vol. 32(6), pages 361-371, August.
    25. Anthony Shorrocks, 2013. "Decomposition procedures for distributional analysis: a unified framework based on the Shapley value," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 11(1), pages 99-126, March.
    26. Kenneth E. Train & Clifford Winston, 2007. "Vehicle Choice Behavior And The Declining Market Share Of U.S. Automakers," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 48(4), pages 1469-1496, November.
    27. Hess, Stephane & Palma, David, 2019. "Apollo: A flexible, powerful and customisable freeware package for choice model estimation and application," Journal of choice modelling, Elsevier, vol. 32(C), pages 1-1.
    28. Haim Shalit, 2020. "The Shapley value of regression portfolios," Journal of Asset Management, Palgrave Macmillan, vol. 21(6), pages 506-512, October.
    29. Francisco Amador & Rosa González & Juan Dios Ortúzar, 2008. "On Confounding Preference Heterogeneity and Income Effect in Discrete Choice Models," Networks and Spatial Economics, Springer, vol. 8(2), pages 97-108, September.
    30. Emily Lancsar & Denzil G. Fiebig & Arne Risa Hole, 2017. "Discrete Choice Experiments: A Guide to Model Specification, Estimation and Software," PharmacoEconomics, Springer, vol. 35(7), pages 697-716, July.
    31. C. Angelo Guevara, 2017. "Mode-valued differences of in-vehicle travel time Savings," Transportation, Springer, vol. 44(5), pages 977-997, September.
    32. Yasasvi Popuri & Kimon Proussaloglou & Cemal Ayvalik & Frank Koppelman & Aimee Lee, 2011. "Importance of traveler attitudes in the choice of public transportation to work: findings from the Regional Transportation Authority Attitudinal Survey," Transportation, Springer, vol. 38(4), pages 643-661, July.
    33. Stephane Hess & Andrew Daly & Richard Batley, 2018. "Revisiting consistency with random utility maximisation: theory and implications for practical work," Theory and Decision, Springer, vol. 84(2), pages 181-204, March.
    34. Vij, Akshay & Walker, Joan L., 2016. "How, when and why integrated choice and latent variable models are latently useful," Transportation Research Part B: Methodological, Elsevier, vol. 90(C), pages 192-217.
    35. Daniel McFadden, 1977. "Quantitative Methods for Analyzing Travel Behaviour of Individuals: Some Recent Developments," Cowles Foundation Discussion Papers 474, Cowles Foundation for Research in Economics, Yale University.
    36. Osnat Israeli, 2007. "A Shapley-based decomposition of the R-Square of a linear regression," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 5(2), pages 199-212, August.
    37. David J. Mott & Koonal K. Shah & Juan Manuel Ramos-Goñi & Nancy J. Devlin & Oliver Rivero-Arias, 2021. "Valuing EQ-5D-Y-3L Health States Using a Discrete Choice Experiment: Do Adult and Adolescent Preferences Differ?," Medical Decision Making, , vol. 41(5), pages 584-596, July.
    38. Bhat, Chandra R., 1998. "A model of post home-arrival activity participation behavior," Transportation Research Part B: Methodological, Elsevier, vol. 32(6), pages 387-400, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lurkin, Virginie & Garrow, Laurie A. & Higgins, Matthew J. & Newman, Jeffrey P. & Schyns, Michael, 2017. "Accounting for price endogeneity in airline itinerary choice models: An application to Continental U.S. markets," Transportation Research Part A: Policy and Practice, Elsevier, vol. 100(C), pages 228-246.
    2. Morlotti, Chiara & Birolini, Sebastian & Malighetti, Paolo & Redondi, Renato, 2023. "A latent class approach to estimate air travelers’ propensity toward connecting itineraries," Research in Transportation Economics, Elsevier, vol. 99(C).
    3. C. Angelo Guevara, 2017. "Mode-valued differences of in-vehicle travel time Savings," Transportation, Springer, vol. 44(5), pages 977-997, September.
    4. Birolini, Sebastian & Cattaneo, Mattia & Malighetti, Paolo & Morlotti, Chiara, 2020. "Integrated origin-based demand modeling for air transportation," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 142(C).
    5. Fukushi, Mitsuyoshi & Delgado, Felipe & Raveau, Sebastián, 2024. "Impact of omitted variable and simultaneous estimation endogeneity in choice-based revenue management systems," Transportation Research Part A: Policy and Practice, Elsevier, vol. 179(C).
    6. Sander van Cranenburgh & Francisco Garrido-Valenzuela, 2023. "Computer vision-enriched discrete choice models, with an application to residential location choice," Papers 2308.08276, arXiv.org.
    7. Cho, Woohyun & Windle, Robert J. & Dresner, Martin E., 2017. "The impact of operational exposure and value-of-time on customer choice: Evidence from the airline industry," Transportation Research Part A: Policy and Practice, Elsevier, vol. 103(C), pages 455-471.
    8. Peter Davis & Pasquale Schiraldi, 2014. "The flexible coefficient multinomial logit (FC-MNL) model of demand for differentiated products," RAND Journal of Economics, RAND Corporation, vol. 45(1), pages 32-63, March.
    9. Isler, Cassiano Augusto & Blumenfeld, Marcelo & Caldeira, Gabriel Pereira & Roberts, Clive, 2024. "Long-Distance railway mode choice in Brazil: Evidence from a discrete choice experiment," Research in Transportation Economics, Elsevier, vol. 104(C).
    10. Thomas E. Guerrero & C. Angelo Guevara & Elisabetta Cherchi & Juan de Dios Ortúzar, 2021. "Addressing endogeneity in strategic urban mode choice models," Transportation, Springer, vol. 48(4), pages 2081-2102, August.
    11. Kim, Sung Hoo & Mokhtarian, Patricia L., 2023. "Finite mixture (or latent class) modeling in transportation: Trends, usage, potential, and future directions," Transportation Research Part B: Methodological, Elsevier, vol. 172(C), pages 134-173.
    12. Schmid, Basil & Becker, Felix & Axhausen, Kay W. & Widmer, Paul & Stein, Petra, 2023. "A simultaneous model of residential location, mobility tool ownership and mode choice using latent variables," Transportation Research Part A: Policy and Practice, Elsevier, vol. 178(C).
    13. José-Benito Pérez-López & Margarita Novales & Francisco-Alberto Varela-García & Alfonso Orro, 2020. "Residential Location Econometric Choice Modeling with Irregular Zoning: Common Border Spatial Correlation Metric," Networks and Spatial Economics, Springer, vol. 20(3), pages 785-802, September.
    14. Ali, Azam & Kalatian, Arash & Choudhury, Charisma F., 2023. "Comparing and contrasting choice model and machine learning techniques in the context of vehicle ownership decisions," Transportation Research Part A: Policy and Practice, Elsevier, vol. 173(C).
    15. Espino, Raquel & de Dios Ortúzar, Juan & Román, Concepción, 2007. "Understanding suburban travel demand: Flexible modelling with revealed and stated choice data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 41(10), pages 899-912, December.
    16. Tinessa, Fiore & Marzano, Vittorio & Papola, Andrea, 2020. "Mixing distributions of tastes with a Combination of Nested Logit (CoNL) kernel: Formulation and performance analysis," Transportation Research Part B: Methodological, Elsevier, vol. 141(C), pages 1-23.
    17. van Cranenburgh, Sander & Garrido-Valenzuela, Francisco, 2025. "Computer vision-enriched discrete choice models, with an application to residential location choice," Transportation Research Part A: Policy and Practice, Elsevier, vol. 192(C).
    18. Lurkin, Virginie & Garrow, Laurie A. & Higgins, Matthew J. & Newman, Jeffrey P. & Schyns, Michael, 2018. "Modeling competition among airline itineraries," Transportation Research Part A: Policy and Practice, Elsevier, vol. 113(C), pages 157-172.
    19. Park, Arim & Chen, Roger & Cho, Soohyun & Zhao, Yao, 2023. "The determinants of online matching platforms for freight services," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 179(C).
    20. Perez-Lopez, Jose-Benito & Novales, Margarita & Orro, Alfonso, 2022. "Spatially correlated nested logit model for spatial location choice," Transportation Research Part B: Methodological, Elsevier, vol. 161(C), pages 1-12.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:eejocm:v:54:y:2025:i:c:s1755534525000016. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/journal-of-choice-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.