IDEAS home Printed from https://ideas.repec.org/a/eee/transb/v140y2020icp236-261.html
   My bibliography  Save this article

Enhancing discrete choice models with representation learning

Author

Listed:
  • Sifringer, Brian
  • Lurkin, Virginie
  • Alahi, Alexandre

Abstract

In discrete choice modeling (DCM), model misspecifications may lead to limited predictability and biased parameter estimates. In this paper, we propose a new approach for estimating choice models in which we divide the systematic part of the utility specification into (i) a knowledge-driven part, and (ii) a data-driven one, which learns a new representation from available explanatory variables. Our formulation increases the predictive power of standard DCM without sacrificing their interpretability. We show the effectiveness of our formulation by augmenting the utility specification of the Multinomial Logit (MNL) and the Nested Logit (NL) models with a new non-linear representation arising from a Neural Network (NN), leading to new choice models referred to as the Learning Multinomial Logit (L-MNL) and Learning Nested Logit (L-NL) models. Using multiple publicly available datasets based on revealed and stated preferences, we show that our models outperform the traditional ones, both in terms of predictive performance and accuracy in parameter estimation. All source code of the models are shared to promote open science.

Suggested Citation

  • Sifringer, Brian & Lurkin, Virginie & Alahi, Alexandre, 2020. "Enhancing discrete choice models with representation learning," Transportation Research Part B: Methodological, Elsevier, vol. 140(C), pages 236-261.
  • Handle: RePEc:eee:transb:v:140:y:2020:i:c:p:236-261
    DOI: 10.1016/j.trb.2020.08.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0191261520303830
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.trb.2020.08.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Torres, Cati & Hanley, Nick & Riera, Antoni, 2011. "How wrong can you be? Implications of incorrect utility function specification for welfare measurement in choice experiments," Journal of Environmental Economics and Management, Elsevier, vol. 62(1), pages 111-121, July.
    2. Patricia M. West & Patrick L. Brockett & Linda L. Golden, 1997. "A Comparative Analysis of Neural Networks and Statistical Methods for Predicting Consumer Choice," Marketing Science, INFORMS, vol. 16(4), pages 370-391.
    3. Zhou, Xiaolu & Wang, Mingshu & Li, Dongying, 2019. "Bike-sharing or taxi? Modeling the choices of travel mode in Chicago using machine learning," Journal of Transport Geography, Elsevier, vol. 79(C), pages 1-1.
    4. Glerum, Aurélie & Atasoy, Bilge & Bierlaire, Michel, 2014. "Using semi-open questions to integrate perceptions in choice models," Journal of choice modelling, Elsevier, vol. 10(C), pages 11-33.
    5. Vij, Akshay, 2013. "Incorporating the Influence of Latent Modal Preferences in Travel Demand Models," University of California Transportation Center, Working Papers qt7ng2z24q, University of California Transportation Center.
    6. Hensher, David A. & Ton, Tu T., 2000. "A comparison of the predictive potential of artificial neural networks and nested logit models for commuter mode choice," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 36(3), pages 155-172, September.
    7. Wong, Melvin & Farooq, Bilal & Bilodeau, Guillaume-Alexandre, 2018. "Discriminative conditional restricted Boltzmann machine for discrete choice and latent variable modelling," Journal of choice modelling, Elsevier, vol. 29(C), pages 152-168.
    8. Thomas Kneib & Bernhard Baumgartner & Winfried Steiner, 2007. "Semiparametric multinomial logit models for analysing consumer choice behaviour," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 91(3), pages 225-244, October.
    9. Hruschka, Harald & Fettes, Werner & Probst, Markus, 2004. "An empirical comparison of the validity of a neural net based multinomial logit choice model to alternative model specifications," European Journal of Operational Research, Elsevier, vol. 159(1), pages 166-180, November.
    10. Lee, Lung-Fei, 1982. "Specification error in multinomial logit models : Analysis of the omitted variable bias," Journal of Econometrics, Elsevier, vol. 20(2), pages 197-209, November.
    11. Marion Schindler & Bernhard Baumgartner & Harald Hruschka, 2007. "Nonlinear Effects in Brand Choice Models: Comparing Heterogeneous Latent Class To Homogeneous Nonlinear Models," Schmalenbach Business Review (sbr), LMU Munich School of Management, vol. 59(2), pages 118-137, April.
    12. Fernández-Antolín, Anna & Guevara, C. Angelo & de Lapparent, Matthieu & Bierlaire, Michel, 2016. "Correcting for endogeneity due to omitted attitudes: Empirical assessment of a modified MIS method using RP mode choice data," Journal of choice modelling, Elsevier, vol. 20(C), pages 1-15.
    13. Yafei Han & Francisco Camara Pereira & Moshe Ben-Akiva & Christopher Zegras, 2020. "A Neural-embedded Choice Model: TasteNet-MNL Modeling Taste Heterogeneity with Flexibility and Interpretability," Papers 2002.00922, arXiv.org, revised Jul 2022.
    14. Harald Hruschka, 2007. "Using a heterogeneous multinomial probit model with a neural net extension to model brand choice," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 26(2), pages 113-127.
    15. Guevara, C. Angelo, 2015. "Critical assessment of five methods to correct for endogeneity in discrete-choice models," Transportation Research Part A: Policy and Practice, Elsevier, vol. 82(C), pages 240-254.
    16. Abe, Makoto, 1999. "A Generalized Additive Model for Discrete-Choice Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 17(3), pages 271-284, July.
    17. Vij, Akshay, 2013. "Incorporating the Influence of Latent Modal Preferences in Travel Demand Models," University of California Transportation Center, Working Papers qt7nq9p0cv, University of California Transportation Center.
    18. Yves Bentz & Dwight Merunka, 2000. "Neural networks and the multinomial logit for brand choice modelling: a hybrid approach," Post-Print hal-01822273, HAL.
    19. Junyi Shen, 2009. "Latent class model or mixed logit model? A comparison by transport mode choice data," Applied Economics, Taylor & Francis Journals, vol. 41(22), pages 2915-2924.
    20. McFadden, Daniel, 1974. "The measurement of urban travel demand," Journal of Public Economics, Elsevier, vol. 3(4), pages 303-328, November.
    21. Melvin Wong & Bilal Farooq, 2019. "ResLogit: A residual neural network logit model for data-driven choice modelling," Papers 1912.10058, arXiv.org, revised Feb 2021.
    22. J. S. Cramer, 2007. "Robustness of Logit Analysis: Unobserved Heterogeneity and Mis‐specified Disturbances," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 69(4), pages 545-555, August.
    23. Xiong, Yingge & Mannering, Fred L., 2013. "The heterogeneous effects of guardian supervision on adolescent driver-injury severities: A finite-mixture random-parameters approach," Transportation Research Part B: Methodological, Elsevier, vol. 49(C), pages 39-54.
    24. Vij, Akshay & Carrel, André & Walker, Joan L., 2013. "Incorporating the influence of latent modal preferences on travel mode choice behavior," Transportation Research Part A: Policy and Practice, Elsevier, vol. 54(C), pages 164-178.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yao, Rui & Bekhor, Shlomo, 2022. "A variational autoencoder approach for choice set generation and implicit perception of alternatives in choice modeling," Transportation Research Part B: Methodological, Elsevier, vol. 158(C), pages 273-294.
    2. Arkoudi, Ioanna & Krueger, Rico & Azevedo, Carlos Lima & Pereira, Francisco C., 2023. "Combining discrete choice models and neural networks through embeddings: Formulation, interpretability and performance," Transportation Research Part B: Methodological, Elsevier, vol. 175(C).
    3. Smeele, Nicholas V.R. & Chorus, Caspar G. & Schermer, Maartje H.N. & de Bekker-Grob, Esther W., 2023. "Towards machine learning for moral choice analysis in health economics: A literature review and research agenda," Social Science & Medicine, Elsevier, vol. 326(C).
    4. Shadi Haj-Yahia & Omar Mansour & Tomer Toledo, 2023. "Incorporating Domain Knowledge in Deep Neural Networks for Discrete Choice Models," Papers 2306.00016, arXiv.org.
    5. Ioanna Arkoudi & Carlos Lima Azevedo & Francisco C. Pereira, 2021. "Combining Discrete Choice Models and Neural Networks through Embeddings: Formulation, Interpretability and Performance," Papers 2109.12042, arXiv.org, revised Sep 2021.
    6. Georges Sfeir & Filipe Rodrigues & Maya Abou-Zeid, 2021. "Gaussian Process Latent Class Choice Models," Papers 2101.12252, arXiv.org.
    7. Krueger, Rico & Bierlaire, Michel & Daziano, Ricardo A. & Rashidi, Taha H. & Bansal, Prateek, 2021. "Evaluating the predictive abilities of mixed logit models with unobserved inter- and intra-individual heterogeneity," Journal of choice modelling, Elsevier, vol. 41(C).
    8. Sander van Cranenburgh & Francisco Garrido-Valenzuela, 2023. "Computer vision-enriched discrete choice models, with an application to residential location choice," Papers 2308.08276, arXiv.org.
    9. Beeramoole, Prithvi Bhat & Arteaga, Cristian & Pinz, Alban & Haque, Md Mazharul & Paz, Alexander, 2023. "Extensive hypothesis testing for estimation of mixed-Logit models," Journal of choice modelling, Elsevier, vol. 47(C).
    10. Dubey, Subodh & Cats, Oded & Hoogendoorn, Serge & Bansal, Prateek, 2022. "A multinomial probit model with Choquet integral and attribute cut-offs," Transportation Research Part B: Methodological, Elsevier, vol. 158(C), pages 140-163.
    11. Weitao Jian & Kunxu Chen & Junshu He & Sifan Wu & Hongli Li & Ming Cai, 2023. "A Federated Personal Mobility Service in Autonomous Transportation Systems," Mathematics, MDPI, vol. 11(12), pages 1-21, June.
    12. Lorena Torres Lahoz & Francisco Camara Pereira & Georges Sfeir & Ioanna Arkoudi & Mayara Moraes Monteiro & Carlos Lima Azevedo, 2023. "Attitudes and Latent Class Choice Models using Machine learning," Papers 2302.09871, arXiv.org.
    13. Qingyi Wang & Shenhao Wang & Yunhan Zheng & Hongzhou Lin & Xiaohu Zhang & Jinhua Zhao & Joan Walker, 2023. "Deep hybrid model with satellite imagery: how to combine demand modeling and computer vision for behavior analysis?," Papers 2303.04204, arXiv.org, revised Feb 2024.
    14. Ali, Azam & Kalatian, Arash & Choudhury, Charisma F., 2023. "Comparing and contrasting choice model and machine learning techniques in the context of vehicle ownership decisions," Transportation Research Part A: Policy and Practice, Elsevier, vol. 173(C).
    15. Hernandez, Jose Ignacio & van Cranenburgh, Sander & Chorus, Caspar & Mouter, Niek, 2023. "Data-driven assisted model specification for complex choice experiments data: Association rules learning and random forests for Participatory Value Evaluation experiments," Journal of choice modelling, Elsevier, vol. 46(C).
    16. Han, Yafei & Pereira, Francisco Camara & Ben-Akiva, Moshe & Zegras, Christopher, 2022. "A neural-embedded discrete choice model: Learning taste representation with strengthened interpretability," Transportation Research Part B: Methodological, Elsevier, vol. 163(C), pages 166-186.
    17. S. Van Cranenburgh & S. Wang & A. Vij & F. Pereira & J. Walker, 2021. "Choice modelling in the age of machine learning -- discussion paper," Papers 2101.11948, arXiv.org, revised Nov 2021.
    18. Sander Cranenburgh & Marco Kouwenhoven, 2021. "An artificial neural network based method to uncover the value-of-travel-time distribution," Transportation, Springer, vol. 48(5), pages 2545-2583, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Han, Yafei & Pereira, Francisco Camara & Ben-Akiva, Moshe & Zegras, Christopher, 2022. "A neural-embedded discrete choice model: Learning taste representation with strengthened interpretability," Transportation Research Part B: Methodological, Elsevier, vol. 163(C), pages 166-186.
    2. Yafei Han & Francisco Camara Pereira & Moshe Ben-Akiva & Christopher Zegras, 2020. "A Neural-embedded Choice Model: TasteNet-MNL Modeling Taste Heterogeneity with Flexibility and Interpretability," Papers 2002.00922, arXiv.org, revised Jul 2022.
    3. Smeele, Nicholas V.R. & Chorus, Caspar G. & Schermer, Maartje H.N. & de Bekker-Grob, Esther W., 2023. "Towards machine learning for moral choice analysis in health economics: A literature review and research agenda," Social Science & Medicine, Elsevier, vol. 326(C).
    4. Ioanna Arkoudi & Carlos Lima Azevedo & Francisco C. Pereira, 2021. "Combining Discrete Choice Models and Neural Networks through Embeddings: Formulation, Interpretability and Performance," Papers 2109.12042, arXiv.org, revised Sep 2021.
    5. Sanjana Hossain & Md. Sami Hasnine & Khandker Nurul Habib, 2021. "A latent class joint mode and departure time choice model for the Greater Toronto and Hamilton Area," Transportation, Springer, vol. 48(3), pages 1217-1239, June.
    6. Fowri, Hamid R. & Seyedabrishami, Seyedehsan, 2020. "Assessment of urban transportation pricing policies with incorporation of unobserved heterogeneity," Transport Policy, Elsevier, vol. 99(C), pages 12-19.
    7. Shamshiripour, Ali & Rahimi, Ehsan & (Kouros) Mohammadian, Abolfazl & Auld, Joshua, 2020. "Investigating the influence of latent lifestyles on productive travels: Insights into designing autonomous transit system," Transportation Research Part A: Policy and Practice, Elsevier, vol. 141(C), pages 469-484.
    8. Zhou, Heng & Norman, Richard & Xia, Jianhong(Cecilia) & Hughes, Brett & Kelobonye, Keone & Nikolova, Gabi & Falkmer, Torbjorn, 2020. "Analysing travel mode and airline choice using latent class modelling: A case study in Western Australia," Transportation Research Part A: Policy and Practice, Elsevier, vol. 137(C), pages 187-205.
    9. Ortelli, Nicola & Hillel, Tim & Pereira, Francisco C. & de Lapparent, Matthieu & Bierlaire, Michel, 2021. "Assisted specification of discrete choice models," Journal of choice modelling, Elsevier, vol. 39(C).
    10. Keskisaari, Ville & Ottelin, Juudit & Heinonen, Jukka, 2017. "Greenhouse gas impacts of different modality style classes using latent class travel behavior model," Journal of Transport Geography, Elsevier, vol. 65(C), pages 155-164.
    11. Kim, Sung Hoo & Mokhtarian, Patricia L., 2018. "Taste heterogeneity as an alternative form of endogeneity bias: Investigating the attitude-moderated effects of built environment and socio-demographics on vehicle ownership using latent class modelin," Transportation Research Part A: Policy and Practice, Elsevier, vol. 116(C), pages 130-150.
    12. S. Van Cranenburgh & S. Wang & A. Vij & F. Pereira & J. Walker, 2021. "Choice modelling in the age of machine learning -- discussion paper," Papers 2101.11948, arXiv.org, revised Nov 2021.
    13. Sfeir, Georges & Abou-Zeid, Maya & Rodrigues, Filipe & Pereira, Francisco Camara & Kaysi, Isam, 2021. "Latent class choice model with a flexible class membership component: A mixture model approach," Journal of choice modelling, Elsevier, vol. 41(C).
    14. Xuemei Fu, 2021. "How habit moderates the commute mode decision process: integration of the theory of planned behavior and latent class choice model," Transportation, Springer, vol. 48(5), pages 2681-2707, October.
    15. Ton, Danique & Bekhor, Shlomo & Cats, Oded & Duives, Dorine C. & Hoogendoorn-Lanser, Sascha & Hoogendoorn, Serge P., 2020. "The experienced mode choice set and its determinants: Commuting trips in the Netherlands," Transportation Research Part A: Policy and Practice, Elsevier, vol. 132(C), pages 744-758.
    16. Kim, Seheon & Rasouli, Soora, 2022. "The influence of latent lifestyle on acceptance of Mobility-as-a-Service (MaaS): A hierarchical latent variable and latent class approach," Transportation Research Part A: Policy and Practice, Elsevier, vol. 159(C), pages 304-319.
    17. Guevara, C. Angelo, 2018. "Overidentification tests for the exogeneity of instruments in discrete choice models," Transportation Research Part B: Methodological, Elsevier, vol. 114(C), pages 241-253.
    18. Hruschka, Harald & Fettes, Werner & Probst, Markus, 2004. "An empirical comparison of the validity of a neural net based multinomial logit choice model to alternative model specifications," European Journal of Operational Research, Elsevier, vol. 159(1), pages 166-180, November.
    19. Li, Zili & Washington, Simon P. & Zheng, Zuduo & Prato, Carlo G., 2023. "A Bayesian hierarchical approach to the joint modelling of Revealed and stated choices," Journal of choice modelling, Elsevier, vol. 47(C).
    20. Chenfeng Xiong & Lei Zhang, 2017. "Dynamic travel mode searching and switching analysis considering hidden model preference and behavioral decision processes," Transportation, Springer, vol. 44(3), pages 511-532, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:140:y:2020:i:c:p:236-261. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.