IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2101.12252.html
   My bibliography  Save this paper

Gaussian Process Latent Class Choice Models

Author

Listed:
  • Georges Sfeir
  • Filipe Rodrigues
  • Maya Abou-Zeid

Abstract

We present a Gaussian Process - Latent Class Choice Model (GP-LCCM) to integrate a non-parametric class of probabilistic machine learning within discrete choice models (DCMs). Gaussian Processes (GPs) are kernel-based algorithms that incorporate expert knowledge by assuming priors over latent functions rather than priors over parameters, which makes them more flexible in addressing nonlinear problems. By integrating a Gaussian Process within a LCCM structure, we aim at improving discrete representations of unobserved heterogeneity. The proposed model would assign individuals probabilistically to behaviorally homogeneous clusters (latent classes) using GPs and simultaneously estimate class-specific choice models by relying on random utility models. Furthermore, we derive and implement an Expectation-Maximization (EM) algorithm to jointly estimate/infer the hyperparameters of the GP kernel function and the class-specific choice parameters by relying on a Laplace approximation and gradient-based numerical optimization methods, respectively. The model is tested on two different mode choice applications and compared against different LCCM benchmarks. Results show that GP-LCCM allows for a more complex and flexible representation of heterogeneity and improves both in-sample fit and out-of-sample predictive power. Moreover, behavioral and economic interpretability is maintained at the class-specific choice model level while local interpretation of the latent classes can still be achieved, although the non-parametric characteristic of GPs lessens the transparency of the model.

Suggested Citation

  • Georges Sfeir & Filipe Rodrigues & Maya Abou-Zeid, 2021. "Gaussian Process Latent Class Choice Models," Papers 2101.12252, arXiv.org.
  • Handle: RePEc:arx:papers:2101.12252
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2101.12252
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ding, Chuan & Cao, Xinyu & Wang, Yunpeng, 2018. "Synergistic effects of the built environment and commuting programs on commute mode choice," Transportation Research Part A: Policy and Practice, Elsevier, vol. 118(C), pages 104-118.
    2. Wong, Melvin & Farooq, Bilal & Bilodeau, Guillaume-Alexandre, 2018. "Discriminative conditional restricted Boltzmann machine for discrete choice and latent variable modelling," Journal of choice modelling, Elsevier, vol. 29(C), pages 152-168.
    3. Yuan, Yuan & You, Wen & Boyle, Kevin J., 2015. "A guide to heterogeneity features captured by parametric and nonparametric mixing distributions for the mixed logit model," 2015 AAEA & WAEA Joint Annual Meeting, July 26-28, San Francisco, California 205733, Agricultural and Applied Economics Association.
    4. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, January.
    5. Train, Kenneth, 2016. "Mixed logit with a flexible mixing distribution," Journal of choice modelling, Elsevier, vol. 19(C), pages 40-53.
    6. Melvin Wong & Bilal Farooq, 2019. "ResLogit: A residual neural network logit model for data-driven choice modelling," Papers 1912.10058, arXiv.org, revised Feb 2021.
    7. William H. Greene & David A. Hensher, 2013. "Revealing additional dimensions of preference heterogeneity in a latent class mixed multinomial logit model," Applied Economics, Taylor & Francis Journals, vol. 45(14), pages 1897-1902, May.
    8. Sifringer, Brian & Lurkin, Virginie & Alahi, Alexandre, 2020. "Enhancing discrete choice models with representation learning," Transportation Research Part B: Methodological, Elsevier, vol. 140(C), pages 236-261.
    9. Sfeir, Georges & Abou-Zeid, Maya & Kaysi, Isam, 2020. "Multivariate count data models for adoption of new transport modes in an organization-based context," Transport Policy, Elsevier, vol. 91(C), pages 59-75.
    10. Zoubin Ghahramani, 2015. "Probabilistic machine learning and artificial intelligence," Nature, Nature, vol. 521(7553), pages 452-459, May.
    11. Angel Bujosa & Antoni Riera & Robert Hicks, 2010. "Combining Discrete and Continuous Representations of Preference Heterogeneity: A Latent Class Approach," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 47(4), pages 477-493, December.
    12. Chandra R. Bhat, 1997. "An Endogenous Segmentation Mode Choice Model with an Application to Intercity Travel," Transportation Science, INFORMS, vol. 31(1), pages 34-48, February.
    13. Daniel McFadden & Kenneth Train, 2000. "Mixed MNL models for discrete response," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(5), pages 447-470.
    14. Liang Tang & Chenfeng Xiong & Lei Zhang, 2015. "Decision tree method for modeling travel mode switching in a dynamic behavioral process," Transportation Planning and Technology, Taylor & Francis Journals, vol. 38(8), pages 833-850, December.
    15. Al-Ayyash, Zahwa & Abou-Zeid, Maya & Kaysi, Isam, 2016. "Modeling the demand for a shared-ride taxi service: An application to an organization-based context," Transport Policy, Elsevier, vol. 48(C), pages 169-182.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Smeele, Nicholas V.R. & Chorus, Caspar G. & Schermer, Maartje H.N. & de Bekker-Grob, Esther W., 2023. "Towards machine learning for moral choice analysis in health economics: A literature review and research agenda," Social Science & Medicine, Elsevier, vol. 326(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sfeir, Georges & Abou-Zeid, Maya & Rodrigues, Filipe & Pereira, Francisco Camara & Kaysi, Isam, 2021. "Latent class choice model with a flexible class membership component: A mixture model approach," Journal of choice modelling, Elsevier, vol. 41(C).
    2. Tinessa, Fiore & Marzano, Vittorio & Papola, Andrea, 2020. "Mixing distributions of tastes with a Combination of Nested Logit (CoNL) kernel: Formulation and performance analysis," Transportation Research Part B: Methodological, Elsevier, vol. 141(C), pages 1-23.
    3. Rico Krueger & Akshay Vij & Taha H. Rashidi, 2018. "A Dirichlet Process Mixture Model of Discrete Choice," Papers 1801.06296, arXiv.org.
    4. Akshay Vij & Rico Krueger, 2018. "Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions," Papers 1802.02299, arXiv.org.
    5. Vij, Akshay & Krueger, Rico, 2017. "Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions," Transportation Research Part B: Methodological, Elsevier, vol. 106(C), pages 76-101.
    6. Bansal, Prateek & Daziano, Ricardo A & Guerra, Erick, 2018. "Minorization-Maximization (MM) algorithms for semiparametric logit models: Bottlenecks, extensions, and comparisons," Transportation Research Part B: Methodological, Elsevier, vol. 115(C), pages 17-40.
    7. Krueger, Rico & Rashidi, Taha H. & Vij, Akshay, 2020. "A Dirichlet process mixture model of discrete choice: Comparisons and a case study on preferences for shared automated vehicles," Journal of choice modelling, Elsevier, vol. 36(C).
    8. Franceschinis, Cristiano & Thiene, Mara & Scarpa, Riccardo & Rose, John & Moretto, Michele & Cavalli, Raffaele, 2017. "Adoption of renewable heating systems: An empirical test of the diffusion of innovation theory," Energy, Elsevier, vol. 125(C), pages 313-326.
    9. Stephane Hess, 2014. "Latent class structures: taste heterogeneity and beyond," Chapters, in: Stephane Hess & Andrew Daly (ed.), Handbook of Choice Modelling, chapter 14, pages 311-330, Edward Elgar Publishing.
    10. Rico Krueger & Taha H. Rashidi & Akshay Vij, 2019. "Semi-Parametric Hierarchical Bayes Estimates of New Yorkers' Willingness to Pay for Features of Shared Automated Vehicle Services," Papers 1907.09639, arXiv.org.
    11. Georges Sfeir & Maya Abou-Zeid & Filipe Rodrigues & Francisco Camara Pereira & Isam Kaysi, 2020. "Semi-nonparametric Latent Class Choice Model with a Flexible Class Membership Component: A Mixture Model Approach," Papers 2007.02739, arXiv.org.
    12. Hocheol Jeon & Joseph A. Herriges, 2017. "Combining Revealed Preference Data with Stated Preference Data: A Latent Class Approach," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 68(4), pages 1053-1086, December.
    13. Bansal, Prateek & Daziano, Ricardo A. & Achtnicht, Martin, 2018. "Comparison of parametric and semiparametric representations of unobserved preference heterogeneity in logit models," Journal of choice modelling, Elsevier, vol. 27(C), pages 97-113.
    14. Catalina M. Torres & Sergio Colombo & Nick Hanley, 2014. "Incorrectly accounting for preference heterogeneity in choice experiments: what are the implications for welfare measurement?," DEA Working Papers 65, Universitat de les Illes Balears, Departament d'Economía Aplicada.
    15. Beeramoole, Prithvi Bhat & Arteaga, Cristian & Pinz, Alban & Haque, Md Mazharul & Paz, Alexander, 2023. "Extensive hypothesis testing for estimation of mixed-Logit models," Journal of choice modelling, Elsevier, vol. 47(C).
    16. Youssef M Aboutaleb & Mazen Danaf & Yifei Xie & Moshe Ben-Akiva, 2020. "Sparse Covariance Estimation in Logit Mixture Models," Papers 2001.05034, arXiv.org.
    17. Sarrias, Mauricio & Daziano, Ricardo A., 2018. "Individual-specific point and interval conditional estimates of latent class logit parameters," Journal of choice modelling, Elsevier, vol. 27(C), pages 50-61.
    18. Heiss, Florian & Hetzenecker, Stephan & Osterhaus, Maximilian, 2022. "Nonparametric estimation of the random coefficients model: An elastic net approach," Journal of Econometrics, Elsevier, vol. 229(2), pages 299-321.
    19. Gutiérrez-Vargas, Álvaro A. & Meulders, Michel & Vandebroek, Martina, 2023. "Modeling preference heterogeneity using model-based decision trees," Journal of choice modelling, Elsevier, vol. 46(C).
    20. Martínez-Pardo, Ana & Orro, Alfonso & Garcia-Alonso, Lorena, 2020. "Analysis of port choice: A methodological proposal adjusted with public data," Transportation Research Part A: Policy and Practice, Elsevier, vol. 136(C), pages 178-193.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2101.12252. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.