IDEAS home Printed from https://ideas.repec.org/a/eee/transb/v191y2025ics019126152400256x.html
   My bibliography  Save this article

Hierarchical Nearest Neighbor Gaussian Process models for discrete choice: Mode choice in New York City

Author

Listed:
  • Villarraga, Daniel F.
  • Daziano, Ricardo A.

Abstract

Standard Discrete Choice Models (DCMs) assume that unobserved effects that influence decision-making are independently and identically distributed among individuals. When unobserved effects are spatially correlated, the independence assumption does not hold, leading to biased standard errors and potentially biased parameter estimates. This paper proposes an interpretable Hierarchical Nearest Neighbor Gaussian Process (HNNGP) model to account for spatially correlated unobservables in discrete choice analysis. Gaussian Processes (GPs) are often regarded as lacking interpretability due to their non-parametric nature. However, we demonstrate how to incorporate GPs directly into the latent utility specification to flexibly model spatially correlated unobserved effects without sacrificing structural economic interpretation. To empirically test our proposed HNNGP models, we analyze binary and multinomial mode choices for commuting to work in New York City. For the multinomial case, we formulate and estimate HNNGPs with and without independence from irrelevant alternatives (IIA). Building on the interpretability of our modeling strategy, we provide both point estimates and credible intervals for the value of travel time savings in NYC. Finally, we compare the results from all proposed specifications with those derived from a standard logit model and a probit model with spatially autocorrelated errors (SAE) to showcase how accounting for different sources of spatial correlation in discrete choice can significantly impact inference. We also show that the HNNGP models attain better out-of-sample prediction performance when compared to the logit and probit SAE models, especially in the multinomial case.

Suggested Citation

  • Villarraga, Daniel F. & Daziano, Ricardo A., 2025. "Hierarchical Nearest Neighbor Gaussian Process models for discrete choice: Mode choice in New York City," Transportation Research Part B: Methodological, Elsevier, vol. 191(C).
  • Handle: RePEc:eee:transb:v:191:y:2025:i:c:s019126152400256x
    DOI: 10.1016/j.trb.2024.103132
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S019126152400256X
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.trb.2024.103132?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Abhirup Datta & Sudipto Banerjee & Andrew O. Finley & Alan E. Gelfand, 2016. "Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 800-812, April.
    2. Michel Goulard & Thibault Laurent & Christine Thomas-Agnan, 2017. "About predictions in spatial autoregressive models: optimal and almost optimal strategies," Spatial Economic Analysis, Taylor & Francis Journals, vol. 12(2-3), pages 304-325, July.
    3. Chandra Bhat, 2015. "A new spatial (social) interaction discrete choice model accommodating for unobserved effects due to endogenous network formation," Transportation, Springer, vol. 42(5), pages 879-914, September.
    4. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, Enero.
    5. Bhat, Chandra R., 2011. "The maximum approximate composite marginal likelihood (MACML) estimation of multinomial probit-based unordered response choice models," Transportation Research Part B: Methodological, Elsevier, vol. 45(7), pages 923-939, August.
    6. Bhat, Chandra R. & Sener, Ipek N. & Eluru, Naveen, 2010. "A flexible spatially dependent discrete choice model: Formulation and application to teenagers' weekday recreational activity participation," Transportation Research Part B: Methodological, Elsevier, vol. 44(8-9), pages 903-921, September.
    7. Lewandowski, Daniel & Kurowicka, Dorota & Joe, Harry, 2009. "Generating random correlation matrices based on vines and extended onion method," Journal of Multivariate Analysis, Elsevier, vol. 100(9), pages 1989-2001, October.
    8. Bhat, Chandra R. & Pinjari, Abdul R. & Dubey, Subodh K. & Hamdi, Amin S., 2016. "On accommodating spatial interactions in a Generalized Heterogeneous Data Model (GHDM) of mixed types of dependent variables," Transportation Research Part B: Methodological, Elsevier, vol. 94(C), pages 240-263.
    9. Wong, Timothy & Brownstone, David & Bunch, David S., 2019. "Aggregation biases in discrete choice models," Journal of choice modelling, Elsevier, vol. 31(C), pages 210-221.
    10. Frank Goetzke, 2008. "Network Effects in Public Transit Use: Evidence from a Spatially Autoregressive Mode Choice Model for New York," Urban Studies, Urban Studies Journal Limited, vol. 45(2), pages 407-417, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dong, Chunjiao & Shao, Chunfu & Clarke, David B. & Nambisan, Shashi S., 2018. "An innovative approach for traffic crash estimation and prediction on accommodating unobserved heterogeneities," Transportation Research Part B: Methodological, Elsevier, vol. 118(C), pages 407-428.
    2. Mondal, Aupal & Bhat, Chandra R., 2022. "A spatial rank-ordered probit model with an application to travel mode choice," Transportation Research Part B: Methodological, Elsevier, vol. 155(C), pages 374-393.
    3. Bhat, Chandra R. & Pinjari, Abdul R. & Dubey, Subodh K. & Hamdi, Amin S., 2016. "On accommodating spatial interactions in a Generalized Heterogeneous Data Model (GHDM) of mixed types of dependent variables," Transportation Research Part B: Methodological, Elsevier, vol. 94(C), pages 240-263.
    4. Vinayak, Pragun & Dias, Felipe F. & Astroza, Sebastian & Bhat, Chandra R. & Pendyala, Ram M. & Garikapati, Venu M., 2018. "Accounting for multi-dimensional dependencies among decision-makers within a generalized model framework: An application to understanding shared mobility service usage levels," Transport Policy, Elsevier, vol. 72(C), pages 129-137.
    5. Daniel F. Villarraga & Ricardo A. Daziano, 2025. "Designing Graph Convolutional Neural Networks for Discrete Choice with Network Effects," Papers 2503.09786, arXiv.org.
    6. Batram, Manuel & Bauer, Dietmar, 2019. "On consistency of the MACML approach to discrete choice modelling," Journal of choice modelling, Elsevier, vol. 30(C), pages 1-16.
    7. Bhat, Chandra R. & Astroza, Sebastian & Hamdi, Amin S., 2017. "A spatial generalized ordered-response model with skew normal kernel error terms with an application to bicycling frequency," Transportation Research Part B: Methodological, Elsevier, vol. 95(C), pages 126-148.
    8. Enam, Annesha & Konduri, Karthik C. & Pinjari, Abdul R. & Eluru, Naveen, 2018. "An integrated choice and latent variable model for multiple discrete continuous choice kernels: Application exploring the association between day level moods and discretionary activity engagement choi," Journal of choice modelling, Elsevier, vol. 26(C), pages 80-100.
    9. Paleti, Rajesh, 2018. "Generalized multinomial probit Model: Accommodating constrained random parameters," Transportation Research Part B: Methodological, Elsevier, vol. 118(C), pages 248-262.
    10. Daziano, Ricardo A., 2015. "Inference on mode preferences, vehicle purchases, and the energy paradox using a Bayesian structural choice model," Transportation Research Part B: Methodological, Elsevier, vol. 76(C), pages 1-26.
    11. Chandra R. Bhat & Subodh K. Dubey & Mohammad Jobair Bin Alam & Waleed H. Khushefati, 2015. "A New Spatial Multiple Discrete-Continuous Modeling Approach To Land Use Change Analysis," Journal of Regional Science, Wiley Blackwell, vol. 55(5), pages 801-841, November.
    12. Ricardo A. Daziano & Martin Achtnicht, 2014. "Forecasting Adoption of Ultra-Low-Emission Vehicles Using Bayes Estimates of a Multinomial Probit Model and the GHK Simulator," Transportation Science, INFORMS, vol. 48(4), pages 671-683, November.
    13. Tinessa, Fiore & Marzano, Vittorio & Papola, Andrea, 2020. "Mixing distributions of tastes with a Combination of Nested Logit (CoNL) kernel: Formulation and performance analysis," Transportation Research Part B: Methodological, Elsevier, vol. 141(C), pages 1-23.
    14. Bhat, Chandra R., 2011. "The maximum approximate composite marginal likelihood (MACML) estimation of multinomial probit-based unordered response choice models," Transportation Research Part B: Methodological, Elsevier, vol. 45(7), pages 923-939, August.
    15. Rossetti, Tomás & Daziano, Ricardo A., 2024. "Crowding multipliers on shared transportation in New York City: The effects of COVID-19 and implications for a sustainable future," Transport Policy, Elsevier, vol. 145(C), pages 224-236.
    16. Akinc, Deniz & Vandebroek, Martina, 2018. "Bayesian estimation of mixed logit models: Selecting an appropriate prior for the covariance matrix," Journal of choice modelling, Elsevier, vol. 29(C), pages 133-151.
    17. Oyama, Yuki & Murakami, Daisuke & Krueger, Rico, 2024. "A hierarchical Bayesian logit model for spatial multivariate choice data," Journal of choice modelling, Elsevier, vol. 52(C).
    18. Patil, Priyadarshan N. & Dubey, Subodh K. & Pinjari, Abdul R. & Cherchi, Elisabetta & Daziano, Ricardo & Bhat, Chandra R., 2017. "Simulation evaluation of emerging estimation techniques for multinomial probit models," Journal of choice modelling, Elsevier, vol. 23(C), pages 9-20.
    19. Akshay Vij & Rico Krueger, 2018. "Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions," Papers 1802.02299, arXiv.org.
    20. Chandra R. Bhat & Patrícia S. Lavieri, 2018. "A new mixed MNP model accommodating a variety of dependent non-normal coefficient distributions," Theory and Decision, Springer, vol. 84(2), pages 239-275, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:191:y:2025:i:c:s019126152400256x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.