Deep neural networks for choice analysis: A statistical learning theory perspective

My bibliography Save this article

Deep neural networks for choice analysis: A statistical learning theory perspective

Author

Listed:

Wang, Shenhao
Wang, Qingyi
Bailey, Nate
Zhao, Jinhua

Registered:

Abstract

Although researchers increasingly use deep neural networks (DNN) to analyze individual choices, overfitting and interpretability issues remain obstacles in theory and practice. This study presents a statistical learning theoretical framework to examine the tradeoff between estimation and approximation errors, and between the quality of prediction and of interpretation. It provides an upper bound on the estimation error of the prediction quality in DNN, measured by zero-one and log losses, shedding light on why DNN models do not overfit. It proposes a metric for interpretation quality by formulating a function approximation loss that measures the difference between true and estimated choice probability functions. It argues that the binary logit (BNL) and multinomial logit (MNL) models are the specific cases of DNNs, since the latter always has smaller approximation errors. We explore the relative performance of DNN and classical choice models through three simulation scenarios comparing DNN, BNL, and binary mixed logit models (BXL), as well as one experiment comparing DNN to BNL, BXL, MNL, and mixed logit (MXL) in analyzing the choice of trip purposes based on the National Household Travel Survey 2017. The results indicate that DNN can be used for choice analysis beyond the current practice of demand forecasting because it has the inherent utility interpretation and the power of automatically learning utility specification. Our results suggest DNN outperforms BNL, BXL, MNL, and MXL models in both prediction and interpretation when the sample size is large (≥O(104)), the input dimension is high, or the true data generating process is complex, while performing worse when the opposite is true. DNN outperforms BNL and BXL in zero-one, log, and approximation losses for most of the experiments, and the larger sample size leads to greater incremental value of using DNN over classical discrete choice models. Overall, this study introduces the statistical learning theory as a new foundation for high-dimensional data, complex statistical models, and non-asymptotic data regimes in choice analysis, and the experiments show the effective prediction and interpretation of DNN for its applications to policy and behavioral analysis.

Suggested Citation

Wang, Shenhao & Wang, Qingyi & Bailey, Nate & Zhao, Jinhua, 2021. "Deep neural networks for choice analysis: A statistical learning theory perspective," Transportation Research Part B: Methodological, Elsevier, vol. 148(C), pages 60-81.

Handle: RePEc:eee:transb:v:148:y:2021:i:c:p:60-81
DOI: 10.1016/j.trb.2021.03.011

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2018. "Big Data And Big Cities: The Promises And Limitations Of Improved Measures Of Urban Life," Economic Inquiry, Western Economic Association International, vol. 56(1), pages 114-137, January.
- Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2015. "Big Data and Big Cities: The Promises and Limitations of Improved Measures of Urban Life," NBER Working Papers 21778, National Bureau of Economic Research, Inc.
- Glaeser, Edward L. & Kominers, Scott Duke & Luca, Michael & Naik, Nikhil, 2015. "Big Data and Big Cities: The Promises and Limitations of Improved Measures for Urban Life," Working Paper Series 15-075, Harvard University, John F. Kennedy School of Government.
- Edward L. Glaeser & Scott Duke Kominers & Michael Luca & Nikhil Naik, 2015. "Big Data and Big Cities: The Promises and Limitations of Improved Measures of Urban Life," Harvard Business School Working Papers 16-065, Harvard Business School.
Allahviranloo, Mahdieh & Recker, Will, 2013. "Daily activity pattern recognition by using support vector machines with multiple classes," Transportation Research Part B: Methodological, Elsevier, vol. 58(C), pages 16-43.
Jonathan Cohen & Keith Marzilli Ericson & David Laibson & John Myles White, 2020. "Measuring Time Preferences," Journal of Economic Literature, American Economic Association, vol. 58(2), pages 299-347, June.
- Jonathan D. Cohen & Keith Marzilli Ericson & David Laibson & John Myles White, 2016. "Measuring Time Preferences," NBER Working Papers 22455, National Bureau of Economic Research, Inc.
Hensher, David A. & Ton, Tu T., 2000. "A comparison of the predictive potential of artificial neural networks and nested logit models for commuter mode choice," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 36(3), pages 155-172, September.
Kenneth Train, 1980. "A Structured Logit Model of Auto Ownership and Mode Choice," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 47(2), pages 357-370.
Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, January.
- Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521747387.
- Kenneth Train, 2003. "Discrete Choice Methods with Simulation," Online economics textbooks, SUNY-Oswego, Department of Economics, number emetr2.
Bartlett, Peter L. & Jordan, Michael I. & McAuliffe, Jon D., 2006. "Convexity, Classification, and Risk Bounds," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 138-156, March.
Yves Bentz & Dwight Merunka, 2000. "Neural networks and the multinomial logit for brand choice modelling: a hybrid approach," Post-Print hal-01822273, HAL.
Wang, Shenhao & Wang, Qingyi & Zhao, Jinhua, 2020. "Multitask learning deep neural networks to combine revealed and stated preference data," Journal of choice modelling, Elsevier, vol. 37(C).
Dong, Chunjiao & Shao, Chunfu & Clarke, David B. & Nambisan, Shashi S., 2018. "An innovative approach for traffic crash estimation and prediction on accommodating unobserved heterogeneities," Transportation Research Part B: Methodological, Elsevier, vol. 118(C), pages 407-428.
Sendhil Mullainathan & Jann Spiess, 2017. "Machine Learning: An Applied Econometric Approach," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 87-106, Spring.
Mozolin, M. & Thill, J. -C. & Lynn Usery, E., 2000. "Trip distribution forecasting with multilayer perceptron neural networks: A critical evaluation," Transportation Research Part B: Methodological, Elsevier, vol. 34(1), pages 53-73, January.
Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
Liang Tang & Chenfeng Xiong & Lei Zhang, 2015. "Decision tree method for modeling travel mode switching in a dynamic behavioral process," Transportation Planning and Technology, Taylor & Francis Journals, vol. 38(8), pages 833-850, December.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Smeele, Nicholas V.R. & Chorus, Caspar G. & Schermer, Maartje H.N. & de Bekker-Grob, Esther W., 2023. "Towards machine learning for moral choice analysis in health economics: A literature review and research agenda," Social Science & Medicine, Elsevier, vol. 326(C).
Dubey, Subodh & Cats, Oded & Hoogendoorn, Serge & Bansal, Prateek, 2022. "A multinomial probit model with Choquet integral and attribute cut-offs," Transportation Research Part B: Methodological, Elsevier, vol. 158(C), pages 140-163.
Qingyi Wang & Shenhao Wang & Yunhan Zheng & Hongzhou Lin & Xiaohu Zhang & Jinhua Zhao & Joan Walker, 2023. "Deep hybrid model with satellite imagery: how to combine demand modeling and computer vision for behavior analysis?," Papers 2303.04204, arXiv.org, revised Feb 2024.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Wang, Shenhao & Mo, Baichuan & Zhao, Jinhua, 2021. "Theory-based residual neural networks: A synergy of discrete choice models and deep neural networks," Transportation Research Part B: Methodological, Elsevier, vol. 146(C), pages 333-358.
Shenhao Wang & Baichuan Mo & Jinhua Zhao, 2020. "Theory-based residual neural networks: A synergy of discrete choice models and deep neural networks," Papers 2010.11644, arXiv.org.
Shenhao Wang & Baichuan Mo & Stephane Hess & Jinhua Zhao, 2021. "Comparing hundreds of machine learning classifiers and discrete choice models in predicting travel behavior: an empirical benchmark," Papers 2102.01130, arXiv.org.
Wang, Shenhao & Wang, Qingyi & Zhao, Jinhua, 2020. "Multitask learning deep neural networks to combine revealed and stated preference data," Journal of choice modelling, Elsevier, vol. 37(C).
Shenhao Wang & Qingyi Wang & Nate Bailey & Jinhua Zhao, 2018. "Deep Neural Networks for Choice Analysis: A Statistical Learning Theory Perspective," Papers 1810.10465, arXiv.org, revised Sep 2019.
Salon, Deborah, 2009. "Neighborhoods, cars, and commuting in New York City: A discrete choice approach," Transportation Research Part A: Policy and Practice, Elsevier, vol. 43(2), pages 180-196, February.
Ali, Azam & Kalatian, Arash & Choudhury, Charisma F., 2023. "Comparing and contrasting choice model and machine learning techniques in the context of vehicle ownership decisions," Transportation Research Part A: Policy and Practice, Elsevier, vol. 173(C).
Sfeir, Georges & Abou-Zeid, Maya & Rodrigues, Filipe & Pereira, Francisco Camara & Kaysi, Isam, 2021. "Latent class choice model with a flexible class membership component: A mixture model approach," Journal of choice modelling, Elsevier, vol. 41(C).
Han, Yafei & Pereira, Francisco Camara & Ben-Akiva, Moshe & Zegras, Christopher, 2022. "A neural-embedded discrete choice model: Learning taste representation with strengthened interpretability," Transportation Research Part B: Methodological, Elsevier, vol. 163(C), pages 166-186.
Smeele, Nicholas V.R. & Chorus, Caspar G. & Schermer, Maartje H.N. & de Bekker-Grob, Esther W., 2023. "Towards machine learning for moral choice analysis in health economics: A literature review and research agenda," Social Science & Medicine, Elsevier, vol. 326(C).
Domenico Piccolo & Rosaria Simone, 2019. "The class of cub models: statistical foundations, inferential issues and empirical evidence," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(3), pages 389-435, September.
Gelhausen, Marc Christopher, 2007. "A Generalized Neural Logit Model for Airport and Access Mode Choice in Germany," MPRA Paper 4313, University Library of Munich, Germany, revised 2007.
Lahiri, Kajal & Yang, Liu, 2013. "Forecasting Binary Outcomes," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1025-1106, Elsevier.
- Kajal Lahiri & Liu Yang, 2012. "Forecasting Binary Outcomes," Discussion Papers 12-09, University at Albany, SUNY, Department of Economics.
Ioanna Arkoudi & Carlos Lima Azevedo & Francisco C. Pereira, 2021. "Combining Discrete Choice Models and Neural Networks through Embeddings: Formulation, Interpretability and Performance," Papers 2109.12042, arXiv.org, revised Sep 2021.
Tranos, Emmanouil & Incera, Andre Carrascal & Willis, George, 2022. "Using the web to predict regional trade flows: data extraction, modelling, and validation," OSF Preprints 9bu5z, Center for Open Science.
Galdo, Virgilio & Li, Yue & Rama, Martin, 2021. "Identifying urban areas by combining human judgment and machine learning: An application to India," Journal of Urban Economics, Elsevier, vol. 125(C).
- Galdo,Virgilio & Li,Yue-000316086 & Rama,Martin G., 2020. "Identifying Urban Areas by Combining Human Judgment and Machine Learning : An Application to India," Policy Research Working Paper Series 0160, The World Bank.
Prithwiraj Choudhury & Dan Wang & Natalie A. Carlson & Tarun Khanna, 2019. "Machine learning approaches to facial and text analysis: Discovering CEO oral communication styles," Strategic Management Journal, Wiley Blackwell, vol. 40(11), pages 1705-1732, November.
Breunig, Christoph & Grabova, Iuliia & Haan, Peter & Weinhardt, Felix & Weizsäcker, Georg, 2021. "Long-run expectations of households," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 31, pages 1-1.
- Breunig, Christoph & Grabova, Iuliia & Haan, Peter & Weinhardt, Felix & Weizsäcker, Georg, 2021. "Long-run expectations of households," Journal of Behavioral and Experimental Finance, Elsevier, vol. 31(C).
- Breunig, Christoph & Grabova, Iuliia & Haan, Peter & Weinhardt, Felix & Weizsäcker, Georg, 2019. "Long-run Expectations of Households," Rationality and Competition Discussion Paper Series 218, CRC TRR 190 Rationality and Competition.
Shenhao Wang & Qingyi Wang & Jinhua Zhao, 2019. "Multitask Learning Deep Neural Networks to Combine Revealed and Stated Preference Data," Papers 1901.00227, arXiv.org, revised Aug 2019.
Christensen, Peter & Osman, Adam, 2021. "The Demand for Mobility: Evidence from an Experiment with Uber Riders," IZA Discussion Papers 14179, Institute of Labor Economics (IZA).

More about this item

Keywords

Deep neural networks; Choice modeling; Statistical learning theory; Interpretability;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:148:y:2021:i:c:p:60-81. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Deep neural networks for choice analysis: A statistical learning theory perspective

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data