IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2101.10261.html
   My bibliography  Save this paper

Discrete Choice Analysis with Machine Learning Capabilities

Author

Listed:
  • Youssef M. Aboutaleb
  • Mazen Danaf
  • Yifei Xie
  • Moshe Ben-Akiva

Abstract

This paper discusses capabilities that are essential to models applied in policy analysis settings and the limitations of direct applications of off-the-shelf machine learning methodologies to such settings. Traditional econometric methodologies for building discrete choice models for policy analysis involve combining data with modeling assumptions guided by subject-matter considerations. Such considerations are typically most useful in specifying the systematic component of random utility discrete choice models but are typically of limited aid in determining the form of the random component. We identify an area where machine learning paradigms can be leveraged, namely in specifying and systematically selecting the best specification of the random component of the utility equations. We review two recent novel applications where mixed-integer optimization and cross-validation are used to algorithmically select optimal specifications for the random utility components of nested logit and logit mixture models subject to interpretability constraints.

Suggested Citation

  • Youssef M. Aboutaleb & Mazen Danaf & Yifei Xie & Moshe Ben-Akiva, 2021. "Discrete Choice Analysis with Machine Learning Capabilities," Papers 2101.10261, arXiv.org.
  • Handle: RePEc:arx:papers:2101.10261
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2101.10261
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Charles F. Manski, 2013. "Response to the Review of ‘Public Policy in an Uncertain World’," Economic Journal, Royal Economic Society, vol. 0, pages 412-415, August.
    2. Charles F. Manski, 1993. "Identification of Endogenous Social Effects: The Reflection Problem," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 60(3), pages 531-542.
    3. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555.
    4. Manski, Charles F., 2013. "Public Policy in an Uncertain World: Analysis and Decisions," Economics Books, Harvard University Press, number 9780674066892, Spring.
    5. Hess, Stephane & Train, Kenneth, 2017. "Correlation and scale in mixed logit models," Journal of choice modelling, Elsevier, vol. 23(C), pages 1-8.
    6. Lhéritier, Alix & Bocamazo, Michael & Delahaye, Thierry & Acuna-Agost, Rodrigo, 2019. "Airline itinerary choice modeling using machine learning," Journal of choice modelling, Elsevier, vol. 31(C), pages 198-209.
    7. Braun, Michael & McAuliffe, Jon, 2010. "Variational Inference for Large-Scale Models of Discrete Choice," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 324-335.
    8. Varian, H.R., 1993. "What Use is Economic Theory?," Papers 93-14, Michigan - Center for Research on Economic & Social Theory.
    9. Nicolas Depraetere & Martina Vandebroek, 2017. "A comparison of variational approximations for fast inference in mixed logit models," Computational Statistics, Springer, vol. 32(1), pages 93-125, March.
    10. Prateek Bansal & Rico Krueger & Michel Bierlaire & Ricardo A. Daziano & Taha H. Rashidi, 2019. "Bayesian Estimation of Mixed Multinomial Logit Models: Advances and Simulation-Based Evaluations," Papers 1904.03647, arXiv.org, revised Dec 2019.
    11. Jonathan James, 2018. "Estimation of Factor Structured Covariance Mixed Logit Models," Working Papers 1802, California Polytechnic State University, Department of Economics.
    12. Yves Bentz & Dwight Merunka, 2000. "Neural networks and the multinomial logit for brand choice modelling: a hybrid approach," Post-Print hal-01822273, HAL.
    13. James, Jonathan, 2018. "Estimation of factor structured covariance mixed logit models," Journal of choice modelling, Elsevier, vol. 28(C), pages 41-55.
    14. Youssef M Aboutaleb & Mazen Danaf & Yifei Xie & Moshe E Ben-Akiva, 2021. "Sparse covariance estimation in logit mixture models," The Econometrics Journal, Royal Economic Society, vol. 24(3), pages 377-398.
    15. Michael Keane & Nada Wasi, 2013. "Comparing Alternative Models Of Heterogeneity In Consumer Choice Behavior," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 28(6), pages 1018-1045, September.
    16. Susan Athey, 2018. "The Impact of Machine Learning on Economics," NBER Chapters, in: The Economics of Artificial Intelligence: An Agenda, pages 507-547, National Bureau of Economic Research, Inc.
    17. Allenby, Greg M. & Rossi, Peter E., 1998. "Marketing models of consumer heterogeneity," Journal of Econometrics, Elsevier, vol. 89(1-2), pages 57-78, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ali, Azam & Kalatian, Arash & Choudhury, Charisma F., 2023. "Comparing and contrasting choice model and machine learning techniques in the context of vehicle ownership decisions," Transportation Research Part A: Policy and Practice, Elsevier, vol. 173(C).
    2. Hamed Naseri & Edward Owen Douglas Waygood & Bobin Wang & Zachary Patterson, 2022. "Application of Machine Learning to Child Mode Choice with a Novel Technique to Optimize Hyperparameters," IJERPH, MDPI, vol. 19(24), pages 1-19, December.
    3. Dubey, Subodh & Cats, Oded & Hoogendoorn, Serge & Bansal, Prateek, 2022. "A multinomial probit model with Choquet integral and attribute cut-offs," Transportation Research Part B: Methodological, Elsevier, vol. 158(C), pages 140-163.
    4. Smeele, Nicholas V.R. & Chorus, Caspar G. & Schermer, Maartje H.N. & de Bekker-Grob, Esther W., 2023. "Towards machine learning for moral choice analysis in health economics: A literature review and research agenda," Social Science & Medicine, Elsevier, vol. 326(C).
    5. John V. Colias & Stella Park & Elizabeth Horn, 2021. "Optimizing B2B product offers with machine learning, mixed logit, and nonlinear programming," Journal of Marketing Analytics, Palgrave Macmillan, vol. 9(3), pages 157-172, September.
    6. John V. Colias & Stella Park & Elizabeth Horn, 2023. "Optimizing B2B Product Offers with Machine Learning, Mixed Logit, and Nonlinear Programming," Papers 2308.07830, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Krueger, Rico & Rashidi, Taha H. & Vij, Akshay, 2020. "A Dirichlet process mixture model of discrete choice: Comparisons and a case study on preferences for shared automated vehicles," Journal of choice modelling, Elsevier, vol. 36(C).
    2. Youssef M Aboutaleb & Mazen Danaf & Yifei Xie & Moshe Ben-Akiva, 2020. "Sparse Covariance Estimation in Logit Mixture Models," Papers 2001.05034, arXiv.org.
    3. Rico Krueger & Taha H. Rashidi & Akshay Vij, 2019. "Semi-Parametric Hierarchical Bayes Estimates of New Yorkers' Willingness to Pay for Features of Shared Automated Vehicle Services," Papers 1907.09639, arXiv.org.
    4. Akshay Vij & Rico Krueger, 2018. "Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions," Papers 1802.02299, arXiv.org.
    5. Coote, Leonard V. & Swait, Joffre & Adamowicz, Wiktor, 2021. "Separating generalizable from source-specific preference heterogeneity in the fusion of revealed and stated preferences," Journal of choice modelling, Elsevier, vol. 40(C).
    6. Vij, Akshay & Krueger, Rico, 2017. "Random taste heterogeneity in discrete choice models: Flexible nonparametric finite mixture distributions," Transportation Research Part B: Methodological, Elsevier, vol. 106(C), pages 76-101.
    7. Rico Krueger & Prateek Bansal & Michel Bierlaire & Ricardo A. Daziano & Taha H. Rashidi, 2019. "Variational Bayesian Inference for Mixed Logit Models with Unobserved Inter- and Intra-Individual Heterogeneity," Papers 1905.00419, arXiv.org, revised Jan 2020.
    8. Loaiza-Maya, Rubén & Smith, Michael Stanley & Nott, David J. & Danaher, Peter J., 2022. "Fast and accurate variational inference for models with many latent variables," Journal of Econometrics, Elsevier, vol. 230(2), pages 339-362.
    9. Michael P. Keane & Nada Wasi, 2013. "The Structure of Consumer Taste Heterogeneity in Revealed vs. Stated Preference Data," Economics Papers 2013-W10, Economics Group, Nuffield College, University of Oxford.
    10. Guido W. Imbens, 2020. "Potential Outcome and Directed Acyclic Graph Approaches to Causality: Relevance for Empirical Practice in Economics," Journal of Economic Literature, American Economic Association, vol. 58(4), pages 1129-1179, December.
    11. Wendong Zhang & Brent Sohngen, 2018. "Do U.S. Anglers Care about Harmful Algal Blooms? A Discrete Choice Experiment of Lake Erie Recreational Anglers," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 100(3), pages 868-888.
    12. Ali, Azam & Kalatian, Arash & Choudhury, Charisma F., 2023. "Comparing and contrasting choice model and machine learning techniques in the context of vehicle ownership decisions," Transportation Research Part A: Policy and Practice, Elsevier, vol. 173(C).
    13. Susan Athey & Guido W. Imbens, 2017. "The State of Applied Econometrics: Causality and Policy Evaluation," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 3-32, Spring.
    14. Rico Krueger & Akshay Vij & Taha H. Rashidi, 2018. "A Dirichlet Process Mixture Model of Discrete Choice," Papers 1801.06296, arXiv.org.
    15. Dionissi Aliprantis, 2011. "Assessing the evidence on neighborhood effects from moving to opportunity," Working Papers (Old Series) 1101, Federal Reserve Bank of Cleveland.
    16. Smeele, Nicholas V.R. & Chorus, Caspar G. & Schermer, Maartje H.N. & de Bekker-Grob, Esther W., 2023. "Towards machine learning for moral choice analysis in health economics: A literature review and research agenda," Social Science & Medicine, Elsevier, vol. 326(C).
    17. Susan Athey & Guido Imbens, 2016. "The Econometrics of Randomized Experiments," Papers 1607.00698, arXiv.org.
    18. Beeramoole, Prithvi Bhat & Arteaga, Cristian & Pinz, Alban & Haque, Md Mazharul & Paz, Alexander, 2023. "Extensive hypothesis testing for estimation of mixed-Logit models," Journal of choice modelling, Elsevier, vol. 47(C).
    19. Sarrias, Mauricio, 2020. "Individual-specific posterior distributions from Mixed Logit models: Properties, limitations and diagnostic checks," Journal of choice modelling, Elsevier, vol. 36(C).
    20. Bansal, Prateek & Krueger, Rico & Bierlaire, Michel & Daziano, Ricardo A. & Rashidi, Taha H., 2020. "Bayesian estimation of mixed multinomial logit models: Advances and simulation-based evaluations," Transportation Research Part B: Methodological, Elsevier, vol. 131(C), pages 124-142.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2101.10261. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.