IDEAS home Printed from https://ideas.repec.org/a/eee/eejocm/v28y2018icp167-182.html
   My bibliography  Save this article

Is your dataset big enough? Sample size requirements when using artificial neural networks for discrete choice analysis

Author

Listed:
  • Alwosheel, Ahmad
  • van Cranenburgh, Sander
  • Chorus, Caspar G.

Abstract

Artificial Neural Networks (ANNs) are increasingly used for discrete choice analysis. But, at present, it is unknown what sample size requirements are appropriate when using ANNs in this particular context. This paper fills this knowledge gap: we empirically establish a rule-of-thumb for ANN-based discrete choice analysis based on analyses of synthetic and real data. To investigate the effect of complexity of the data generating process on the minimum required sample size, we conduct extensive Monte Carlo analyses using a series of different model specifications with different levels of model complexity, including RUM and RRM models, with and without random taste parameters. Based on our analyses we advise to use a minimum sample size of fifty times the number of weights in the ANN; it should be noted, that the number of weights is generally much larger than the number of parameters in a discrete choice model. This rule-of-thumb is considerably more conservative than the rule-of-thumb that is most often used in the ANN community, which advises to use at least ten times the number of weights.

Suggested Citation

  • Alwosheel, Ahmad & van Cranenburgh, Sander & Chorus, Caspar G., 2018. "Is your dataset big enough? Sample size requirements when using artificial neural networks for discrete choice analysis," Journal of choice modelling, Elsevier, vol. 28(C), pages 167-182.
  • Handle: RePEc:eee:eejocm:v:28:y:2018:i:c:p:167-182
    DOI: 10.1016/j.jocm.2018.07.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1755534518300058
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jocm.2018.07.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. McFadden, Daniel L., 1984. "Econometric analysis of qualitative response models," Handbook of Econometrics, in: Z. Griliches† & M. D. Intriligator (ed.), Handbook of Econometrics, edition 1, volume 2, chapter 24, pages 1395-1457, Elsevier.
    2. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, January.
    3. Huber, Joel & Train, Kenneth, 2000. "On the Similarity of Classical and Bayesian Estimates of Individual Mean Partworths," Department of Economics, Working Paper Series qt7zm4f51b, Department of Economics, Institute for Business and Economic Research, UC Berkeley.
    4. van Cranenburgh, Sander & Guevara, Cristian Angelo & Chorus, Caspar G., 2015. "New insights on random regret minimization models," Transportation Research Part A: Policy and Practice, Elsevier, vol. 74(C), pages 91-109.
    5. Davide Castelvecchi, 2016. "Can we open the black box of AI?," Nature, Nature, vol. 538(7623), pages 20-23, October.
    6. Caspar Chorus & Michel Bierlaire, 2013. "An empirical comparison of travel choice models that capture preferences for compromise alternatives," Transportation, Springer, vol. 40(3), pages 549-562, May.
    7. Hensher, David A. & Ton, Tu T., 2000. "A comparison of the predictive potential of artificial neural networks and nested logit models for commuter mode choice," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 36(3), pages 155-172, September.
    8. John Rose & Michiel Bliemer, 2013. "Sample size requirements for stated choice experiments," Transportation, Springer, vol. 40(5), pages 1021-1041, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nova, Gabriel & Guevara, C. Angelo & Hess, Stephane & Hancock, Thomas O., 2025. "A random utility maximisation model considering the information search process," Transportation Research Part B: Methodological, Elsevier, vol. 199(C).
    2. van Cranenburgh, Sander & Collins, Andrew T., 2019. "New software tools for creating stated choice experimental designs efficient for regret minimisation and utility maximisation decision rules," Journal of choice modelling, Elsevier, vol. 31(C), pages 104-123.
    3. Boeri, Marco & Scarpa, Riccardo & Chorus, Caspar G., 2014. "Stated choices and benefit estimates in the context of traffic calming schemes: Utility maximization, regret minimization, or both?," Transportation Research Part A: Policy and Practice, Elsevier, vol. 61(C), pages 121-135.
    4. Jang, Sunghoon & Hong, Doosun & Kim, Youngho & Lee, Chungwon, 2025. "Investigation of preferences for ride-sourcing taxis: Compromise effect and regret-based latent class modeling," Transportation Research Part A: Policy and Practice, Elsevier, vol. 192(C).
    5. John Buckell & Vrinda Vasavada & Sarah Wordsworth & Dean A. Regier & Matthew Quaife, 2022. "Utility maximization versus regret minimization in health choice behavior: Evidence from four datasets," Health Economics, John Wiley & Sons, Ltd., vol. 31(2), pages 363-381, February.
    6. Bibhuti Sharma & Mark Hickman & Neema Nassir, 2019. "Park-and-ride lot choice model using random utility maximization and random regret minimization," Transportation, Springer, vol. 46(1), pages 217-232, February.
    7. Abdurrahman B. Aydemir & Erkan Duman, 2021. "Migrant Networks and Destination Choice: Evidence from Moves across Turkish Provinces," Koç University-TUSIAD Economic Research Forum Working Papers 2109, Koc University-TUSIAD Economic Research Forum.
    8. Chorus, Caspar & van Cranenburgh, Sander & Daniel, Aemiro Melkamu & Sandorf, Erlend Dancke & Sobhani, Anae & Szép, Teodóra, 2021. "Obfuscation maximization-based decision-making: Theory, methodology and first empirical evidence," Mathematical Social Sciences, Elsevier, vol. 109(C), pages 28-44.
    9. Mittelhammer, Ron C. & Judge, George, 2011. "A family of empirical likelihood functions and estimators for the binary response model," Journal of Econometrics, Elsevier, vol. 164(2), pages 207-217, October.
    10. Kanchanaroek, Yingluk & Termansen, Mette & Quinn, Claire, 2013. "Property rights regimes in complex fishery management systems: A choice experiment application," Ecological Economics, Elsevier, vol. 93(C), pages 363-373.
    11. Villas-Boas, Sofia B & Taylor, Rebecca & Krovetz, Hannah, 2016. "Willingness to Pay for Low Water Footprint Food Choices During Drought," Department of Agricultural & Resource Economics, UC Berkeley, Working Paper Series qt9vh3x180, Department of Agricultural & Resource Economics, UC Berkeley.
    12. Kim, Junghun & Seung, Hyunchan & Lee, Jongsu & Ahn, Joongha, 2020. "Asymmetric preference and loss aversion for electric vehicles: The reference-dependent choice model capturing different preference directions," Energy Economics, Elsevier, vol. 86(C).
    13. Michel Beine & Marco Delogu & Lionel Ragot, 2017. "The Role of Fees in Foreign Education: Evidence From Italy and the United Kingdom," Working Papers 2017-04, CEPII research center.
    14. Schaefer, Thilo & Peichl, Andreas, 2006. "Documentation FiFoSiM: integrated tax benefit microsimulation and CGE model," FiFo Discussion Papers - Finanzwissenschaftliche Diskussionsbeiträge 06-10, University of Cologne, FiFo Institute for Public Economics.
    15. Daniele Pacifico, 2012. "Fitting nonparametric mixed logit models via expectation-maximization algorithm," Stata Journal, StataCorp LLC, vol. 12(2), pages 284-298, June.
    16. Erlend Dancke Sandorf & Danny Campbell & Caspar Chorus, 2022. "A simple satisficing model," PLOS ONE, Public Library of Science, vol. 17(10), pages 1-30, October.
    17. Peng Jing & Mengxuan Zhao & Meiling He & Long Chen, 2018. "Travel Mode and Travel Route Choice Behavior Based on Random Regret Minimization: A Systematic Review," Sustainability, MDPI, vol. 10(4), pages 1-20, April.
    18. Liu, Gang, 2007. "A behavioral model of work-trip mode choice in Shanghai," China Economic Review, Elsevier, vol. 18(4), pages 456-476.
    19. Byun, Hyunsuk & Shin, Jungwoo & Lee, Chul-Yong, 2018. "Using a discrete choice experiment to predict the penetration possibility of environmentally friendly vehicles," Energy, Elsevier, vol. 144(C), pages 312-321.
    20. Vidale, E & Pettenella, D & Gatto, P & Secco, L, 23. "What can we sell behind timber production?," Scandinavian Forest Economics: Proceedings of the Biennial Meeting of the Scandinavian Society of Forest Economics, Scandinavian Society of Forest Economics, issue 44, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:eejocm:v:28:y:2018:i:c:p:167-182. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/journal-of-choice-modelling .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.