Learning In Random Utility Models Via Online Decision Problems

My bibliography Save this paper

Learning In Random Utility Models Via Online Decision Problems

Author

Listed:

Emerson Melo
(Indiana University, Bloomington)

Registered:

Abstract

This paper studies the Random Utility Model (RUM) in environments where the decision maker is imperfectly informed about the payoffs associated to each of the alternatives he faces. By embedding the RUM into an online decision problem, we make four contributions. First, we propose a gradient-based learning algorithm and show that a large class of RUMs are Hannan consistent (Hannan [1957]); that is, the average difference between the expected payoffs generated by a RUM and that of the best ?xed policy in hindsight goes to zero as the number of periods increase. Second, we show that the class of Generalized Extreme Value (GEV) models can be implemented with our learning algorithm. Examples in the GEV class include the Nested Logit, Ordered, and Product Differentiation models among many others. Third, we show that our gradient-based algorithm is the dual, in a convex analysis sense, of the Follow the Regularized Leader (FTRL) algorithm, which is widely used in the Machine Learning literature. Finally, we discuss how our approach can incorporate recency bias and be used to implement prediction markets in general environments.javascript:void(0);

Suggested Citation

Emerson Melo, 2021. "Learning In Random Utility Models Via Online Decision Problems," CAEPR Working Papers 2022-003 Classification-D, Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington.

Handle: RePEc:inu:caeprp:2022003

Download full text from publisher

References listed on IDEAS

Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Small, Kenneth A, 1987. "A Discrete Choice Model for Ordered Alternatives," Econometrica, Econometric Society, vol. 55(2), pages 409-424, March.
Filip Matêjka & Alisdair McKay, 2015. "Rational Inattention to Discrete Choices: A New Foundation for the Multinomial Logit Model," American Economic Review, American Economic Association, vol. 105(1), pages 272-298, January.
- Filip Matejka & Alisdair McKay, 2011. "Rational Inattention to Discrete Choices: A New Foundation for the Multinomial Logit Model," CERGE-EI Working Papers wp442, The Center for Economic Research and Graduate Education - Economics Institute, Prague.
- Alisdair McKay & Filip Matejka, 2011. "Rational Inattention to Discrete Choices: A New Foundation for the Multinomial Logit Model," 2011 Meeting Papers 535, Society for Economic Dynamics.
- Alisdair McKay & Filip Matejka, 2011. "Rational Inattention to Discrete Choices: A New Foundation for the Multinomial Logit Model," Boston University - Department of Economics - Working Papers Series WP2011-026, Boston University - Department of Economics.
S. Cerreia-Vioglio & F. Maccheroni & M. Marinacci & A. Rustichini, 2017. "Multinomial logit processes and preference discovery: inside and outside the black box," Working Papers 615, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
- Simone Cerreia-Vioglio & Fabio Maccheroni & Massimo Marinacci & Aldo Rustichini, 2020. "Multinomial logit processes and preference discovery: inside and outside the black box," Papers 2004.13376, arXiv.org, revised Jan 2021.
Foster, Dean P. & Vohra, Rakesh, 1999. "Regret in the On-Line Decision Problem," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 7-35, October.
Mogens Fosgerau & Emerson Melo & André de Palma & Matthew Shum, 2020. "Discrete Choice And Rational Inattention: A General Equivalence Result," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 61(4), pages 1569-1589, November.
- Mogens Fosgerau & Emerson Melo & André de Palma & Matthew Shum, 2017. "Discrete Choice and Rational Inattention: a General Equivalence Result," Discussion Papers 17-26, University of Copenhagen. Department of Economics.
- Mogens Fosgerau & Emerson Melo & Andre de Palma & Matthew Shum, 2017. "Discrete Choice and Rational Inattention: a General Equivalence Result," Papers 1709.09117, arXiv.org.
- Mogens Fosgerau & Emerson Melo & André de Palma & Matthew Shum, 2017. "Discrete Choice and Rational Inattention: A General Equivalence Result," Working Papers hal-01501313, HAL.
Fudenberg, Drew & Levine, David K., 1995. "Consistency and cautious fictitious play," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 1065-1089.
- Fudenberg, Drew & Levine, David, 1995. "Consistency and Cautious Fictitious Play," Scholarly Articles 3198694, Harvard University Department of Economics.
- Drew Fudenberg & David K. Levine, 1996. "Consistency and Cautious Fictitious Play," Levine's Working Paper Archive 470, David K. Levine.
Fudenberg, Drew & Levine, David K., 2014. "Recency, Consistent Learning, and Nash Equilibrium," Scholarly Articles 13477947, Harvard University Department of Economics.
Loomes, Graham & Sugden, Robert, 1982. "Regret Theory: An Alternative Theory of Rational Choice under Uncertainty," Economic Journal, Royal Economic Society, vol. 92(368), pages 805-824, December.
McKelvey Richard D. & Palfrey Thomas R., 1995. "Quantal Response Equilibria for Normal Form Games," Games and Economic Behavior, Elsevier, vol. 10(1), pages 6-38, July.
- McKelvey, Richard D. & Palfrey, Thomas R., 1994. "Quantal Response Equilibria For Normal Form Games," Working Papers 883, California Institute of Technology, Division of the Humanities and Social Sciences.
- R. McKelvey & T. Palfrey, 2010. "Quantal Response Equilibria for Normal Form Games," Levine's Working Paper Archive 510, David K. Levine.
Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart & Andreu Mas-Colell, 2000. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Econometrica, Econometric Society, vol. 68(5), pages 1127-1150, September.
- Sergiu Hart & Andreu Mas-Colell, 1996. "A simple adaptive procedure leading to correlated equilibrium," Economics Working Papers 200, Department of Economics and Business, Universitat Pompeu Fabra, revised Dec 1996.
- S. Hart & A. Mas-Collel, 2010. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Levine's Working Paper Archive 572, David K. Levine.
- Sergiu Hart & Andreu Mas-Colell, 1997. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Game Theory and Information 9703006, University Library of Munich, Germany, revised 25 Nov 1997.
Andrew Caplin & Daniel Martin, 2015. "A Testable Theory of Imperfect Perception," Economic Journal, Royal Economic Society, vol. 125(582), pages 184-202, February.
- Andrew Caplin & Daniel Martin, 2011. "A Testable Theory of Imperfect Perception," NBER Working Papers 17163, National Bureau of Economic Research, Inc.
- Andrew Caplin & Daniel Martin, 2013. "A Testable Theory of Imperfect Perception," Levine's Working Paper Archive 786969000000000649, David K. Levine.
- Andrew Caplin & Daniel Martin, 2015. "A Testable Theory of Imperfect Perception," PSE-Ecole d'économie de Paris (Postprint) halshs-01155313, HAL.
- Andrew Caplin & Daniel Martin, 2015. "A Testable Theory of Imperfect Perception," PSE - Labex "OSE-Ouvrir la Science Economique" halshs-01155313, HAL.
- Andrew Caplin & Daniel Martin, 2015. "A Testable Theory of Imperfect Perception," Post-Print halshs-01155313, HAL.
Daniel McFadden, 2001. "Economic Choices," American Economic Review, American Economic Association, vol. 91(3), pages 351-378, June.
- McFadden, Daniel L., 2000. "Economic Choices," Nobel Prize in Economics documents 2000-6, Nobel Prize Committee.
Jay Lu, 2016. "Random Choice and Private Information," Econometrica, Econometric Society, vol. 84, pages 1983-2027, November.
Bergemann, Dirk & Morris, Stephen, 2016. "Bayes correlated equilibrium and the comparison of information structures in games," Theoretical Economics, Econometric Society, vol. 11(2), May.
- Dirk Bergemann & Stephen Morris, 2013. "Bayes Correlated Equilibrium and the Comparison of Information Structures in Games," Working Papers 054-2013, Princeton University, Department of Economics, Econometric Research Program..
- Dirk Bergemann & Stephen Morris, 2013. "Bayes Correlated Equilibrium and the Comparison of Information Structures in Games," Cowles Foundation Discussion Papers 1909RR, Cowles Foundation for Research in Economics, Yale University, revised Oct 2014.
- Dirk Bergemann & Stephen Morris, 2015. "Bayes Correlated Equilibrium and the Comparison of Information Structures in Games," Levine's Bibliography 786969000000001085, UCLA Department of Economics.
- Dirk Bergemann & Stephen Morris, 2013. "Bayes Correlated Equilibrium and the Comparison of Information Structures in Games," Cowles Foundation Discussion Papers 1909R3, Cowles Foundation for Research in Economics, Yale University, revised Apr 2015.
Andrew Caplin & Mark Dean, 2015. "Revealed Preference, Rational Inattention, and Costly Information Acquisition," American Economic Review, American Economic Association, vol. 105(7), pages 2183-2203, July.
- Andrew Caplin & Mark Dean, 2014. "Revealed Preference, Rational Inattention, and Costly Information Acquisition," NBER Working Papers 19876, National Bureau of Economic Research, Inc.
Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555.
- Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521747387.
- Kenneth Train, 2003. "Discrete Choice Methods with Simulation," Online economics textbooks, SUNY-Oswego, Department of Economics, number emetr2.
Drew Fudenberg & Ryota Iijima & Tomasz Strzalecki, 2015. "Stochastic Choice and Revealed Perturbed Utility," Econometrica, Econometric Society, vol. 83, pages 2371-2409, November.
- Drew Fudenberg & Ryota Iijima & Tomasz Strzalecki, "undated". "Stochastic Choice and Revealed Perturbed Utility," Working Paper 136731, Harvard University OpenScholar.
Wen, Chieh-Hua & Koppelman, Frank S., 2001. "The generalized nested logit model," Transportation Research Part B: Methodological, Elsevier, vol. 35(7), pages 627-641, August.
Han Bleichrodt & Peter P. Wakker, 2015. "Regret Theory: A Bold Alternative to the Alternatives," Economic Journal, Royal Economic Society, vol. 0(583), pages 493-532, March.
Ryan Webb, 2019. "The (Neural) Dynamics of Stochastic Choice," Management Science, INFORMS, vol. 65(1), pages 230-255, January.
Gualdani, Cristina & Sinha, Shruti, 2019. "Identification and inference in discrete choice models with imperfect information," TSE Working Papers 19-1049, Toulouse School of Economics (TSE), revised Jun 2020.
Josef Hofbauer & William H. Sandholm, 2002. "On the Global Convergence of Stochastic Fictitious Play," Econometrica, Econometric Society, vol. 70(6), pages 2265-2294, November.
Robin Hanson, 2003. "Combinatorial Information Market Design," Information Systems Frontiers, Springer, vol. 5(1), pages 107-119, January.
Cristina Gualdani & Shruti Sinha, 2019. "Identification in discrete choice models with imperfect information," Papers 1911.04529, arXiv.org, revised Dec 2023.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Loomes, Graham & Sugden, Robert, 1987. "Some implications of a more general form of regret theory," Journal of Economic Theory, Elsevier, vol. 41(2), pages 270-287, April.
David E. Bell, 1982. "Regret in Decision Making under Uncertainty," Operations Research, INFORMS, vol. 30(5), pages 961-981, October.
Robin Hanson, 2007. "Logarithmic Market Scoring Rules for Modular Combinatorial Information Aggregation," Journal of Prediction Markets, University of Buckingham Press, vol. 1(1), pages 3-15, February.
Paulo Natenzon, 2019. "Random Choice and Learning," Journal of Political Economy, University of Chicago Press, vol. 127(1), pages 419-457.
H.D. Block & Jacob Marschak, 1959. "Random Orderings and Stochastic Theories of Response," Cowles Foundation Discussion Papers 66, Cowles Foundation for Research in Economics, Yale University.
Drew Fudenberg & Peysakhovich, A, 2014. "Recency, Records and Recaps: Learning and Non-Equilibrium Behavior in a Simple Decision Problem," Working Paper 167691, Harvard University OpenScholar.
- Fudenberg, Drew & Peysakhovich, Alexander, 2014. "Recency, Records and Recaps: Learning and Non-Equilibrium Behavior in a Simple Decision Problem," Scholarly Articles 27755296, Harvard University Department of Economics.
Guiyun Feng & Xiaobo Li & Zizhuo Wang, 2017. "Technical Note—On the Relation Between Several Discrete Choice Models," Operations Research, INFORMS, vol. 65(6), pages 1516-1525, December.
repec:cup:cbooks:9781316779309 is not listed on IDEAS
Roughgarden,Tim, 2016. "Twenty Lectures on Algorithmic Game Theory," Cambridge Books, Cambridge University Press, number 9781316624791.
Roughgarden,Tim, 2016. "Twenty Lectures on Algorithmic Game Theory," Cambridge Books, Cambridge University Press, number 9781107172661.
Freund, Yoav & Schapire, Robert E., 1999. "Adaptive Game Playing Using Multiplicative Weights," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 79-103, October.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Emerson Melo, 2022. "On the uniqueness of quantal response equilibria and its application to network games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 74(3), pages 681-725, October.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Emerson Melo, 2021. "Learning in Random Utility Models Via Online Decision Problems," Papers 2112.10993, arXiv.org, revised Aug 2022.
Emerson Melo, 2022. "On the uniqueness of quantal response equilibria and its application to network games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 74(3), pages 681-725, October.
S. Cerreia-Vioglio & F. Maccheroni & M. Marinacci & A. Rustichini, 2017. "Multinomial logit processes and preference discovery: inside and outside the black box," Working Papers 615, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
- Simone Cerreia-Vioglio & Fabio Maccheroni & Massimo Marinacci & Aldo Rustichini, 2020. "Multinomial logit processes and preference discovery: inside and outside the black box," Papers 2004.13376, arXiv.org, revised Jan 2021.
Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2006. "Stochastic Approximations and Differential Inclusions, Part II: Applications," Mathematics of Operations Research, INFORMS, vol. 31(4), pages 673-695, November.
- Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions; Part II: Applications," Working Papers hal-00242974, HAL.
Simone Cerreia-Vioglio & Fabio Maccheroni & Massimo Marinacci, 2020. "Multinomial logit processes and preference discovery: outside and inside the black box," Working Papers 663, IGIER (Innocenzo Gasparini Institute for Economic Research), Bocconi University.
Duffy, Sean & Gussman, Steven & Smith, John, 2021. "Visual judgments of length in the economics laboratory: Are there brains in stochastic choice?," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 93(C).
Block, Juan I. & Fudenberg, Drew & Levine, David K., 2019. "Learning dynamics with social comparisons and limited memory," Theoretical Economics, Econometric Society, vol. 14(1), January.
Duffy, Sean & Smith, John, 2020. "An economist and a psychologist form a line: What can imperfect perception of length tell us about stochastic choice?," MPRA Paper 99417, University Library of Munich, Germany.
Mogens Fosgerau & Julien Monardo & André de Palma, 2019. "The Inverse Product Differentiation Logit Model," Working Papers hal-02183411, HAL.
- Mogens Fosgerau & Julien Monardo & André de Palma, 2022. "The Inverse Product Differentiation Logit Model," THEMA Working Papers 2022-22, THEMA (THéorie Economique, Modélisation et Applications), Université de Cergy-Pontoise.
- André De Palma & Mogens Fosgerau & Julien Monardo, 2021. "The Inverse Product Differentiation Logit Model," THEMA Working Papers 2021-04, THEMA (THéorie Economique, Modélisation et Applications), Université de Cergy-Pontoise.
Flynn, Joel P. & Sastry, Karthik A., 2023. "Strategic mistakes," Journal of Economic Theory, Elsevier, vol. 212(C).
Juan I Block & Drew Fudenberg & David K Levine, 2017. "Learning Dynamics Based on Social Comparisons," Levine's Working Paper Archive 786969000000001375, David K. Levine.
Sergiu Hart & Andreu Mas-Colell, 2013. "A General Class Of Adaptive Strategies," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 3, pages 47-76, World Scientific Publishing Co. Pte. Ltd..
- Hart, Sergiu & Mas-Colell, Andreu, 2001. "A General Class of Adaptive Strategies," Journal of Economic Theory, Elsevier, vol. 98(1), pages 26-54, May.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A general class of adaptative strategies," Economics Working Papers 373, Department of Economics and Business, Universitat Pompeu Fabra.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A General Class of Adaptive Strategies," Game Theory and Information 9904001, University Library of Munich, Germany, revised 23 Mar 2000.
Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
- Ed Hopkins & Josef Hofbauer & Michel Benaim, 2005. "Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 135, Edinburgh School of Economics, University of Edinburgh.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2006. "Learning in Games with Unstable Equilibria," Levine's Bibliography 321307000000000547, UCLA Department of Economics.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2005. "Learning in Games with Unstable Equilibria," Levine's Bibliography 784828000000000609, UCLA Department of Economics.
Tommaso Denti & Doron Ravid, 2023. "Robust Predictions in Games with Rational Inattention," Papers 2306.09964, arXiv.org.
Xie, Erhao, 2021. "Empirical properties and identification of adaptive learning models in behavioral game theory," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 798-821.
Tim Roughgarden, 2018. "Complexity Theory, Game Theory, and Economics: The Barbados Lectures," Papers 1801.00734, arXiv.org, revised Feb 2020.
Cristina Gualdani & Shruti Sinha, 2019. "Identification in discrete choice models with imperfect information," Papers 1911.04529, arXiv.org, revised Dec 2023.
Andriy Zapechelnyuk, 2009. "Limit Behavior of No-regret Dynamics," Discussion Papers 21, Kyiv School of Economics.
Eric Friedman & Scott Shenker & Amy Greenwald, 1998. "Learning in Networks Contexts: Experimental Results from Simulations," Departmental Working Papers 199825, Rutgers University, Department of Economics.

More about this item

Keywords

Random utility models; Multinomial Logit Model; Generalized Nested Logit models; GEV class; Online optimization; Online learning; Hannan consistency; no-regret learning;
All these keywords.

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BIG-2022-03-07 (Big Data)
NEP-DCM-2022-03-07 (Discrete Choice Models)
NEP-UPT-2022-03-07 (Utility Models and Prospect Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inu:caeprp:2022003. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Center for Applied Economics and Policy Research (email available below). General contact details of provider: https://edirc.repec.org/data/caeprus.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Learning In Random Utility Models Via Online Decision Problems

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data