Dynamic Decision-Making under Model Misspecification: A Stochastic Stability Approach

Dynamic Decision-Making under Model Misspecification: A Stochastic Stability Approach

Author

Listed:

Xinyu Dai
Daniel Chen
Yian Qian

Abstract

Dynamic decision-making under model uncertainty is central to many economic environments, yet existing bandit and reinforcement learning algorithms rely on the assumption of correct model specification. This paper studies the behavior and performance of one of the most commonly used Bayesian reinforcement learning algorithms, Thompson Sampling (TS), when the model class is misspecified. We first provide a complete dynamic classification of posterior evolution in a misspecified two-armed Gaussian bandit, identifying distinct regimes: correct model concentration, incorrect model concentration, and persistent belief mixing, characterized by the direction of statistical evidence and the model-action mapping. These regimes yield sharp predictions for limiting beliefs, action frequencies, and asymptotic regret. We then extend the analysis to a general finite model class and develop a unified stochastic stability framework that represents posterior evolution as a Markov process on the belief simplex. This approach characterizes two sufficient conditions to classify the ergodic and transient behaviors and provides inductive dimensional reductions of the posterior dynamics. Our results offer the first qualitative and geometric classification of TS under misspecification, bridging Bayesian learning with evolutionary dynamics, and also build the foundations of robust decision-making in structured bandits.

Suggested Citation

Xinyu Dai & Daniel Chen & Yian Qian, 2026. "Dynamic Decision-Making under Model Misspecification: A Stochastic Stability Approach," Papers 2602.17086, arXiv.org.

Handle: RePEc:arx:papers:2602.17086

Download full text from publisher

References listed on IDEAS

Ignacio Esponda & Demian Pouzo, 2016. "Berk–Nash Equilibrium: A Framework for Modeling Agents With Misspecified Models," Econometrica, Econometric Society, vol. 84, pages 1093-1130, May.
- Ignacio Esponda & Demian Pouzo, 2014. "Berk-Nash Equilibrium: A Framework for Modeling Agents with Misspecified Models," Papers 1411.1152, arXiv.org, revised Nov 2019.
He, Kevin & Libgober, Jonathan, 2025. "Misspecified learning and evolutionary stability," Journal of Economic Theory, Elsevier, vol. 230(C).
- Kevin He & Jonathan Libgober, 2025. "Misspecified learning and evolutionary stability," Papers 2509.16067, arXiv.org.
- Kevin He & Jonathan Libgober, 2025. "Misspecified Learning and Evolutionary Stability," PIER Working Paper Archive 25-020, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
Fudenberg, Drew & Lanzani, Giacomo, 2023. "Which misspecifications persist?," Theoretical Economics, Econometric Society, vol. 18(3), July.
Donald W K Andrews & Soonwoo Kwon, 2024. "Misspecified Moment Inequality Models: Inference and Diagnostics," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 91(1), pages 45-76.
Raffaella Giacomini & Toru Kitagawa & Harald Uhlig, 2019. "Estimation Under Ambiguity," CeMMAP working papers CWP24/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Karun Adusumilli, 2025. "Risk and Optimal Policies in Bandit Experiments," Econometrica, Econometric Society, vol. 93(3), pages 1003-1029, May.
Esponda, Ignacio & Pouzo, Demian & Yamamoto, Yuichi, 2021. "Asymptotic behavior of Bayesian learners with misspecified models," Journal of Economic Theory, Elsevier, vol. 195(C).
- Ignacio Esponda & Demian Pouzo & Yuichi Yamamoto, 2019. "Asymptotic Behavior of Bayesian Learners with Misspecified Models," Papers 1904.08551, arXiv.org, revised Oct 2019.
Lanzani, Giacomo, 2025. "Dynamic Concern for Misspecification," Department of Economics, Working Paper Series qt6zg4w2ff, Department of Economics, Institute for Business and Economic Research, UC Berkeley.
Karun Adusumilli, 2021. "Risk and optimal policies in bandit experiments," Papers 2112.06363, arXiv.org, revised May 2025.
Jorgen W. Weibull, 1997. "Evolutionary Game Theory," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262731215, December.
Drew Fudenberg & Giacomo Lanzani & Philipp Strack, 2021. "Limit Points of Endogenous Misspecified Learning," Econometrica, Econometric Society, vol. 89(3), pages 1065-1098, May.
Giacomo Lanzani, 2025. "Dynamic Concern for Misspecification," Econometrica, Econometric Society, vol. 93(4), pages 1333-1370, July.
repec:fth:iniesr:487 is not listed on IDEAS

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Ignacio Esponda & Demian Pouzo, 2026. "Learning and Equilibrium under Model Misspecification," Papers 2601.09891, arXiv.org.
He, Kevin & Libgober, Jonathan, 2025. "Misspecified learning and evolutionary stability," Journal of Economic Theory, Elsevier, vol. 230(C).
- Kevin He & Jonathan Libgober, 2025. "Misspecified Learning and Evolutionary Stability," PIER Working Paper Archive 25-020, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
- Kevin He & Jonathan Libgober, 2025. "Misspecified learning and evolutionary stability," Papers 2509.16067, arXiv.org.
Yingkai Li & Aleksandrs Slivkins, 2022. "Exploration and Incentivizing Participation in Randomized Trials," Papers 2202.06191, arXiv.org, revised Jan 2026.
Florian Mudekereza, 2026. "Motivating Innovation with Misspecified Contracts," Papers 2602.18879, arXiv.org.
J. Aislinn Bohren & Daniel N. Hauser, 2023. "Behavioral Foundations of Model Misspecification," PIER Working Paper Archive 23-007, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
Fudenberg, Drew & Gao, Ying & Pei, Harry, 2022. "A reputation for honesty," Journal of Economic Theory, Elsevier, vol. 204(C).
- Drew Fudenberg & Ying Gao & Harry Pei, 2020. "A Reputation for Honesty," Papers 2011.07159, arXiv.org.
Filippo Massari & Jonathan Newton, 2026. "Rational beliefs when the truth is not an option," International Journal of Game Theory, Springer;Game Theory Society, vol. 55(1), pages 1-26, June.
Kevin He & Jonathan Libgober, 2025. "Higher-Order Beliefs and (Mis)learning from Prices," PIER Working Paper Archive 25-018, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
Alfonso Maselli, 2025. "Misspecification Averse Preferences," PIER Working Paper Archive 25-010, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
Federico Echenique & Anqi Li, 2025. "Implicit Incentive Provision with Misspecified Learning," Papers 2512.01129, arXiv.org.
Ba, Cuimin & Gindin, Alice, 2023. "A multi-agent model of misspecified learning with overconfidence," Games and Economic Behavior, Elsevier, vol. 142(C), pages 315-338.
Yingkai Li & Argyris Oikonomou, 2024. "Dynamics and Contracts for an Agent with Misspecified Beliefs," Papers 2405.20423, arXiv.org.
Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," Working Papers halshs-03735680, HAL.
- Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," PSE Working Papers halshs-03735680, HAL.
Bowen, T. Renee & Galperti, Simone & Dmitriev, Danil, 2021. "Learning from Shared News: When Abundant Information Leads to Belief Polarization," CEPR Discussion Papers 15789, C.E.P.R. Discussion Papers.
Larry Samuelson & Jakub Steiner, 2024. "Robust latent data representations," ECON - Working Papers 460, Department of Economics - University of Zurich, revised Jul 2025.
Paul Heidhues & Botond Koszegi & Philipp Strack, 2023. "Misinterpreting Yourself," Cowles Foundation Discussion Papers 2378, Cowles Foundation for Research in Economics, Yale University.
Cho, In-Koo & Libgober, Jonathan, 2025. "Learning underspecified models," Journal of Economic Theory, Elsevier, vol. 226(C).
- In-Koo Cho & Jonathan Libgober, 2022. "Learning Underspecified Models," Papers 2207.10140, arXiv.org.
Daria Fedyaeva & Georgy Lukyanov & Hannah Tolli'e, 2025. "Learning to Unlearn: Education as a Remedy for Misspecified Beliefs," Papers 2510.24735, arXiv.org.
Philippe Jehiel & Erik Mohlin, 2023. "Categorization in Games: A Bias-Variance Perspective," Working Papers halshs-04154272, HAL.
- Jehiel, Philippe & Mohlin, Erik, 2025. "Categorization in Games: A Bias-Variance Perspective," Working Papers 2025:7, Lund University, Department of Economics.
Drew Fudenberg & Florian Mudekereza, 2026. "Complexity and Misspecification," Papers 2602.15674, arXiv.org.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2602.17086. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Dynamic Decision-Making under Model Misspecification: A Stochastic Stability Approach

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data