IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v65y2019i10p4598-4606.html
   My bibliography  Save this article

Markov Decision Processes with Exogenous Variables

Author

Listed:
  • Robert L. Bray

    (Kellogg School of Management, Northwestern University, Evanston, Illinois 60208-0814)

Abstract

I present two algorithms for solving dynamic programs with exogenous variables: endogenous value iteration and endogenous policy iteration. These algorithms are always at least as fast as relative value iteration and relative policy iteration, and they are faster when the endogenous variables converge to their stationary distributions sooner than the exogenous variables.

Suggested Citation

  • Robert L. Bray, 2019. "Markov Decision Processes with Exogenous Variables," Management Science, INFORMS, vol. 65(10), pages 4598-4606, October.
  • Handle: RePEc:inm:ormnsc:v:65:y:2019:i:10:p:4598-4606
    DOI: 10.1287/mnsc.2018.3158
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/mnsc.2018.3158
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2018.3158?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Aguirregabiria, Victor & Mira, Pedro, 2010. "Dynamic discrete choice structural models: A survey," Journal of Econometrics, Elsevier, vol. 156(1), pages 38-67, May.
    2. Tauchen, George, 1986. "Finite state markov-chain approximations to univariate and vector autoregressions," Economics Letters, Elsevier, vol. 20(2), pages 177-181.
    3. Daniel Adelman & Angelo J. Mancini, 2016. "Optimality of Quasi-Open-Loop Policies for Discounted Semi-Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1222-1247, November.
    4. Julia L. Higle & James C. Bean & Robert L. Smith, 1990. "Deterministic Equivalence in Stochastic Infinite Horizon Problems," Mathematics of Operations Research, INFORMS, vol. 15(3), pages 396-407, August.
    5. Thomas E. Morton & William E. Wecker, 1977. "Discounting, Ergodicity and Convergence for Markov Decision Processes," Management Science, INFORMS, vol. 23(8), pages 890-900, April.
    6. Victor Aguirregabiria & Arvind Magesan, "undated". "Soultion and Estimation of Dynamic Discrete Choice Structural Models Using Euler Equations," Working Papers 2016-32, Department of Economics, University of Calgary, revised 24 May 2016.
    7. Victor Aguirregabiria & Arvind Magesan, 2013. "Euler Equations for the Estimation of Dynamic Discrete Choice Structural Models," Advances in Econometrics, in: Structural Econometric Models, volume 31, pages 3-44, Emerald Group Publishing Limited.
    8. Robert L. Bray, 2019. "Strong convergence and dynamic economic models," Quantitative Economics, Econometric Society, vol. 10(1), pages 43-65, January.
    9. Thomas E. Morton, 1971. "Technical Note—On the Asymptotic Convergence Rate of Cost Differences for Markovian Decision Processes," Operations Research, INFORMS, vol. 19(1), pages 244-248, February.
    10. Paarsch, Harry J. & Rust, John, 2009. "Valuing programs with deterministic and stochastic cycles," Journal of Economic Dynamics and Control, Elsevier, vol. 33(3), pages 614-623, March.
    11. Aguirregabiria, Victor & Magesan, Arvind, 2013. "Euler Equations for the Estimation of Dynamic Discrete Choice Structural," MPRA Paper 46056, University Library of Munich, Germany.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jiaming Mao & Jingzhi Xu, 2020. "Ensemble Learning with Statistical and Structural Models," Papers 2006.05308, arXiv.org.
    2. Christopher Ferrall, 2023. "Object Oriented (Dynamic) Programming: Closing the “Structural” Estimation Coding Gap," Computational Economics, Springer;Society for Computational Economics, vol. 62(3), pages 761-816, October.
    3. Guan, Xiangyang & Chen, Cynthia, 2021. "A behaviorally-integrated individual-level state-transition model that can predict rapid changes in evacuation demand days earlier," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 152(C).
    4. Michele Fioretti & Alexander Vostroknutov & Giorgio Coricelli, 2022. "Dynamic Regret Avoidance," American Economic Journal: Microeconomics, American Economic Association, vol. 14(1), pages 70-93, February.
    5. David Canning & Declan French & Michael Moore, 2016. "The Economics of Fertility Timing: An Euler Equation Approach," CHaRMS Working Papers 16-03, Centre for HeAlth Research at the Management School (CHaRMS).
    6. Victor Aguirregabiria & Allan Collard-Wexler & Stephen P. Ryan, 2021. "Dynamic Games in Empirical Industrial Organization," NBER Working Papers 29291, National Bureau of Economic Research, Inc.
    7. Harris, Jeremiah & Siebert, Ralph, 2017. "Firm-specific time preferences and postmerger firm performance," International Journal of Industrial Organization, Elsevier, vol. 53(C), pages 32-62.
    8. Christian Bayer & Falko Juessen, 2012. "On the Dynamics of Interstate Migration: Migration Costs and Self-Selection," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 15(3), pages 377-401, July.
    9. Gabriel Ulyssea, 2018. "Firms, Informality, and Development: Theory and Evidence from Brazil," American Economic Review, American Economic Association, vol. 108(8), pages 2015-2047, August.
    10. Myrto Kalouptsidi & Paul T. Scott & Eduardo Souza-Rodrigues, 2018. "Linear IV Regression Estimators for Structural Dynamic Discrete Choice Models," NBER Working Papers 25134, National Bureau of Economic Research, Inc.
    11. Victor Aguirregabiria & Cesar Alonso-Borrego, 2014. "Labor Contracts And Flexibility: Evidence From A Labor Market Reform In Spain," Economic Inquiry, Western Economic Association International, vol. 52(2), pages 930-957, April.
    12. Kalouptsidi, Myrto & Scott, Paul T. & Souza-Rodrigues, Eduardo, 2021. "Linear IV regression estimators for structural dynamic discrete choice models," Journal of Econometrics, Elsevier, vol. 222(1), pages 778-804.
    13. Nathan Yang, 2011. "An Empirical Model of Industry Dynamics with Common Uncertainty and Learning from the Actions of Competitors," Working Papers 11-16, NET Institute.
    14. Italo Lopez Garcia, 2015. "Human Capital and Labor Informality in Chile A Life-Cycle Approach," Working Papers WR-1087, RAND Corporation.
    15. Jeremiah Harris & Ralph Siebert, 2015. "Driven by the Discount Factor: Impact of Mergers on Market Performance in the Semiconductor Industry," CESifo Working Paper Series 5199, CESifo.
    16. Hancevic, Pedro Ignacio, 2017. "A dynamic approach to environmental compliance decisions in U.S. Electricity Market: The Acid Rain Program revisited," Energy Policy, Elsevier, vol. 106(C), pages 129-137.
    17. Robert L. Bray & Yuliang Yao & Yongrui Duan & Jiazhen Huo, 2019. "Ration Gaming and the Bullwhip Effect," Operations Research, INFORMS, vol. 67(2), pages 453-467, March.
    18. Kalouptsidi, Myrto & Scott, Paul T. & Souza-Rodrigues, Eduardo, 2018. "Linear IV Regression Estimators for Structural Dynamic Discrete Choice Models," CEPR Discussion Papers 13240, C.E.P.R. Discussion Papers.
    19. Nakashima, Kiyotaka & Ogawa, Toshiaki, 2020. "The Impacts of Strengthening Regulatory Surveillance on Bank Behavior: A Dynamic Analysis from Incomplete to Complete Enforcement of Capital Regulation in Microprudential Policy," MPRA Paper 99938, University Library of Munich, Germany.
    20. Laczó, Sarolta & Rossi, Raffaele, 2020. "Time-consistent consumption taxation," Journal of Monetary Economics, Elsevier, vol. 114(C), pages 194-220.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:65:y:2019:i:10:p:4598-4606. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.