IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v263y2017i2p664-678.html
   My bibliography  Save this article

A comparison of Monte Carlo tree search and rolling horizon optimization for large-scale dynamic resource allocation problems

Author

Listed:
  • Bertsimas, Dimitris
  • Griffith, J. Daniel
  • Gupta, Vishal
  • Kochenderfer, Mykel J.
  • Mišić, Velibor V.

Abstract

Dynamic resource allocation (DRA) problems constitute an important class of dynamic stochastic optimization problems that arise in many real-world applications. DRA problems are notoriously difficult to solve since they combine stochastic dynamics with intractably large state and action spaces. Although the artificial intelligence and operations research communities have independently proposed two successful frameworks for solving such problems—Monte Carlo tree search (MCTS) and rolling horizon optimization (RHO), respectively—the relative merits of these two approaches are not well understood. In this paper, we adapt MCTS and RHO to two problems – a problem inspired by tactical wildfire management and a classical problem involving the control of queueing networks – and undertake an extensive computational study comparing the two methods on large scale instances of both problems in terms of both the state and the action spaces. Both methods are able to greatly improve on a baseline, problem-specific heuristic. On smaller instances, the MCTS and RHO approaches perform comparably, but RHO outperforms MCTS as the size of the problem increases for a fixed computational budget.

Suggested Citation

  • Bertsimas, Dimitris & Griffith, J. Daniel & Gupta, Vishal & Kochenderfer, Mykel J. & Mišić, Velibor V., 2017. "A comparison of Monte Carlo tree search and rolling horizon optimization for large-scale dynamic resource allocation problems," European Journal of Operational Research, Elsevier, vol. 263(2), pages 664-678.
  • Handle: RePEc:eee:ejores:v:263:y:2017:i:2:p:664-678
    DOI: 10.1016/j.ejor.2017.05.032
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221717304605
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2017.05.032?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Bertsimas, Dimitris & Gupta, Shubham & Lulli, Guglielmo, 2014. "Dynamic resource allocation: A flexible and tractable modeling framework," European Journal of Operational Research, Elsevier, vol. 236(1), pages 14-26.
    2. Guillermo Gallego & Garrett van Ryzin, 1994. "Optimal Dynamic Pricing of Inventories with Stochastic Demand over Finite Horizons," Management Science, INFORMS, vol. 40(8), pages 999-1020, August.
    3. Dimitris Bertsimas & Sarah Stock Patterson, 1998. "The Air Traffic Flow Management Problem with Enroute Capacities," Operations Research, INFORMS, vol. 46(3), pages 406-422, June.
    4. Dragos Florin Ciocan & Vivek Farias, 2012. "Model Predictive Control for Dynamic Resource Allocation," Mathematics of Operations Research, INFORMS, vol. 37(3), pages 501-525, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Daniel R. Jiang & Lina Al-Kanj & Warren B. Powell, 2020. "Optimistic Monte Carlo Tree Search with Sampled Information Relaxation Dual Bounds," Operations Research, INFORMS, vol. 68(6), pages 1678-1697, November.
    2. Deniz Preil & Michael Krapp, 2022. "Artificial intelligence-based inventory management: a Monte Carlo tree search approach," Annals of Operations Research, Springer, vol. 308(1), pages 415-439, January.
    3. Germán Herrera Vidal & Jairo R. Coronado-Hernández & Claudia Minnaard, 2023. "Measuring manufacturing system complexity: a literature review," Journal of Intelligent Manufacturing, Springer, vol. 34(7), pages 2865-2888, October.
    4. Tapia, Tomás & Lorca, Álvaro & Olivares, Daniel & Negrete-Pincetic, Matías & Lamadrid L, Alberto J., 2021. "A robust decision-support method based on optimization and simulation for wildfire resilience in highly renewable power systems," European Journal of Operational Research, Elsevier, vol. 294(2), pages 723-733.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yanzhe (Murray) Lei & Stefanus Jasin & Amitabh Sinha, 2018. "Joint Dynamic Pricing and Order Fulfillment for E-commerce Retailers," Manufacturing & Service Operations Management, INFORMS, vol. 20(2), pages 269-284, May.
    2. Karsu, Özlem & Morton, Alec, 2015. "Inequity averse optimization in operational research," European Journal of Operational Research, Elsevier, vol. 245(2), pages 343-359.
    3. Huanan Zhang & Cong Shi & Chao Qin & Cheng Hua, 2016. "Stochastic regret minimization for revenue management problems with nonstationary demands," Naval Research Logistics (NRL), John Wiley & Sons, vol. 63(6), pages 433-448, September.
    4. Dimitris Bertsimas & Allison Chang & Velibor V. Mišić & Nishanth Mundru, 2019. "The Airlift Planning Problem," Transportation Science, INFORMS, vol. 53(3), pages 773-773, May.
    5. Chatwin, Richard E., 2000. "Optimal dynamic pricing of perishable products with stochastic demand and a finite set of prices," European Journal of Operational Research, Elsevier, vol. 125(1), pages 149-174, August.
    6. Schulte, Benedikt & Sachs, Anna-Lena, 2020. "The price-setting newsvendor with Poisson demand," European Journal of Operational Research, Elsevier, vol. 283(1), pages 125-137.
    7. Ali Hortaçsu & Olivia R. Natan & Hayden Parsley & Timothy Schwieg & Kevin R. Williams, 2021. "Organizational Structure and Pricing: Evidence from a Large U.S. Airline," NBER Working Papers 29508, National Bureau of Economic Research, Inc.
    8. Dasci, A. & Karakul, M., 2009. "Two-period dynamic versus fixed-ratio pricing in a capacity constrained duopoly," European Journal of Operational Research, Elsevier, vol. 197(3), pages 945-968, September.
    9. Thomas W. M. Vossen & Michael O. Ball, 2006. "Slot Trading Opportunities in Collaborative Ground Delay Programs," Transportation Science, INFORMS, vol. 40(1), pages 29-43, February.
    10. Aniruddha Dutta, 2019. "Capacity Allocation of Game Tickets Using Dynamic Pricing," Data, MDPI, vol. 4(4), pages 1-12, October.
    11. Namin, Aidin & Soysal, Gonca P. & Ratchford, Brian T., 2022. "Alleviating demand uncertainty for seasonal goods: An analysis of attribute-based markdown policy for fashion retailers," Journal of Business Research, Elsevier, vol. 145(C), pages 671-681.
    12. Yiwei Chen & Nikolaos Trichakis, 2021. "Technical Note—On Revenue Management with Strategic Customers Choosing When and What to Buy," Operations Research, INFORMS, vol. 69(1), pages 175-187, January.
    13. Davide Crapis & Bar Ifrach & Costis Maglaras & Marco Scarsini, 2017. "Monopoly Pricing in the Presence of Social Learning," Management Science, INFORMS, vol. 63(11), pages 3586-3608, November.
    14. Qi Chen & Qi Xu & Wenjie Wang, 2019. "Optimal Policies for the Pricing and Replenishment of Fashion Apparel considering the Effect of Fashion Level," Complexity, Hindawi, vol. 2019, pages 1-12, February.
    15. Adam J. Mersereau & Dan Zhang, 2012. "Markdown Pricing with Unknown Fraction of Strategic Customers," Manufacturing & Service Operations Management, INFORMS, vol. 14(3), pages 355-370, July.
    16. Hu, Qiying & Wei, Yihua & Xia, Yusen, 2010. "Revenue management for a supply chain with two streams of customers," European Journal of Operational Research, Elsevier, vol. 200(2), pages 582-598, January.
    17. Josef Broder & Paat Rusmevichientong, 2012. "Dynamic Pricing Under a General Parametric Choice Model," Operations Research, INFORMS, vol. 60(4), pages 965-980, August.
    18. Andreatta, Giovanni & Dell'Olmo, Paolo & Lulli, Guglielmo, 2011. "An aggregate stochastic programming model for air traffic flow management," European Journal of Operational Research, Elsevier, vol. 215(3), pages 697-704, December.
    19. Lingxiu Dong & Panos Kouvelis & Zhongjun Tian, 2009. "Dynamic Pricing and Inventory Control of Substitute Products," Manufacturing & Service Operations Management, INFORMS, vol. 11(2), pages 317-339, December.
    20. Banerjee, Pradeep K. & Turner, T. Rolf, 2012. "A flexible model for the pricing of perishable assets," Omega, Elsevier, vol. 40(5), pages 533-540.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:263:y:2017:i:2:p:664-678. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.