IDEAS home Printed from https://ideas.repec.org/a/spr/queues/v94y2020i3d10.1007_s11134-019-09624-z.html
   My bibliography  Save this article

Admit or preserve? Addressing server failures in cloud computing task management

Author

Listed:
  • Nadav Lavi

    (Tel-Aviv University)

  • Hanoch Levy

    (Tel-Aviv University)

Abstract

Cloud computing task management has a critical role in the efficient operation of the cloud resources, i.e., the servers. The task management handles critical and complicated decisions, overcoming the inherent dynamic nature of cloud computing systems and the additional complexity due to the large magnitude of resources in such systems (tens of thousands of servers). Due to the fact that servers may fail, task management is required to conduct both task admissions and task preservation decisions. Moreover, both these decisions require considering future system trajectories and the interplay between preservation and admission. In this paper we study the combined problem of task admission and preservation in a dynamic environment of cloud computing systems through analysis of a queueing system based on a Markov decision process (MDP). We show that the optimal operational policy is of a double switching curve type. On face value, the extraction of the optimal policy is rather complicated, yet our analysis reveals that the optimal policy can be reduced to a single rule, since the rules can effectively be decoupled. Based on this result, we propose two heuristic approaches that approximate the optimal rule for the most relevant system settings in cloud computing systems. Our results provide a simple policy scheme for the combined admission and preservation problem that can be applied in a complex cloud computing environments, and eliminate the need for sophisticated real-time control mechanisms.

Suggested Citation

  • Nadav Lavi & Hanoch Levy, 2020. "Admit or preserve? Addressing server failures in cloud computing task management," Queueing Systems: Theory and Applications, Springer, vol. 94(3), pages 279-325, April.
  • Handle: RePEc:spr:queues:v:94:y:2020:i:3:d:10.1007_s11134-019-09624-z
    DOI: 10.1007/s11134-019-09624-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11134-019-09624-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11134-019-09624-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. MILLER, Bruce L., 1969. "A queueing reward system with several customer classes," LIDAM Reprints CORE 41, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    2. Bruce L. Miller, 1969. "A Queueing Reward System with Several Customer Classes," Management Science, INFORMS, vol. 16(3), pages 234-245, November.
    3. Marcel F. Neuts & David M. Lucantoni, 1979. "A Markovian Queue with N Servers Subject to Breakdowns and Repairs," Management Science, INFORMS, vol. 25(9), pages 849-861, September.
    4. Dmitry Efrosinin, 2013. "Queueing model of a hybrid channel with faster link subject to partial and complete failures," Annals of Operations Research, Springer, vol. 202(1), pages 75-102, January.
    5. Ghoneim, Hussein A. & Stidham, Shaler, 1985. "Control of arrivals to two queues in series," European Journal of Operational Research, Elsevier, vol. 21(3), pages 399-409, September.
    6. I. L. Mitrany & B. Avi-Itzhak, 1968. "A Many-Server Queue with Service Interruptions," Operations Research, INFORMS, vol. 16(3), pages 628-638, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kyle Y. Lin, 2003. "Decentralized admission control of a queueing system: A game‐theoretic model," Naval Research Logistics (NRL), John Wiley & Sons, vol. 50(7), pages 702-718, October.
    2. Kyle Y. Lin & Sheldon M. Ross, 2003. "Admission Control with Incomplete Information of a Queueing System," Operations Research, INFORMS, vol. 51(4), pages 645-654, August.
    3. E. Carrizosa & E. Conde & M. Muñoz-Márquez, 1998. "Admission Policies in Loss Queueing Models with Heterogeneous Arrivals," Management Science, INFORMS, vol. 44(3), pages 311-320, March.
    4. Davide Mezzogori & Giovanni Romagnoli & Francesco Zammori, 2021. "Defining accurate delivery dates in make to order job-shops managed by workload control," Flexible Services and Manufacturing Journal, Springer, vol. 33(4), pages 956-991, December.
    5. van Jaarsveld, W.L. & Dekker, R., 2009. "Finding optimal policies in the (S - 1, S ) lost sales inventory model with multiple demand classes," Econometric Institute Research Papers EI 2009-14, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
    6. van Wijk, A.C.C. & Adan, I.J.B.F. & van Houtum, G.J., 2012. "Approximate evaluation of multi-location inventory models with lateral transshipments and hold back levels," European Journal of Operational Research, Elsevier, vol. 218(3), pages 624-635.
    7. Li, Xueping & Wang, Jiao & Sawhney, Rapinder, 2012. "Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems," European Journal of Operational Research, Elsevier, vol. 221(1), pages 99-109.
    8. Hessam Bavafa & Anne Canamucio & Steven C. Marcus & Christian Terwiesch & Rachel M. Werner, 2022. "Capacity Rationing in Primary Care: Provider Availability Shocks and Channel Diversion," Management Science, INFORMS, vol. 68(4), pages 2842-2859, April.
    9. Leon Yang Chu & Zhixi Wan & Dongyuan Zhan, 2018. "Harnessing the Double-edged Sword via Routing: Information Provision on Ride-hailing Platforms," Working Papers 18-04, NET Institute.
    10. Xufeng Yang & Wen Jiao & Juliang Zhang & Hong Yan, 2022. "Capacity management for a leasing system with different equipment and batch demands," Production and Operations Management, Production and Operations Management Society, vol. 31(7), pages 3004-3020, July.
    11. Retsef Levi & Ana Radovanović, 2010. "Provably Near-Optimal LP-Based Policies for Revenue Management in Systems with Reusable Resources," Operations Research, INFORMS, vol. 58(2), pages 503-507, April.
    12. Apurva Jain & Kamran Moinzadeh & Aussadavut Dumrongsiri, 2015. "Priority Allocation in a Rental Model with Decreasing Demand," Manufacturing & Service Operations Management, INFORMS, vol. 17(2), pages 236-248, May.
    13. Pedram Sahba & Bariş Balciog̃lu & Dragan Banjevic, 2013. "Analysis of the finite‐source multiclass priority queue with an unreliable server and setup time," Naval Research Logistics (NRL), John Wiley & Sons, vol. 60(4), pages 331-342, June.
    14. Miaomiao Yu & Yinghui Tang, 2022. "Analysis of a renewal batch arrival queue with a fault-tolerant server using shift operator method," Operational Research, Springer, vol. 22(3), pages 2831-2858, July.
    15. Huh, Woonghee Tim & Lee, Jaywon & Park, Heesang & Park, Kun Soo, 2019. "The potty parity problem: Towards gender equality at restrooms in business facilities," Socio-Economic Planning Sciences, Elsevier, vol. 68(C).
    16. Papier, Felix & Thonemann, Ulrich W., 2011. "Capacity rationing in rental systems with two customer classes and batch arrivals," Omega, Elsevier, vol. 39(1), pages 73-85, January.
    17. Demet Batur & Jennifer K. Ryan & Zhongyuan Zhao & Mehmet C. Vuran, 2019. "Dynamic Pricing of Wireless Internet Based on Usage and Stochastically Changing Capacity," Manufacturing & Service Operations Management, INFORMS, vol. 21(4), pages 833-852, October.
    18. Sergei V. Savin & Morris A. Cohen & Noah Gans & Ziv Katalan, 2005. "Capacity Management in Rental Businesses with Two Customer Bases," Operations Research, INFORMS, vol. 53(4), pages 617-631, August.
    19. Eugene A. Feinberg & Fenghsu Yang, 2016. "Optimal pricing for a GI/M/k/N queue with several customer types and holding costs," Queueing Systems: Theory and Applications, Springer, vol. 82(1), pages 103-120, February.
    20. Xufeng Yang & Juliang Zhang & Wen Jiao & Hong Yan, 2023. "Optimal Capacity Rationing Policy for a Container Leasing System with Multiple Kinds of Customers and Substitutable Containers," Management Science, INFORMS, vol. 69(3), pages 1468-1485, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:queues:v:94:y:2020:i:3:d:10.1007_s11134-019-09624-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.