IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v322y2025i1p133-146.html
   My bibliography  Save this article

Optimal capacity planning for cloud service providers with periodic, time-varying demand

Author

Listed:
  • Furman, Eugene
  • Diamant, Adam

Abstract

Allocating capacity to private cloud computing services is challenging because demand is time-varying, there are often no buffers, and customers can re-submit jobs a finite number of times. We model this setting using a multi-station queueing network where servers represent CPU cores and jobs not immediately processed retry several times. Assuming retrial rates are stationary and that there is a maximum number of retrial attempts, we determine an optimal service capacity and retrial interval under an admission control policy employed by our partner institution — the server informs customers when they should next attempt service without enforcement. We introduce a recursive representation of the offered load which approximates the fluid dynamics of the system. We then use this representation to develop a solution technique that minimizes the total variation in the constructed offered load. We prove this approach is linked to maximizing system throughput and that in certain settings, the optimal stationary and time-varying retrial intervals are equivalent. Utilizing a data set of cloud computing requests spanning a 24-hour period, our analysis indicates that the optimal policy prescribes a 10% reduction in capacity. We also investigate the fidelity of the fluid model and the sensitivity of our recommendations to the behavior of retrial jobs. We find that retrial-time announcements allow a provider to satisfy service level agreements while encouraging retrial jobs to be processed during off-peak periods. Further, the policy is suitably robust to a customer’s willingness to comply with the suggested retrial times.

Suggested Citation

  • Furman, Eugene & Diamant, Adam, 2025. "Optimal capacity planning for cloud service providers with periodic, time-varying demand," European Journal of Operational Research, Elsevier, vol. 322(1), pages 133-146.
  • Handle: RePEc:eee:ejores:v:322:y:2025:i:1:p:133-146
    DOI: 10.1016/j.ejor.2024.11.017
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221724008865
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2024.11.017?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Elif Akcali & Murray Côté & Chin Lin, 2006. "A network flow approach to optimizing hospital bed capacity decisions," Health Care Management Science, Springer, vol. 9(4), pages 391-404, November.
    2. Yi-Ju Chiang & Yen-Chieh Ouyang, 2014. "Profit Optimization in SLA-Aware Cloud Services with a Finite Capacity Queuing Model," Mathematical Problems in Engineering, Hindawi, vol. 2014, pages 1-11, June.
    3. Dieter Fiems, 2023. "Retrial queues with constant retrial times," Queueing Systems: Theory and Applications, Springer, vol. 103(3), pages 347-365, April.
    4. Gad Allon & Achal Bassamboo, 2011. "The Impact of Delaying the Delay Announcements," Operations Research, INFORMS, vol. 59(5), pages 1198-1210, October.
    5. Aguir, M. Salah & Aksin, O. Zeynep & Karaesmen, Fikri & Dallery, Yves, 2008. "On the interaction between retrials and sizing of call centers," European Journal of Operational Research, Elsevier, vol. 191(2), pages 398-408, December.
    6. Noa Zychlinski & Avishai Mandelbaum & Petar Momčilović, 2018. "Time-varying tandem queues with blocking: modeling, analysis, and operational insights via fluid models with reflection," Queueing Systems: Theory and Applications, Springer, vol. 89(1), pages 15-47, June.
    7. Galit B. Yom-Tov & Avishai Mandelbaum, 2014. "Erlang-R: A Time-Varying Queue with Reentrant Customers, in Support of Healthcare Staffing," Manufacturing & Service Operations Management, INFORMS, vol. 16(2), pages 283-299, May.
    8. Maxime C. Cohen & Philipp W. Keller & Vahab Mirrokni & Morteza Zadimoghaddam, 2019. "Overcommitment in Cloud Services: Bin Packing with Chance Constraints," Management Science, INFORMS, vol. 65(7), pages 3255-3271, July.
    9. Eugene Furman & Adam Diamant & Murat Kristal, 2021. "Customer Acquisition and Retention: A Fluid Approach for Staffing," Production and Operations Management, Production and Operations Management Society, vol. 30(11), pages 4236-4257, November.
    10. Wang, Guanglei & Ben-Ameur, Walid & Ouorou, Adam, 2019. "A Lagrange decomposition based branch and bound algorithm for the optimal mapping of cloud virtual machines," European Journal of Operational Research, Elsevier, vol. 276(1), pages 28-39.
    11. William A. Massey & Jamol Pender, 2018. "Dynamic rate Erlang-A queues," Queueing Systems: Theory and Applications, Springer, vol. 89(1), pages 127-164, June.
    12. Dimitrakopoulos, Yiannis & Economou, Antonis & Leonardos, Stefanos, 2021. "Strategic customer behavior in a queueing system with alternating information structure," European Journal of Operational Research, Elsevier, vol. 291(3), pages 1024-1040.
    13. Shlomo Halfin & Ward Whitt, 1981. "Heavy-Traffic Limits for Queues with Many Exponential Servers," Operations Research, INFORMS, vol. 29(3), pages 567-588, June.
    14. Mor Armony & Constantinos Maglaras, 2004. "Contact Centers with a Call-Back Option and Real-Time Delay Information," Operations Research, INFORMS, vol. 52(4), pages 527-545, August.
    15. Shi Chen & Hau Lee & Kamran Moinzadeh, 2019. "Pricing Schemes in Cloud Computing: Utilization‐Based vs. Reservation‐Based," Production and Operations Management, Production and Operations Management Society, vol. 28(1), pages 82-102, January.
    16. Shi Chen & Kamran Moinzadeh & Jing-Sheng Song & Yuan Zhong, 2023. "Cloud Computing Value Chains: Research from the Operations Management Perspective," Manufacturing & Service Operations Management, INFORMS, vol. 25(4), pages 1338-1356, July.
    17. Chen, Jian & Huang, George Q. & Wang, Jun-Qiang & Yang, Chen, 2019. "A cooperative approach to service booking and scheduling in cloud manufacturing," European Journal of Operational Research, Elsevier, vol. 273(3), pages 861-873.
    18. Manuel A. Nunez & Xue Bai & Linna Du, 2021. "Leveraging Slack Capacity in IaaS Contract Cloud Services," Production and Operations Management, Production and Operations Management Society, vol. 30(4), pages 883-901, April.
    19. Refael Hassin & Ricky Roet-Green, 2021. "On Queue-Length Information when Customers Travel to a Queue," Manufacturing & Service Operations Management, INFORMS, vol. 23(4), pages 989-1004, July.
    20. Yang, T. & Posner, M. J. M. & Templeton, J. G. C. & Li, H., 1994. "An approximation method for the M/G/1 retrial queue with general retrial times," European Journal of Operational Research, Elsevier, vol. 76(3), pages 552-562, August.
    21. Achal Bassamboo & Assaf Zeevi, 2009. "On a Data-Driven Method for Staffing Large Call Centers," Operations Research, INFORMS, vol. 57(3), pages 714-726, June.
    22. Gust, Gunther & Schlüter, Alexander & Feuerriegel, Stefan & Úbeda, Ignacio & Lee, Jonathan T. & Neumann, Dirk, 2024. "Designing electricity distribution networks: The impact of demand coincidence," European Journal of Operational Research, Elsevier, vol. 315(1), pages 271-288.
    23. Seung Bum Soh & Itai Gurvich, 2017. "Call Center Staffing: Service-Level Constraints and Index Priorities," Operations Research, INFORMS, vol. 65(2), pages 537-555, April.
    24. A. J. E. M. Janssen & J. S. H. van Leeuwaarden & Bert Zwart, 2011. "Refining Square-Root Safety Staffing by Expanding Erlang C," Operations Research, INFORMS, vol. 59(6), pages 1512-1522, December.
    25. Noah Gans & Haipeng Shen & Yong-Pin Zhou & Nikolay Korolev & Alan McCord & Herbert Ristock, 2015. "Parametric Forecasting and Stochastic Programming Models for Call-Center Workforce Scheduling," Manufacturing & Service Operations Management, INFORMS, vol. 17(4), pages 571-588, October.
    26. Defraeye, Mieke & Van Nieuwenhuyse, Inneke, 2016. "Staffing and scheduling under nonstationary demand for service: A literature review," Omega, Elsevier, vol. 58(C), pages 4-25.
    27. Dieter Fiems & Tuan Phung-Duc, 2019. "Light-traffic analysis of random access systems without collisions," Annals of Operations Research, Springer, vol. 277(2), pages 311-327, June.
    28. Zohar Feldman & Avishai Mandelbaum & William A. Massey & Ward Whitt, 2008. "Staffing of Time-Varying Queues to Achieve Time-Stable Performance," Management Science, INFORMS, vol. 54(2), pages 324-338, February.
    29. Rouba Ibrahim, 2018. "Sharing delay information in service systems: a literature survey," Queueing Systems: Theory and Applications, Springer, vol. 89(1), pages 49-79, June.
    30. Jerome Niyirora & Jamol Pender, 2016. "Optimal staffing in nonstationary service centers with constraints," Naval Research Logistics (NRL), John Wiley & Sons, vol. 63(8), pages 615-630, December.
    31. Pourbabai, Behnam, 1993. "Tandem behavior of a telecommunication system with repeated calls: II, A general case without buffers," European Journal of Operational Research, Elsevier, vol. 65(2), pages 247-258, March.
    32. Chen Li & Junjun Zheng & Hiroyuki Okamura & Tadashi Dohi, 2023. "Performance Evaluation of a Cloud Datacenter Using CPU Utilization Data," Mathematics, MDPI, vol. 11(3), pages 1-16, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Eugene Furman & Adam Diamant & Murat Kristal, 2021. "Customer Acquisition and Retention: A Fluid Approach for Staffing," Production and Operations Management, Production and Operations Management Society, vol. 30(11), pages 4236-4257, November.
    2. Noa Zychlinski, 2023. "Applications of fluid models in service operations management," Queueing Systems: Theory and Applications, Springer, vol. 103(1), pages 161-185, February.
    3. Eugene Furman & Alex Cressman & Saeha Shin & Alexey Kuznetsov & Fahad Razak & Amol Verma & Adam Diamant, 2021. "Prediction of personal protective equipment use in hospitals during COVID-19," Health Care Management Science, Springer, vol. 24(2), pages 439-453, June.
    4. Niyirora, Jerome & Zhuang, Jun, 2017. "Fluid approximations and control of queues in emergency departments," European Journal of Operational Research, Elsevier, vol. 261(3), pages 1110-1124.
    5. Defraeye, Mieke & Van Nieuwenhuyse, Inneke, 2016. "Staffing and scheduling under nonstationary demand for service: A literature review," Omega, Elsevier, vol. 58(C), pages 4-25.
    6. Smirnov, Dmitry & Huchzermeier, Arnd, 2020. "Analytics for labor planning in systems with load-dependent service times," European Journal of Operational Research, Elsevier, vol. 287(2), pages 668-681.
    7. Ward Whitt, 2018. "A broad view of queueing theory through one issue," Queueing Systems: Theory and Applications, Springer, vol. 89(1), pages 3-14, June.
    8. J. G. Dai & Pengyi Shi, 2017. "A Two-Time-Scale Approach to Time-Varying Queues in Hospital Inpatient Flow Management," Operations Research, INFORMS, vol. 65(2), pages 514-536, April.
    9. Heemskerk, M. & Mandjes, M. & Mathijsen, B., 2022. "Staffing for many-server systems facing non-standard arrival processes," European Journal of Operational Research, Elsevier, vol. 296(3), pages 900-913.
    10. Galit B. Yom-Tov & Avishai Mandelbaum, 2014. "Erlang-R: A Time-Varying Queue with Reentrant Customers, in Support of Healthcare Staffing," Manufacturing & Service Operations Management, INFORMS, vol. 16(2), pages 283-299, May.
    11. Achal Bassamboo & Rouba Ibrahim, 2021. "A General Framework to Compare Announcement Accuracy: Static vs. LES-Based Announcement," Management Science, INFORMS, vol. 67(7), pages 4191-4208, July.
    12. Guo, Xiaotong & He, Yong & Ignatius, Joshua, 2025. "Optimal security and pricing strategies for AI cloud service providers: Balancing effort and price discounts across public, private, and hybrid AI cloud models," International Journal of Production Economics, Elsevier, vol. 284(C).
    13. Pei, Zhi & Dai, Xu & Yuan, Yilun & Du, Rui & Liu, Changchun, 2021. "Managing price and fleet size for courier service with shared drones," Omega, Elsevier, vol. 104(C).
    14. Rouba Ibrahim & Mor Armony & Achal Bassamboo, 2017. "Does the Past Predict the Future? The Case of Delay Announcements in Service Systems," Management Science, INFORMS, vol. 63(6), pages 1762-1780, June.
    15. Rouba Ibrahim & Ward Whitt, 2011. "Wait-Time Predictors for Customer Service Systems with Time-Varying Demand and Capacity," Operations Research, INFORMS, vol. 59(5), pages 1106-1118, October.
    16. Achal Bassamboo & Assaf Zeevi, 2009. "On a Data-Driven Method for Staffing Large Call Centers," Operations Research, INFORMS, vol. 57(3), pages 714-726, June.
    17. Siddharth Prakash Singh & Mohammad Delasay & Alan Scheller‐Wolf, 2023. "Real‐time delay announcement under competition," Production and Operations Management, Production and Operations Management Society, vol. 32(3), pages 863-881, March.
    18. Matthieu Jonckheere & Balakrishna J. Prabhu, 2018. "Asymptotics of insensitive load balancing and blocking phases," Queueing Systems: Theory and Applications, Springer, vol. 88(3), pages 243-278, April.
    19. Legros, Benjamin & Jouini, Oualid & Akşin, O. Zeynep & Koole, Ger, 2020. "Front-office multitasking between service encounters and back-office tasks," European Journal of Operational Research, Elsevier, vol. 287(3), pages 946-963.
    20. Jian Xu & Hemant K. Jain & Dongxiao Gu & Changyong Liang, 2025. "Business-Process-Driven Service Composition in a Hybrid Cloud Environment," Information Systems Frontiers, Springer, vol. 27(1), pages 259-281, February.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:322:y:2025:i:1:p:133-146. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.