IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v304y2023i2p618-633.html
   My bibliography  Save this article

Balanced routing with partial information in a distributed parallel many-server queueing system

Author

Listed:
  • Zhong, Zhiheng
  • Cao, Ping

Abstract

We consider a queueing system that is comprised of multiple stations serving a single class of customers. Each station consists of many statistically identical servers, and has its own dedicated queue. In this paper, we propose a family of generalized load-balancing routing policies parameterized by vectors c and d with ξ-choice (abbreviated as LB(ξ,c,d)), that are suitable for dynamic routing with partial state information. Under the proposed policy, upon an arrival, a subset of ξ stations will be randomly collected, with their state information being retrieved, where ξ is a positive integer-valued random variable satisfying P{ξ≥2}>0; then, this new customer will be routed to: (i) the station with minimum expected delay modified by c when all the collected stations are fully occupied; or otherwise (ii) the station with maximum idleness ratio modified by d among the collected stations with idle servers. The parameters c and d are employed to control the relative expected delay and utilization levels across stations. Using asymptotic analysis, we derive diffusion limits of queue-length processes as well as their stationary distributions under the LB(ξ,c,d) policy in the Halfin–Whitt regime. Based on these stationary diffusion limit results, we develop a solution procedure to determine policy parameters under different optimization criteria. Finally, we provide numerical experiments to validate the accuracy of our diffusion approximation, and compare the performances under the LB(ξ,c,d) policies with other routing policies.

Suggested Citation

  • Zhong, Zhiheng & Cao, Ping, 2023. "Balanced routing with partial information in a distributed parallel many-server queueing system," European Journal of Operational Research, Elsevier, vol. 304(2), pages 618-633.
  • Handle: RePEc:eee:ejores:v:304:y:2023:i:2:p:618-633
    DOI: 10.1016/j.ejor.2022.02.042
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221722001631
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2022.02.042?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Itay Gurvich & Ward Whitt, 2009. "Queue-and-Idleness-Ratio Controls in Many-Server Service Systems," Mathematics of Operations Research, INFORMS, vol. 34(2), pages 363-396, May.
    2. Wu, Rong & Down, Douglas G., 2009. "Round robin scheduling of heterogeneous parallel servers in heavy traffic," European Journal of Operational Research, Elsevier, vol. 195(2), pages 372-380, June.
    3. Rami Atar & Anat Lev-Ari, 2018. "Workload-Dependent Dynamic Priority for the Multiclass Queue with Reneging," Mathematics of Operations Research, INFORMS, vol. 43(2), pages 494-515, May.
    4. Avishai Mandelbaum & Alexander L. Stolyar, 2004. "Scheduling Flexible Servers with Convex Delay Costs: Heavy-Traffic Optimality of the Generalized cμ-Rule," Operations Research, INFORMS, vol. 52(6), pages 836-855, December.
    5. Noah Gans & Ger Koole & Avishai Mandelbaum, 2003. "Telephone Call Centers: Tutorial, Review, and Research Prospects," Manufacturing & Service Operations Management, INFORMS, vol. 5(2), pages 79-141, September.
    6. Bruce Hajek, 1985. "Extremal Splittings of Point Processes," Mathematics of Operations Research, INFORMS, vol. 10(4), pages 543-556, November.
    7. Varun Gupta & Neil Walton, 2019. "Load Balancing in the Nondegenerate Slowdown Regime," Operations Research, INFORMS, vol. 67(1), pages 281-294, January.
    8. Shlomo Halfin & Ward Whitt, 1981. "Heavy-Traffic Limits for Queues with Many Exponential Servers," Operations Research, INFORMS, vol. 29(3), pages 567-588, June.
    9. Lei Ying & R. Srikant & Xiaohan Kang, 2017. "The Power of Slightly More than One Sample in Randomized Load Balancing," Mathematics of Operations Research, INFORMS, vol. 42(3), pages 692-722, August.
    10. Zhen Liu & Rhonda Righter, 1998. "Optimal Load Balancing on Distributed Homogeneous Unreliable Processors," Operations Research, INFORMS, vol. 46(4), pages 563-573, August.
    11. Itai Gurvich & Ward Whitt, 2010. "Service-Level Differentiation in Many-Server Service Systems via Queue-Ratio Routing," Operations Research, INFORMS, vol. 58(2), pages 316-328, April.
    12. Itai Gurvich, 2014. "Validity of Heavy-Traffic Steady-State Approximations in Multiclass Queueing Networks: The Case of Queue-Ratio Disciplines," Mathematics of Operations Research, INFORMS, vol. 39(1), pages 121-162, February.
    13. Kuang Xu & Yuan Zhong, 2020. "Information and Memory in Dynamic Resource Allocation," Operations Research, INFORMS, vol. 68(6), pages 1698-1715, November.
    14. Tolga Tezcan, 2008. "Optimal Control of Distributed Parallel Server Systems Under the Halfin and Whitt Regime," Mathematics of Operations Research, INFORMS, vol. 33(1), pages 51-90, February.
    15. Gad Allon & Sarang Deo & Wuqin Lin, 2013. "The Impact of Size and Occupancy of Hospital on the Extent of Ambulance Diversion: Theory and Evidence," Operations Research, INFORMS, vol. 61(3), pages 544-562, June.
    16. Ilyas Iyoob & Emrah Zarifoglu & A. B. Dieker, 2013. "Cloud Computing Operations Research," Service Science, INFORMS, vol. 5(2), pages 88-101, June.
    17. A. Charnes & W. W. Cooper, 1962. "Programming with linear fractional functionals," Naval Research Logistics Quarterly, John Wiley & Sons, vol. 9(3‐4), pages 181-186, September.
    18. Itay Gurvich & Ward Whitt, 2009. "Scheduling Flexible Servers with Convex Delay Costs in Many-Server Service Systems," Manufacturing & Service Operations Management, INFORMS, vol. 11(2), pages 237-253, June.
    19. Mor Armony & Amy R. Ward, 2010. "Fair Dynamic Routing in Large-Scale Heterogeneous-Server Systems," Operations Research, INFORMS, vol. 58(3), pages 624-637, June.
    20. Cao, Ping & Zhong, Zhiheng & Huang, Junfei, 2021. "Dynamic routing in a distributed parallel many-server service system: The effect of ξ-choice," European Journal of Operational Research, Elsevier, vol. 294(1), pages 219-235.
    21. Baris Ata & Xiaoshan Peng, 2018. "An Equilibrium Analysis of a Multiclass Queue with Endogenous Abandonments in Heavy Traffic," Operations Research, INFORMS, vol. 66(1), pages 163-183, January.
    22. J. G. Dai & Tolga Tezcan, 2011. "State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems," Mathematics of Operations Research, INFORMS, vol. 36(2), pages 271-320, May.
    23. Amarjit Budhiraja & Chihoon Lee, 2009. "Stationary Distribution Convergence for Generalized Jackson Networks in Heavy Traffic," Mathematics of Operations Research, INFORMS, vol. 34(1), pages 45-56, February.
    24. Hong Chen & Heng-Qing Ye, 2012. "Asymptotic Optimality of Balanced Routing," Operations Research, INFORMS, vol. 60(1), pages 163-179, February.
    25. Avishai Mandelbaum & Petar Momčilović & Yulia Tseytlin, 2012. "On Fair Routing from Emergency Departments to Hospital Wards: QED Queues with Heterogeneous Servers," Management Science, INFORMS, vol. 58(7), pages 1273-1291, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dijkstra, Sander & Baas, Stef & Braaksma, Aleida & Boucherie, Richard J., 2023. "Dynamic fair balancing of COVID-19 patients over hospitals based on forecasts of bed occupancy," Omega, Elsevier, vol. 116(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cao, Ping & Zhong, Zhiheng & Huang, Junfei, 2021. "Dynamic routing in a distributed parallel many-server service system: The effect of ξ-choice," European Journal of Operational Research, Elsevier, vol. 294(1), pages 219-235.
    2. Jinsheng Chen & Jing Dong & Pengyi Shi, 2020. "A survey on skill-based routing with applications to service operations management," Queueing Systems: Theory and Applications, Springer, vol. 96(1), pages 53-82, October.
    3. Noa Zychlinski, 2023. "Applications of fluid models in service operations management," Queueing Systems: Theory and Applications, Springer, vol. 103(1), pages 161-185, February.
    4. Amy R. Ward & Mor Armony, 2013. "Blind Fair Routing in Large-Scale Service Systems with Heterogeneous Customers and Servers," Operations Research, INFORMS, vol. 61(1), pages 228-243, February.
    5. Arapostathis, Ari & Pang, Guodong, 2019. "Infinite horizon asymptotic average optimality for large-scale parallel server networks," Stochastic Processes and their Applications, Elsevier, vol. 129(1), pages 283-322.
    6. Carri W. Chan & Linda V. Green & Suparerk Lekwijit & Lijian Lu & Gabriel Escobar, 2019. "Assessing the Impact of Service Level When Customer Needs Are Uncertain: An Empirical Investigation of Hospital Step-Down Units," Management Science, INFORMS, vol. 65(2), pages 751-775, February.
    7. Anton Braverman, 2020. "Steady-State Analysis of the Join-the-Shortest-Queue Model in the Halfin–Whitt Regime," Mathematics of Operations Research, INFORMS, vol. 45(3), pages 1069-1103, August.
    8. Itay Gurvich & Ward Whitt, 2009. "Queue-and-Idleness-Ratio Controls in Many-Server Service Systems," Mathematics of Operations Research, INFORMS, vol. 34(2), pages 363-396, May.
    9. J. G. Dai & Tolga Tezcan, 2011. "State Space Collapse in Many-Server Diffusion Limits of Parallel Server Systems," Mathematics of Operations Research, INFORMS, vol. 36(2), pages 271-320, May.
    10. Wyean Chan & Ger Koole & Pierre L'Ecuyer, 2014. "Dynamic Call Center Routing Policies Using Call Waiting and Agent Idle Times," Manufacturing & Service Operations Management, INFORMS, vol. 16(4), pages 544-560, October.
    11. Petar Momčilović & Amir Motaei, 2018. "QED limits for many-server systems under a priority policy," Queueing Systems: Theory and Applications, Springer, vol. 90(1), pages 125-159, October.
    12. Merve Bodur & James R. Luedtke, 2017. "Mixed-Integer Rounding Enhanced Benders Decomposition for Multiclass Service-System Staffing and Scheduling with Arrival Rate Uncertainty," Management Science, INFORMS, vol. 63(7), pages 2073-2091, July.
    13. Adan, Ivo J.B.F. & Boon, Marko A.A. & Weiss, Gideon, 2019. "Design heuristic for parallel many server systems," European Journal of Operational Research, Elsevier, vol. 273(1), pages 259-277.
    14. Zhenghua Long & Nahum Shimkin & Hailun Zhang & Jiheng Zhang, 2020. "Dynamic Scheduling of Multiclass Many-Server Queues with Abandonment: The Generalized cμ / h Rule," Operations Research, INFORMS, vol. 68(4), pages 1128-1230, July.
    15. Vijay Mehrotra & Kevin Ross & Geoff Ryder & Yong-Pin Zhou, 2012. "Routing to Manage Resolution and Waiting Time in Call Centers with Heterogeneous Servers," Manufacturing & Service Operations Management, INFORMS, vol. 14(1), pages 66-81, January.
    16. Hassan Hmedi & Ari Arapostathis & Guodong Pang, 2022. "Uniform stability of some large-scale parallel server networks," Queueing Systems: Theory and Applications, Springer, vol. 102(3), pages 509-552, December.
    17. Avishai Mandelbaum & Petar Momčilović & Yulia Tseytlin, 2012. "On Fair Routing from Emergency Departments to Hospital Wards: QED Queues with Heterogeneous Servers," Management Science, INFORMS, vol. 58(7), pages 1273-1291, July.
    18. Ohad Perry & Ward Whitt, 2011. "A Fluid Approximation for Service Systems Responding to Unexpected Overloads," Operations Research, INFORMS, vol. 59(5), pages 1159-1170, October.
    19. Mor Armony & Amy R. Ward, 2010. "Fair Dynamic Routing in Large-Scale Heterogeneous-Server Systems," Operations Research, INFORMS, vol. 58(3), pages 624-637, June.
    20. Jeunghyun Kim & Ramandeep S. Randhawa & Amy R. Ward, 2018. "Dynamic Scheduling in a Many-Server, Multiclass System: The Role of Customer Impatience in Large Systems," Manufacturing & Service Operations Management, INFORMS, vol. 20(2), pages 285-301, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:304:y:2023:i:2:p:618-633. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.