IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v67y2021i10p6089-6115.html
   My bibliography  Save this article

Marrying Stochastic Gradient Descent with Bandits: Learning Algorithms for Inventory Systems with Fixed Costs

Author

Listed:
  • Hao Yuan

    (Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, Michigan 48105)

  • Qi Luo

    (Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, Michigan 48105)

  • Cong Shi

    (Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, Michigan 48105)

Abstract

We consider a periodic-review single-product inventory system with fixed cost under censored demand. Under full demand distributional information, it is well known that the celebrated (s, S) policy is optimal. In this paper, we assume the firm does not know the demand distribution a priori and makes adaptive inventory ordering decisions in each period based only on the past sales (a.k.a. censored demand). Our performance measure is regret, which is the cost difference between a feasible learning algorithm and the clairvoyant (full-information) benchmark. Compared with prior literature, the key difficulty of this problem lies in the loss of joint convexity of the objective function as a result of the presence of fixed cost. We develop the first learning algorithm, termed the ( δ , S ) policy, that combines the power of stochastic gradient descent , bandit controls , and simulation-based methods in a seamless and nontrivial fashion. We prove that the cumulative regret is O ( log T T ) , which is provably tight up to a logarithmic factor. We also develop several technical results that are of independent interest. We believe that the developed framework could be widely applied to learning other important stochastic systems with partial convexity in the objectives.

Suggested Citation

  • Hao Yuan & Qi Luo & Cong Shi, 2021. "Marrying Stochastic Gradient Descent with Bandits: Learning Algorithms for Inventory Systems with Fixed Costs," Management Science, INFORMS, vol. 67(10), pages 6089-6115, October.
  • Handle: RePEc:inm:ormnsc:v:67:y:2021:i:10:p:6089-6115
    DOI: 10.1287/mnsc.2020.3799
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.2020.3799
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2020.3799?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Suresh P. Sethi & Feng Cheng, 1997. "Optimality of ( s , S ) Policies in Inventory Models with Markovian Demand," Operations Research, INFORMS, vol. 45(6), pages 931-939, December.
    2. Li Chen & Erica L. Plambeck, 2008. "Dynamic Inventory Management with Learning About the Demand Distribution and Substitution Probability," Manufacturing & Service Operations Management, INFORMS, vol. 10(2), pages 236-256, May.
    3. Katy S. Azoury, 1985. "Bayes Solution to Dynamic Inventory Models Under Unknown Demand Distribution," Management Science, INFORMS, vol. 31(9), pages 1150-1160, September.
    4. Özalp Özer & Wei Wei, 2004. "Inventory Control with Limited Capacity and Advance Demand Information," Operations Research, INFORMS, vol. 52(6), pages 988-1000, December.
    5. Khouja, Moutaz & Goyal, Suresh, 2008. "A review of the joint replenishment problem literature: 1989-2005," European Journal of Operational Research, Elsevier, vol. 186(1), pages 1-16, April.
    6. Awi Federgruen & Paul Zipkin, 1984. "An Efficient Algorithm for Computing Optimal ( s , S ) Policies," Operations Research, INFORMS, vol. 32(6), pages 1268-1285, December.
    7. Donald L. Iglehart, 1963. "Optimality of (s, S) Policies in the Infinite Horizon Dynamic Inventory Problem," Management Science, INFORMS, vol. 9(2), pages 259-267, January.
    8. Cong Shi & Huanan Zhang & Xiuli Chao & Retsef Levi, 2014. "Approximation algorithms for capacitated stochastic inventory systems with setup costs," Naval Research Logistics (NRL), John Wiley & Sons, vol. 61(4), pages 304-319, June.
    9. Peng Hu & Ye Lu & Miao Song, 2019. "Joint Pricing and Inventory Control with Fixed and Convex/Concave Variable Production Costs," Production and Operations Management, Production and Operations Management Society, vol. 28(4), pages 847-877, April.
    10. Xiangwen Lu & Jing-Sheng Song & Kaijie Zhu, 2008. "Analysis of Perishable-Inventory Systems with Censored Demand Data," Operations Research, INFORMS, vol. 56(4), pages 1034-1038, August.
    11. Woonghee Tim Huh & Ganesh Janakiraman & John A. Muckstadt & Paat Rusmevichientong, 2009. "An Adaptive Algorithm for Finding the Optimal Base-Stock Policy in Lost Sales Inventory Systems with Censored Demand," Mathematics of Operations Research, INFORMS, vol. 34(2), pages 397-416, May.
    12. Georgia Perakis & Guillaume Roels, 2008. "Regret in the Newsvendor Model with Partial Information," Operations Research, INFORMS, vol. 56(1), pages 188-203, February.
    13. Huanan Zhang & Xiuli Chao & Cong Shi, 2020. "Closing the Gap: A Learning Algorithm for Lost-Sales Inventory Systems with Lead Times," Management Science, INFORMS, vol. 66(5), pages 1962-1980, May.
    14. Arthur F. Veinott, Jr. & Harvey M. Wagner, 1965. "Computing Optimal (s, S) Inventory Policies," Management Science, INFORMS, vol. 11(5), pages 525-552, March.
    15. Woonghee Tim Huh & Retsef Levi & Paat Rusmevichientong & James B. Orlin, 2011. "Adaptive Data-Driven Inventory Control with Censored Demand Based on Kaplan-Meier Estimator," Operations Research, INFORMS, vol. 59(4), pages 929-941, August.
    16. Yongpei Guan & Andrew J. Miller, 2008. "Polynomial-Time Algorithms for Stochastic Uncapacitated Lot-Sizing Problems," Operations Research, INFORMS, vol. 56(5), pages 1172-1183, October.
    17. Chen Shaoxiang & M. Lambrecht, 1996. "X-Y Band and Modified ( s , S ) Policy," Operations Research, INFORMS, vol. 44(6), pages 1013-1019, December.
    18. Xiangwen Lu & Jing-Sheng Song & Kaijie Zhu, 2005. "On “The Censored Newsvendor and the Optimal Acquisition of Information”," Operations Research, INFORMS, vol. 53(6), pages 1024-1026, December.
    19. Retsef Levi & Georgia Perakis & Joline Uichanco, 2015. "The Data-Driven Newsvendor Problem: New Bounds and Insights," Operations Research, INFORMS, vol. 63(6), pages 1294-1306, December.
    20. Xiuli Chao & Paul H. Zipkin, 2008. "Optimal Policy for a Periodic-Review Inventory System Under a Supply Capacity Contract," Operations Research, INFORMS, vol. 56(1), pages 59-68, February.
    21. Gregory A. Godfrey & Warren B. Powell, 2001. "An Adaptive, Distribution-Free Algorithm for the Newsvendor Problem with Censored Demands, with Applications to Inventory and Distribution," Management Science, INFORMS, vol. 47(8), pages 1101-1112, August.
    22. Ozgun Caliskan-Demirag & Youhua (Frank) Chen & Yi Yang, 2012. "Ordering Policies for Periodic-Review Inventory Systems with Quantity-Dependent Fixed Costs," Operations Research, INFORMS, vol. 60(4), pages 785-796, August.
    23. Weidong Chen & Cong Shi & Izak Duenyas, 2020. "Optimal Learning Algorithms for Stochastic Inventory Systems with Random Capacities," Production and Operations Management, Production and Operations Management Society, vol. 29(7), pages 1624-1649, July.
    24. D. Beyer & S. P. Sethi, 1997. "Average Cost Optimality in Inventory Models with Markovian Demands," Journal of Optimization Theory and Applications, Springer, vol. 92(3), pages 497-526, March.
    25. Yu-Sheng Zheng & A. Federgruen, 1991. "Finding Optimal (s, S) Policies Is About As Simple As Evaluating a Single Policy," Operations Research, INFORMS, vol. 39(4), pages 654-665, August.
    26. Woonghee Tim Huh & Paat Rusmevichientong, 2009. "A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand," Mathematics of Operations Research, INFORMS, vol. 34(1), pages 103-123, February.
    27. Paul Zipkin, 2008. "On the Structure of Lost-Sales Inventory Models," Operations Research, INFORMS, vol. 56(4), pages 937-944, August.
    28. Retsef Levi & Robin O. Roundy & David B. Shmoys, 2007. "Provably Near-Optimal Sampling-Based Policies for Stochastic Inventory Control Models," Mathematics of Operations Research, INFORMS, vol. 32(4), pages 821-839, November.
    29. Retsef Levi & Cong Shi, 2013. "Approximation Algorithms for the Stochastic Lot-Sizing Problem with Order Lead Times," Operations Research, INFORMS, vol. 61(3), pages 593-602, June.
    30. Boxiao Chen & Xiuli Chao & Cong Shi, 2021. "Nonparametric Learning Algorithms for Joint Pricing and Inventory Control with Lost Sales and Censored Demand," Mathematics of Operations Research, INFORMS, vol. 46(2), pages 726-756, May.
    31. Warren Powell & Andrzej Ruszczyński & Huseyin Topaloglu, 2004. "Learning Algorithms for Separable Approximations of Discrete Stochastic Optimization Problems," Mathematics of Operations Research, INFORMS, vol. 29(4), pages 814-836, November.
    32. Boxiao Chen & Xiuli Chao & Hyun-Soo Ahn, 2019. "Coordinating Pricing and Inventory Replenishment with Nonparametric Demand Learning," Operations Research, INFORMS, vol. 67(4), pages 1035-1052, July.
    33. George R. Murray, Jr. & Edward A. Silver, 1966. "A Bayesian Analysis of the Style Goods Inventory Problem," Management Science, INFORMS, vol. 12(11), pages 785-797, July.
    34. Donald L. Iglehart, 1964. "The Dynamic Inventory Problem with Unknown Demand Distribution," Management Science, INFORMS, vol. 10(3), pages 429-440, April.
    35. Woonghee Tim Huh & Ganesh Janakiraman, 2008. "( s, S ) Optimality in Joint Inventory-Pricing Control: An Alternate Approach," Operations Research, INFORMS, vol. 56(3), pages 783-790, June.
    36. Apostolos N. Burnetas & Craig E. Smith, 2000. "Adaptive Ordering and Pricing for Perishable Products," Operations Research, INFORMS, vol. 48(3), pages 436-443, June.
    37. Qi Feng, 2010. "Integrating Dynamic Pricing and Replenishment Decisions Under Supply Capacity Uncertainty," Management Science, INFORMS, vol. 56(12), pages 2154-2172, December.
    38. Zhan Pang & Frank Y. Chen & Youyi Feng, 2012. "Technical Note---A Note on the Structure of Joint Inventory-Pricing Control with Leadtimes," Operations Research, INFORMS, vol. 60(3), pages 581-587, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jinzhi Bu & David Simchi-Levi & Li Wang, 2023. "Offline Pricing and Demand Learning with Censored Data," Management Science, INFORMS, vol. 69(2), pages 885-903, February.
    2. David Simchi-Levi & Rui Sun & Huanan Zhang, 2022. "Online Learning and Optimization for Revenue Management Problems with Add-on Discounts," Management Science, INFORMS, vol. 68(10), pages 7402-7421, October.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gah-Yi Ban, 2020. "Confidence Intervals for Data-Driven Inventory Policies with Demand Censoring," Operations Research, INFORMS, vol. 68(2), pages 309-326, March.
    2. Boxiao Chen & Xiuli Chao & Cong Shi, 2021. "Nonparametric Learning Algorithms for Joint Pricing and Inventory Control with Lost Sales and Censored Demand," Mathematics of Operations Research, INFORMS, vol. 46(2), pages 726-756, May.
    3. Satya S. Malladi & Alan L. Erera & Chelsea C. White, 2023. "Inventory control with modulated demand and a partially observed modulation process," Annals of Operations Research, Springer, vol. 321(1), pages 343-369, February.
    4. Woonghee Tim Huh & Paat Rusmevichientong, 2009. "A Nonparametric Asymptotic Analysis of Inventory Planning with Censored Demand," Mathematics of Operations Research, INFORMS, vol. 34(1), pages 103-123, February.
    5. Boxiao Chen & David Simchi-Levi & Yining Wang & Yuan Zhou, 2022. "Dynamic Pricing and Inventory Control with Fixed Ordering Cost and Incomplete Demand Information," Management Science, INFORMS, vol. 68(8), pages 5684-5703, August.
    6. Woonghee Tim Huh & Retsef Levi & Paat Rusmevichientong & James B. Orlin, 2011. "Adaptive Data-Driven Inventory Control with Censored Demand Based on Kaplan-Meier Estimator," Operations Research, INFORMS, vol. 59(4), pages 929-941, August.
    7. Cong Shi & Weidong Chen & Izak Duenyas, 2016. "Technical Note—Nonparametric Data-Driven Algorithms for Multiproduct Inventory Systems with Censored Demand," Operations Research, INFORMS, vol. 64(2), pages 362-370, April.
    8. Boxiao Chen & Xiuli Chao, 2020. "Dynamic Inventory Control with Stockout Substitution and Demand Learning," Management Science, INFORMS, vol. 66(11), pages 5108-5127, November.
    9. Aditya Jain & Nils Rudi & Tong Wang, 2015. "Demand Estimation and Ordering Under Censoring: Stock-Out Timing Is (Almost) All You Need," Operations Research, INFORMS, vol. 63(1), pages 134-150, February.
    10. Woonghee Tim Huh & Paat Rusmevichientong, 2014. "Online Sequential Optimization with Biased Gradients: Theory and Applications to Censored Demand," INFORMS Journal on Computing, INFORMS, vol. 26(1), pages 150-159, February.
    11. Lin An & Andrew A. Li & Benjamin Moseley & R. Ravi, 2023. "The Nonstationary Newsvendor with (and without) Predictions," Papers 2305.07993, arXiv.org, revised Oct 2023.
    12. Sandun C. Perera & Suresh P. Sethi, 2023. "A survey of stochastic inventory models with fixed costs: Optimality of (s, S) and (s, S)‐type policies—Discrete‐time case," Production and Operations Management, Production and Operations Management Society, vol. 32(1), pages 131-153, January.
    13. Weidong Chen & Cong Shi & Izak Duenyas, 2020. "Optimal Learning Algorithms for Stochastic Inventory Systems with Random Capacities," Production and Operations Management, Production and Operations Management Society, vol. 29(7), pages 1624-1649, July.
    14. Gah-Yi Ban & Cynthia Rudin, 2019. "The Big Data Newsvendor: Practical Insights from Machine Learning," Operations Research, INFORMS, vol. 67(1), pages 90-108, January.
    15. Gen Sakoda & Hideki Takayasu & Misako Takayasu, 2019. "Data Science Solutions for Retail Strategy to Reduce Waste Keeping High Profit," Sustainability, MDPI, vol. 11(13), pages 1-30, June.
    16. Jinzhi Bu & David Simchi-Levi & Li Wang, 2023. "Offline Pricing and Demand Learning with Censored Data," Management Science, INFORMS, vol. 69(2), pages 885-903, February.
    17. Andrew F. Siegel & Michael R. Wagner, 2021. "Profit Estimation Error in the Newsvendor Model Under a Parametric Demand Distribution," Management Science, INFORMS, vol. 67(8), pages 4863-4879, August.
    18. Jiri Chod & Mihalis G. Markakis & Nikolaos Trichakis, 2021. "On the Learning Benefits of Resource Flexibility," Management Science, INFORMS, vol. 67(10), pages 6513-6528, October.
    19. Sandun C. Perera & Suresh P. Sethi, 2023. "A survey of stochastic inventory models with fixed costs: Optimality of (s, S) and (s, S)‐type policies—Continuous‐time case," Production and Operations Management, Production and Operations Management Society, vol. 32(1), pages 154-169, January.
    20. Omar Besbes & Alp Muharremoglu, 2013. "On Implications of Demand Censoring in the Newsvendor Problem," Management Science, INFORMS, vol. 59(6), pages 1407-1424, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:67:y:2021:i:10:p:6089-6115. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.