IDEAS home Printed from https://ideas.repec.org/a/spr/joinma/v34y2023i1d10.1007_s10845-022-01982-5.html
   My bibliography  Save this article

Hybrid algorithm based on reinforcement learning for smart inventory management

Author

Listed:
  • Carlos Cuartas

    (Universidad EAFIT)

  • Jose Aguilar

    (Universidad EAFIT
    Universidad de Alcalá
    Universidad de Los Andes)

Abstract

This article proposes a hybrid algorithm based on reinforcement learning and the inventory management methodology called DDMRP (Demand Driven Material Requirement Planning) to determine the optimal time to buy a certain product, and how much quantity should be requested. For this, the inventory management problem is formulated as a Markov Decision Process where the environment with which the system interacts is designed from the concepts raised in the DDMRP methodology, and through the reinforcement learning algorithm—specifically, Q-Learning. The optimal policy is determined for making decisions about when and how much to buy. To determine the optimal policy, three approaches are proposed for the reward function: the first one is based on inventory levels; the second is an optimization function based on the distance of the inventory to its optimal level, and the third is a shaping function based on levels and distances to the optimal inventory. The results show that the proposed algorithm has promising results in scenarios with different characteristics, performing adequately in difficult case studies, with a diversity of situations such as scenarios with discontinuous or continuous demand, seasonal and non-seasonal behavior, and with high demand peaks, among others.

Suggested Citation

  • Carlos Cuartas & Jose Aguilar, 2023. "Hybrid algorithm based on reinforcement learning for smart inventory management," Journal of Intelligent Manufacturing, Springer, vol. 34(1), pages 123-149, January.
  • Handle: RePEc:spr:joinma:v:34:y:2023:i:1:d:10.1007_s10845-022-01982-5
    DOI: 10.1007/s10845-022-01982-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10845-022-01982-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10845-022-01982-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Dhahri, Issam & Chabchoub, Habib, 2007. "Nonlinear goal programming models quantifying the bullwhip effect in supply chain based on ARIMA parameters," European Journal of Operational Research, Elsevier, vol. 177(3), pages 1800-1810, March.
    2. Giannoccaro, Ilaria & Pontrandolfo, Pierpaolo, 2002. "Inventory management in supply chains: a reinforcement learning approach," International Journal of Production Economics, Elsevier, vol. 78(2), pages 153-161, July.
    3. Olumide Emmanuel Oluyisola & Swapnil Bhalla & Fabio Sgarbossa & Jan Ola Strandhagen, 2022. "Designing and developing smart production planning and control systems in the industry 4.0 era: a methodology and case study," Journal of Intelligent Manufacturing, Springer, vol. 33(1), pages 311-332, January.
    4. Angela Patricia Velasco Acosta & Christian Mascle & Pierre Baptiste, 2020. "Applicability of Demand-Driven MRP in a complex manufacturing environment," International Journal of Production Research, Taylor & Francis Journals, vol. 58(14), pages 4233-4245, July.
    5. Thomy Eko Saputro & Gonçalo Figueira & Bernardo Almada-Lobo, 2021. "Integrating supplier selection with inventory management under supply disruptions," International Journal of Production Research, Taylor & Francis Journals, vol. 59(11), pages 3304-3322, June.
    6. Haiying Che & Zixing Bai & Rong Zuo & Honglei Li, 2020. "A Deep Reinforcement Learning Approach to the Optimization of Data Center Task Scheduling," Complexity, Hindawi, vol. 2020, pages 1-12, August.
    7. Sushil Punia & Konstantinos Nikolopoulos & Surya Prakash Singh & Jitendra K. Madaan & Konstantia Litsiou, 2020. "Deep learning with long short-term memory networks and random forests for demand forecasting in multi-channel retail," International Journal of Production Research, Taylor & Francis Journals, vol. 58(16), pages 4964-4979, July.
    8. Fei Zhang & Zailin Guan & Li Zhang & Yanyan Cui & Pengxing Yi & Saif Ullah, 2019. "Inventory management for a remanufacture-to-order production with multi-components (parts)," Journal of Intelligent Manufacturing, Springer, vol. 30(1), pages 59-78, January.
    9. Matthias Thürer & Nuno O. Fernandes & Mark Stevenson, 2022. "Production planning and control in multi-stage assembly systems: an assessment of Kanban, MRP, OPT (DBR) and DDMRP by simulation," International Journal of Production Research, Taylor & Francis Journals, vol. 60(3), pages 1036-1050, February.
    10. Edward A. Silver, 1981. "Operations Research in Inventory Management: A Review and Critique," Operations Research, INFORMS, vol. 29(4), pages 628-645, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mahmut Parlar & David Perry, 1996. "Inventory models of future supply uncertainty with single and multiple suppliers," Naval Research Logistics (NRL), John Wiley & Sons, vol. 43(2), pages 191-210, March.
    2. Schmid, Verena & Doerner, Karl F. & Laporte, Gilbert, 2013. "Rich routing problems arising in supply chain management," European Journal of Operational Research, Elsevier, vol. 224(3), pages 435-448.
    3. Visentin, Andrea & Prestwich, Steven & Rossi, Roberto & Tarim, S. Armagan, 2021. "Computing optimal (R,s,S) policy parameters by a hybrid of branch-and-bound and stochastic dynamic programming," European Journal of Operational Research, Elsevier, vol. 294(1), pages 91-99.
    4. Gurkan, M. Edib & Tunc, Huseyin & Tarim, S. Armagan, 2022. "The joint stochastic lot sizing and pricing problem," Omega, Elsevier, vol. 108(C).
    5. Durdu Hakan Utku, 2023. "The Evaluation and Improvement of the Production Processes of an Automotive Industry Company via Simulation and Optimization," Sustainability, MDPI, vol. 15(3), pages 1-17, January.
    6. Fotuhi, Fateme & Huynh, Nathan & Vidal, Jose M. & Xie, Yuanchang, 2013. "Modeling yard crane operators as reinforcement learning agents," Research in Transportation Economics, Elsevier, vol. 42(1), pages 3-12.
    7. Kumar, Anupam & Evers, Philip T., 2015. "Setting safety stock based on imprecise records," International Journal of Production Economics, Elsevier, vol. 169(C), pages 68-75.
    8. Keun Hee Lee & Mali Abdollahian & Sergei Schreider & Sona Taheri, 2023. "Supply Chain Demand Forecasting and Price Optimisation Models with Substitution Effect," Mathematics, MDPI, vol. 11(11), pages 1-28, May.
    9. Azadi, Majid & Yousefi, Saeed & Farzipoor Saen, Reza & Shabanpour, Hadi & Jabeen, Fauzia, 2023. "Forecasting sustainability of healthcare supply chains using deep learning and network data envelopment analysis," Journal of Business Research, Elsevier, vol. 154(C).
    10. Gel, Esma S. & Salman, F. Sibel, 2022. "Dynamic ordering decisions with approximate learning of supply yield uncertainty," International Journal of Production Economics, Elsevier, vol. 243(C).
    11. Hadar Amrani & Eugene Khmelnitsky, 2017. "Optimal division of inventory between depot and bases," Naval Research Logistics (NRL), John Wiley & Sons, vol. 64(1), pages 3-18, February.
    12. P K Tripathy & W-M Wee & P R Majhi, 2003. "An EOQ model with process reliability considerations," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(5), pages 549-554, May.
    13. Parlar, Mahmut & Perry, David, 1995. "Analysis of a (Q, r, T) inventory policy with deterministic and random yields when future supply is uncertain," European Journal of Operational Research, Elsevier, vol. 84(2), pages 431-443, July.
    14. Bellantuono, Nicola & Giannoccaro, Ilaria & Pontrandolfo, Pierpaolo & Tang, Christopher S., 2009. "The implications of joint adoption of revenue sharing and advance booking discount programs," International Journal of Production Economics, Elsevier, vol. 121(2), pages 383-394, October.
    15. Monthatipkul, Chumpol & Yenradee, Pisal, 2008. "Inventory/distribution control system in a one-warehouse/multi-retailer supply chain," International Journal of Production Economics, Elsevier, vol. 114(1), pages 119-133, July.
    16. Sandun C. Perera & Suresh P. Sethi, 2023. "A survey of stochastic inventory models with fixed costs: Optimality of (s, S) and (s, S)‐type policies—Discrete‐time case," Production and Operations Management, Production and Operations Management Society, vol. 32(1), pages 131-153, January.
    17. Nikolopoulos, Konstantinos, 2021. "We need to talk about intermittent demand forecasting," European Journal of Operational Research, Elsevier, vol. 291(2), pages 549-559.
    18. Patanjal Kumar & Sachin Kumar Mangla & Yigit Kazancoglu & Ali Emrouznejad, 2023. "A decision framework for incorporating the coordination and behavioural issues in sustainable supply chains in digital economy," Annals of Operations Research, Springer, vol. 326(2), pages 721-749, July.
    19. Hsing-Chun Hung & Yuh-Wen Chen, 2023. "Striving to Achieve United Nations Sustainable Development Goals of Taiwanese SMEs by Adopting Industry 4.0," Sustainability, MDPI, vol. 15(3), pages 1-18, January.
    20. David Yao & Morton Klein, 1989. "Lot sizes under continuous demand: The backorder case," Naval Research Logistics (NRL), John Wiley & Sons, vol. 36(5), pages 615-624, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joinma:v:34:y:2023:i:1:d:10.1007_s10845-022-01982-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.