IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v221y2012i1p99-109.html
   My bibliography  Save this article

Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems

Author

Listed:
  • Li, Xueping
  • Wang, Jiao
  • Sawhney, Rapinder

Abstract

The paper investigates a problem faced by a make-to-order (MTO) firm that has the ability to reject or accept orders, and set prices and lead-times to influence demands. Inventory holding costs for early completed orders, tardiness costs for late delivery orders, order rejection costs, manufacturing variable costs, and fixed costs are considered. In order to maximize the expected profits in an infinite planning horizon with stochastic demands, the firm needs to make decisions from the following aspects: which orders to accept or reject, the trade-off between price and lead-time, and the potential for increased demand against capacity constraints. We model the problem as a Semi-Markov Decision Problem (SMDP) and develop a reinforcement learning (RL) based Q-learning algorithm (QLA) for the problem. In addition, we build a discrete-event simulation model to validate the performance of the QLA, and compare the experimental results with two benchmark policies, the First-Come-First-Serve (FCFS) policy and a threshold heuristic policy. It is shown that the QLA outperforms the existing policies.

Suggested Citation

  • Li, Xueping & Wang, Jiao & Sawhney, Rapinder, 2012. "Reinforcement learning for joint pricing, lead-time and scheduling decisions in make-to-order systems," European Journal of Operational Research, Elsevier, vol. 221(1), pages 99-109.
  • Handle: RePEc:eee:ejores:v:221:y:2012:i:1:p:99-109
    DOI: 10.1016/j.ejor.2012.03.020
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221712002135
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2012.03.020?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. E. Andrew Boyd & Ioana C. Bilegan, 2003. "Revenue Management and E-Commerce," Management Science, INFORMS, vol. 49(10), pages 1363-1386, October.
    2. Shiming Deng & Candace A. Yano, 2006. "Joint Production and Pricing Decisions with Setup Costs and Capacity Constraints," Management Science, INFORMS, vol. 52(5), pages 741-756, May.
    3. Izak Duenyas & Wallace J. Hopp, 1995. "Quoting Customer Lead Times," Management Science, INFORMS, vol. 41(1), pages 43-57, January.
    4. Kut C. So, 2000. "Price and Time Competition for Service Delivery," Manufacturing & Service Operations Management, INFORMS, vol. 2(4), pages 392-409, April.
    5. MILLER, Bruce L., 1969. "A queueing reward system with several customer classes," LIDAM Reprints CORE 41, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    6. Bruce L. Miller, 1969. "A Queueing Reward System with Several Customer Classes," Management Science, INFORMS, vol. 16(3), pages 234-245, November.
    7. Gosavi, Abhijit, 2004. "Reinforcement learning for long-run average cost," European Journal of Operational Research, Elsevier, vol. 155(3), pages 654-674, June.
    8. Hall, Nicholas G. & Magazine, Michael J., 1994. "Maximizing the value of a space mission," European Journal of Operational Research, Elsevier, vol. 78(2), pages 224-241, October.
    9. Feichtinger, Gustav & Hartl, Richard, 1985. "Optimal pricing and production in an inventory model," European Journal of Operational Research, Elsevier, vol. 19(1), pages 45-56, January.
    10. Easton, Fred F. & Moodie, Douglas R., 1999. "Pricing and lead time decisions for make-to-order firms with contingent orders," European Journal of Operational Research, Elsevier, vol. 116(2), pages 305-318, July.
    11. Stephen M. Gilbert, 2000. "Coordination of Pricing and Multiple-Period Production Across Multiple Constant Priced Goods," Management Science, INFORMS, vol. 46(12), pages 1602-1616, December.
    12. Dov Pekelman, 1974. "Simultaneous Price-Production Decisions," Operations Research, INFORMS, vol. 22(4), pages 788-794, August.
    13. Chen, Miao-Sheng & Chu, Mei-Chen, 2003. "The analysis of optimal control model in matching problem between manufacturing and marketing," European Journal of Operational Research, Elsevier, vol. 150(2), pages 293-303, October.
    14. So, Kut C. & Song, Jing-Sheng, 1998. "Price, delivery time guarantees and capacity selection," European Journal of Operational Research, Elsevier, vol. 111(1), pages 28-49, November.
    15. T. M. Whitin, 1955. "Inventory Control and Price Theory," Management Science, INFORMS, vol. 2(1), pages 61-68, October.
    16. Pinar Keskinocak & R. Ravi & Sridhar Tayur, 2001. "Scheduling and Reliable Lead-Time Quotation for Orders with Availability Intervals and Lead-Time Sensitive Revenues," Management Science, INFORMS, vol. 47(2), pages 264-279, February.
    17. Izak Duenyas, 1995. "Single Facility Due Date Setting with Multiple Customer Classes," Management Science, INFORMS, vol. 41(4), pages 608-619, April.
    18. Charnsirisakskul, Kasarin & Griffin, Paul M. & Keskinocak, Pinar, 2006. "Pricing and scheduling decisions with leadtime flexibility," European Journal of Operational Research, Elsevier, vol. 171(1), pages 153-169, May.
    19. Tapas K. Das & Abhijit Gosavi & Sridhar Mahadevan & Nicholas Marchalleck, 1999. "Solving Semi-Markov Decision Problems Using Average Reward Reinforcement Learning," Management Science, INFORMS, vol. 45(4), pages 560-574, April.
    20. Kim, DaeSoo & Lee, Won J., 1998. "Optimal joint pricing and lot sizing with fixed and variable capacity," European Journal of Operational Research, Elsevier, vol. 109(1), pages 212-227, August.
    21. Gilbert, Stephen M., 1999. "Coordination of pricing and multi-period production for constant priced goods," European Journal of Operational Research, Elsevier, vol. 114(2), pages 330-337, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Olumide Emmanuel Oluyisola & Swapnil Bhalla & Fabio Sgarbossa & Jan Ola Strandhagen, 2022. "Designing and developing smart production planning and control systems in the industry 4.0 era: a methodology and case study," Journal of Intelligent Manufacturing, Springer, vol. 33(1), pages 311-332, January.
    2. Sun, Yiqi & Wu, Zhengping & Zhu, Wanshan, 2022. "When do firms benefit from joint price and lead-time competition?," European Journal of Operational Research, Elsevier, vol. 302(2), pages 497-517.
    3. Yang, Hongbing & Li, Wenchao & Wang, Bin, 2021. "Joint optimization of preventive maintenance and production scheduling for multi-state production systems based on reinforcement learning," Reliability Engineering and System Safety, Elsevier, vol. 214(C).
    4. Juan Pablo Usuga Cadavid & Samir Lamouri & Bernard Grabot & Robert Pellerin & Arnaud Fortin, 2020. "Machine learning applied in production planning and control: a state-of-the-art in the era of industry 4.0," Journal of Intelligent Manufacturing, Springer, vol. 31(6), pages 1531-1558, August.
    5. Yan, Yimo & Chow, Andy H.F. & Ho, Chin Pang & Kuo, Yong-Hong & Wu, Qihao & Ying, Chengshuo, 2022. "Reinforcement learning for logistics and supply chain management: Methodologies, state of the art, and future opportunities," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 162(C).
    6. Tanja Mlinar & Philippe Chevalier, 2016. "Pooling heterogeneous products for manufacturing environments," 4OR, Springer, vol. 14(2), pages 173-200, June.
    7. Zhai, Yue & Cheng, T.C.E., 2022. "Lead-time quotation and hedging coordination in make-to-order supply chain," European Journal of Operational Research, Elsevier, vol. 300(2), pages 449-460.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Abhijit Upasani & Reha Uzsoy, 2008. "Incorporating manufacturing lead times in joint production-marketing models: A review and some future directions," Annals of Operations Research, Springer, vol. 161(1), pages 171-188, July.
    2. Lamas, Alejandro & Chevalier, Philippe, 2018. "Joint dynamic pricing and lot-sizing under competition," European Journal of Operational Research, Elsevier, vol. 266(3), pages 864-876.
    3. Charnsirisakskul, Kasarin & Griffin, Paul M. & Keskinocak, Pinar, 2006. "Pricing and scheduling decisions with leadtime flexibility," European Journal of Operational Research, Elsevier, vol. 171(1), pages 153-169, May.
    4. Hyun-soo Ahn & Mehmet Gümüş & Philip Kaminsky, 2007. "Pricing and Manufacturing Decisions When Demand Is a Function of Prices in Multiple Periods," Operations Research, INFORMS, vol. 55(6), pages 1039-1057, December.
    5. Zhi-Long Chen & Nicholas G. Hall, 2010. "The Coordination of Pricing and Scheduling Decisions," Manufacturing & Service Operations Management, INFORMS, vol. 12(1), pages 77-92, April.
    6. Shiming Deng & Candace A. Yano, 2006. "Joint Production and Pricing Decisions with Setup Costs and Capacity Constraints," Management Science, INFORMS, vol. 52(5), pages 741-756, May.
    7. Slotnick, Susan A., 2011. "Order acceptance and scheduling: A taxonomy and review," European Journal of Operational Research, Elsevier, vol. 212(1), pages 1-11, July.
    8. Seçil Savaşaneril & Paul M. Griffin & Pınar Keskinocak, 2010. "Dynamic Lead-Time Quotation for an M/M/1 Base-Stock Inventory Queue," Operations Research, INFORMS, vol. 58(2), pages 383-395, April.
    9. Barış Ata & Tava Lennon Olsen, 2009. "Near-Optimal Dynamic Lead-Time Quotation and Scheduling Under Convex-Concave Customer Delay Costs," Operations Research, INFORMS, vol. 57(3), pages 753-768, June.
    10. A. Baykal Hafızoğlu & Esma S. Gel & Pınar Keskinocak, 2016. "Price and Lead Time Quotation for Contract and Spot Customers," Operations Research, INFORMS, vol. 64(2), pages 406-415, April.
    11. W K Chiang & Y Feng, 2010. "Retailer or e-tailer? Strategic pricing and economic-lot-size decisions in a competitive supply chain with drop-shipping," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(11), pages 1645-1653, November.
    12. Bajwa, Naeem & Sox, Charles R. & Ishfaq, Rafay, 2016. "Coordinating pricing and production decisions for multiple products," Omega, Elsevier, vol. 64(C), pages 86-101.
    13. Tanja Mlinar & Philippe Chevalier, 2016. "Pooling heterogeneous products for manufacturing environments," 4OR, Springer, vol. 14(2), pages 173-200, June.
    14. Cemal AKTÜRK & Sevinç GÜLSEÇEN, 2018. "Sipariş Teslim Tarihi Problemi İçin Çok Kriterli ve Çok Yöntemli Karar Destek Sistemi Önerisi," Istanbul Management Journal, Istanbul University Business School, vol. 29(84), pages 65-78, June.
    15. Gökçe Kahveciog̃lu & Barış Balcıog̃lu, 2016. "Coping with production time variability via dynamic lead-time quotation," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 38(4), pages 877-898, October.
    16. Feng, Jiejian & Zhang, Michael, 2017. "Dynamic quotation of leadtime and price for a Make-To-Order system with multiple customer classes and perfect information on customer preferences," European Journal of Operational Research, Elsevier, vol. 258(1), pages 334-342.
    17. Murat Erkoc & S. David Wu & Haresh Gurnani, 2008. "Delivery‐date and capacity management in a decentralized internal market," Naval Research Logistics (NRL), John Wiley & Sons, vol. 55(5), pages 390-405, August.
    18. Sabri Çelik & Costis Maglaras, 2008. "Dynamic Pricing and Lead-Time Quotation for a Multiclass Make-to-Order Queue," Management Science, INFORMS, vol. 54(6), pages 1132-1146, June.
    19. Erica L. Plambeck, 2004. "Optimal Leadtime Differentiation via Diffusion Approximations," Operations Research, INFORMS, vol. 52(2), pages 213-228, April.
    20. Philipp Afèche & Opher Baron & Yoav Kerner, 2013. "Pricing Time-Sensitive Services Based on Realized Performance," Manufacturing & Service Operations Management, INFORMS, vol. 15(3), pages 492-506, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:221:y:2012:i:1:p:99-109. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.