IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2211.04586.html

Learning to Price Supply Chain Contracts against a Learning Retailer

Author

Listed:
  • Xuejun Zhao
  • Ruihao Zhu
  • William B. Haskell

Abstract

The rise of big data analytics has automated the decision-making of companies and increased supply chain agility. In this paper, we study the supply chain contract design problem faced by a data-driven supplier who needs to respond to the inventory decisions of the downstream retailer. Both the supplier and the retailer are uncertain about the market demand and need to learn about it sequentially. The goal for the supplier is to develop data-driven pricing policies with sublinear regret bounds under a wide range of possible retailer inventory policies for a fixed time horizon. To capture the dynamics induced by the retailer's learning policy, we first make a connection to non-stationary online learning by following the notion of variation budget. The variation budget quantifies the impact of the retailer's learning strategy on the supplier's decision-making. We then propose dynamic pricing policies for the supplier for both discrete and continuous demand. We also note that our proposed pricing policy only requires access to the support of the demand distribution, but critically, does not require the supplier to have any prior knowledge about the retailer's learning policy or the demand realizations. We examine several well-known data-driven policies for the retailer, including sample average approximation, distributionally robust optimization, and parametric approaches, and show that our pricing policies lead to sublinear regret bounds in all these cases. At the managerial level, we answer affirmatively that there is a pricing policy with a sublinear regret bound under a wide range of retailer's learning policies, even though she faces a learning retailer and an unknown demand distribution. Our work also provides a novel perspective in data-driven operations management where the principal has to learn to react to the learning policies employed by other agents in the system.

Suggested Citation

  • Xuejun Zhao & Ruihao Zhu & William B. Haskell, 2022. "Learning to Price Supply Chain Contracts against a Learning Retailer," Papers 2211.04586, arXiv.org.
  • Handle: RePEc:arx:papers:2211.04586
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2211.04586
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ming Chen & Zhi-Long Chen, 2015. "Recent Developments in Dynamic Pricing Research: Multiple Products, Competition, and Limited Demand Information," Production and Operations Management, Production and Operations Management Society, vol. 24(5), pages 704-731, May.
    2. Katy S. Azoury, 1985. "Bayes Solution to Dynamic Inventory Models Under Unknown Demand Distribution," Management Science, INFORMS, vol. 31(9), pages 1150-1160, September.
    3. N. Bora Keskin & John R. Birge, 2019. "Dynamic Selling Mechanisms for Product Differentiation and Learning," Operations Research, INFORMS, vol. 67(4), pages 1069-1089, July.
    4. Omar Besbes & Assaf Zeevi, 2011. "On the Minimax Complexity of Pricing in a Changing Environment," Operations Research, INFORMS, vol. 59(1), pages 66-79, February.
    5. N. Bora Keskin & Yuexing Li & Jing-Sheng Song, 2022. "Data-Driven Dynamic Pricing and Ordering with Perishable Inventory in a Changing Environment," Management Science, INFORMS, vol. 68(3), pages 1938-1958, March.
    6. Josef Broder & Paat Rusmevichientong, 2012. "Dynamic Pricing Under a General Parametric Choice Model," Operations Research, INFORMS, vol. 60(4), pages 965-980, August.
    7. Wang Chi Cheung & David Simchi-Levi & He Wang, 2017. "Technical Note—Dynamic Pricing and Demand Learning with Limited Price Experimentation," Operations Research, INFORMS, vol. 65(6), pages 1722-1731, December.
    8. Arnoud V. den Boer & N. Bora Keskin, 2022. "Dynamic Pricing with Demand Learning and Reference Effects," Management Science, INFORMS, vol. 68(10), pages 7112-7130, October.
    9. Arnoud V. den Boer & N. Bora Keskin, 2020. "Discontinuous Demand Functions: Estimation and Pricing," Management Science, INFORMS, vol. 66(10), pages 4516-4534, October.
    10. Georgia Perakis & Guillaume Roels, 2007. "The Price of Anarchy in Supply Chains: Quantifying the Efficiency of Price-Only Contracts," Management Science, INFORMS, vol. 53(8), pages 1249-1268, August.
    11. Basak Kalkanci & Kay-Yut Chen & Feryal Erhun, 2011. "Contract Complexity and Performance Under Asymmetric Demand Information: An Experimental Evaluation," Management Science, INFORMS, vol. 57(4), pages 689-704, April.
    12. N. Bora Keskin & Assaf Zeevi, 2017. "Chasing Demand: Learning and Earning in a Changing Environment," Mathematics of Operations Research, INFORMS, vol. 42(2), pages 277-307, May.
    13. Alison L. Gibbs & Francis Edward Su, 2002. "On Choosing and Bounding Probability Metrics," International Statistical Review, International Statistical Institute, vol. 70(3), pages 419-435, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Qing Feng & Ruihao Zhu & Stefanus Jasin, 2025. "Temporal Fairness in Learning and Earning: Price Protection Guarantee and Phase Transitions," Operations Research, INFORMS, vol. 73(2), pages 775-797, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xuejun Zhao & Ruihao Zhu & William B. Haskell, 2026. "Learning to Price Supply Chain Contracts Against a Learning Retailer," Management Science, INFORMS, vol. 72(3), pages 2168-2187, March.
    2. Yang, Xiangyu & Zhang, Jianghua & Hu, Jian-Qiang & Hu, Jiaqiao, 2024. "Nonparametric multi-product dynamic pricing with demand learning via simultaneous price perturbation," European Journal of Operational Research, Elsevier, vol. 319(1), pages 191-205.
    3. N. Bora Keskin & Yuexing Li & Jing-Sheng Song, 2022. "Data-Driven Dynamic Pricing and Ordering with Perishable Inventory in a Changing Environment," Management Science, INFORMS, vol. 68(3), pages 1938-1958, March.
    4. Boxiao Chen & David Simchi-Levi & Yining Wang & Yuan Zhou, 2022. "Dynamic Pricing and Inventory Control with Fixed Ordering Cost and Incomplete Demand Information," Management Science, INFORMS, vol. 68(8), pages 5684-5703, August.
    5. John R. Birge & Hongfan (Kevin) Chen & N. Bora Keskin & Amy Ward, 2024. "To Interfere or Not To Interfere: Information Revelation and Price-Setting Incentives in a Multiagent Learning Environment," Operations Research, INFORMS, vol. 72(6), pages 2391-2412, November.
    6. N. Bora Keskin & Meng Li, 2024. "Selling Quality-Differentiated Products in a Markovian Market with Unknown Transition Probabilities," Operations Research, INFORMS, vol. 72(3), pages 885-902, May.
    7. Maxime C. Cohen & Sentao Miao & Yining Wang, 2025. "Dynamic Pricing with Fairness Constraints," Operations Research, INFORMS, vol. 73(6), pages 3027-3043, November.
    8. Ningyuan Chen & Guillermo Gallego, 2022. "A Primal–Dual Learning Algorithm for Personalized Dynamic Pricing with an Inventory Constraint," Mathematics of Operations Research, INFORMS, vol. 47(4), pages 2585-2613, November.
    9. Giovanni Gatti Pinheiro & Thomas Fiig & Michael D. Wittman & Michael Defoin-Platel & Riccardo D. Jadanza, 2022. "Demand change detection in airline revenue management," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 21(6), pages 581-595, December.
    10. Kazemi, Mohammad Sadegh & Fotopoulos, Stergios B. & Wang, Xinchang, 2023. "Minimizing online retailers’ revenue loss under a time-varying willingness-to-pay distribution," International Journal of Production Economics, Elsevier, vol. 257(C).
    11. Xiao, Baichun & Yang, Wei, 2021. "A Bayesian learning model for estimating unknown demand parameter in revenue management," European Journal of Operational Research, Elsevier, vol. 293(1), pages 248-262.
    12. Thomas Loots & Arnoud V. den Boer, 2023. "Data‐driven collusion and competition in a pricing duopoly with multinomial logit demand," Production and Operations Management, Production and Operations Management Society, vol. 32(4), pages 1169-1186, April.
    13. Xuejun Zhao & William B. Haskell & Guodong Yu, 2024. "Supply Chain Contracts in the Small Data Regime," Manufacturing & Service Operations Management, INFORMS, vol. 26(4), pages 1387-1401, July.
    14. Xiaocheng Li & Zeyu Zheng, 2024. "Dynamic Pricing with External Information and Inventory Constraint," Management Science, INFORMS, vol. 70(9), pages 5985-6001, September.
    15. John R. Birge & Hongfan (Kevin) Chen & N. Bora Keskin, 2025. "Markdown Policies for Demand Learning with Forward-Looking Customers," Operations Research, INFORMS, vol. 73(5), pages 2550-2566, September.
    16. Boxiao Chen & Yining Wang & Yuan Zhou, 2024. "Optimal Policies for Dynamic Pricing and Inventory Control with Nonparametric Censored Demands," Management Science, INFORMS, vol. 70(5), pages 3362-3380, May.
    17. Ying Zhong & L. Jeff Hong & Guangwu Liu, 2021. "Earning and Learning with Varying Cost," Production and Operations Management, Production and Operations Management Society, vol. 30(8), pages 2379-2394, August.
    18. Tao Shen & Yifan Cui, 2026. "Proxy-Aided Demand Learning with an Application to Various Pricing Problems," Operations Research, INFORMS, vol. 74(2), pages 770-787, March.
    19. Huanan Zhang & Stefanus Jasin, 2022. "Online Learning and Optimization of (Some) Cyclic Pricing Policies in the Presence of Patient Customers," Manufacturing & Service Operations Management, INFORMS, vol. 24(2), pages 1165-1182, March.
    20. Xi Chen & Sentao Miao & Yining Wang, 2023. "Differential Privacy in Personalized Pricing with Nonparametric Demand Models," Operations Research, INFORMS, vol. 71(2), pages 581-602, March.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2211.04586. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.