IDEAS home Printed from https://ideas.repec.org/a/inm/ormnsc/v72y2026i2p1007-1024.html

Last-Iterate Convergence in No-Regret Learning: Games with Reference Effects Under Logit Demand

Author

Listed:
  • Mengzi Amy Guo

    (Department of Industrial Engineering and Operations Research, University of California, Berkeley, Berkeley, California 94720)

  • Donghao Ying

    (Department of Industrial Engineering and Operations Research, University of California, Berkeley, Berkeley, California 94720)

  • Javad Lavaei

    (Department of Industrial Engineering and Operations Research, University of California, Berkeley, Berkeley, California 94720)

  • Zuo-Jun Max Shen

    (Department of Industrial Engineering and Operations Research, University of California, Berkeley, Berkeley, California 94720; and Faculty of Engineering and Faculty of Business and Economics, University of Hong Kong, Hong Kong, China)

Abstract

This work examines the behaviors of the online projected gradient ascent ( OPGA ) algorithm and its variant in a repeated oligopoly price competition under reference effects. In particular, we consider that multiple firms engage in a multiperiod price competition, where consecutive periods are linked by the reference price update and each firm has access only to its own first-order feedback. Consumers assess their willingness to pay by comparing the current price against the memory-based reference price, and their choices follow the multinomial logit (MNL) model. We use the notion of stationary Nash equilibrium (SNE), defined as the fixed point of the equilibrium pricing policy, to simultaneously capture the long-run equilibrium and stability. We first study the loss-neutral reference effects and show that if the firms employ the OPGA algorithm—adjusting the price using the first-order derivatives of their log-revenues—the price and reference price paths attain last-iterate convergence to the unique SNE, thereby guaranteeing the no-regret learning and market stability. Moreover, with appropriate step-sizes, we prove that this algorithm exhibits a convergence rate of O ˜ ( 1 / t 2 ) in terms of the squared distance and achieves a constant dynamic regret. Despite the simplicity of the algorithm, its convergence analysis is challenging due to the model lacking typical properties such as strong monotonicity and variational stability that are ordinarily used for the convergence analysis of online games. The inherent asymmetry nature of reference effects motivates the exploration beyond loss-neutrality. When loss-averse reference effects are introduced, we propose a variant of the original algorithm named the conservative- OPGA ( C-OPGA ) to handle the nonsmooth revenue functions and show that the price and reference price achieve last-iterate convergence to the set of SNEs with the rate of O ( 1 / t ) . Finally, we demonstrate the practicality and robustness of OPGA and C-OPGA by theoretically showing that these algorithms can also adapt to firm-differentiated step-sizes and inexact gradients.

Suggested Citation

  • Mengzi Amy Guo & Donghao Ying & Javad Lavaei & Zuo-Jun Max Shen, 2026. "Last-Iterate Convergence in No-Regret Learning: Games with Reference Effects Under Logit Demand," Management Science, INFORMS, vol. 72(2), pages 1007-1024, February.
  • Handle: RePEc:inm:ormnsc:v:72:y:2026:i:2:p:1007-1024
    DOI: 10.1287/mnsc.2023.03464
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mnsc.2023.03464
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mnsc.2023.03464?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ruxian Wang, 2018. "When Prospect Theory Meets Consumer Choice Models: Assortment and Pricing Management with Reference Prices," Manufacturing & Service Operations Management, INFORMS, vol. 20(3), pages 583-600, July.
    2. Ningyuan Chen & Javad Nasiry, 2020. "Does Loss Aversion Preclude Price Variation?," Manufacturing & Service Operations Management, INFORMS, vol. 22(2), pages 383-395, March.
    3. Arnoud V. den Boer & N. Bora Keskin, 2022. "Dynamic Pricing with Demand Learning and Reference Effects," Management Science, INFORMS, vol. 68(10), pages 7112-7130, October.
    4. Hanzhang Qin & David Simchi-Levi & Li Wang, 2022. "Data-Driven Approximation Schemes for Joint Pricing and Inventory Control Models," Management Science, INFORMS, vol. 68(9), pages 6591-6609, September.
    5. Gurumurthy Kalyanaram & Russell S. Winer, 1995. "Empirical Generalizations from Reference Price Research," Marketing Science, INFORMS, vol. 14(3_supplem), pages 161-169.
    6. Briesch, Richard A, et al, 1997. "A Comparative Analysis of Reference Price Models," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 24(2), pages 202-214, September.
    7. Ioana Popescu & Yaozhong Wu, 2007. "Dynamic Pricing Strategies with Reference Effects," Operations Research, INFORMS, vol. 55(3), pages 413-429, June.
    8. Colombo, Luca & Labrecciosa, Paola, 2021. "Dynamic oligopoly pricing with reference-price effects," European Journal of Operational Research, Elsevier, vol. 288(3), pages 1006-1016.
    9. Bruce G. S. Hardie & Eric J. Johnson & Peter S. Fader, 1993. "Modeling Loss Aversion and Reference Dependence Effects on Brand Choice," Marketing Science, INFORMS, vol. 12(4), pages 378-394.
    10. Krishnamurthi, Lakshman & Mazumdar, Tridib & Raj, S P, 1992. "Asymmetric Response to Price in Consumer Brand Choice and Purchase Quantity Decisions," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 19(3), pages 387-400, December.
    11. Xin Chen & Peng Hu & Zhenyu Hu, 2017. "Efficient Algorithms for the Dynamic Pricing Problem with Reference Price Effect," Management Science, INFORMS, vol. 63(12), pages 4389-4406, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hansheng Jiang & Junyu Cao & Zuo-Jun Max Shen, 2024. "Intertemporal Pricing via Nonparametric Estimation: Integrating Reference Effects and Consumer Heterogeneity," Manufacturing & Service Operations Management, INFORMS, vol. 26(1), pages 28-46, January.
    2. Yan, Xiaoming & Zhao, Wenhan & Yu, Yugang, 2022. "Optimal product line design with reference price effects," European Journal of Operational Research, Elsevier, vol. 302(3), pages 1045-1062.
    3. Mengzi Amy Guo & Hansheng Jiang & Zuo-Jun Max Shen, 2025. "Multiproduct Dynamic Pricing with Reference Effects Under Logit Demand," Manufacturing & Service Operations Management, INFORMS, vol. 27(5), pages 1645-1663, September.
    4. Robert Slonim & Ellen Garbarino, 2009. "Similarities and differences between stockpiling and reference effects," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 30(6), pages 351-371.
    5. Pennesi, Daniele, 2025. "A behavioral model of consumer response to price information," Journal of Economic Behavior & Organization, Elsevier, vol. 230(C).
    6. Necati Tereyağoğlu & Peter S. Fader & Senthil Veeraraghavan, 2018. "Multiattribute Loss Aversion and Reference Dependence: Evidence from the Performing Arts Industry," Management Science, INFORMS, vol. 64(1), pages 421-436, January.
    7. Sojin Jung & Hyeon Jeong Cho & Byoungho Ellie Jin, 2020. "Does effective cost transparency increase price fairness? An analysis of apparel brand strategies," Journal of Brand Management, Palgrave Macmillan, vol. 27(5), pages 495-507, September.
    8. Kopalle, Praveen K. & Kannan, P.K. & Boldt, Lin Bao & Arora, Neeraj, 2012. "The impact of household level heterogeneity in reference price effects on optimal retailer pricing policies," Journal of Retailing, Elsevier, vol. 88(1), pages 102-114.
    9. Dmitri Kuksov & Kangkang Wang, 2014. "The Bright Side of Loss Aversion in Dynamic and Competitive Markets," Marketing Science, INFORMS, vol. 33(5), pages 693-711, September.
    10. Wolk, Agnieszka & Spann, Martin, 2008. "The effects of reference prices on bidding behavior in interactive pricing mechanisms," Journal of Interactive Marketing, Elsevier, vol. 22(4), pages 2-18.
    11. Wang, Zeming & Veldman, Jasper & Teunter, Ruud, 2025. "Process improvement under the reference price effect," European Journal of Operational Research, Elsevier, vol. 322(3), pages 937-948.
    12. Ahrens, Steffen & Pirschel, Inske & Snower, Dennis J., 2017. "A theory of price adjustment under loss aversion," Journal of Economic Behavior & Organization, Elsevier, vol. 134(C), pages 78-95.
    13. Zhao, Wenhan & Yan, Xiaoming & Yu, Yugang, 2025. "Product price and delivery-time commitment decisions with reference effects," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 197(C).
    14. Zhang, Jie & Kevin Chiang, Wei–yu & Liang, Liang, 2014. "Strategic pricing with reference effects in a competitive supply chain," Omega, Elsevier, vol. 44(C), pages 126-135.
    15. van Oest, Rutger, 2013. "Why are Consumers Less Loss Averse in Internal than External Reference Prices?," Journal of Retailing, Elsevier, vol. 89(1), pages 62-71.
    16. Sojin Jung & Hyeon Jeong Cho & Byoungho Ellie Jin, 0. "Does effective cost transparency increase price fairness? An analysis of apparel brand strategies," Journal of Brand Management, Palgrave Macmillan, vol. 0, pages 1-13.
    17. Lillian L. Cheng & Kent B. Monroe, 2013. "An appraisal of behavioral price research (part 1): price as a physical stimulus," AMS Review, Springer;Academy of Marketing Science, vol. 3(3), pages 103-129, September.
    18. David R. Bell & James M. Lattin, 2000. "Looking for Loss Aversion in Scanner Panel Data: The Confounding Effect of Price Response Heterogeneity," Marketing Science, INFORMS, vol. 19(2), pages 185-200, May.
    19. Ma, Deqing & Li, Kaifu & Hu, Jinsong & Wang, Xue, 2024. "How to leverage blockchain to react to consumer reference effects and develop a distribution strategy in online retailing?," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 187(C).
    20. Zhenyu Hu & Javad Nasiry, 2018. "Are Markets with Loss-Averse Consumers More Sensitive to Losses?," Management Science, INFORMS, vol. 64(3), pages 1384-1395, March.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:72:y:2026:i:2:p:1007-1024. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.