IDEAS home Printed from https://ideas.repec.org/a/pal/jorapm/v19y2020i5d10.1057_s41272-020-00228-4.html
   My bibliography  Save this article

Reinforcement learning applied to airline revenue management

Author

Listed:
  • Nicolas Bondoux

    (Research, Innovation and Ventures, Amadeus S.A.S.)

  • Anh Quan Nguyen

    (Research, Innovation and Ventures, Amadeus S.A.S.)

  • Thomas Fiig

    (Amadeus IT Group)

  • Rodrigo Acuna-Agost

    (Research, Innovation and Ventures, Amadeus S.A.S.)

Abstract

Reinforcement learning (RL) is an area of machine learning concerned with how agents take actions to optimize a given long-term reward by interacting with the environment they are placed in. Some well-known recent applications include self-driving cars and computers playing games with super-human performance. One of the main advantages of this approach is that there is no need to explicitly model the nature of the interactions with the environment. In this work, we present a new airline Revenue Management System (RMS) based on RL, which does not require a demand forecaster. The optimization module remains but works in a different way. It is theoretically proven that RL converges to the optimal solution; however, in practice, the system may require a significant amount of data (a booking history with millions of daily departures) to learn the optimal policies. To overcome these difficulties, we present a novel model that integrates domain knowledge with a deep neural network trained on GPUs. The results are very encouraging in different scenarios and open the door for a new generation of RMSs that could automatically learn by directly interacting with customers.

Suggested Citation

  • Nicolas Bondoux & Anh Quan Nguyen & Thomas Fiig & Rodrigo Acuna-Agost, 2020. "Reinforcement learning applied to airline revenue management," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 19(5), pages 332-348, October.
  • Handle: RePEc:pal:jorapm:v:19:y:2020:i:5:d:10.1057_s41272-020-00228-4
    DOI: 10.1057/s41272-020-00228-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/s41272-020-00228-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1057/s41272-020-00228-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Garrett van Ryzin & Jeff McGill, 2000. "Revenue Management Without Forecasting or Optimization: An Adaptive Algorithm for Determining Airline Seat Protection Levels," Management Science, INFORMS, vol. 46(6), pages 760-775, June.
    2. Guillermo Gallego & Garrett van Ryzin, 1994. "Optimal Dynamic Pricing of Inventories with Stochastic Demand over Finite Horizons," Management Science, INFORMS, vol. 40(8), pages 999-1020, August.
    3. Arnoud V. den Boer & Bert Zwart, 2014. "Simultaneously Learning and Optimizing Using Controlled Variance Pricing," Management Science, INFORMS, vol. 60(3), pages 770-783, March.
    4. Kalyan Talluri & Garrett van Ryzin, 2004. "Revenue Management Under a General Discrete Choice Model of Consumer Behavior," Management Science, INFORMS, vol. 50(1), pages 15-33, January.
    5. Ravi Kumar & Ang Li & Wei Wang, 2018. "Learning and optimizing through dynamic pricing," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 17(2), pages 63-77, April.
    6. Balvers, Ronald J & Cosimano, Thomas F, 1990. "Actively Learning about Demand and the Dynamics of Price Adjustment," Economic Journal, Royal Economic Society, vol. 100(402), pages 882-898, September.
    7. Philipp Bartke & Natalia Kliewer & Catherine Cleophas, 2018. "Benchmarking filter-based demand estimates for airline revenue management," EURO Journal on Transportation and Logistics, Springer;EURO - The Association of European Operational Research Societies, vol. 7(1), pages 57-88, March.
    8. Arnoud V. den Boer & Bert Zwart, 2015. "Dynamic Pricing and Learning with Finite Inventories," Operations Research, INFORMS, vol. 63(4), pages 965-978, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Rodrigo Acuna-Agost & Eoin Thomas & Alix Lhéritier, 2021. "Price elasticity estimation for deep learning-based choice models: an application to air itinerary choices," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 20(3), pages 213-226, June.
    2. Alexander Kastius & Rainer Schlosser, 2022. "Dynamic pricing under competition using reinforcement learning," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 21(1), pages 50-63, February.
    3. Neda Etebari Alamdari & Gilles Savard, 2021. "Deep reinforcement learning in seat inventory control problem: an action generation approach," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 20(5), pages 566-579, October.
    4. Fleckenstein, David & Klein, Robert & Steinhardt, Claudius, 2023. "Recent advances in integrating demand management and vehicle routing: A methodological review," European Journal of Operational Research, Elsevier, vol. 306(2), pages 499-518.
    5. Justin Dumouchelle & Emma Frejinger & Andrea Lodi, 2024. "Reinforcement learning for freight booking control problems," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 23(4), pages 318-345, August.
    6. Nicolas Eschenbaum & Filip Mellgren & Philipp Zahn, 2022. "Robust Algorithmic Collusion," Papers 2201.00345, arXiv.org, revised Jan 2022.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Morlotti, Chiara & Mantin, Benny & Malighetti, Paolo & Redondi, Renato, 2024. "Price volatility of revenue managed goods: Implications for demand and price elasticity," European Journal of Operational Research, Elsevier, vol. 312(3), pages 1039-1058.
    2. Athanassios N. Avramidis & Arnoud V. Boer, 2021. "Dynamic pricing with finite price sets: a non-parametric approach," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 94(1), pages 1-34, August.
    3. Arnoud V. den Boer & N. Bora Keskin, 2020. "Discontinuous Demand Functions: Estimation and Pricing," Management Science, INFORMS, vol. 66(10), pages 4516-4534, October.
    4. Ravi Kumar & Ang Li & Wei Wang, 2018. "Learning and optimizing through dynamic pricing," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 17(2), pages 63-77, April.
    5. William L. Cooper & Tito Homem-de-Mello, 2007. "Some Decomposition Methods for Revenue Management," Transportation Science, INFORMS, vol. 41(3), pages 332-353, August.
    6. William L. Cooper & Tito Homem-de-Mello & Anton J. Kleywegt, 2015. "Learning and Pricing with Models That Do Not Explicitly Incorporate Competition," Operations Research, INFORMS, vol. 63(1), pages 86-103, February.
    7. Xiao, Baichun & Yang, Wei, 2021. "A Bayesian learning model for estimating unknown demand parameter in revenue management," European Journal of Operational Research, Elsevier, vol. 293(1), pages 248-262.
    8. Thomas Loots & Arnoud V. den Boer, 2023. "Data‐driven collusion and competition in a pricing duopoly with multinomial logit demand," Production and Operations Management, Production and Operations Management Society, vol. 32(4), pages 1169-1186, April.
    9. Hu, Qiying & Wei, Yihua & Xia, Yusen, 2010. "Revenue management for a supply chain with two streams of customers," European Journal of Operational Research, Elsevier, vol. 200(2), pages 582-598, January.
    10. Jason Rhuggenaath & Alp Akcay & Yingqian Zhang & Uzay Kaymak, 2022. "Setting Reserve Prices in Second-Price Auctions with Unobserved Bids," INFORMS Journal on Computing, INFORMS, vol. 34(6), pages 2950-2967, November.
    11. Lingxiu Dong & Panos Kouvelis & Zhongjun Tian, 2009. "Dynamic Pricing and Inventory Control of Substitute Products," Manufacturing & Service Operations Management, INFORMS, vol. 11(2), pages 317-339, December.
    12. Boxiao Chen & Xiuli Chao & Cong Shi, 2021. "Nonparametric Learning Algorithms for Joint Pricing and Inventory Control with Lost Sales and Censored Demand," Mathematics of Operations Research, INFORMS, vol. 46(2), pages 726-756, May.
    13. Christian Borgs & Ozan Candogan & Jennifer Chayes & Ilan Lobel & Hamid Nazerzadeh, 2014. "Optimal Multiperiod Pricing with Service Guarantees," Management Science, INFORMS, vol. 60(7), pages 1792-1811, July.
    14. Sen, Alper & Zhang, Alex X., 2009. "Style goods pricing with demand learning," European Journal of Operational Research, Elsevier, vol. 196(3), pages 1058-1075, August.
    15. Yingjie Lan & Huina Gao & Michael O. Ball & Itir Karaesmen, 2008. "Revenue Management with Limited Demand Information," Management Science, INFORMS, vol. 54(9), pages 1594-1609, September.
    16. Omar Besbes & Assaf Zeevi, 2009. "Dynamic Pricing Without Knowing the Demand Function: Risk Bounds and Near-Optimal Algorithms," Operations Research, INFORMS, vol. 57(6), pages 1407-1420, December.
    17. Kevin R. Williams, 2017. "Dynamic Airline Pricing and Seat Availability," Cowles Foundation Discussion Papers 2103, Cowles Foundation for Research in Economics, Yale University.
    18. Athanassios N. Avramidis, 2020. "A pricing problem with unknown arrival rate and price sensitivity," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 92(1), pages 77-106, August.
    19. Dongdong Yu & Miyu Wan & Chunlin Luo, 2022. "Dynamic pricing and dual‐channel choice in the presence of strategic consumers," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 43(6), pages 2392-2408, September.
    20. Wang, Xiubin & Regan, Amelia, 2006. "Dynamic yield management when aircraft assignments are subject to swap," Transportation Research Part B: Methodological, Elsevier, vol. 40(7), pages 563-576, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:jorapm:v:19:y:2020:i:5:d:10.1057_s41272-020-00228-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.palgrave.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.