IDEAS home Printed from https://ideas.repec.org/a/eee/transb/v189y2024ics0191261524001383.html
   My bibliography  Save this article

Providing real-time en-route suggestions to CAVs for congestion mitigation: A two-way deep reinforcement learning approach

Author

Listed:
  • Ma, Xiaoyu
  • He, Xiaozheng

Abstract

This research investigates the effectiveness of information provision for congestion reduction in Connected Autonomous Vehicle (CAV) systems. The inherent advantages of CAVs, such as vehicle-to-everything communication, advanced vehicle autonomy, and reduced human involvement, make them conducive to achieving Correlated Equilibrium (CE). Leveraging these advantages, this research proposes a reinforcement learning framework involving CAVs and an information provider, where CAVs conduct real-time learning to minimize their individual travel time, while the information provider offers real-time route suggestions aiming to minimize the system’s total travel time. The en-route routing problem of the CAVs is formulated as a Markov game and the information provision problem is formulated as a single-agent Markov decision process. Then, this research develops a customized two-way deep reinforcement learning approach to solve the interrelated problems, accounting for their unique characteristics. Moreover, CE has been formulated within the proposed framework. Theoretical analysis rigorously proves the realization of CE and that the proposed framework can effectively mitigate congestion without compromising individual user optimality. Numerical results demonstrate the effectiveness of this approach. Our research contributes to the advancement of congestion reduction strategies in CAV systems with the mitigation of the conflict between system-level and individual-level goals using CE as a theoretical foundation. The results highlight the potential of information provision in fostering coordination and correlation among CAVs, thereby enhancing traffic efficiency and achieving system-level goals in smart transportation.

Suggested Citation

  • Ma, Xiaoyu & He, Xiaozheng, 2024. "Providing real-time en-route suggestions to CAVs for congestion mitigation: A two-way deep reinforcement learning approach," Transportation Research Part B: Methodological, Elsevier, vol. 189(C).
  • Handle: RePEc:eee:transb:v:189:y:2024:i:c:s0191261524001383
    DOI: 10.1016/j.trb.2024.103014
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0191261524001383
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.trb.2024.103014?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Aumann, Robert J, 1987. "Correlated Equilibrium as an Expression of Bayesian Rationality," Econometrica, Econometric Society, vol. 55(1), pages 1-18, January.
    2. Liu, Yixuan & Whinston, Andrew B., 2019. "Efficient real-time routing for autonomous vehicles through Bayes correlated equilibrium: An information design framework," Information Economics and Policy, Elsevier, vol. 47(C), pages 14-26.
    3. Zhou, Bo & Song, Qiankun & Zhao, Zhenjiang & Liu, Tangzhi, 2020. "A reinforcement learning scheme for the equilibrium of the in-vehicle route choice problem based on congestion game," Applied Mathematics and Computation, Elsevier, vol. 371(C).
    4. Liang Wang & Lei Zhao & Xiaojian Hu & Xinyong Zhao & Huan Wang, 2023. "A Reliability-Based Traffic Equilibrium Model with Boundedly Rational Travelers Considering Acceptable Arrival Thresholds," Sustainability, MDPI, vol. 15(8), pages 1-19, April.
    5. Du, Lili & Han, Lanshan & Chen, Shuwei, 2015. "Coordinated online in-vehicle routing balancing user optimality and system optimality through information perturbation," Transportation Research Part B: Methodological, Elsevier, vol. 79(C), pages 121-133.
    6. Wang, Chaojie & Peeta, Srinivas & Wang, Jian, 2021. "Incentive-based decentralized routing for connected and autonomous vehicles using information propagation," Transportation Research Part B: Methodological, Elsevier, vol. 149(C), pages 138-161.
    7. Ning, Yuqiang & Du, Lili, 2023. "Robust and resilient equilibrium routing mechanism for traffic congestion mitigation built upon correlated equilibrium and distributed optimization," Transportation Research Part B: Methodological, Elsevier, vol. 168(C), pages 170-205.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ning, Yuqiang & Du, Lili, 2023. "Robust and resilient equilibrium routing mechanism for traffic congestion mitigation built upon correlated equilibrium and distributed optimization," Transportation Research Part B: Methodological, Elsevier, vol. 168(C), pages 170-205.
    2. Le Zhang & Lijing Lyu & Shanshui Zheng & Li Ding & Lang Xu, 2022. "A Q-Learning-Based Approximate Solving Algorithm for Vehicular Route Game," Sustainability, MDPI, vol. 14(19), pages 1-14, September.
    3. John Geanakoplos, 1993. "Common Knowledge," Cowles Foundation Discussion Papers 1062, Cowles Foundation for Research in Economics, Yale University.
    4. Fukuda, Satoshi, 2024. "The existence of universal qualitative belief spaces," Journal of Economic Theory, Elsevier, vol. 216(C).
    5. Arfi, Badredine, 2007. "Quantum social game theory," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 374(2), pages 794-820.
    6. Dirk Bergemann & Stephen Morris, 2019. "Information Design: A Unified Perspective," Journal of Economic Literature, American Economic Association, vol. 57(1), pages 44-95, March.
    7. Samet, Dov, 1990. "Ignoring ignorance and agreeing to disagree," Journal of Economic Theory, Elsevier, vol. 52(1), pages 190-207, October.
    8. Radzvilas, Mantas, 2016. "Hypothetical Bargaining and the Equilibrium Selection Problem in Non-Cooperative Games," MPRA Paper 70248, University Library of Munich, Germany.
    9. Konstantinos Georgalos & Indrajit Ray & Sonali SenGupta, 2020. "Nash versus coarse correlation," Experimental Economics, Springer;Economic Science Association, vol. 23(4), pages 1178-1204, December.
    10. Antonio Cabrales & Michalis Drouvelis & Zeynep Gurguy & Indrajit Ray, 2017. "Transparency is Overrated: Communicating in a Coordination Game with Private Information," CESifo Working Paper Series 6781, CESifo.
    11. Qin, Cheng-Zhong & Yang, Chun-Lei, 2009. "An Explicit Approach to Modeling Finite-Order Type Spaces and Applications," University of California at Santa Barbara, Economics Working Paper Series qt8hq7j89k, Department of Economics, UC Santa Barbara.
    12. Sergiu Hart, 2013. "Adaptive Heuristics," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 11, pages 253-287, World Scientific Publishing Co. Pte. Ltd..
    13. Itzhak Gilboa, 1993. "Can Free Choice Be Known?," Discussion Papers 1055, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
    14. Hellman, Ziv, 2011. "Iterated expectations, compact spaces, and common priors," Games and Economic Behavior, Elsevier, vol. 72(1), pages 163-171, May.
    15. Chirantan Ganguly & Indrajit Ray, 2023. "Simple Mediation in a Cheap-Talk Game," Games, MDPI, vol. 14(3), pages 1-14, June.
    16. Robert Nau, 2001. "De Finetti was Right: Probability Does Not Exist," Theory and Decision, Springer, vol. 51(2), pages 89-124, December.
    17. Carranza, Luis & Galdon-Sanchez, Jose E., 2004. "Financial intermediation, variability and the development process," Journal of Development Economics, Elsevier, vol. 73(1), pages 27-54, February.
    18. Carsten Helm, 1998. "International Cooperation Behind the Veil of Uncertainty – The Case of Transboundary Acidification," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 12(2), pages 185-201, September.
    19. Ehud Lehrer & Eilon Solan, 2007. "Learning to play partially-specified equilibrium," Levine's Working Paper Archive 122247000000001436, David K. Levine.
    20. Lenzo, Justin & Sarver, Todd, 2006. "Correlated equilibrium in evolutionary models with subpopulations," Games and Economic Behavior, Elsevier, vol. 56(2), pages 271-284, August.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:transb:v:189:y:2024:i:c:s0191261524001383. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/548/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.