IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v658y2025ics0378437124007994.html

Coupled vehicle-signal control based on Stackelberg Game Enabled Multi-agent Reinforcement Learning in mixed traffic environment

Author

Listed:
  • Zhang, Xinshao
  • He, Zhaocheng
  • Zhu, Yiting
  • Huang, Wei

Abstract

Related studies on traffic control in partially connected environments either did not consider the collaboration of traffic signal control and vehicular control, or did not consider others’ responsive actions before decision-making in coupled vehicle-signal control. Thus, we propose a Stackelberg Game Enabled Multi-agent Reinforcement Learning (SGMRL) method for coupled vehicle-signal control at an intersection with mixed traffic flow of Connected and Automated Vehicles (CAVs)/Human Driven Vehicles (HDVs). A two-stage framework is applied in SGMRL to learn optimal signal control strategy and CAV platoon strategies in mixed flows of all entrance roads at an intersection. Stackelberg game theory is introduced in SGMRL to make an asynchronous decision-making mechanism. The signal controller is a leader that allocates green times to different phases based on predictions of vehicles’ responsive actions, and CAVs in different directions are followers that form platoons and adjust speeds to adapt to the signal lights decided by the leader. Moreover, CAV platoons in different directions are regarded as agents and form a multi-agent learning framework with the signal controller. Then, an improved Dueling Double Deep Q Network (ID3QN) algorithm is investigated to calculate the Stackelberg equilibrium for the control problem. Experimental results demonstrate that the proposed model effectively reduces the overall waiting time and queue length of all vehicles, in the mixed traffic environment with different CAV penetration rates.

Suggested Citation

  • Zhang, Xinshao & He, Zhaocheng & Zhu, Yiting & Huang, Wei, 2025. "Coupled vehicle-signal control based on Stackelberg Game Enabled Multi-agent Reinforcement Learning in mixed traffic environment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 658(C).
  • Handle: RePEc:eee:phsmap:v:658:y:2025:i:c:s0378437124007994
    DOI: 10.1016/j.physa.2024.130289
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437124007994
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2024.130289?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Qiu, Jiahua & Du, Lili, 2023. "Cooperative trajectory control for synchronizing the movement of two connected and autonomous vehicles separated in a mixed traffic flow," Transportation Research Part B: Methodological, Elsevier, vol. 174(C).
    2. Yao, Zhihong & Jin, Yuting & Jiang, Haoran & Hu, Lu & Jiang, Yangsheng, 2022. "CTM-based traffic signal optimization of mixed traffic flow with connected automated vehicles and human-driven vehicles," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 603(C).
    3. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    4. Yu, Chunhui & Feng, Yiheng & Liu, Henry X. & Ma, Wanjing & Yang, Xiaoguang, 2018. "Integrated optimization of traffic signals and vehicle trajectories at isolated urban intersections," Transportation Research Part B: Methodological, Elsevier, vol. 112(C), pages 89-112.
    5. Zhang, Xiaoshun & Bao, Tao & Yu, Tao & Yang, Bo & Han, Chuanjia, 2017. "Deep transfer Q-learning with virtual leader-follower for supply-demand Stackelberg game of smart grid," Energy, Elsevier, vol. 133(C), pages 348-365.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang, Jiawen & Zhou, Liping & Yang, Chengcheng, 2025. "Decision model based on driving-mode misidentification for mixed AV–HDV straight–left conflict interactions at two-phase signalized intersections," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 677(C).
    2. Shen, Shiquan & He, Huafeng & Li, Zongdian & Liu, Huan & Liu, Xi & Chen, Zheng & Chen, Yuguang, 2026. "Traffic strategy of signalized intersection integrating vehicle lane change and velocity trajectory optimization in mixed traffic environment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 682(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gao, Yuan & Hu, Zehuan & Yamate, Shun & Otomo, Junichiro & Chen, Wei-An & Liu, Mingzhe & Xu, Tingting & Ruan, Yingjun & Shang, Juan, 2025. "Unlocking predictive insights and interpretability in deep reinforcement learning for Building-Integrated Photovoltaic and Battery (BIPVB) systems," Applied Energy, Elsevier, vol. 384(C).
    2. Luqin Fan & Jing Zhang & Yu He & Ying Liu & Tao Hu & Heng Zhang, 2021. "Optimal Scheduling of Microgrid Based on Deep Deterministic Policy Gradient and Transfer Learning," Energies, MDPI, vol. 14(3), pages 1-15, January.
    3. Gao, Yuan & Matsunami, Yuki & Miyata, Shohei & Akashi, Yasunori, 2022. "Multi-agent reinforcement learning dealing with hybrid action spaces: A case study for off-grid oriented renewable building energy system," Applied Energy, Elsevier, vol. 326(C).
    4. Gao, Yuan & Matsunami, Yuki & Miyata, Shohei & Akashi, Yasunori, 2022. "Operational optimization for off-grid renewable building energy system using deep reinforcement learning," Applied Energy, Elsevier, vol. 325(C).
    5. Hongxin Yu & Lihui Zhang & Meng Zhang & Fengyue Jin & Yibing Wang, 2024. "Coordinated Ramp Metering Considering the Dynamics of Mixed-Autonomy Traffic," Sustainability, MDPI, vol. 16(22), pages 1-26, November.
    6. Daniel Russo, 2023. "Approximation Benefits of Policy Gradient Methods with Aggregated States," Management Science, INFORMS, vol. 69(11), pages 6898-6911, November.
    7. Tulika Saha & Sriparna Saha & Pushpak Bhattacharyya, 2020. "Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-28, July.
    8. Zhao, Jing & Knoop, Victor L. & Wang, Meng, 2020. "Two-dimensional vehicular movement modelling at intersections based on optimal control," Transportation Research Part B: Methodological, Elsevier, vol. 138(C), pages 1-22.
    9. Peng, Jiali & Shangguan, Wei & Peng, Cong & Chai, Linguo, 2024. "Uncertainty modeling of connected and automated vehicle penetration rate under mixed traffic environment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 639(C).
    10. Hamid Ebrahimi, 2026. "A Novel Evidential Uncertainty Framework for Hybrid Models in Rainfall Simulation," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 40(2), pages 1-24, January.
    11. Mahmoud Mahfouz & Angelos Filos & Cyrine Chtourou & Joshua Lockhart & Samuel Assefa & Manuela Veloso & Danilo Mandic & Tucker Balch, 2019. "On the Importance of Opponent Modeling in Auction Markets," Papers 1911.12816, arXiv.org.
    12. Lixiang Zhang & Yan Yan & Yaoguang Hu, 2024. "Deep reinforcement learning for dynamic scheduling of energy-efficient automated guided vehicles," Journal of Intelligent Manufacturing, Springer, vol. 35(8), pages 3875-3888, December.
    13. Imen Azzouz & Wiem Fekih Hassen, 2023. "Optimization of Electric Vehicles Charging Scheduling Based on Deep Reinforcement Learning: A Decentralized Approach," Energies, MDPI, vol. 16(24), pages 1-18, December.
    14. Benjamin Heinbach & Peter Burggräf & Johannes Wagner, 2024. "gym-flp: A Python Package for Training Reinforcement Learning Algorithms on Facility Layout Problems," SN Operations Research Forum, Springer, vol. 5(1), pages 1-26, March.
    15. Jacob W. Crandall & Mayada Oudah & Tennom & Fatimah Ishowo-Oloko & Sherief Abdallah & Jean-François Bonnefon & Manuel Cebrian & Azim Shariff & Michael A. Goodrich & Iyad Rahwan, 2018. "Cooperating with machines," Nature Communications, Nature, vol. 9(1), pages 1-12, December.
      • Abdallah, Sherief & Bonnefon, Jean-François & Cebrian, Manuel & Crandall, Jacob W. & Ishowo-Oloko, Fatimah & Oudah, Mayada & Rahwan, Iyad & Shariff, Azim & Tennom,, 2017. "Cooperating with Machines," TSE Working Papers 17-806, Toulouse School of Economics (TSE).
      • Abdallah, Sherief & Bonnefon, Jean-François & Cebrian, Manuel & Crandall, Jacob W. & Ishowo-Oloko, Fatimah & Oudah, Mayada & Rahwan, Iyad & Shariff, Azim & Tennom,, 2017. "Cooperating with Machines," IAST Working Papers 17-68, Institute for Advanced Study in Toulouse (IAST).
    16. Sun, Alexander Y., 2020. "Optimal carbon storage reservoir management through deep reinforcement learning," Applied Energy, Elsevier, vol. 278(C).
    17. Konstantin Avrachenkov & Vivek S. Borkar & Pratik Shah, 2026. "Lagrangian index policy for restless bandits with average reward," Queueing Systems: Theory and Applications, Springer, vol. 110(1), pages 1-34, March.
    18. Yassine Chemingui & Adel Gastli & Omar Ellabban, 2020. "Reinforcement Learning-Based School Energy Management System," Energies, MDPI, vol. 13(23), pages 1-21, December.
    19. Hamsa Bastani & Osbert Bastani & Wichinpong Park Sinchaisri, 2026. "Improving Human Sequential Decision Making with Reinforcement Learning," Management Science, INFORMS, vol. 72(1), pages 733-755, January.
    20. Woo Jae Byun & Bumkyu Choi & Seongmin Kim & Joohyun Jo, 2023. "Practical Application of Deep Reinforcement Learning to Optimal Trade Execution," FinTech, MDPI, vol. 2(3), pages 1-16, June.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:658:y:2025:i:c:s0378437124007994. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.