IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v18y2026i3p1510-d1855535.html

Coordinated Multi-Intersection Traffic Signal Control Using a Policy-Regulated Deep Q-Network

Author

Listed:
  • Lin Ma

    (Linxia Daohe Investment Co., Ltd., Linxia City 731100, China)

  • Yan Liu

    (School of Traffic and Transportation, Lanzhou Jiaotong University, Lanzhou 730070, China)

  • Yang Liu

    (School of Traffic and Transportation, Lanzhou Jiaotong University, Lanzhou 730070, China)

  • Changxi Ma

    (School of Traffic and Transportation, Lanzhou Jiaotong University, Lanzhou 730070, China)

  • Shanpu Wang

    (School of Traffic and Transportation, Lanzhou Jiaotong University, Lanzhou 730070, China)

Abstract

Coordinated control across multiple signalized intersections is essential for mitigating congestion propagation in urban road networks. However, existing DQN-based approaches often suffer from unstable action switching, limited interpretability, and insufficient capability to model spatial spillback between adjacent intersections. To address these limitations, this study proposes a Policy-Regulated and Aligned Deep Q-Network (PRA-DQN) for cooperative multi-intersection signal control. A differentiable policy function is introduced and explicitly trained to align with the optimal Q-value-derived target distribution, yielding more stable and interpretable policy behavior. In addition, a cooperative reward structure integrating local delay, movement pressure, and upstream–downstream interactions enables agents to simultaneously optimize local efficiency and regional coordination. A parameter-sharing multi-agent framework further enhances scalability and learning consistency across intersections. Simulation experiments conducted on a 2 × 2 SUMO grid show that PRA-DQN consistently outperforms fixed-time, classical DQN, distributed DQN, and pressure/wave-based baselines. Compared with fixed-time control, PRA-DQN reduces maximum queue length by 21.17%, average queue length by 18.75%, and average waiting time by 17.71%. Moreover, relative to classical DQN coordination, PRA-DQN achieves an additional 7.53% reduction in average waiting time. These results confirm the effectiveness and superiority of the proposed method in suppressing congestion propagation and improving network-level traffic performance. The proposed PRA-DQN provides a practical and scalable basis for real-time deployment of coordinated signal control and can be readily extended to larger networks and time-varying demand conditions.

Suggested Citation

  • Lin Ma & Yan Liu & Yang Liu & Changxi Ma & Shanpu Wang, 2026. "Coordinated Multi-Intersection Traffic Signal Control Using a Policy-Regulated Deep Q-Network," Sustainability, MDPI, vol. 18(3), pages 1-23, February.
  • Handle: RePEc:gam:jsusta:v:18:y:2026:i:3:p:1510-:d:1855535
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/18/3/1510/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/18/3/1510/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:18:y:2026:i:3:p:1510-:d:1855535. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.