Author
Listed:
- Yifei Qi
(College of Electrical Engineering, Jiangsu University, Zhenjiang 212013, China)
- Bo Wang
(College of Electrical Engineering, Jiangsu University, Zhenjiang 212013, China)
Abstract
The public deployment of electric vehicle charging stations must simultaneously balance construction economics, user accessibility, queueing pressure, feeder security, tail risk under demand uncertainty, and spatial fairness. These criteria are strongly coupled, yet most existing studies either rely on static optimization with limited behavioral realism or use multi-agent reinforcement learning for short-term charging operation rather than for long-term siting. This paper proposes a preference-conditioned multi-agent deep deterministic policy gradient (PC-MADDPG) framework for the urban charging station siting problem in a coupled traffic–distribution environment. Candidate charging sites are modeled as cooperative agents under centralized training and decentralized execution. Each agent outputs a continuous pile-allocation action, which is repaired into an integer expansion plan under a budget constraint. The environment evaluates each plan through attraction-based demand assignment, queue approximation, LinDistFlow-style feeder analysis, and a six-objective performance vector, including annual net cost, travel burden, service inconvenience, grid penalty, CVaR of unmet charging demand, and equity loss. On a reproducible benchmark with 12 demand zones, 10 candidate sites, an 11-bus radial feeder, and 16 stochastic daily scenarios, the proposed framework generates a non-dominated archive with 42 unique feasible plans. A representative PC-MADDPG solution opens 5 of 10 candidate sites and installs 20 fast-charging piles, achieving 99.88% mean demand coverage with an annual profit of 2.083 M$ and a maximum line utilization of 0.999. Relative to the NoGrid ablation, the selected full model reduces grid penalty by 23.87% and equity Gini by 51.08%, with only a 0.35% profit concession. Relative to the NoRisk ablation, the CVaR of unmet demand is lowered by 69.70%. Compared with a demand-greedy baseline, the proposed method reduces grid penalty by 11.72% and equity Gini by 25.19% while preserving similar demand coverage. These results provide proof-of-concept evidence, on a reproducible coupled benchmark, that preference-conditioned multi-agent learning can serve as a practical many-objective siting engine for charging-infrastructure planning when coupled traffic and feeder constraints are explicitly modeled.
Suggested Citation
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:14:y:2026:i:9:p:1464-:d:1929426. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.