A novel optimal bipartite consensus control scheme for unknown multi-agent systems via model-free reinforcement learning

A novel optimal bipartite consensus control scheme for unknown multi-agent systems via model-free reinforcement learning

Author

Listed:

Peng, Zhinan
Hu, Jiangping
Shi, Kaibo
Luo, Rui
Huang, Rui
Ghosh, Bijoy Kumar
Huang, Jiuke

Abstract

In this paper, the optimal bipartite consensus control (OBCC) problem is investigated for unknown multi-agent systems (MASs) with coopetition networks. A novel distributed OBCC scheme is proposed based on model-free reinforcement learning method to achieve OBCC, where the agent’s dynamics are no longer required. First, The coopetition networks are applied to establish the cooperative and competitive interactions among agents, and then the OBCC problem is formulated by introducing local neighbor bipartite consensus errors and performance index functions (PIFs) for each agent. Second, in order to obtain the OBCC laws, a policy iteration algorithm (PIA) is employed to learn the solutions to discrete-time (DT) Hamilton-Jacobi-Bellman (HJB) equations. Third, to implement the proposed methods, we adopt a data-driven actor-critic-based neural networks (NNs) framework to approximate the control laws and the PIFs, respectively, in an online learning manner. Finally, some simulation results are given to demonstrate the effectiveness of the developed approaches.

Suggested Citation

Peng, Zhinan & Hu, Jiangping & Shi, Kaibo & Luo, Rui & Huang, Rui & Ghosh, Bijoy Kumar & Huang, Jiuke, 2020. "A novel optimal bipartite consensus control scheme for unknown multi-agent systems via model-free reinforcement learning," Applied Mathematics and Computation, Elsevier, vol. 369(C).

Handle: RePEc:eee:apmaco:v:369:y:2020:i:c:s0096300319308136
DOI: 10.1016/j.amc.2019.124821

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Ye, Dan & Yang, Xiang & Su, Lei, 2017. "Fault-tolerant synchronization control for complex dynamical networks with semi-Markov jump topology," Applied Mathematics and Computation, Elsevier, vol. 312(C), pages 36-48.
Yu, Zhiyong & Jiang, Haijun & Mei, Xuehui & Hu, Cheng, 2018. "Guaranteed cost consensus for second-order multi-agent systems with heterogeneous inertias," Applied Mathematics and Computation, Elsevier, vol. 338(C), pages 739-757.
Hu, Jiangping & Hong, Yiguang, 2007. "Leader-following coordination of multi-agent systems with coupling time delays," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 374(2), pages 853-863.
Shi, Kaibo & Wang, Jun & Zhong, Shouming & Zhang, Xiaojun & Liu, Yajuan & Cheng, Jun, 2019. "New reliable nonuniform sampling control for uncertain chaotic neural networks under Markov switching topologies," Applied Mathematics and Computation, Elsevier, vol. 347(C), pages 169-193.
Hongwen Ma & Derong Liu & Ding Wang & Biao Luo, 2016. "Bipartite output consensus in networked multi-agent systems of high-order power integrators with signed digraph and input noises," International Journal of Systems Science, Taylor & Francis Journals, vol. 47(13), pages 3116-3131, October.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Zhao, Huarong & Peng, Li & Yu, Hongnian, 2022. "Quantized model-free adaptive iterative learning bipartite consensus tracking for unknown nonlinear multi-agent systems," Applied Mathematics and Computation, Elsevier, vol. 412(C).
Wang, Xiaoling & Su, Housheng, 2020. "Completely model-free RL-based consensus of continuous-time multi-agent systems," Applied Mathematics and Computation, Elsevier, vol. 382(C).
Wang, Yun & Fang, Tian & Kong, Qingkai & Li, Feng, 2024. "Zero-sum game-based optimal control for discrete-time Markov jump systems: A parallel off-policy Q-learning method," Applied Mathematics and Computation, Elsevier, vol. 467(C).
Li, Xiaoqing & Nguang, Sing Kiong & She, Kun & Cheng, Jun & Zhong, Shouming, 2021. "Resilient controller synthesis for Markovian jump systems with probabilistic faults and gain fluctuations under stochastic sampling operational mechanism," Applied Mathematics and Computation, Elsevier, vol. 392(C).
Wang, Changlin, 2024. "Social media platform-oriented topic mining and information security analysis by big data and deep convolutional neural network," Technological Forecasting and Social Change, Elsevier, vol. 199(C).
Li, Baoxing & Han, Tao & Xiao, Bo & Zhan, Xi-Sheng & Yan, Huaicheng, 2022. "Leader-following bipartite consensus of multiple uncertain Euler-Lagrange systems under deception attacks," Applied Mathematics and Computation, Elsevier, vol. 428(C).
Meng, Hao & Pang, Denghao & Cao, Jinde & Guo, Yechen & Niazi, Azmat Ullah Khan, 2024. "Optimal bipartite consensus control for heterogeneous unknown multi-agent systems via reinforcement learning," Applied Mathematics and Computation, Elsevier, vol. 476(C).
Shen, Ziwen & Dong, Tao & Huang, Tingwen, 2025. "Data-driven bipartite synchronization control of multi-agent systems with asymmetric input saturation over switching networks," Applied Mathematics and Computation, Elsevier, vol. 494(C).
Jinfeng Wang & Hui Dong & Fenghua Chen & Mai The Vu & Ali Dokht Shakibjoo & Ardashir Mohammadzadeh, 2023. "Formation Control of Non-Holonomic Mobile Robots: Predictive Data-Driven Fuzzy Compensator," Mathematics, MDPI, vol. 11(8), pages 1-21, April.
Lv, Yuan-Wei & Yang, Guang-Hong & Dimirovski, Georgi Marko, 2025. "Distributed adaptive moving horizon estimation for multi-sensor networks subject to quantization effects," Applied Mathematics and Computation, Elsevier, vol. 488(C).
Hou, Rui & Cui, Lizhi & Bu, Xuhui & Yang, Junqi, 2021. "Distributed formation control for multiple non-holonomic wheeled mobile robots with velocity constraint by using improved data-driven iterative learning," Applied Mathematics and Computation, Elsevier, vol. 395(C).
Jia, Guolong & Yang, Qing & Liu, Jinxu & Shen, Hao, 2025. "Reinforcement learning-based linear quadratic tracking control for partially unknown Markov jump singular interconnected systems," Applied Mathematics and Computation, Elsevier, vol. 491(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Jian, Long & Hu, Jiangping & Wang, Jun & Shi, Kaibo, 2019. "Observer-based output feedback distributed event-triggered control for linear multi-agent systems under general directed graphs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 534(C).
Jian, Long & Hu, Jiangping & Wang, Jun & Shi, Kaibo, 2019. "Distributed event-triggered protocols with Kx-functional observer for leader-following multi-agent systems," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 535(C).
Li, Xiaoqing & Nguang, Sing Kiong & She, Kun & Cheng, Jun & Zhong, Shouming, 2021. "Resilient controller synthesis for Markovian jump systems with probabilistic faults and gain fluctuations under stochastic sampling operational mechanism," Applied Mathematics and Computation, Elsevier, vol. 392(C).
Wang, Xin & Su, Housheng, 2019. "Consensus of hybrid multi-agent systems by event-triggered/self-triggered strategy," Applied Mathematics and Computation, Elsevier, vol. 359(C), pages 490-501.
Jing Bai & Guoguang Wen & Ahmed Rahmani & Xing Chu & Yongguang Yu, 2016. "Consensus with a reference state for fractional-order multi-agent systems," International Journal of Systems Science, Taylor & Francis Journals, vol. 47(1), pages 222-234, January.
Wang, Bo & Cheng, Jun & Zhou, Xia, 2020. "A multiple hierarchical structure strategy to quantized control of Markovian switching systems," Applied Mathematics and Computation, Elsevier, vol. 373(C).
Cai, Yuliang & Dai, Jing & Zhang, Huaguang & Wang, Yingchun, 2021. "Fixed-time leader-following/containment consensus of nonlinear multi-agent systems based on event-triggered mechanism," Applied Mathematics and Computation, Elsevier, vol. 396(C).
Sakthivel, R. & Joby, Maya & Wang, Chao & Kaviarasan, B., 2018. "Finite-time fault-tolerant control of neutral systems against actuator saturation and nonlinear actuator faults," Applied Mathematics and Computation, Elsevier, vol. 332(C), pages 425-436.
Ruan, Xiaoli & Xu, Chen & Feng, Jianwen & Wang, Jingyi & Zhao, Yi, 2022. "Adaptive dynamic event-triggered control for multi-agent systems with matched uncertainties under directed topologies," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 586(C).
Baojie Zheng & Xiaowu Mu, 2016. "Formation-containment control of second-order multi-agent systems with only sampled position data," International Journal of Systems Science, Taylor & Francis Journals, vol. 47(15), pages 3609-3618, November.
Cheng-Lin Liu & Fei Liu, 2014. "Adjacent‐Compensation Consensus Algorithm in Asynchronously Coupled Form for Second‐Order Multiagent Network under Communication Delay," Abstract and Applied Analysis, John Wiley & Sons, vol. 2014(1).
Chen, Feng & Chen, Yuming & Zhu, Quanxin & Zhang, Qimin, 2023. "Stability of stochastic systems with semi-Markovian switching and impulses," Chaos, Solitons & Fractals, Elsevier, vol. 177(C).
Zhang, Zhiming & Zheng, Wei & Lam, H.K. & Wen, Shuhuan & Sun, Fuchun & Xie, Ping, 2020. "Stability analysis and output feedback control for stochastic networked systems with multiple communication delays and nonlinearities using fuzzy control technique," Applied Mathematics and Computation, Elsevier, vol. 386(C).
Guo, Beibei & Xiao, Yu, 2023. "Intermittent synchronization for multi-link and multi-delayed large-scale systems with semi-Markov jump and its application of Chua’s circuits," Chaos, Solitons & Fractals, Elsevier, vol. 174(C).
Xia, ZeLiang & He, Shuping, 2022. "Finite-time asynchronous H∞ fault-tolerant control for nonlinear hidden markov jump systems with actuator and sensor faults," Applied Mathematics and Computation, Elsevier, vol. 428(C).
Long, Mingkang & Su, Housheng & Liu, Bo, 2019. "Second-order controllability of two-time-scale multi-agent systems," Applied Mathematics and Computation, Elsevier, vol. 343(C), pages 299-313.
Hongjie Li & Ming Chen & Shigen Shen & Lin Li, 2013. "Delay‐Distribution‐Dependent Consensus for Second‐Order Leader‐Follower Nonlinear Multiagent Systems via Pinning Control," Abstract and Applied Analysis, John Wiley & Sons, vol. 2013(1).
Xi, Jianxiang & Shi, Zongying & Zhong, Yisheng, 2012. "Admissible consensus and consensualization of high-order linear time-invariant singular swarm systems," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(23), pages 5839-5849.
Chen, Yonghui & Zhang, Xian & Xue, Yu, 2022. "Global exponential synchronization of high-order quaternion Hopfield neural networks with unbounded distributed delays and time-varying discrete delays," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 193(C), pages 173-189.
Guo, Wanli & He, Wennuo & Shi, Lili & Sun, Wen & Lu, Xiaoqing, 2021. "Fixed-time consensus tracking for nonlinear stochastically disturbed multi-agent systems via discontinuous protocols," Applied Mathematics and Computation, Elsevier, vol. 400(C).

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:apmaco:v:369:y:2020:i:c:s0096300319308136. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/applied-mathematics-and-computation .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A novel optimal bipartite consensus control scheme for unknown multi-agent systems via model-free reinforcement learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data