CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:201/68  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Evolution of opinions with estimation and interference 会议论文
Proceedings of 41st Chinese Control Conference, Hefei, 2022.7.25-27
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(214Kb)  |  收藏  |  浏览/下载:116/43  |  提交时间:2023/06/28
Opinion dynamics, Self-cognition, Estimation  
Commander-Soldiers Reinforcement Learning for Cooperative Multi-Agent Systems 会议论文
, 意大利, 2022-7
作者:  Chen YQ(陈逸群);  Yang Wei;  Tianle Zhang;  Shiguang Wu;  Hongxing Chang
Adobe PDF(15907Kb)  |  收藏  |  浏览/下载:121/28  |  提交时间:2023/06/28
Empirical Learning of Decision Parameters for Agent-Based Model 会议论文
, Macau, China, 2022
作者:  Song B(宋冰);  Xiong G(熊刚);  Zhu F(朱凤华);  Wu X(武许可);  Lv Y(吕宜生);  Ye P(叶佩军)
Adobe PDF(1359Kb)  |  收藏  |  浏览/下载:118/44  |  提交时间:2023/06/26
Calibration of Agent-Based Model Using Reinforcement Learning 会议论文
, Beijing, 2021
作者:  Song B(宋冰);  Xiong G(熊刚);  Yu S(于松民);  Ye P(叶佩军);  Dong X(董西松);  Lv Y(吕宜生)
Adobe PDF(437Kb)  |  收藏  |  浏览/下载:110/40  |  提交时间:2023/06/26
STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-Agent Cooperation 会议论文
, 线上, 2020-11
作者:  Huimu Wang;  Zhen Liu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(916Kb)  |  收藏  |  浏览/下载:91/0  |  提交时间:2021/06/24
Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文
, 线上, 2020-11
作者:  Shiguang, Wu;  Zhiqiang, Pu;  Jianqiang, Yi;  Huimu, Wang
Adobe PDF(1185Kb)  |  收藏  |  浏览/下载:138/1  |  提交时间:2021/06/24
Adaptive flocking of multi-agent system with uncertain nonlinear dynamics and unknown disturbances using neural networks 会议论文
, Online, August 20-21
作者:  Shiguang Wu;  Zhiqiang Pu;  Jianqiang Yi;  Jinlin Su;  Tianyi Xiong;  Tenghai Qiu
Adobe PDF(2014Kb)  |  收藏  |  浏览/下载:145/50  |  提交时间:2022/04/06
Multi-Agent Formation Control with Obstacles Avoidance under Restricted Communication through Graph Reinforcement Learning 会议论文
, 线上, 2020.06
作者:  Huimu, Wang;  Tenghai, Qiu;  Zhen, Liu;  Zhiqiang, Pu;  Jianqiang, Yi
Adobe PDF(1461Kb)  |  收藏  |  浏览/下载:170/35  |  提交时间:2021/06/24
基于拍卖和边际效益的自主信号交叉口建模方法 会议论文
, China, 2017
作者:  赵伊瑶;  沈震;  张淅鹏;  熊刚;  朱凤华;  刘陶忠
浏览  |  Adobe PDF(811Kb)  |  收藏  |  浏览/下载:289/78  |  提交时间:2017/12/31