CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:47/16  |  提交时间:2024/06/05
Traffic Signal Control Based on Reinforcement Learning and Fuzzy Neural Network 会议论文
, Macau, China, October 8-12, 2022
作者:  Zhao, Hongxia;  Chen, Songhang;  Zhu, Fenghua;  Tang, Haina
Adobe PDF(565Kb)  |  收藏  |  浏览/下载:42/18  |  提交时间:2024/06/03
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:39/8  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:42/8  |  提交时间:2024/05/28
Learning to Coordinate via Multiple Graph Neural Networks 会议论文
, BALI, Indonesia, December 8-12, 2021
作者:  Zhiwei Xu;  Bin Zhang;  Yunpeng Bai;  Dapeng Li;  Guoliang Fan
Adobe PDF(2047Kb)  |  收藏  |  浏览/下载:56/22  |  提交时间:2024/05/28
Intrinsic Reward with Peer Incentives for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Online, 18-23 July 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Wu SG(吴士广);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(2189Kb)  |  收藏  |  浏览/下载:234/69  |  提交时间:2023/06/12
Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs 会议论文
, Philadelphia, PA, USA, May 23-27, 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4277Kb)  |  收藏  |  浏览/下载:178/41  |  提交时间:2023/06/12
Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文
, 中国桂林, 2022-7-9
作者:  Shan QF(单钦锋);  Wang WJ(王伟杰);  Guo DF(郭丁飞);  Sun XR(孙向荣);  Jia LH(贾立好)
Adobe PDF(494Kb)  |  收藏  |  浏览/下载:162/51  |  提交时间:2023/06/05
Deep learning  Mechatronics  Navigation  Reinforcement learning  Cost function  Real-time systems  Trajectory