CASIA OpenIR

浏览/检索结果: 共25条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:196/67  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:140/34  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Knowledge Transfer from Situation Evaluation to Multi-agent Reinforcement Learning 会议论文
, New Delhi, India, 2022年11月22-2022年11月26
作者:  Chen M(陈敏);  Pu ZQ(蒲志强);  Pan Y(潘一);  Yi JQ(易建强)
Adobe PDF(4734Kb)  |  收藏  |  浏览/下载:129/47  |  提交时间:2023/06/27
Multi-agent reinforcement learning  Transfer learning  
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:176/59  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent  
Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文
, 中国桂林, 2022-7-9
作者:  Shan QF(单钦锋);  Wang WJ(王伟杰);  Guo DF(郭丁飞);  Sun XR(孙向荣);  Jia LH(贾立好)
Adobe PDF(494Kb)  |  收藏  |  浏览/下载:98/28  |  提交时间:2023/06/05
Deep learning  Mechatronics  Navigation  Reinforcement learning  Cost function  Real-time systems  Trajectory  
POPO: Pessimistic Offline Policy Optimization 会议论文
, Singapore, Singapore, 23-27 May 2022
作者:  He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:165/34  |  提交时间:2022/06/27
reinforcement learning  offline optimization  out-of-distribution  
Trajectory-based Split Hindsight Reverse Curriculum Learning 会议论文
, Prague, Czech Republic, 2021-9
作者:  Wu, Jiaxi;  Zhang, Dianmin;  Zhong, Shanlin;  Qiao, Hong
Adobe PDF(5094Kb)  |  收藏  |  浏览/下载:200/43  |  提交时间:2022/06/14
Reinforcement Learning  Curriculum Learning  
Learning Smooth and Omnidirectional Locomotion for Quadruped Robots 会议论文
, Chongqing, China, 2021-7
作者:  Wu, Jiaxi;  Wang, Chen'an;  Zhang, Dianmin;  Zhong, Shanlin;  Wang, Boxing;  Qiao, Hong
Adobe PDF(1436Kb)  |  收藏  |  浏览/下载:164/41  |  提交时间:2022/06/14
Quadruped Robot  Reinforcement Learning  
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:199/37  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Suzhou, China, May 14-16, 2021
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Rui;  Wang, Shuo
Adobe PDF(855Kb)  |  收藏  |  浏览/下载:71/28  |  提交时间:2023/08/02
Omnidirectional Drift Control  Undulating Fin  Underwater Biomimetic Vehicle-manipulator System (UBVMS)  Reinforcement Learning  Twin Delayed Deep Deterministic policy gradient (TD3)