CASIA OpenIR

浏览/检索结果: 共135条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks 会议论文
, Macao, China, 2023-8
作者:  Pei Xu;  Junge Zhang;  Kaiqi Huang
Adobe PDF(1369Kb)  |  收藏  |  浏览/下载:206/64  |  提交时间:2023/06/19
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:196/67  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:140/34  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Counterfactual Debiasing for Fact Verification 会议论文
, Toronto, Canada, 7.9-7.14, 2023
作者:  Xu WZ(许伟志);  Liu Q(刘强);  Wu S(吴书);  Wang L(王亮)
Adobe PDF(1287Kb)  |  收藏  |  浏览/下载:143/42  |  提交时间:2023/06/26
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
作者:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:106/44  |  提交时间:2023/06/27
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:112/35  |  提交时间:2023/06/29
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:120/37  |  提交时间:2023/06/28
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:176/59  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent  
Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文
, Jinghong, China, 05-09 December 2022
作者:  Junhang Wei;  Shaowei Cui;  Peng Hao;  Shuo Wang
Adobe PDF(933Kb)  |  收藏  |  浏览/下载:131/50  |  提交时间:2023/10/25
Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文
, 中国桂林, 2022-7-9
作者:  Shan QF(单钦锋);  Wang WJ(王伟杰);  Guo DF(郭丁飞);  Sun XR(孙向荣);  Jia LH(贾立好)
Adobe PDF(494Kb)  |  收藏  |  浏览/下载:98/28  |  提交时间:2023/06/05
Deep learning  Mechatronics  Navigation  Reinforcement learning  Cost function  Real-time systems  Trajectory