CASIA OpenIR

浏览/检索结果: 共82条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Policy Iteration Algorithm for Constrained Cost Optimal Control of Discrete-Time Nonlinear System 会议论文
, Shenzhen, China, 2021.7.18-22
作者:  Li, Tao;  Wei, Qinglai;  Li, Hongyang;  Song, Ruizhuo
Adobe PDF(920Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/05/28
D2AH-PPO: Playing ViZDoom With Object-Aware Hierarchical Reinforcement Learning 会议论文
, 中国重庆, 2024-5-7
作者:  Niu LY(钮龙宇);  Wan J(万军)
Adobe PDF(1645Kb)  |  收藏  |  浏览/下载:2/1  |  提交时间:2024/05/28
Position Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Liuzhou, China, 20-22 November 2020
作者:  Ma, Ruichen;  Wang, Yu;  Gao, Zisen;  Zhao, Tianzi;  Wang, Rui;  Wang, Shuo;  Zhou, Chao
Adobe PDF(927Kb)  |  收藏  |  浏览/下载:75/34  |  提交时间:2023/08/03
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Suzhou, China, May 14-16, 2021
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Rui;  Wang, Shuo
Adobe PDF(855Kb)  |  收藏  |  浏览/下载:82/30  |  提交时间:2023/08/02
Omnidirectional Drift Control  Undulating Fin  Underwater Biomimetic Vehicle-manipulator System (UBVMS)  Reinforcement Learning  Twin Delayed Deep Deterministic policy gradient (TD3)  
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:135/38  |  提交时间:2023/06/29
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:221/72  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:120/39  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:103/31  |  提交时间:2023/06/27
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
作者:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:117/47  |  提交时间:2023/06/27
Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文
, Xining, China, 15-19 July 2021
作者:  Zhang Tiandong;  Wang Rui;  Wang Yu;  Wang Shuo
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:58/20  |  提交时间:2023/06/14