CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:242/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Neuro-Optimal Trajectory Tracking With Value Iteration of Discrete-Time Nonlinear Dynamics 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Ding;  Ha, Mingming;  Cheng, Long
收藏  |  浏览/下载:252/0  |  提交时间:2022/01/27
Trajectory  Heuristic algorithms  Convergence  Trajectory tracking  Stability criteria  Optimal control  Dynamic programming  Adaptive critic design  discrete-time nonlinear plants  neuro-optimal trajectory tracking  uniformly ultimately bounded stability  value iteration  
Target Tracking Control of a Biomimetic Underwater Vehicle Through Deep Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Wang, Yu;  Tang, Chong;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Min;  Hou, Zengguang
收藏  |  浏览/下载:212/0  |  提交时间:2022/01/27
Reinforcement learning  Target tracking  Robots  Sports  Aerospace electronics  Mobile robots  Underwater vehicles  Biomimetic underwater vehicle (BUV)  reinforcement learning  target tracking control  
Adaptive Critic Learning for Constrained Optimal Event-Triggered Control With Discounted Cost 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 卷号: 32, 期号: 1, 页码: 91-104
作者:  Yang, Xiong;  Wei, Qinglai
收藏  |  浏览/下载:174/0  |  提交时间:2021/06/15
Nonlinear systems  Optimal control  Robustness  Cost function  Adaptive systems  Adaptive critic designs (ACDs)  adaptive critic learning (ACL)  adaptive dynamic programming (ADP)  constrained optimal control  event-triggered control (ETC)  reinforcement learning (RL)