CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共36条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:240/25  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:282/51  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
MGRL: Graph neural network based inference in a Markov network with Reinforcement Learning for visual navigation 期刊论文
Neurocomputing, 2021, 卷号: 0, 期号: 0, 页码: 0
作者:  Lu, Yi;  Chen, Yaran;  Zhao, Dongbin;  Li, Dong
浏览  |  Adobe PDF(976Kb)  |  收藏  |  浏览/下载:249/72  |  提交时间:2020/10/19
Visual navigation, graph neural network, Markov network, reinforcement learning, probabilistic graph model  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:369/114  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:415/122  |  提交时间:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
Adaptive cruise control via adaptive dynamic programming with experience replay 期刊论文
SOFT COMPUTING, 2019, 卷号: 23, 期号: 12, 页码: 4131-4144
作者:  Wang, Bin;  Zhao, Dongbin;  Cheng, Jin
收藏  |  浏览/下载:222/0  |  提交时间:2019/07/11
Adaptive cruise control  Adaptive dynamic programming  Experience replay  Reinforcement learning  Neural networks  
Reinforcement Learning and Deep Learning based Lateral Control for Autonomous Driving 期刊论文
IEEE Computational Intelligence Magazine, IEEE Computational Intelligence Magazine, 2019, 2019, 卷号: 14, 14, 期号: 2, 页码: 83-98, 83-98
作者:  Dong Li;  Dongbin Zhao;  Qichao Zhang;  Yaran Chen
浏览  |  Adobe PDF(2205Kb)  |  收藏  |  浏览/下载:363/104  |  提交时间:2019/04/25
Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:293/38  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:411/183  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Policy Iteration for Hinfinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE Transactions on Cybernetics, 2017, 期号: PP, 页码: 1-9
作者:  Yuanheng Zhu;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(894Kb)  |  收藏  |  浏览/下载:341/163  |  提交时间:2017/09/13
Adaptive Dynamic Programming (Adp)  H∞ Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)