CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
作者:  Luo, Biao;  Yang, Yin;  Liu, Derong
收藏  |  浏览/下载:263/0  |  提交时间:2019/01/08
Data-based  experience replay  neural networks (NNs)  off-policy  optimal control  Q-learning (QL)  
A Review of Computational Intelligence for StarCraft AI 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Tang, Zhentao;  Shao, Kun;  Zhu, Yuanheng;  Li, Dong;  Zhao, Dongbin;  Huang, Tingwen
Adobe PDF(131Kb)  |  收藏  |  浏览/下载:466/219  |  提交时间:2019/04/25
Learning Evasion Strategy in Pursuit-Evasion by Deep Q-network 会议论文
, Beijing, China, 20-24 Aug. 2018
作者:  Zhu, Jiagang;  Zou, Wei;  Zhu, Zheng
浏览  |  Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:226/63  |  提交时间:2020/06/10
Path Planning of Multiagent Constrained Formation through Deep Reinforcement Learning 会议论文
, Rio de Janeiro, Brazil, July 8-13, 2018
作者:  Sui Zezhi;  Pu Zhiqiang;  Yi Jianqiang;  Tan Xiangmin
浏览  |  Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:188/47  |  提交时间:2020/07/08
Research on Autonomous Maneuvering Decision of UCAV Based on Deep Reinforcement Learning 会议论文
CCDC2018, Shenyang, China, June 9-11, 2018
作者:  Zhang, Yesheng;  Zu, Wei;  Gao, Yang;  Chang, Hongxing
浏览  |  Adobe PDF(1271Kb)  |  收藏  |  浏览/下载:667/262  |  提交时间:2018/05/09
Air Combat  Autonomous Maneuvering Decision  Deep Reinforcement Learning  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
浏览  |  Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:369/113  |  提交时间:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
深度强化学习在多机对战战术决策中的应用研究 学位论文
, 北京: 中国科学院大学, 2018
作者:  张业胜
Adobe PDF(6414Kb)  |  收藏  |  浏览/下载:572/8  |  提交时间:2018/06/19
深度强化学习  机动决策  战术决策  空战仿真  多机协同  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:399/180  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Visual navigation with Actor-Critic deep reinforcement learning 会议论文
, Rio, Brazil, 2018-01
作者:  Kun Shao;  Dongbin Zhao;  Yuanheng Zhu;  Qichao Zhang
浏览  |  Adobe PDF(1827Kb)  |  收藏  |  浏览/下载:290/122  |  提交时间:2019/04/22
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:279/34  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)