中国科学院自动化研究所机构知识库(CASIA OpenIR): 检索

浏览/检索结果: 共3条，第1-3条

帮助

已选(0)清除条数/页：排序方式：
	Continuous-Time Time-Varying Policy Iteration 期刊论文 IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971 作者: Wei, Qinglai; Liao, Zehua; Yang, Zhanyu; Li, Benkai; Liu, Derong Adobe PDF(3149Kb) \| 收藏 \| 浏览/下载：293/54 \| 提交时间：2021/03/02 Optimal control Nonlinear systems Time-varying systems Mathematical model Dynamic programming Approximation algorithms Iterative algorithms Adaptive critic designs adaptive dynamic programming (ADP) neuro-dynamic programming nonlinear systems optimal control policy iteration
	Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文 IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885 作者: Zhang, Qichao; Zhao, Dongbin Adobe PDF(1021Kb) \| 收藏 \| 浏览/下载：448/131 \| 提交时间：2019/07/12 Integral reinforcement learning (IRL) neural network (NN) nonzero-sum (NZS) games off-policy single-critic unknown drift dynamics
	Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文 IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348 作者: Luo, Biao; Yang, Yin; Liu, Derong 收藏 \| 浏览/下载：321/0 \| 提交时间：2019/01/08 Data-based experience replay neural networks (NNs) off-policy optimal control Q-learning (QL)

中国科学院自动化研究所机构知识库