CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:157/35  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Manifold Regularized Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 932-943
作者:  Li, Hongliang;  Liu, Derong;  Wang, Ding
收藏  |  浏览/下载:175/0  |  提交时间:2018/10/10
Adaptive Dynamic Programming  Approximate Dynamic Programming  Approximate Policy Iteration (Api)  Manifold Regularization  Reinforcement Learning (Rl)  
Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems 期刊论文
COGNITIVE COMPUTATION, 2015, 卷号: 7, 期号: 6, 页码: 763-771
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
Adobe PDF(809Kb)  |  收藏  |  浏览/下载:244/38  |  提交时间:2016/01/18
Approximate Policy Iteration  Approximation Error  Optimal Control  Fuzzy Approximator  
Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design 期刊论文
AUTOMATICA, 2014, 卷号: 50, 期号: 12, 页码: 3281-3290
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(668Kb)  |  收藏  |  浏览/下载:261/93  |  提交时间:2015/08/12
Nonlinear Optimal Control  Reinforcement Learning  Off-policy  Data-based Approximate Policy Iteration  Neural Network  Hamilton-jacobi-bellman Equation