CASIA OpenIR

浏览/检索结果: 共2条,第1-2条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 页码: 36
作者:  Yang, Yongliang;  Zhu, Hufei;  Zhang, Qichao;  Zhao, Bo;  Li, Zhenning;  Wunsch, Donald C.
收藏  |  浏览/下载:199/0  |  提交时间:2021/11/02
Reproducing kernel Hilbert space  Actor-critic learning  Value function approximation  Online sparsification  Non-parametric learning  
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:  Song, Ruizhuo;  Wei, Qinglai;  Zhang, Huaguang;  Lewis, Frank L.
收藏  |  浏览/下载:206/0  |  提交时间:2021/08/15
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  discrete-time  nonzero-sum (NZS)  off-policy  reinforcement learning (RL)