CASIA OpenIR

浏览/检索结果: 共2条,第1-2条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:38/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Policy generation network for zero-shot policy learning 期刊论文
COMPUTATIONAL INTELLIGENCE, 2023, 页码: 27
作者:  Qian, Yiming;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:61/0  |  提交时间:2023/11/17
knowledge representation  lifelong reinforcement learning  zero-shot policy generation