CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:49/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Synergetic learning for unknown nonlinear H. control using neural networks 期刊论文
NEURAL NETWORKS, 2023, 卷号: 168, 页码: 287-299
作者:  Zhu, Liao;  Guo, Ping;  Wei, Qinglai
收藏  |  浏览/下载:98/0  |  提交时间:2023/12/21
H. control  Nonlinear systems  Adaptive dynamic programming  Temporal difference  Neural network  Data-driven  
DAPath: Distance-aware knowledge graph reasoning based on deep reinforcement learning 期刊论文
NEURAL NETWORKS, 2021, 卷号: 135, 页码: 1-12
作者:  Tiwari, Prayag;  Zhu, Hongyin;  Pandey, Hari Mohan
收藏  |  浏览/下载:254/0  |  提交时间:2021/03/15
Knowledge graph reasoning  Reinforcement learning  Graph self-attention  GRU  
Learning to Activate Logic Rules for Textual Reasoning 期刊论文
NEURAL NETWORKS, 2018, 期号: 106, 页码: 42-49
作者:  Yao, Yiqun;  Xu, Jiaming;  Shi, Jing;  Xu, Bo
收藏  |  浏览/下载:108/0  |  提交时间:2020/10/27
Natural Language Reasoning  Memory Networks  Image Schema  Logic Rules  Reinforcement Learning  
Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks 期刊论文
NEURAL NETWORKS, 2018, 期号: 1, 页码: 1-27
作者:  Shi, Jing;  Xu, Jiaming;  Yao, Yiqun;  Xu, Bo
收藏  |  浏览/下载:48/0  |  提交时间:2020/10/27
One-shot Learning  Memory  Attention  Deep Reinforcement  
Improved value iteration for neural-network-based stochastic optimal control design 期刊论文
NEURAL NETWORKS, 2020, 卷号: 124, 页码: 280-295
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
Adobe PDF(5875Kb)  |  收藏  |  浏览/下载:269/58  |  提交时间:2020/06/02
Adaptive critic designs  Adaptive dynamic programming  Neural networks  Optimal control  Stochastic processes  Value iteration  
Reinforcement learning solution for HJB equation arising in constrained optimal control problem 期刊论文
NEURAL NETWORKS, 2015, 卷号: 71, 期号: 0, 页码: 150-158
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:482/206  |  提交时间:2016/03/30
Constrained Optimal Control  Data-based  Off-policy Reinforcement Learning  Hamilton-jacobi-bellman Equation  The Method Of Weighted Residuals  
An iterative is an element of-optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state 期刊论文
NEURAL NETWORKS, 2012, 卷号: 32, 期号: x, 页码: 236-244
作者:  Wei, Qinglai;  Liu, Derong;  Derong Liu
浏览  |  Adobe PDF(438Kb)  |  收藏  |  浏览/下载:214/66  |  提交时间:2015/08/12
Adaptive Dynamic Programming  Approximate Dynamic Programming  Is An Element Of-optimal Control  Finite Horizon  Neural Networks  
Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning 期刊论文
NEURAL NETWORKS, 2014, 卷号: 55, 页码: 30-41
作者:  Yang, Xiong;  Liu, Derong;  Wang, Ding;  Wei, Qinglai
浏览  |  Adobe PDF(684Kb)  |  收藏  |  浏览/下载:395/146  |  提交时间:2015/08/12
Adaptive Critic Design  Neural Network  Nonaffine Nonlinear System  Online Learning  Reinforcement Learning