CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:214/1  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Robust C-Loss Kernel Classifiers 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 3, 页码: 510-522
作者:  Xu, Guibiao;  Hu, Bao-Gang;  Principe, Jose C.
浏览  |  Adobe PDF(3169Kb)  |  收藏  |  浏览/下载:397/158  |  提交时间:2018/01/05
Correntropy  Half-quadratic (Hq) Optimization  Kernel Classifier  Loss Function  
Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 卷号: 27, 期号: 2, 页码: 444-458
作者:  Wei, Qinglai;  Song, Ruizhuo;  Yan, Pengfei
浏览  |  Adobe PDF(2204Kb)  |  收藏  |  浏览/下载:425/139  |  提交时间:2016/06/14
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Recurrent Neural Network (Rnn)  Reinforcement Learning  
Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 866-879
作者:  Wei, Qinglai;  Liu, Derong;  Yang, Xiong
浏览  |  Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:295/117  |  提交时间:2015/09/21
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration  Neural Networks (Nns)  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:277/112  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Error Bounds of Adaptive Dynamic Programming Algorithms for Solving Undiscounted Optimal Control Problems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 6, 页码: 1323-1334
作者:  Liu, Derong;  Li, Hongliang;  Wang, Ding
Adobe PDF(1114Kb)  |  收藏  |  浏览/下载:301/93  |  提交时间:2015/09/17
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  
Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 卷号: 25, 期号: 3, 页码: 621-634
作者:  Liu, Derong;  Wei, Qinglai
Adobe PDF(2635Kb)  |  收藏  |  浏览/下载:218/86  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Discrete-time Policy Iteration  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning