CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:228/4  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:237/6  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:361/82  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:394/121  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(547Kb)  |  收藏  |  浏览/下载:460/189  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
A pdf-Free Change Detection Test Based on Density Difference Estimation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 2, 页码: 324-334
作者:  Bu, Li;  Alippi, Cesare;  Zhao, Dongbin
浏览  |  Adobe PDF(2468Kb)  |  收藏  |  浏览/下载:395/114  |  提交时间:2017/05/04
Concept Drift  Least Squares Density-difference (Lsdd)-based Method  Probability Density Function (Pdf)-free  Three-level Threshold Mechanism  
Decentralized Stabilization for a Class of Continuous-Time Nonlinear Interconnected Systems Using Online Learning Optimal Control Approach 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 卷号: 25, 期号: 2, 页码: 418-428
作者:  Liu, Derong;  Wang, Ding;  Li, Hongliang
Adobe PDF(1311Kb)  |  收藏  |  浏览/下载:291/108  |  提交时间:2015/08/12
Adaptive Dynamic Programming  Decentralized Control  Large-scale Systems  Neural Networks  Nonlinear Interconnected Systems  Optimal Control  Policy Iteration  Reinforcement Learning  
Policy Iteration Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 卷号: 25, 期号: 3, 页码: 621-634
作者:  Liu, Derong;  Wei, Qinglai
Adobe PDF(2635Kb)  |  收藏  |  浏览/下载:224/87  |  提交时间:2015/08/12
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Discrete-time Policy Iteration  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning