CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:60/2  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
PIRNet: Personality-Enhanced Iterative Refinement Network for Emotion Recognition in Conversation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2013Kb)  |  收藏  |  浏览/下载:400/7  |  提交时间:2022/09/19
Emotion recognition  Iterative methods  Context modeling  Psychology  Oral communication  Logic gates  Learning systems  Contextual information  emotion recognition in conversation (ERC)  iterative method  Personality-enhanced Iterative Refinement Network (PIRNet)  personality influence  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:231/4  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:274/32  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 714-725
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(547Kb)  |  收藏  |  浏览/下载:463/189  |  提交时间:2017/05/05
Adaptive Dynamic Programming (Adp)  H-infinity Control  Policy Iteration (Pi)  Zero-sum Game (Zsg)  
Multiple Actor-Critic Structures for Continuous-Time Optimal Control Using Input-Output Data 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 851-865
作者:  Song, Ruizhuo;  Lewis, Frank;  Wei, Qinglai;  Zhang, Hua-Guang;  Jiang, Zhong-Ping;  Levine, Dan;  Qinglai Wei
浏览  |  Adobe PDF(3455Kb)  |  收藏  |  浏览/下载:420/179  |  提交时间:2015/09/21
Actor-critic  Approximate Dynamic Programming (Adp)  Category  Optimal Control  Shunting Inhibitory Artificial Neural Network (Siann)  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:286/115  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation