CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Peer Incentive Reinforcement Learning for Cooperative Multi-Agent Games 期刊论文
IEEE Transactions on Games, 2022, 页码: 1-14
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(18835Kb)  |  收藏  |  浏览/下载:134/33  |  提交时间:2023/06/12
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:197/69  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Dual-discriminator adversarial framework for data-free quantization 期刊论文
NEUROCOMPUTING, 2022, 卷号: 511, 页码: 67-77
作者:  Li, Zhikai;  Ma, Liping;  Long, Xianlei;  Xiao, Junrui;  Gu, Qingyi
Adobe PDF(1512Kb)  |  收藏  |  浏览/下载:369/80  |  提交时间:2022/11/21
Model compression  Quantized neural networks  Data-free quantization  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:252/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum