中国科学院自动化研究所机构知识库(CASIA OpenIR): 检索

浏览/检索结果: 共3条，第1-3条

帮助

已选(0)清除条数/页：排序方式：
	NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13 作者: Chai, Jiajun; Zhu, Yuanheng; Zhao, Dongbin Adobe PDF(2469Kb) \| 收藏 \| 浏览/下载：65/5 \| 提交时间：2023/11/16 Large-scale multiagent neighboring communication reinforcement learning (RL) variational information flow
	Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241 作者: Zhu, Yuanheng; Zhao, Dongbin Adobe PDF(2838Kb) \| 收藏 \| 浏览/下载：258/15 \| 提交时间：2022/06/10 Games Nash equilibrium Mathematical model Markov processes Convergence Dynamic programming Training Deep reinforcement learning (DRL) generalized policy iteration (GPI) Markov game (MG) Nash equilibrium Q network zero sum
	Synthesis of Cooperative Adaptive Cruise Control With Feedforward Strategies 期刊论文 IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 卷号: 69, 期号: 4, 页码: 3615-3627 作者: Zhu, Yuanheng; Zhao, Dongbin; He, Haibo Adobe PDF(2462Kb) \| 收藏 \| 浏览/下载：210/16 \| 提交时间：2020/06/22 Cooperative cruise control H-infinity-norm L-2-gain time-delay system state-space model

中国科学院自动化研究所机构知识库