CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game 期刊论文
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 页码: 10
作者:  Li, Hongyang;  Wei, Qinglai
Adobe PDF(716Kb)  |  收藏  |  浏览/下载:212/54  |  提交时间:2022/06/14
基于自适应动态规划的分布式迭代控制方法研究 学位论文
工学博士, 人工智能学院: 中国科学院大学, 2022
作者:  李洪阳
Adobe PDF(3786Kb)  |  收藏  |  浏览/下载:304/26  |  提交时间:2022/06/14
自适应动态规划,最优控制,分布式控制,智能控制,强化学习  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:235/6  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
On Iterative Proportional Updating: Limitations and Improvements for General Population Synthesis 期刊论文
IEEE Transactions on Cybernetics, 2022, 卷号: 52, 期号: 3, 页码: 1726-1735
作者:  Peijun Ye;  Bin Tian;  Yisheng Lv;  Qijie Li;  Fei-Yue Wang
Adobe PDF(1066Kb)  |  收藏  |  浏览/下载:242/51  |  提交时间:2020/10/15
Agent-based simulation, bilevel optimization, iterative proportional updating (IPU), population synthesis