CASIA OpenIR

浏览/检索结果: 共17条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:44/8  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:251/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Event-triggered optimal control for discrete-time multi-player non-zero-sum games using parallel control 期刊论文
INFORMATION SCIENCES, 2022, 卷号: 584, 页码: 519-535
作者:  Lu, Jingwei;  Wei, Qinglai;  Wang, Ziyang;  Zhou, Tianmin;  Wang, Fei-Yue
收藏  |  浏览/下载:248/0  |  提交时间:2021/12/28
Event-triggered  Non-zero-sum games  Parallel control  Neural network  Adaptive dynamic programming  
Event-Triggered Adaptive Dynamic Programming for Zero-Sum Game of Partially Unknown Continuous-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 9, 页码: 3189-3199
作者:  Xue, Shan;  Luo, Biao;  Liu, Derong
收藏  |  浏览/下载:264/0  |  提交时间:2020/09/28
Adaptive dynamic programming (ADP)  event-triggered control  Hamilton-Jacobi-Isaacs (HJI) equation  neural network (NN) identifier  zero-sum (ZS) game  
Actor-Critic-Identifier Structure-Based Decentralized Neuro-Optimal Control of Modular Robot Manipulators With Environmental Collisions 期刊论文
IEEE ACCESS, 2019, 卷号: 7, 页码: 96148-96165
作者:  Dong, Bo;  An, Tianjiao;  Zhou, Fan;  Liu, Keping;  Yu, Weibo;  Li, Yuanchun
收藏  |  浏览/下载:295/0  |  提交时间:2019/12/16
Adaptive dynamic programming  collision identification  decentralized optimal control  modular robot manipulators  zero-sum game  
Decentralized robust zero-sum neuro-optimal control for modular robot manipulators in contact with uncertain environments: theory and experimental verification 期刊论文
NONLINEAR DYNAMICS, 2019, 卷号: 97, 期号: 1, 页码: 503-524
作者:  Dong, Bo;  An, Tianjiao;  Zhou, Fan;  Liu, Keping;  Li, Yuanchun
收藏  |  浏览/下载:266/0  |  提交时间:2019/12/16
Modular robot manipulators  Adaptive dynamic programming  Decentralized control  Optimal control  Zero-sum game  
Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 5, 页码: 1633-1646
作者:  Zhong, Xiangnan;  He, Haibo;  Wang, Ding;  Ni, Zhen
收藏  |  浏览/下载:182/0  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  Globalized Dual Heuristic Dynamic Programming (Gdhp)  Model-free  Neural Networks  Zero-sum Game  
On Mixed Data and Event Driven Design for Adaptive-Critic-Based Nonlinear H-infinity Control 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 993-1005
作者:  Wang, Ding;  Mu, Chaoxu;  Liu, Derong;  Ma, Hongwen
收藏  |  浏览/下载:210/0  |  提交时间:2018/10/10
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Data Driven Control  Event Driven Control  Hamilton-jacobi-isaacs (Hji) Equation  Neural Network Identification  Nonlinear H-infinity Control  Zero-sum Game  
Generative Adversarial Networks: Introduction and Outlook 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2017, 卷号: 4, 期号: 4, 页码: 588-598
作者:  Kunfeng Wang;  Chao Gou;  Yanjie Duan;  Yilun Lin;  Xinhu Zheng;  Fei-Yue Wang
浏览  |  Adobe PDF(16945Kb)  |  收藏  |  浏览/下载:383/46  |  提交时间:2018/01/08
Acp Approach  Adversarial Learning  Generative Adversarial Networks (Gans)  Generative Models  Parallel Intelligence  Zero-sum Game  
Chaotic system optimal tracking using data-based synchronous method with unknown dynamics and disturbances 期刊论文
CHINESE PHYSICS B, 2017, 卷号: 26, 期号: 3
作者:  Song, Ruizhuo;  Wei, Qinglai
收藏  |  浏览/下载:122/0  |  提交时间:2017/05/05
Adaptive Dynamic Programming  Approximate Dynamic Programming  Chaotic System  Zero-sum