CASIA OpenIR

浏览/检索结果: 共160条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Secure Tracking Control via Fixed-Time Convergent Reinforcement Learning for a UAV CPS 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1699-1701
作者:  Zhenyu Gong;  Feisheng Yang
Adobe PDF(604Kb)  |  收藏  |  浏览/下载:34/12  |  提交时间:2024/06/07
Distributed Optimal Variational GNE Seeking in Merely Monotone Games 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1621-1630
作者:  Wangli He;  Yanzhen Wang
Adobe PDF(2076Kb)  |  收藏  |  浏览/下载:25/12  |  提交时间:2024/06/07
Distributed algorithms  equilibria selection  generalized Nash equilibrium (GNE)  merely monotone games  
Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions 期刊论文
IEEE Transactions on Fuzzy Systems, 2024, 页码: 1
作者:  Qingxu Fu;  Zhiqiang Pu;  Yi Pan;  Tenghai Qiu;  Jianqiang Yi
Adobe PDF(4975Kb)  |  收藏  |  浏览/下载:30/10  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/06/05
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:32/4  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
A Constrained Path Following Method for Snake-like Manipulators via Controlled Winding Uncoiling Strategy 会议论文
, Yokohama, Japan, 2024-5-13
作者:  Mingrui, Luo;  Yunong, Tian;  Yinghua, Cao;  Minghao, Chen;  Yanfeng, Zhang;  En, Li;  Min, Tan
Adobe PDF(5791Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/06/03
基于深度强化学习的大规模群体智能决策方法研究 学位论文
, 2024
作者:  付清旭
Adobe PDF(39071Kb)  |  收藏  |  浏览/下载:53/6  |  提交时间:2024/05/29
大规模,群体系统,协同,决策,深度强化学习,多智能体系统  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:46/19  |  提交时间:2024/05/29
Policy Iteration Algorithm for Constrained Cost Optimal Control of Discrete-Time Nonlinear System 会议论文
, Shenzhen, China, 2021.7.18-22
作者:  Li, Tao;  Wei, Qinglai;  Li, Hongyang;  Song, Ruizhuo
Adobe PDF(920Kb)  |  收藏  |  浏览/下载:44/18  |  提交时间:2024/05/28
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 卷号: 35, 期号: 3, 页码: 3251 - 3264
作者:  Wei, Qinglai;  Li, Tao
Adobe PDF(8471Kb)  |  收藏  |  浏览/下载:46/16  |  提交时间:2024/05/28
Adaptive dynamic programming  approximate dynamic programming  constrained cost  optimal control  reinforcement learning