CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions 期刊论文
IEEE Transactions on Fuzzy Systems, 2024, 页码: 1
作者:  Qingxu Fu;  Zhiqiang Pu;  Yi Pan;  Tenghai Qiu;  Jianqiang Yi
Adobe PDF(4975Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/06/05
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/06/05
A Constrained Path Following Method for Snake-like Manipulators via Controlled Winding Uncoiling Strategy 会议论文
, Yokohama, Japan, 2024-5-13
作者:  Mingrui, Luo;  Yunong, Tian;  Yinghua, Cao;  Minghao, Chen;  Yanfeng, Zhang;  En, Li;  Min, Tan
Adobe PDF(5791Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/06/03
基于深度强化学习的大规模群体智能决策方法研究 学位论文
, 2024
作者:  付清旭
Adobe PDF(39071Kb)  |  收藏  |  浏览/下载:24/1  |  提交时间:2024/05/29
大规模,群体系统,协同,决策,深度强化学习,多智能体系统  
多智能体强化学习预训练方法研究 学位论文
, 2024
作者:  孟令辉
Adobe PDF(5071Kb)  |  收藏  |  浏览/下载:30/3  |  提交时间:2024/05/28
多智能体强化学习  预训练方法  神经网络  表示学习  在线强化评估  
多智能体策略一致性奖励塑造算法研究 学位论文
, 2024
作者:  杨晨
Adobe PDF(6011Kb)  |  收藏  |  浏览/下载:16/0  |  提交时间:2024/05/27
多智能体系统  深度强化学习  信用分配  奖励塑造  
Recursive Filtering for Stochastic Systems With Filter-and-Forward Successive Relays 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 5, 页码: 1202-1212
作者:  Hailong Tan;  Bo Shen;  Qi Li;  Hongjian Liu
Adobe PDF(1711Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/04/10
Filter-and-forward successive relay (FFSR)  recursive filtering  relay network  stochastic system  time-varying system  
Designing Proportional-Integral Consensus Protocols for Second-Order Multi-Agent Systems Using Delayed and Memorized State Information 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 878-892
作者:  Honghai Wang;  Qing-Long Han
Adobe PDF(2068Kb)  |  收藏  |  浏览/下载:39/15  |  提交时间:2024/03/18
Consensus protocol  Hurwitz stability  multi-agent systems  quasi-polynomials  time delay  
Prescribed-Time Fully Distributed Nash Equilibrium Seeking Strategy in Networked Games 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 261-263
作者:  Cheng Qian;  Lei Ding
Adobe PDF(928Kb)  |  收藏  |  浏览/下载:115/46  |  提交时间:2024/01/02
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 18-36
作者:  Ding Wang;  Ning Gao;  Derong Liu;  Jinna Li;  Frank L. Lewis
Adobe PDF(1945Kb)  |  收藏  |  浏览/下载:265/184  |  提交时间:2024/01/02
Adaptive dynamic programming (ADP)  advanced control  complex environment  data-driven control  event-triggered design  intelligent control  neural networks  nonlinear systems  optimal control  reinforcement learning (RL)