已选(0)清除
条数/页: 排序方式: |
| 自适应分布式聚合博弈广义纳什均衡算法 期刊论文 自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1210-1220 作者: 时侠圣; 任璐; 孙长银 Adobe PDF(1595Kb)  |  收藏  |  浏览/下载:12/5  |  提交时间:2024/07/02 聚合博弈 自适应 比例积分 梯度跟踪 一般线性多智能体系统 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui; Ruan Jingqing; Xing Dengpeng; Xu Bo Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/06/11 |
| Distributed Optimal Variational GNE Seeking in Merely Monotone Games 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1621-1630 作者: Wangli He; Yanzhen Wang Adobe PDF(2076Kb)  |  收藏  |  浏览/下载:25/12  |  提交时间:2024/06/07 Distributed algorithms equilibria selection generalized Nash equilibrium (GNE) merely monotone games |
| Ultimately Bounded Output Feedback Control for Networked Nonlinear Systems With Unreliable Communication Channel: A Buffer-Aided Strategy 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1566-1578 作者: Yuhan Zhang; Zidong Wang; Lei Zou; Yun Chen; Guoping Lu Adobe PDF(2016Kb)  |  收藏  |  浏览/下载:28/9  |  提交时间:2024/06/07 Buffer-aided strategy neural networks nonlinear control output-feedback control unreliable communication channel |
| A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文 Computers and Electrical Engineering, 2024, 页码: 118 作者: Lexing Wang; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:35/9  |  提交时间:2024/06/06 |
| FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文 IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13 作者: Guangzheng Hu; Yuanheng Zhu; Haoran Li; Dongbin Zhao Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:32/4  |  提交时间:2024/06/05 Games Q-learning Task analysis Optimization Convergence Training Nash equilibrium Multi-agent reinforcement learning minimax-Q learning two-team zero-sum Markov games |
| 类脑心理揣测脉冲神经网络模型研究 学位论文 , 2024 作者: Zhao,Zhuoya Adobe PDF(23946Kb)  |  收藏  |  浏览/下载:26/2  |  提交时间:2024/06/04 类脑心理揣测模型 脉冲神经网络 多智能体社会交互 区分自我和他人 类脑心理揣测模型 脉冲神经网络 多智能体社会交互 区分自我和他人 类脑心理揣测模型 脉冲神经网络 多智能体社会交互 区分自我和他人 |
| Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories 期刊论文 IEEE Communications Surveys and Tutorials, 2024, 页码: 50 作者: Yang,Ning; Chen,Shuo; Zhang,Haijun; Berry,Randall Adobe PDF(1694Kb)  |  收藏  |  浏览/下载:43/4  |  提交时间:2024/06/01 Reinforcement learning, mobile edge computing, offloading scheduling, content caching, and communication |
| 基于强化学习的多智能体协同决策关键问题研究 学位论文 , 2024 作者: 徐志伟 Adobe PDF(12464Kb)  |  收藏  |  浏览/下载:79/7  |  提交时间:2024/05/28 强化学习 多智能体系统 协同与合作 分层决策 对比学习 |
| 视觉自监督学习关键技术研究 学位论文 , 2024 作者: Li, Zhaowen(李朝闻) Adobe PDF(42567Kb)  |  收藏  |  浏览/下载:51/3  |  提交时间:2024/05/27 请输入关键词 |