已选(0)清除
条数/页: 排序方式: |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂) ; Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(727Kb)  |   收藏  |  浏览/下载:34/14  |  提交时间:2024/07/04 |
| 面向多机器人博弈的深度强化学习方法 学位论文 , 2024 作者: 胡光政![](/image/person.jpg)
Adobe PDF(17740Kb)  |   收藏  |  浏览/下载:44/0  |  提交时间:2024/07/04 多智能体深度强化学习 多机器人博弈 极小极大Q学习 值分解 最大熵 |
| FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文 IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13 作者: Guangzheng Hu ; Yuanheng Zhu ; Haoran Li ; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(2144Kb)  |   收藏  |  浏览/下载:52/11  |  提交时间:2024/06/05 Games Q-learning Task analysis Optimization Convergence Training Nash equilibrium Multi-agent reinforcement learning minimax-Q learning two-team zero-sum Markov games |
| Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文 , online, 2022 作者: Qingxu Fu ; Tenghai Qiu ; Jianqiang Yi ; Zhiqiang Pu ; Shiguang Wu![](/image/person.jpg)
Adobe PDF(5807Kb)  |   收藏  |  浏览/下载:50/18  |  提交时间:2024/06/05 |
| 基于深度强化学习的大规模群体智能决策方法研究 学位论文 , 2024 作者: 付清旭![](/image/person.jpg)
Adobe PDF(39071Kb)  |   收藏  |  浏览/下载:64/6  |  提交时间:2024/05/29 大规模,群体系统,协同,决策,深度强化学习,多智能体系统 |
| 基于深度强化学习的连续动作空中博弈对抗决策 学位论文 , 2023 作者: 李伟凡![](/image/person.jpg)
Adobe PDF(43167Kb)  |   收藏  |  浏览/下载:519/19  |  提交时间:2023/06/26 强化学习 深度强化学习 自注意力网络 智能决策 多智能体系统 |
| Multiagent Adversarial Collaborative Learning via Mean-Field Theory 期刊论文 IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 10, 页码: 4994-5007 作者: Luo, Guiyang; Zhang, Hui ; He, Haibo; Li, Jinglin; Wang, Fei-Yue![](/image/person.jpg)
![](/themes/default/image/downing1.png) 收藏  |  浏览/下载:224/0  |  提交时间:2021/12/28 Games Training Collaborative work Task analysis Nash equilibrium Sociology Statistics Adversarial collaborative learning (ACL) friend-or-foe Q-learning mean-field theory multiagent reinforcement learning (MARL) |
| Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文 , 线上, 2020-11 作者: Shiguang, Wu ; Zhiqiang, Pu ; Jianqiang, Yi ; Huimu, Wang![](/image/person.jpg)
Adobe PDF(1185Kb)  |   收藏  |  浏览/下载:176/1  |  提交时间:2021/06/24 |
| A Probabilistic Matrix Factorization Method for Link Sign Prediction in Social Networks 会议论文 , New York, NY, USA, July 16-21, 2016 作者: Luo G(罗冠) ; Weiming Hu![](/image/person.jpg)
浏览  |   Adobe PDF(410Kb)  |   收藏  |  浏览/下载:207/54  |  提交时间:2019/10/08 |
| Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256 作者: Zhang,Zhen ; Zhao DB(赵冬斌)![](/image/person.jpg)
浏览  |   Adobe PDF(707Kb)  |   收藏  |  浏览/下载:243/99  |  提交时间:2017/12/30 Reinforcement Learning Factor Graphs |