已选(0)清除
条数/页: 排序方式: |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui ; Ruan Jingqing; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(1450Kb)  |   收藏  |  浏览/下载:54/23  |  提交时间:2024/06/11 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu ; Cui Zeyu ; Wu Shu ; Liu Qiang; Wu Jinlin ; Wang Liang ; Tan Tieniu![](/image/person.jpg)
Adobe PDF(69424Kb)  |   收藏  |  浏览/下载:238/77  |  提交时间:2023/07/06 |
| Empirical Policy Optimization for n-Player Markov Games 期刊论文 IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775} 作者: Yuanheng Zhu ; Weifan Li ; Mengchen Zhao; Jianye Hao; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(1739Kb)  |   收藏  |  浏览/下载:118/49  |  提交时间:2023/04/26 |
| Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 9, 页码: 1673-1686 作者: Wenzhang Liu; Lu Dong; Dan Niu; Changyin Sun
Adobe PDF(5554Kb)  |   收藏  |  浏览/下载:184/79  |  提交时间:2022/08/19 Knowledge transfer multi-agent systems reinforcement learning successor features |
| 基于深度强化学习的群体协同决策方法研究 学位论文 工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022 作者: 吴士广![](/image/person.jpg)
Adobe PDF(14260Kb)  |   收藏  |  浏览/下载:459/25  |  提交时间:2022/06/15 群体系统 协同决策 深度强化学习 多智能体强化学习 图注意力网络 |
| Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241 作者: Zhu, Yuanheng ; Zhao, Dongbin![](/image/person.jpg)
Adobe PDF(2838Kb)  |   收藏  |  浏览/下载:260/16  |  提交时间:2022/06/10 Games Nash equilibrium Mathematical model Markov processes Convergence Dynamic programming Training Deep reinforcement learning (DRL) generalized policy iteration (GPI) Markov game (MG) Nash equilibrium Q network zero sum |