CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共24条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:126/40  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:  Jiajun Chai;  Wenzhang Chen;  Yuanheng Zhu;  Zong-xin Yao,;  Dongbin Zhao
Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:225/113  |  提交时间:2023/04/26
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:96/39  |  提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:210/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Reinforcement Learning for Build-Order Production in StarCraft II 会议论文
, Cordoba, Granada, and Seville, Spain, 30 June-6 July 2018
作者:  Zhentao Tang;  Dongbin Zhao;  Yuanheng Zhu;  Ping Guo
Adobe PDF(2680Kb)  |  收藏  |  浏览/下载:171/53  |  提交时间:2021/07/07
深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero 期刊论文
控 制 理 论 与 应 用, 2017, 卷号: 34, 期号: 12, 页码: 1529-1546
作者:  唐振韬;  邵 坤;  赵冬斌;  朱圆恒
Adobe PDF(8232Kb)  |  收藏  |  浏览/下载:230/36  |  提交时间:2021/07/05
深度强化学习  AlphaGo Zero  深度学习  强化学习  人工智能  
A Review of Computational Intelligence for StarCraft AI 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Tang, Zhentao;  Shao, Kun;  Zhu, Yuanheng;  Li, Dong;  Zhao, Dongbin;  Huang, Tingwen
浏览  |  Adobe PDF(131Kb)  |  收藏  |  浏览/下载:495/226  |  提交时间:2019/04/25
Visual navigation with Actor-Critic deep reinforcement learning 会议论文
, Rio, Brazil, 2018-01
作者:  Kun Shao;  Dongbin Zhao;  Yuanheng Zhu;  Qichao Zhang
Adobe PDF(1827Kb)  |  收藏  |  浏览/下载:314/128  |  提交时间:2019/04/22
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
作者:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(4125Kb)  |  收藏  |  浏览/下载:343/133  |  提交时间:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:297/40  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)