已选(0)清除
条数/页: 排序方式: |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(727Kb)  |  收藏  |  浏览/下载:16/8  |  提交时间:2024/07/04 |
| Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文 IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12 作者: Liu MS(刘民颂); Zhu YH(朱圆恒); Chen YR(陈亚冉); Zhao DB(赵冬斌) Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:28/8  |  提交时间:2024/06/24 |
| Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文 IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10 作者: Luntong Li; Yuanheng Zhu Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:38/13  |  提交时间:2024/06/05 |
| MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12 作者: Boyu Li; Haran Li; Yuanheng Zhu; Dongbin Zhao Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:27/8  |  提交时间:2024/06/05 |
| NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13 作者: Chai, Jiajun; Zhu, Yuanheng; Zhao, Dongbin Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:61/3  |  提交时间:2023/11/16 Large-scale multiagent neighboring communication reinforcement learning (RL) variational information flow |
| Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文 , Budapest, Hungary, 2019-7-14 作者: Zhu YH(朱圆恒); Haibo He; Dongbin Zhao; Zhongsheng Hou Adobe PDF(679Kb)  |  收藏  |  浏览/下载:73/35  |  提交时间:2023/05/22 |
| A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文 IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444 作者: Jiajun Chai; Wenzhang Chen; Yuanheng Zhu; Zong-xin Yao,; Dongbin Zhao Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:284/124  |  提交时间:2023/04/26 |
| Empirical Policy Optimization for n-Player Markov Games 期刊论文 IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775} 作者: Yuanheng Zhu; Weifan Li; Mengchen Zhao; Jianye Hao; Dongbin Zhao Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:110/44  |  提交时间:2023/04/26 |
| UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12 作者: Chai, Jiajun; Li, Weifan; Zhu, Yuanheng; Zhao, Dongbin; Ma, Zhe; Sun, Kewu; Ding, Jishiyu Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:287/37  |  提交时间:2022/01/27 Multi-agent systems Training Task analysis Reinforcement learning Sun Learning systems Semantics Centralized training with decentralized execution (CTDE) multiagent reinforcement learning StarCraft II |
| Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15 作者: Yang, Xiong; Zhu, Yuanheng; Dong, Na; Wei, Qinglai Adobe PDF(1578Kb)  |  收藏  |  浏览/下载:234/14  |  提交时间:2022/01/27 Adaptive critic designs (ACDs) adaptive dynamic programming (ADP) decentralized event-driven control input constraint reinforcement learning (RL) |