已选(0)清除
条数/页: 排序方式: |
| Driving Control with Deep and Reinforcement Learning in The Open Racing Car Simulator 会议论文 , Siem Reap, Cambodia, 2018, 12, 13-16 作者: Yuanheng Zhu ; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(697Kb)  |   收藏  |  浏览/下载:8/3  |  提交时间:2024/06/05 |
| MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12 作者: Boyu Li; Haran Li; Yuanheng Zhu ; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(9953Kb)  |   收藏  |  浏览/下载:6/3  |  提交时间:2024/06/05 |
| FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文 IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13 作者: Guangzheng Hu; Yuanheng Zhu ; Haoran Li; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(2144Kb)  |   收藏  |  浏览/下载:5/0  |  提交时间:2024/06/05 |
| Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文 , 昆士兰, 2023-6 作者: Li WF(李伟凡) ; Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(4104Kb)  |   收藏  |  浏览/下载:231/75  |  提交时间:2023/06/29 multi-agent reinforcement learning policy gradient |
| Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文 , Budapest, Hungary, 2019-7-14 作者: Zhu YH(朱圆恒) ; Haibo He; Dongbin Zhao ; Zhongsheng Hou
Adobe PDF(679Kb)  |   收藏  |  浏览/下载:63/31  |  提交时间:2023/05/22 |
| Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2022, 页码: doi={10.1109/TCDS.2022.3218940} 作者: Minsong Liu ; Luntong Li; Shuai Hao; Yuanheng Zhu ; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(12013Kb)  |   收藏  |  浏览/下载:77/20  |  提交时间:2023/04/26 |
| Vision-based control in the open racing car simulator with deep and reinforcement learning 期刊论文 Journal of Ambient Intelligence and Humanized Computing, 2019, 页码: doi={10.1007/s12652-019-01503-y} 作者: Yuanheng Zhu ; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(2210Kb)  |   收藏  |  浏览/下载:51/12  |  提交时间:2023/04/26 |
| Empirical Policy Optimization for n-Player Markov Games 期刊论文 IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775} 作者: Yuanheng Zhu ; Weifan Li ; Mengchen Zhao; Jianye Hao; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(1739Kb)  |   收藏  |  浏览/下载:99/40  |  提交时间:2023/04/26 |
| UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12 作者: Chai, Jiajun ; Li, Weifan ; Zhu, Yuanheng ; Zhao, Dongbin ; Ma, Zhe; Sun, Kewu; Ding, Jishiyu
Adobe PDF(3402Kb)  |   收藏  |  浏览/下载:259/29  |  提交时间:2022/01/27 Multi-agent systems Training Task analysis Reinforcement learning Sun Learning systems Semantics Centralized training with decentralized execution (CTDE) multiagent reinforcement learning StarCraft II |
| Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文 IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108 作者: Zhu, Yuanheng ; Zhao, Dongbin ; He, Haibo
Adobe PDF(2666Kb)  |   收藏  |  浏览/下载:195/2  |  提交时间:2021/08/15 Microscopy Feedback control Mathematical model Data models Dynamic programming Psychology Computational modeling Adaptive dynamic programming (ADP) heterogeneous corridors macroscopic pedestrian dynamics optimal feedback control pedestrian flow |