已选(0)清除
条数/页: 排序方式: |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂) ; Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(727Kb)  |   收藏  |  浏览/下载:25/11  |  提交时间:2024/07/04 |
| Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文 IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12 作者: Liu MS(刘民颂) ; Zhu YH(朱圆恒) ; Chen YR(陈亚冉) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(1162Kb)  |   收藏  |  浏览/下载:33/9  |  提交时间:2024/06/24 |
| FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文 IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13 作者: Guangzheng Hu ; Yuanheng Zhu ; Haoran Li ; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(2144Kb)  |   收藏  |  浏览/下载:46/9  |  提交时间:2024/06/05 Games Q-learning Task analysis Optimization Convergence Training Nash equilibrium Multi-agent reinforcement learning minimax-Q learning two-team zero-sum Markov games |
| Deep Reinforcement Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1 - 16 作者: Liu, Yuqi ; Zhang, Qichao ; Gao, Yinfeng; Zhao, Dongbin![](/image/person.jpg)
Adobe PDF(22863Kb)  |   收藏  |  浏览/下载:47/16  |  提交时间:2024/06/03 Reinforcement Learning Autonomous Driving Intersection Navigating |
| A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios 会议论文 , Orlando, FL, USA, 2022-1-24 作者: Liu, Yuqi ; Zhang, Qichao ; Zhao, Dongbin![](/image/person.jpg)
Adobe PDF(1537Kb)  |   收藏  |  浏览/下载:44/23  |  提交时间:2024/06/03 |
| Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文 , 昆士兰, 2023-6 作者: Li WF(李伟凡) ; Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(4104Kb)  |   收藏  |  浏览/下载:261/81  |  提交时间:2023/06/29 multi-agent reinforcement learning policy gradient |
| Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文 , Xiamen, China, 2019-12-6 作者: Li WF(李伟凡) ; Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(488Kb)  |   收藏  |  浏览/下载:156/49  |  提交时间:2023/06/28 reinforcement learning unsupervised clustering matrix game |
| Multi-Objective Neural Architecture Search for Light-Weight Model 会议论文 , Hangzhou, China, 22-24 November 2019 作者: Nannan Li ; Yaran Chen ; Zixiang Ding ; Dongbin Zhao ; Zhonghua Pang; Ruisheng Qin
Adobe PDF(430Kb)  |   收藏  |  浏览/下载:169/58  |  提交时间:2023/06/27 Neural architecture search light-weight multi-objective reinforcement learning image classification |
| A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文 IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444 作者: Jiajun Chai ; Wenzhang Chen; Yuanheng Zhu ; Zong-xin Yao,; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(9249Kb)  |   收藏  |  浏览/下载:294/128  |  提交时间:2023/04/26 |
| Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241 作者: Zhu, Yuanheng ; Zhao, Dongbin![](/image/person.jpg)
Adobe PDF(2838Kb)  |   收藏  |  浏览/下载:256/13  |  提交时间:2022/06/10 Games Nash equilibrium Mathematical model Markov processes Convergence Dynamic programming Training Deep reinforcement learning (DRL) generalized policy iteration (GPI) Markov game (MG) Nash equilibrium Q network zero sum |