CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共69条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/07/04
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/06/24
Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Chen YR(陈亚冉);  Zhao DB(赵冬斌)
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/06/24
Driving Control with Deep and Reinforcement Learning in The Open Racing Car Simulator 会议论文
, Siem Reap, Cambodia, 2018, 12, 13-16
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(697Kb)  |  收藏  |  浏览/下载:31/15  |  提交时间:2024/06/05
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/06/05
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:44/8  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:258/81  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文
, Guangzhou, China, 2021-11
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(1117Kb)  |  收藏  |  浏览/下载:162/55  |  提交时间:2023/05/30
Lane change decision-making through deep reinforcement learning with rule-based constraints 会议论文
, Budapest, Hungary, 2019-7
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌);  Chen YR(陈亚冉)
Adobe PDF(295Kb)  |  收藏  |  浏览/下载:145/43  |  提交时间:2023/05/30
Lane Change  Decision-making  Deep Reinforcement Learning  Deep Q-Network