CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共49条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:37/13  |  提交时间:2024/06/24
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:31/10  |  提交时间:2024/06/05
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:83/16  |  提交时间:2024/02/22
Adaptive Search for Broad Attention based Vision Transformers 期刊论文
IEEE Transactions on Evolutionary Computation, 2023, 页码: 0-0
作者:  Nannan Li;  Yaran Chen;  Dongbin Zhao
Adobe PDF(824Kb)  |  收藏  |  浏览/下载:194/60  |  提交时间:2023/06/28
Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文
, Guangzhou, China, 2021-11
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(1117Kb)  |  收藏  |  浏览/下载:157/54  |  提交时间:2023/05/30
Lane change decision-making through deep reinforcement learning with rule-based constraints 会议论文
, Budapest, Hungary, 2019-7
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌);  Chen YR(陈亚冉)
Adobe PDF(295Kb)  |  收藏  |  浏览/下载:144/43  |  提交时间:2023/05/30
Lane Change  Decision-making  Deep Reinforcement Learning  Deep Q-Network  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:193/68  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:73/35  |  提交时间:2023/05/22
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:111/44  |  提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:250/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum