CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:41/12  |  提交时间:2024/05/30
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:53/24  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:40/22  |  提交时间:2024/05/28
Parallel Learning Based Foundation Model for Networked Traffic Signal Control 会议论文
, Bilbao, Bizkaia, Spain, 2022-9-24
作者:  Zhao, Chen;  Dai, Xingyuan;  Chen, Yuanyuan;  Yilun, Lin;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1112Kb)  |  收藏  |  浏览/下载:32/13  |  提交时间:2024/05/28
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:86/18  |  提交时间:2024/02/22
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:62/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:173/46  |  提交时间:2023/06/29
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:260/81  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
作者:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:132/51  |  提交时间:2023/06/27
基于深度强化学习的超车换道决策方法 学位论文
, 2023
作者:  王俊杰
Adobe PDF(17475Kb)  |  收藏  |  浏览/下载:193/3  |  提交时间:2023/06/26
深度强化学习,自动驾驶,换道决策,基于模型值扩展,动力学泛化