CASIA OpenIR

浏览/检索结果: 共28条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Tacit Commitments Emergence in Multi-agent Reinforcement Learning 会议论文
, New Delhi, India, 2023-7
作者:  Liu BY(刘博寅);  Zhiqiang Pu;  Junlong Gao;  Jianqiang Yi;  Zhenyu Guo
Adobe PDF(932Kb)  |  收藏  |  浏览/下载:15/6  |  提交时间:2024/07/15
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文
, Queensland, Australia, 2023-6
作者:  Hu GZ(胡光政);  Li HR(李浩然);  Liu SS(刘莎莎);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:33/9  |  提交时间:2024/07/04
Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation 会议论文
, 新奥尔良, 2023-12-9 至 2023-12-15
作者:  Keji He;  Chenyang Si;  Zhihe Lu;  Yan Huang;  Liang Wang;  Xinchao Wang
Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:41/15  |  提交时间:2024/06/26
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/25
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:47/11  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:44/10  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:41/12  |  提交时间:2024/06/05
Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文
, 厦门国际会议中心, 2023-10-13
作者:  Chen ZP(陈忠鹏);  Guan Q(关强)
Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/06/04
Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation