CASIA OpenIR

浏览/检索结果: 共278条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Tacit Commitments Emergence in Multi-agent Reinforcement Learning 会议论文
, New Delhi, India, 2023-7
作者:  Liu BY(刘博寅);  Zhiqiang Pu;  Junlong Gao;  Jianqiang Yi;  Zhenyu Guo
Adobe PDF(932Kb)  |  收藏  |  浏览/下载:28/10  |  提交时间:2024/07/15
Improved Self-Propelled Swarms Model with Enhanced Convergence Efficiency 会议论文
, Tianjing, China, 2020
作者:  Boyin Liu;  Zhiqiang Pu;  Shiguang Wu;  Lele Wang
Adobe PDF(210Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/07/12
A Semantic and Structural Transformer for Code Summarization Generation 会议论文
, 澳大利亚, 2023.6.8
作者:  Ruyi Ji;  Zhenyu Tong;  Tiejian Luo;  Jing Liu;  Libo Zhang
Adobe PDF(912Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/07/08
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:49/22  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文
, Queensland, Australia, 2023-6
作者:  Hu GZ(胡光政);  Li HR(李浩然);  Liu SS(刘莎莎);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:41/11  |  提交时间:2024/07/04
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:33/14  |  提交时间:2024/07/04
Gait Learning for 3D Bipedal Robots Based on a Combined Strategy of Hybrid Zero Dynamics Feedback Control and Periodic Reward 会议论文
, 中国湖南长沙, 2024-5-25
作者:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(690Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/07/01
Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation 会议论文
, 新奥尔良, 2023-12-9 至 2023-12-15
作者:  Keji He;  Chenyang Si;  Zhihe Lu;  Yan Huang;  Liang Wang;  Xinchao Wang
Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:52/18  |  提交时间:2024/06/26
Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision 会议论文
, 线上, 2021-12-7至2021-12-10
作者:  Keji He;  Yan Huang;  Qi Wu;  Jianhua Yang;  Dong An;  Shuanglin Sima;  Liang Wang
Adobe PDF(871Kb)  |  收藏  |  浏览/下载:41/11  |  提交时间:2024/06/26
Adaptive Multi-Agent Coordination among Different Team Attribute Tasks via Contextual Meta-Reinforcement Learning 会议论文
, 河南开封, 2024年5月17-19日
作者:  Huang, Shangjing;  Zhao, Zijie;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(15515Kb)  |  收藏  |  浏览/下载:36/12  |  提交时间:2024/06/26