CASIA OpenIR

浏览/检索结果: 共370条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文
, Queensland, Australia, 2023-6
作者:  Hu GZ(胡光政);  Li HR(李浩然);  Liu SS(刘莎莎);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:20/4  |  提交时间:2024/07/04
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/07/04
Optimizing Reward Function Weights and Enhancing Control Mechanisms for Bipedal Robots Using LSTM and Attention Mechanisms 会议论文
, 河北保定, 2023-8-16
作者:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(541Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/07/01
Adaptive Multi-Agent Coordination among Different Team Attribute Tasks via Contextual Meta-Reinforcement Learning 会议论文
, 河南开封, 2024年5月17-19日
作者:  Huang, Shangjing;  Zhao, Zijie;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(15515Kb)  |  收藏  |  浏览/下载:10/4  |  提交时间:2024/06/26
Online Optimization of Normalized CPGs for a Multi-Joint Robotic Fish 会议论文
, 中国,上海, 2021年7月
作者:  Tong R(仝茹);  Wu ZX(吴正兴);  Wang J(王健);  Tan M(谭民);  Yu JZ(喻俊志)
Adobe PDF(456Kb)  |  收藏  |  浏览/下载:6/4  |  提交时间:2024/06/26
V2X-BGN: Camera-based V2X-Collaborative 3D Object Detection with BEV Global Non-Maximum Suppression 会议论文
, Jeju Island, South Korea, June 2-5, 2024
作者:  Zhang Caiji;  Tian Bin;  Meng Shi;  Qi Shuangying;  Sun Yang;  Ai Yunfeng;  Chen Long
Adobe PDF(1659Kb)  |  收藏  |  浏览/下载:14/6  |  提交时间:2024/06/25
V2X  
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:9/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:11/5  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning 会议论文
, Vancouver, British Columbia, Canada, 2019-08-22至2019-08-26
作者:  Xingyuan Gao;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1500Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/21