CASIA OpenIR

浏览/检索结果: 共59条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:104/41  |  提交时间:2023/06/29
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:113/54  |  提交时间:2023/06/12
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:86/36  |  提交时间:2023/06/12
Marine autonomous navigation for biomimetic underwater robots based on deep stereo attention network 会议论文
, Prague, Czech Republic, 2021年9月27日-2021年10月1日
作者:  Yan, Shuaizheng;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(4783Kb)  |  收藏  |  浏览/下载:130/56  |  提交时间:2023/06/12
Autonomous underwater vehicles  Visualization  Navigation  Biological system modeling  Real-time systems  
Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文
, Guangzhou, China, 2021-11
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(1117Kb)  |  收藏  |  浏览/下载:104/39  |  提交时间:2023/05/30
A Collaborative Communication-Qmix Approach for Large-scale Networked Traffic Signal Control 会议论文
, Indianapolis, IN, United States, 2021-9-19
作者:  Chen, Xiaoyu;  Xiong, Gang;  Lv, Yisheng;  Chen, yuanyuan;  Song, bing;  Wang, Feiyue
Adobe PDF(1208Kb)  |  收藏  |  浏览/下载:232/59  |  提交时间:2022/06/16
ADEL: Autonomous Developmental Evolutionary Learning for Robotic Manipulation 会议论文
, 北京, 2021-8
作者:  Li YM(李一鸣)
Adobe PDF(9586Kb)  |  收藏  |  浏览/下载:130/15  |  提交时间:2022/06/16
Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文
, 线上会议, 2021-9
作者:  Wu Shiguang;  Qiu Tenghai;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:218/63  |  提交时间:2022/06/16
Reinforcement Learning Based Variable Impedance Control for High Precision Human-robot Collaboration Tasks 会议论文
, 线上, July 3-5, 2021
作者:  Meng, Yan;  Su, Jianhua;  Wu, Jiaxi
Adobe PDF(6621Kb)  |  收藏  |  浏览/下载:179/41  |  提交时间:2022/06/15
Dynamic Dual Gating Neural Networks 会议论文
, Online, 2021
作者:  Li, Fanrong;  Li, Gang;  He, Xiangyu;  Cheng, Jian
Adobe PDF(1988Kb)  |  收藏  |  浏览/下载:157/46  |  提交时间:2022/06/14