CASIA OpenIR

浏览/检索结果: 共185条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:195/66  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
作者:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:104/43  |  提交时间:2023/06/27
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:128/44  |  提交时间:2023/06/12
Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文
, Jinghong, China, 05-09 December 2022
作者:  Junhang Wei;  Shaowei Cui;  Peng Hao;  Shuo Wang
Adobe PDF(933Kb)  |  收藏  |  浏览/下载:129/50  |  提交时间:2023/10/25
Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文
, 中国桂林, 2022-7-9
作者:  Shan QF(单钦锋);  Wang WJ(王伟杰);  Guo DF(郭丁飞);  Sun XR(孙向荣);  Jia LH(贾立好)
Adobe PDF(494Kb)  |  收藏  |  浏览/下载:96/28  |  提交时间:2023/06/05
Deep learning  Mechatronics  Navigation  Reinforcement learning  Cost function  Real-time systems  Trajectory  
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:166/67  |  提交时间:2023/07/06
Evolution of opinions with estimation and interference 会议论文
Proceedings of 41st Chinese Control Conference, Hefei, 2022.7.25-27
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(214Kb)  |  收藏  |  浏览/下载:115/43  |  提交时间:2023/06/28
Opinion dynamics, Self-cognition, Estimation  
Deconfounding Physical Dynamics with Global Causal Relation and Confounder Transmission for Counterfactual Prediction 会议论文
, 加拿大温哥华(线上参加), 2022-2
作者:  Li, Zongzhao;  Zhu, Xiangyu;  Lei, Zhen;  Zhang, Zhaoxiang
Adobe PDF(5435Kb)  |  收藏  |  浏览/下载:98/33  |  提交时间:2023/07/14
A Peer-to-Peer Distributed Bisecting K-means 会议论文
, 线上, 2022-2-19
作者:  Gao HY(高浩元)
Adobe PDF(4307Kb)  |  收藏  |  浏览/下载:164/46  |  提交时间:2022/06/17
Continuous-Time Linear Parallel Output Regulation 会议论文
, Beijing, China, 22-24 October 2021
作者:  Li, Hongyang;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(912Kb)  |  收藏  |  浏览/下载:179/60  |  提交时间:2022/06/14