CASIA OpenIR

浏览/检索结果: 共79条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:141/34  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:145/56  |  提交时间:2023/06/26
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:128/44  |  提交时间:2023/06/12
Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文
, 中国桂林, 2022-7-9
作者:  Shan QF(单钦锋);  Wang WJ(王伟杰);  Guo DF(郭丁飞);  Sun XR(孙向荣);  Jia LH(贾立好)
Adobe PDF(494Kb)  |  收藏  |  浏览/下载:98/28  |  提交时间:2023/06/05
Deep learning  Mechatronics  Navigation  Reinforcement learning  Cost function  Real-time systems  Trajectory  
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:168/67  |  提交时间:2023/07/06
LEARN EFFECTIVE REPRESENTATION FOR DEEP REINFORCEMENT LEARNING 会议论文
, Taipei, Taiwan, 26 August 2022
作者:  Zhan Yuan;  Xu Zhiwei;  Fan Guoliang
Adobe PDF(2093Kb)  |  收藏  |  浏览/下载:114/42  |  提交时间:2023/06/08
Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution 会议论文
, Padua, Italy, 18-23 July 2022
作者:  Yunpeng Bai;  Chen Gong;  Bin Zhang;  Guoliang Fan;  Xinwen Hou;  Yu Liu
Adobe PDF(8946Kb)  |  收藏  |  浏览/下载:93/30  |  提交时间:2023/06/14
A Peer-to-Peer Distributed Bisecting K-means 会议论文
, 线上, 2022-2-19
作者:  Gao HY(高浩元)
Adobe PDF(4307Kb)  |  收藏  |  浏览/下载:164/46  |  提交时间:2022/06/17
Continuous-Time Linear Parallel Output Regulation 会议论文
, Beijing, China, 22-24 October 2021
作者:  Li, Hongyang;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(912Kb)  |  收藏  |  浏览/下载:181/60  |  提交时间:2022/06/14
Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning 会议论文
, 意大利, 2022-07
作者:  Yang GK(杨光开);  Chenhao(陈皓);  Junge Zhang(张俊格);  Qiyue Yin(尹奇跃);  Kaiqi Huang(黄凯奇)
Adobe PDF(2924Kb)  |  收藏  |  浏览/下载:224/49  |  提交时间:2022/07/12