CASIA OpenIR

浏览/检索结果: 共107条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:140/34  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:128/44  |  提交时间:2023/06/12
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:168/67  |  提交时间:2023/07/06
Evolution of opinions with estimation and interference 会议论文
Proceedings of 41st Chinese Control Conference, Hefei, 2022.7.25-27
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(214Kb)  |  收藏  |  浏览/下载:115/43  |  提交时间:2023/06/28
Opinion dynamics, Self-cognition, Estimation  
Meta-Imitation Learning by Watching Video Demonstrations 会议论文
, 线上, 2022.4.25-2022.4.29
作者:  Li, Jiayi;  Lu, Tao;  Cao, Xiaoge;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(8968Kb)  |  收藏  |  浏览/下载:197/47  |  提交时间:2022/06/14
Distributed event-triggered formation control for a multi-robotic fish system 会议论文
, Harbin, China, 5-7 August 2022
作者:  Dai, Shijie(戴时捷);  Zhengxing Wu;  Min Tan;  Junzhi Yu
Adobe PDF(366Kb)  |  收藏  |  浏览/下载:104/32  |  提交时间:2023/06/15
MiaoSuan Wargame: A Multi-Mode Integrated Platform for Imperfect Information Game 会议论文
, Beijing, China, August 21-24, 2022
作者:  Jiale Xu;  Jian Hu;  Shixian Wang;  Xuyang Yang;  Wancheng Ni
Adobe PDF(726Kb)  |  收藏  |  浏览/下载:58/16  |  提交时间:2023/06/28
open platform  human-computer gaming  AI evaluation  Turing test  imperfect information game  wargame  
Multi-UAV Cooperative Short-Range Combat via Attention-Based Reinforcement Learning using Individual Reward Shaping 会议论文
, Kyoto, Japan, October 23-27, 2022
作者:  Zhang TL(张天乐);  Qiu TH(丘腾海);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(896Kb)  |  收藏  |  浏览/下载:106/38  |  提交时间:2023/06/12
Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs 会议论文
, Philadelphia, PA, USA, May 23-27, 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4277Kb)  |  收藏  |  浏览/下载:104/28  |  提交时间:2023/06/12
知识和数据协同驱动的群体智能决策方法研究综述 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 3, 页码: 1-17
作者:  蒲志强;  易建强;  刘振;  丘腾海;  孙金林;  李非墨
Adobe PDF(1352Kb)  |  收藏  |  浏览/下载:256/63  |  提交时间:2022/04/02
群体智能  知识与数据协同  多智能体  决策智能