CASIA OpenIR

浏览/检索结果: 共18条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:169/38  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:127/52  |  提交时间:2023/06/29
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:138/28  |  提交时间:2023/06/21
Distributed event-triggered formation control for a multi-robotic fish system 会议论文
, Harbin, China, 5-7 August 2022
作者:  Dai, Shijie(戴时捷);  Zhengxing Wu;  Min Tan;  Junzhi Yu
Adobe PDF(366Kb)  |  收藏  |  浏览/下载:118/34  |  提交时间:2023/06/15
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:150/46  |  提交时间:2023/06/12
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:122/56  |  提交时间:2023/06/12
Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs 会议论文
, Philadelphia, PA, USA, May 23-27, 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4277Kb)  |  收藏  |  浏览/下载:129/33  |  提交时间:2023/06/12
A Multi Domain Knowledge Enhanced Matching Network for Response Selection in Retrieval-Based Dialogue Systems 会议论文
, Singapore, Singapore, 2022-05
作者:  Chen, Xiuyi;  Chen, Feilong;  Xu, Shuang;  Xu, Bo
Adobe PDF(1248Kb)  |  收藏  |  浏览/下载:234/52  |  提交时间:2022/06/27
Multi-robot cooperative target encirclement through learning distributed transferable policy 会议论文
, Online, July 19-24
作者:  Zhang Tianle;  Liu Zhen;  Wu Shiguang;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(949Kb)  |  收藏  |  浏览/下载:177/54  |  提交时间:2022/06/16
A hybrid formation control design for multi-robot system with obstacle avoidance 会议论文
, Guangzhou, July 27-30
作者:  Wu Shiguang;  Sui Zezhi;  Yi Jianqiang;  Pu Zhiqiang
Adobe PDF(751Kb)  |  收藏  |  浏览/下载:136/48  |  提交时间:2022/06/16