CASIA OpenIR

浏览/检索结果: 共27条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:8/1  |  提交时间:2024/06/05
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/06/03
Cross-modal Prototype Learning for Zero-shot Handwriting Recognition 会议论文
, Sydney, Australia, 20-25 Septemper 2019
作者:  Ao, Xiang;  Zhang, Xu-Yao;  Yang, Hong-Ming;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(226Kb)  |  收藏  |  浏览/下载:16/7  |  提交时间:2024/05/30
printed character  handwritten character  cross-modal  prototype learning  zero-shot  
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:163/62  |  提交时间:2023/06/29
Multi-UAV Cooperative Short-Range Combat via Attention-Based Reinforcement Learning using Individual Reward Shaping 会议论文
, Kyoto, Japan, October 23-27, 2022
作者:  Zhang TL(张天乐);  Qiu TH(丘腾海);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(896Kb)  |  收藏  |  浏览/下载:142/45  |  提交时间:2023/06/12
Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs 会议论文
, Philadelphia, PA, USA, May 23-27, 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4277Kb)  |  收藏  |  浏览/下载:141/34  |  提交时间:2023/06/12
Robot Navigation among External Autonomous Agents through Deep Reinforcement Learning using Graph Attention Network 会议论文
, Berlin, Germany, July 12-17, 2020
作者:  Zhang TL(张天乐);  Qiu TH(丘腾海);  Pu ZQ(蒲志强);  Liu Z(刘振);  Yi JQ(易建强)
Adobe PDF(496Kb)  |  收藏  |  浏览/下载:106/32  |  提交时间:2023/06/12
A Brain-Inspired Causal Reasoning Model Based on Spiking Neural Networks 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Fang HongJian(方宏坚);  Zeng Yi(曾毅)
Adobe PDF(1959Kb)  |  收藏  |  浏览/下载:206/65  |  提交时间:2022/09/26
A Working Memory Model for Task-oriented Dialog Response Generation 会议论文
, Florence, Italy, 2019-07
作者:  Chen, Xiuyi;  Xu, Jiaming;  Xu, Bo
Adobe PDF(792Kb)  |  收藏  |  浏览/下载:169/56  |  提交时间:2022/06/27
Deep Behavioral Cloning for Traffic Control with Virtual Expert Demonstration Under a Parallel Learning Framework 会议论文
, 北京, 2020-12
作者:  Li Xiaoshuang;  Zhu Fenghua;  Wang Fei-Yue
Adobe PDF(770Kb)  |  收藏  |  浏览/下载:178/71  |  提交时间:2022/06/16