CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Neighbor-view Enhanced Model for Vision and Language Navigation 会议论文
Proceedings of the ACM International Conference on Multimedia, Chengdu, China, 2021-10-20
作者:  Dong An;  Yuankai Qi;  Yan Huang;  Qi Wu;  Liang Wang;  Tieniu Tan
Adobe PDF(2412Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/05/28
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:184/70  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:160/57  |  提交时间:2023/06/28
Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文
, Xining, China, 15-19 July 2021
作者:  Zhang Tiandong;  Wang Rui;  Wang Yu;  Wang Shuo
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:72/24  |  提交时间:2023/06/14
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:146/62  |  提交时间:2023/06/12
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:128/51  |  提交时间:2023/06/12
Modeling Inter-Claim Interactions for Verifying Multiple Claims 会议论文
, 线上, 2021年11月
作者:  Wang S(王帅);  Mao WJ(毛文吉)
Adobe PDF(1336Kb)  |  收藏  |  浏览/下载:179/34  |  提交时间:2022/07/01
fact checking knowledge graph  
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:238/43  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文
, 线上会议, 2021-9
作者:  Wu Shiguang;  Qiu Tenghai;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:257/76  |  提交时间:2022/06/16
DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文
, 西安, 2021.5.30-2021.6.5
作者:  Li, Jiayi;  Li, Boyao;  Lu, Tao;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:197/37  |  提交时间:2022/06/14