CASIA OpenIR

浏览/检索结果: 共89条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:27/10  |  提交时间:2024/06/11
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:23/8  |  提交时间:2024/06/03
Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering 会议论文
, Torino, Italia, 2024-5
作者:  Wang, Chenhao;  Cao, Pengfei;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Jiang, Xiaojian;  Xu, Jiexin;  Li, Qiuxia;  Jun Zhao
Adobe PDF(909Kb)  |  收藏  |  浏览/下载:35/8  |  提交时间:2024/05/30
CN-AutoMIC: Distilling Chinese Commonsense Knowledge from Pretrained Language Models 会议论文
, Abu Dhabi, United Arab Emirates, 2022-12
作者:  Wang, Chenhao;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Zhao, Jun
Adobe PDF(848Kb)  |  收藏  |  浏览/下载:31/8  |  提交时间:2024/05/30
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:46/19  |  提交时间:2024/05/29
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:29/7  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:28/5  |  提交时间:2024/05/28
Design of a Robotic Fish Based on a Passive Flexible Mechanism 会议论文
, 云南大理, 2019.12.6
作者:  Lu Ben;  Yuzhuo Fu;  Qianqian Zou;  Sai Deng;  Chao Zhou
Adobe PDF(10378Kb)  |  收藏  |  浏览/下载:34/10  |  提交时间:2024/05/28
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:16/10  |  提交时间:2024/05/28
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:115/26  |  提交时间:2024/01/22