CASIA OpenIR

浏览/检索结果: 共107条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering 会议论文
, Torino, Italia, 2024-5
作者:  Wang, Chenhao;  Cao, Pengfei;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Jiang, Xiaojian;  Xu, Jiexin;  Li, Qiuxia;  Jun Zhao
Adobe PDF(909Kb)  |  收藏  |  浏览/下载:3/1  |  提交时间:2024/05/30
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:3/2  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:5/0  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:6/1  |  提交时间:2024/05/28
Learning to Coordinate via Multiple Graph Neural Networks 会议论文
, BALI, Indonesia, December 8-12, 2021
作者:  Zhiwei Xu;  Bin Zhang;  Yunpeng Bai;  Dapeng Li;  Guoliang Fan
Adobe PDF(2047Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/05/28
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/05/28
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:139/41  |  提交时间:2023/06/28