CASIA OpenIR

浏览/检索结果: 共100条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:18/2  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Generative Calibration for In-context Learning 会议论文
, Singapore, 2023-10-6
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(763Kb)  |  收藏  |  浏览/下载:38/17  |  提交时间:2024/06/06
Alignment Rationale for Natural Language Inference 会议论文
, Online, 2021-8-1
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Zhao Yang;  Jun Zhao;  Kang Liu
Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:42/14  |  提交时间:2024/06/06
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:44/10  |  提交时间:2024/06/05
T-Agent: A Term-Aware Agent for Medical Dialogue Generation 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
作者:  Zefa Hu;  Haozhi Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(483Kb)  |  收藏  |  浏览/下载:54/16  |  提交时间:2024/05/29
A Wire-driven Elastic Robotic Fish and its Design and CPG-Based Control 期刊论文
Journal of Intelligent & Robotic Systems, 2023, 卷号: 107, 期号: 1, 页码: 4
作者:  Xiaocun Liao;  Chao Zhou;  Jian Wang;  Junfeng Fan;  Zhuoliang Zhang
Adobe PDF(1749Kb)  |  收藏  |  浏览/下载:53/18  |  提交时间:2024/05/28
Robotic fish  Wire-driven mode  Elastic component  Kinematics model  Body wave  
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:53/15  |  提交时间:2024/05/28
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:47/19  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:37/7  |  提交时间:2024/05/28