CASIA OpenIR

浏览/检索结果: 共112条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文
, Bangkok, Thailand, 2024-08
作者:  Wang, Chenhao;  Cao, Pengfei;  Jin, Zhuoran;  Chen, Yubo;  Zeng, Daojian;  Liu, Kang;  Zhao, Jun
Adobe PDF(571Kb)  |  收藏  |  浏览/下载:13/5  |  提交时间:2024/06/25
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/06/11
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文
, Seoul, Korea, 2024.4.14-2024.4.19
作者:  Meng Linghui;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(964Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/06/11
Generative Calibration for In-context Learning 会议论文
, Singapore, 2023-10-6
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(763Kb)  |  收藏  |  浏览/下载:28/10  |  提交时间:2024/06/06
Alignment Rationale for Natural Language Inference 会议论文
, Online, 2021-8-1
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Zhao Yang;  Jun Zhao;  Kang Liu
Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:31/12  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:53/17  |  提交时间:2024/06/05
Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文
, online, 2022
作者:  Qingxu Fu;  Tenghai Qiu;  Jianqiang Yi;  Zhiqiang Pu;  Shiguang Wu
Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/06/05
Information bottleneck based knowledge selection for commonsense reasoning 期刊论文
Information Sciences, 2024, 卷号: 660, 页码: 120134
作者:  Zhao Yang;  Yuanzhe Zhang;  Pengfei Cao;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(1069Kb)  |  收藏  |  浏览/下载:40/13  |  提交时间:2024/05/30
Commonsense reasoning  Knowledge selection  Information bottleneck  KG-augmented model  
Toward the Intelligent, Safe Exploration of a Biomimetic Underwater Robot: Modeling, Planning, and Control 期刊论文
Biomimetics, 2024, 期号: 9, 页码: 126
作者:  Wang, Yu;  Wang, Jian;  Yu Lianyi;  Kong Shihan;  Yu Junzhi
Adobe PDF(1171Kb)  |  收藏  |  浏览/下载:43/13  |  提交时间:2024/05/30