CASIA OpenIR

浏览/检索结果: 共198条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊
创刊日期: 2018,
主办者:  Liu BY(刘博寅)
Adobe PDF(5797Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/07/12
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:16/2  |  提交时间:2024/07/12
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:19/4  |  提交时间:2024/07/12
UNSUPERVISED LEARNING OF NEURAL SEMANTIC MAPPINGS WITH THE HUNGARIAN ALGORITHM FOR COMPOSITIONAL SEMANTICS 会议论文
, Seoul, South Korea, 2024-04
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(294Kb)  |  收藏  |  浏览/下载:41/18  |  提交时间:2024/06/27
Prediction and Calibration: Complex Reasoning over Knowledge Graph with Bi-directional Directed Acyclic Graph Neural Network 会议论文
, Toronto, Canada, 2023.07.09-2023.07.14
作者:  Yao Xu;  Shizhu HE;  Li Cai;  Kang Liu;  Jun Zhao
Adobe PDF(628Kb)  |  收藏  |  浏览/下载:33/11  |  提交时间:2024/06/20
Query2Triple: Unified Query Encoding for Answering Diverse Complex Queries over Knowledge Graphs 会议论文
, Singapore, 2023.11.06-2023.11.10
作者:  Yao Xu;  Shizhu HE;  Cunguang Wang;  Li Cai;  Kang Liu;  Jun Zhao
Adobe PDF(811Kb)  |  收藏  |  浏览/下载:33/11  |  提交时间:2024/06/20
Immersion and Invariance Based Composite Adaptive Control for Nonlinear Systems with Both Parametric and Non-Parametric Uncertainties 会议论文
, Berlin, Germany, 2020.7.12-17
作者:  Zhen Liu;  Zhiqiang Pu;  Tenghai Qiu;  Huimu Wang;  Jianqiang Yi
Adobe PDF(1554Kb)  |  收藏  |  浏览/下载:44/16  |  提交时间:2024/06/20
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:31/8  |  提交时间:2024/06/11
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:66/22  |  提交时间:2024/06/05
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/06/05