CASIA OpenIR

浏览/检索结果: 共17条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:12/5  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:12/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:11/6  |  提交时间:2024/06/25
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/06/11
Interpreting Sentiment Composition with Latent Semantic Tree 会议论文
, Toronto, Canada, 2023-7-9
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:35/16  |  提交时间:2024/06/06
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:29/7  |  提交时间:2024/06/06
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:45/19  |  提交时间:2024/05/30
Hedonic Coalition Formation for Distributed Task Allocation in Heterogeneous Multi-agent System 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 页码: 13
作者:  Wang, Lexing;  Qiu, Tenghai;  Pu, Zhiqiang;  Yi, Jianqiang;  Zhu, Jinying;  Yuan, Wanmai
Adobe PDF(2578Kb)  |  收藏  |  浏览/下载:109/13  |  提交时间:2024/03/13
Coalition formation  hedonic games  heterogeneous agents  Nash stable  task allocation  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:141/2  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
Deep reinforcement learning based multi-target coverage with connectivity guaranteed 期刊论文
IEEE Transactions on Industrial Informatics, 2022, 期号: 2022, 页码: 1-12
作者:  Shiguang Wu;  Zhiqiang Pu;  Tenghai Qiu;  Jianqiang Yi;  Tianle Zhang
Adobe PDF(15731Kb)  |  收藏  |  浏览/下载:281/33  |  提交时间:2022/04/02
Multi-target coverage  multi-robot system  connectivity maintenance  deep reinforcement learning