CASIA OpenIR

浏览/检索结果: 共33条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:41/17  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 卷号: 118, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:52/15  |  提交时间:2024/06/06
Multi-agent system  Target allocation  Decision making  Swarm motion control  
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:46/13  |  提交时间:2024/06/05
Hedonic Coalition Formation for Distributed Task Allocation in Heterogeneous Multi-agent System 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 页码: 13
作者:  Wang, Lexing;  Qiu, Tenghai;  Pu, Zhiqiang;  Yi, Jianqiang;  Zhu, Jinying;  Yuan, Wanmai
Adobe PDF(2578Kb)  |  收藏  |  浏览/下载:124/17  |  提交时间:2024/03/13
Coalition formation  hedonic games  heterogeneous agents  Nash stable  task allocation  
基于噪声对比估计的权重自适应对抗生成式模仿学习 期刊论文
模式识别与人工智能, 2023, 卷号: 36, 期号: 4, 页码: 300-312
作者:  关伟凡;  张希
Adobe PDF(1849Kb)  |  收藏  |  浏览/下载:146/49  |  提交时间:2023/06/29
强化学习  模仿学习  噪声对比估计  自适应权重  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:178/35  |  提交时间:2023/06/21
基于分布式协商的多无人机目标覆盖与连通保持算法 期刊论文
指挥与控制学报, 2022, 卷号: 8, 期号: 2, 页码: 9
作者:  陶忠良;  蒲志强;  王乐乐;  付清旭;  耿虎军
Adobe PDF(2704Kb)  |  收藏  |  浏览/下载:188/89  |  提交时间:2023/06/14
多无人机  多目标覆盖  ,连通保持  分布式协商  
Robot Subgoal-guided Navigation in Dynamic Crowded Environments with Hierarchical Deep Reinforcement Learning 期刊论文
International Journal of Control, Automation and Systems, 2023, 页码: 1-13
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Liang YY(梁延研);  Zhang D(张渡)
Adobe PDF(6472Kb)  |  收藏  |  浏览/下载:191/41  |  提交时间:2023/06/12
Peer Incentive Reinforcement Learning for Cooperative Multi-Agent Games 期刊论文
IEEE Transactions on Games, 2022, 页码: 1-14
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(18835Kb)  |  收藏  |  浏览/下载:136/34  |  提交时间:2023/06/12