CASIA OpenIR

浏览/检索结果: 共54条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Improved Self-Propelled Swarms Model with Enhanced Convergence Efficiency 会议论文
, Tianjing, China, 2020
作者:  Boyin Liu;  Zhiqiang Pu;  Shiguang Wu;  Lele Wang
Adobe PDF(210Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/07/12
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:52/23  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:42/12  |  提交时间:2024/06/11
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 卷号: 118, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:65/21  |  提交时间:2024/06/06
Multi-agent system  Target allocation  Decision making  Swarm motion control  
Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文
, online, 2022
作者:  Qingxu Fu;  Tenghai Qiu;  Jianqiang Yi;  Zhiqiang Pu;  Shiguang Wu
Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:50/18  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:71/23  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning 期刊论文
BIOMIMETICS, 2024, 卷号: 9, 期号: 1, 页码: 19
作者:  Wang, Yu;  Wang, Jian;  Kang, Song;  Yu, Junzhi
Adobe PDF(1553Kb)  |  收藏  |  浏览/下载:98/24  |  提交时间:2024/03/26
biomimetic motion  biomimetic autonomous system  target following  deep reinforcement learning  predictive control  
SOTVerse: A User-Defined Task Space of Single Object Tracking 期刊论文
International Journal of Computer Vision, 2023, 卷号: 132, 期号: 3, 页码: 1-59
作者:  Shiyu, Hu;  Xin, Zhao;  Kaiqi Huang
Adobe PDF(53048Kb)  |  收藏  |  浏览/下载:98/12  |  提交时间:2024/01/22
Single object tracking  Experimental environment  Evaluation system  Performance analysis  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:164/9  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system