CASIA OpenIR

浏览/检索结果: 共26条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:8/5  |  提交时间:2024/06/25
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/06/11
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:26/7  |  提交时间:2024/06/05
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:16/6  |  提交时间:2024/06/03
Locomotion Optimization of a Tendon-Driven Robotic Fish with Variable Passive Tail Fin 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 页码: 4983 - 4992
作者:  Qiu CL(邱常林);  Wu ZX(吴正兴);  Wang J(王健);  Tan M(谭民);  Yu JZ(喻俊志)
Adobe PDF(1023Kb)  |  收藏  |  浏览/下载:41/17  |  提交时间:2024/05/29
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:42/17  |  提交时间:2024/05/29
SOTVerse: A User-Defined Task Space of Single Object Tracking 期刊论文
International Journal of Computer Vision, 2023, 卷号: 132, 期号: 3, 页码: 1-59
作者:  Shiyu, Hu;  Xin, Zhao;  Kaiqi Huang
Adobe PDF(53048Kb)  |  收藏  |  浏览/下载:73/6  |  提交时间:2024/01/22
Single object tracking  Experimental environment  Evaluation system  Performance analysis  
单目标跟踪中的视觉智能评估技术综述 期刊论文
中国图象图形学报, 2023, 页码: 1-30
作者:  胡世宇;  赵鑫;  黄凯奇
Adobe PDF(10669Kb)  |  收藏  |  浏览/下载:145/39  |  提交时间:2024/01/22
智能评估技术  竞赛和数据集  视觉跟踪能力  单目标跟踪  目标跟踪算法  
Multiagent-Reinforcement-Learning-Based Stable Path Tracking Control for a Bionic Robotic Fish With Reaction Wheel 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 12, 页码: 12670-12679
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(1587Kb)  |  收藏  |  浏览/下载:159/10  |  提交时间:2023/11/17
Multiagent reinforcement learning (MARL)  path tracking control  reaction wheel  robotic fish  underwater robot  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:138/1  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system