CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2024, 页码: 10.1109/TITS.2024.3400227
作者:  Zeyu Gao;  Yao Mu;  Chen Chen;  Jingliang Duan;  Ping Luo;  Yanfeng Lu;  Shengbo Eben Li
Adobe PDF(3954Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/06/06
End-to-end autonomous driving  deep reinforcement learning  world model  
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:44/8  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading 会议论文
, 台湾台北, 20240507-20240510
作者:  Cheng X(程曦);  Zhang JH(张景昊);  Ceng YN(曾宇楠);  Xue WF(薛文芳)
Adobe PDF(739Kb)  |  收藏  |  浏览/下载:44/12  |  提交时间:2024/06/03
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:54/19  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Robotics Dexterous Grasping: The Methods Based on Point Cloud and Deep Learning 期刊论文
Frontiers in Neurorobotics, 2021, 卷号: 15, 页码: 658280
作者:  Duan, Haonan;  Wang, Peng;  Huang, Yayu;  Xu, Guangyun;  Wei, Wei;  Shen, Xiaofei
Adobe PDF(3145Kb)  |  收藏  |  浏览/下载:42/14  |  提交时间:2024/05/29
Robotics  Dexterous grasping  Point Cloud  Deep learning  
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:88/18  |  提交时间:2024/02/22
Hierarchical Policy Learning With Demonstration Learning for Robotic Multiple Peg-in-Hole Assembly Tasks 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 卷号: 19, 期号: 10, 页码: 10254-10264
作者:  Yan, Shaohua;  Xu, De;  Tao, Xian
Adobe PDF(4845Kb)  |  收藏  |  浏览/下载:134/13  |  提交时间:2023/11/17
Assembly model  demonstration learning (DL)  force-based control algorithm  hierarchical reinforcement learning (HRL)  peg-in-hole assembly  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:152/5  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
A Survey on Reinforcement Learning Methods in Bionic Underwater Robots 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 2, 页码: 29
作者:  Tong, Ru;  Feng, Yukai;  Wang, Jian;  Wu, Zhengxing;  Tan, Min;  Yu, Junzhi
Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:149/19  |  提交时间:2023/11/17
bionic underwater robot  reinforcement learning  robotic fish  intelligent control  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:62/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow