CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文
, Queensland, Australia, 2023-6
作者:  Hu GZ(胡光政);  Li HR(李浩然);  Liu SS(刘莎莎);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/07/04
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:33/11  |  提交时间:2024/06/24
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:80/16  |  提交时间:2024/02/22
Conditional Goal-Oriented Trajectory Prediction for Interacting Vehicles 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Li, Ding;  Zhang, Qichao;  Lu, Shuai;  Pan, Yifeng;  Zhao, Dongbin
收藏  |  浏览/下载:161/0  |  提交时间:2023/12/21
Trajectory  Predictive models  Behavioral sciences  Pipelines  Task analysis  Feature extraction  Vehicle dynamics  Conditional prediction  goal-oriented trajectory prediction  hierarchical vectorized representation  joint trajectory prediction  marginal prediction  
ABCP: Automatic Blockwise and Channelwise Network Pruning via Joint Search 期刊论文
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 卷号: 15, 期号: 3, 页码: 1560-1573
作者:  Li, Jiaqi;  Li, Haoran;  Chen, Yaran;  Ding, Zixiang;  Li, Nannan;  Ma, Mingjun;  Duan, Zicheng;  Zhao, Dongbin
收藏  |  浏览/下载:143/0  |  提交时间:2023/12/21
Joint search  model compression  pruning  reinforcement learning  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:61/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Data Generation Feedback Relearning Control for Unmodeled Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 页码: 12
作者:  Zhang, Yong;  Mu, Chaoxu;  Zhao, Dongbin
收藏  |  浏览/下载:108/0  |  提交时间:2023/11/16
Data models  Real-time systems  Heuristic algorithms  Mathematical models  Adaptation models  Approximation algorithms  Cost function  Data generation model  feedback relearning control  delayed neural network  reinforcement learning  unmodeled nonlinear system  
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:171/46  |  提交时间:2023/06/29
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:253/81  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Adaptive Search for Broad Attention based Vision Transformers 期刊论文
IEEE Transactions on Evolutionary Computation, 2023, 页码: 0-0
作者:  Nannan Li;  Yaran Chen;  Dongbin Zhao
Adobe PDF(824Kb)  |  收藏  |  浏览/下载:192/60  |  提交时间:2023/06/28