CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共65条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:39/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Data Generation Feedback Relearning Control for Unmodeled Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 页码: 12
作者:  Zhang, Yong;  Mu, Chaoxu;  Zhao, Dongbin
收藏  |  浏览/下载:59/0  |  提交时间:2023/11/16
Data models  Real-time systems  Heuristic algorithms  Mathematical models  Adaptation models  Approximation algorithms  Cost function  Data generation model  feedback relearning control  delayed neural network  reinforcement learning  unmodeled nonlinear system  
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:112/35  |  提交时间:2023/06/29
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文
, 昆士兰, 2023-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:196/67  |  提交时间:2023/06/29
multi-agent  reinforcement learning  policy gradient  
Multi-Agent Reinforcement Learning Based on Clustering in Two-Player Games 会议论文
, Xiamen, China, 2019-12-6
作者:  Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(488Kb)  |  收藏  |  浏览/下载:94/33  |  提交时间:2023/06/28
reinforcement learning  unsupervised clustering  matrix game  
Multi-Objective Neural Architecture Search for Light-Weight Model 会议论文
, Hangzhou, China, 22-24 November 2019
作者:  Nannan Li;  Yaran Chen;  Zixiang Ding;  Dongbin Zhao;  Zhonghua Pang;  Ruisheng Qin
Adobe PDF(430Kb)  |  收藏  |  浏览/下载:99/38  |  提交时间:2023/06/27
Neural architecture search  light-weight  multi-objective  reinforcement learning  image classification  
Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文
, Guangzhou, China, 2021-11
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(1117Kb)  |  收藏  |  浏览/下载:97/36  |  提交时间:2023/05/30
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:131/53  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:52/27  |  提交时间:2023/05/22
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:  Jiajun Chai;  Wenzhang Chen;  Yuanheng Zhu;  Zong-xin Yao,;  Dongbin Zhao
Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:195/106  |  提交时间:2023/04/26