CASIA OpenIR

浏览/检索结果: 共107条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570
作者:  Yan Song;  He Jiang;  Zheng Tian;  Haifeng Zhang;  Yingping Zhang;  Jiangcheng Zhu;  Zonghong Dai;  Weinan Zhang;  Jun Wang
Adobe PDF(24588Kb)  |  收藏  |  浏览/下载:0/0  |  提交时间:2024/05/23
Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory  
电熔镁砂熔炼过程电极电流饱和约束一步最优控制 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 1, 页码: 239-248
作者:  富月;  李宝
Adobe PDF(2482Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/05/20
电熔镁砂  饱和约束  离散时间非线性系统  一步最优控制  
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:26/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 18-36
作者:  Ding Wang;  Ning Gao;  Derong Liu;  Jinna Li;  Frank L. Lewis
Adobe PDF(1945Kb)  |  收藏  |  浏览/下载:257/184  |  提交时间:2024/01/02
Adaptive dynamic programming (ADP)  advanced control  complex environment  data-driven control  event-triggered design  intelligent control  neural networks  nonlinear systems  optimal control  reinforcement learning (RL)  
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:151/47  |  提交时间:2023/06/12
Multiexperience-Assisted Efficient Multiagent Reinforcement Learning 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1-15
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Yi JQ(易建强);  Wu SG(吴士广);  Pu ZQ(蒲志强);  Zhao YJ(赵彦杰)
Adobe PDF(2718Kb)  |  收藏  |  浏览/下载:254/91  |  提交时间:2023/06/02
Terrain-Adaptive Longitudinal Control for Autonomous Trucks 会议论文
, Macau, China, 2022.10.08
作者:  Xiaoyu Xiong;  Bin Tian;  Rui Zhang;  Yang Sun;  Long Chen
Adobe PDF(2543Kb)  |  收藏  |  浏览/下载:86/29  |  提交时间:2023/05/06
A Data-Based Feedback Relearning Algorithm for Uncertain Nonlinear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 5, 页码: 1288-1303
作者:  Chaoxu Mu;  Yong Zhang;  Guangbin Cai;  Ruijun Liu;  Changyin Sun
Adobe PDF(4205Kb)  |  收藏  |  浏览/下载:216/108  |  提交时间:2023/04/26
Data episodes  experience replay  neural networks  reinforcement learning (RL)  uncertain systems  
面向兵棋推演的多智能体智能博弈决策算法研究 学位论文
, 2023
作者:  余照科
Adobe PDF(15273Kb)  |  收藏  |  浏览/下载:739/34  |  提交时间:2023/01/31
请输入关兵棋,智能决策,多智能体,深度强化学习,分布式训练键词  
Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 9, 页码: 1673-1686
作者:  Wenzhang Liu;  Lu Dong;  Dan Niu;  Changyin Sun
Adobe PDF(5554Kb)  |  收藏  |  浏览/下载:151/68  |  提交时间:2022/08/19
Knowledge transfer  multi-agent systems  reinforcement learning  successor features