CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/05
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:20/6  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 卷号: 35, 期号: 3, 页码: 3251 - 3264
作者:  Wei, Qinglai;  Li, Tao
Adobe PDF(8471Kb)  |  收藏  |  浏览/下载:27/9  |  提交时间:2024/05/28
Adaptive dynamic programming  approximate dynamic programming  constrained cost  optimal control  reinforcement learning  
Dynamic Movement Primitives Based Robot Skills Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 396-407
作者:  Ling-Huan Kong;  Wei He;  Wen-Shi Chen;  Hui Zhang;  Yao-Nan Wang
Adobe PDF(3181Kb)  |  收藏  |  浏览/下载:37/10  |  提交时间:2024/04/23
Dynamic movement primitives (DMPs), trajectory tracking control, robot learning from demonstrations, neural networks (NNs), adaptive control  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:85/9  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Network-Wide Traffic Signal Control Based on MARL With Hierarchical Nash-Stackelberg Game Model 期刊论文
IEEE ACCESS, 2023, 卷号: 11, 页码: 145085-145100
作者:  Shen, Hui;  Zhao, Hongxia;  Zhang, Zundong;  Yang, Xun;  Song, Yutong;  Liu, Xiaoming
收藏  |  浏览/下载:36/0  |  提交时间:2024/02/22
Games  Roads  Approximation algorithms  Q-learning  Multi-agent systems  Process control  Optimization  Reinforcement learning  Traffic control  Network-wide traffic signal control  hierarchical game model  multi-agent reinforcement learning  
Event-Triggered-Based Consensus Neural Network Tracking Control for Nonlinear Pure-Feedback Multiagent Systems With Delayed Full-State Constraints 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 页码: 11
作者:  Wang, Xiao-An;  Zhang, Guang-Ju;  Niu, Ben;  Wang, Ding;  Wang, Xiao-Mei
收藏  |  浏览/下载:67/0  |  提交时间:2024/02/22
Nonlinear multiagent systems  delayed full state constraints  event-triggered design  asymptotic tracking control  adaptive control  
Multiagent-Reinforcement-Learning-Based Stable Path Tracking Control for a Bionic Robotic Fish With Reaction Wheel 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 12, 页码: 12670-12679
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(1587Kb)  |  收藏  |  浏览/下载:147/7  |  提交时间:2023/11/17
Multiagent reinforcement learning (MARL)  path tracking control  reaction wheel  robotic fish  underwater robot  
Distributed Formation Control for a Multirobotic Fish System With Model-Based Event-Triggered Communication Mechanism 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 11, 页码: 11433-11442
作者:  Dai, Shijie;  Wu, Zhengxing;  Zhang, Pengfei;  Tan, Min;  Yu, Junzhi
收藏  |  浏览/下载:107/0  |  提交时间:2023/11/17
Formation control  model-based eventtriggered mechanism (ETM)  robotic fish  underwater multiagent system  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:52/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow