CASIA OpenIR

浏览/检索结果: 共202条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Learning and Controlling Multiscale Dynamics in Spiking Neural Networks Using Recursive Least Square Modifications 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 14
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(8060Kb)  |  收藏  |  浏览/下载:37/0  |  提交时间:2024/03/27
Direct dynamic programming (DDP)  Lorenz system  multiscale dynamics  point-to-point control  recursive least square (RLS)  spiking neural network (SNN)  
Autonomous Recovery Control of Biomimetic Robotic Fish Based on Multi-Sensory System 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 卷号: 9, 期号: 2, 页码: 1420-1427
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(4262Kb)  |  收藏  |  浏览/下载:35/1  |  提交时间:2024/03/26
Robots  Wires  Tail  Robot sensing systems  Task analysis  Springs  Sensors  Autonomous recovery  motion control  robotic fish  underwater perception  
Hedonic Coalition Formation for Distributed Task Allocation in Heterogeneous Multi-agent System 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 页码: 13
作者:  Wang, Lexing;  Qiu, Tenghai;  Pu, Zhiqiang;  Yi, Jianqiang;  Zhu, Jinying;  Yuan, Wanmai
Adobe PDF(2578Kb)  |  收藏  |  浏览/下载:75/0  |  提交时间:2024/03/13
Coalition formation  hedonic games  heterogeneous agents  Nash stable  task allocation  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:63/2  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Network-Wide Traffic Signal Control Based on MARL With Hierarchical Nash-Stackelberg Game Model 期刊论文
IEEE ACCESS, 2023, 卷号: 11, 页码: 145085-145100
作者:  Shen, Hui;  Zhao, Hongxia;  Zhang, Zundong;  Yang, Xun;  Song, Yutong;  Liu, Xiaoming
收藏  |  浏览/下载:27/0  |  提交时间:2024/02/22
Games  Roads  Approximation algorithms  Q-learning  Multi-agent systems  Process control  Optimization  Reinforcement learning  Traffic control  Network-wide traffic signal control  hierarchical game model  multi-agent reinforcement learning  
Event-Triggered-Based Consensus Neural Network Tracking Control for Nonlinear Pure-Feedback Multiagent Systems With Delayed Full-State Constraints 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 页码: 11
作者:  Wang, Xiao-An;  Zhang, Guang-Ju;  Niu, Ben;  Wang, Ding;  Wang, Xiao-Mei
收藏  |  浏览/下载:56/0  |  提交时间:2024/02/22
Nonlinear multiagent systems  delayed full state constraints  event-triggered design  asymptotic tracking control  adaptive control  
Multiagent-Reinforcement-Learning-Based Stable Path Tracking Control for a Bionic Robotic Fish With Reaction Wheel 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 12, 页码: 12670-12679
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(1587Kb)  |  收藏  |  浏览/下载:131/1  |  提交时间:2023/11/17
Multiagent reinforcement learning (MARL)  path tracking control  reaction wheel  robotic fish  underwater robot  
Distributed Formation Control for a Multirobotic Fish System With Model-Based Event-Triggered Communication Mechanism 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 11, 页码: 11433-11442
作者:  Dai, Shijie;  Wu, Zhengxing;  Zhang, Pengfei;  Tan, Min;  Yu, Junzhi
收藏  |  浏览/下载:96/0  |  提交时间:2023/11/17
Formation control  model-based eventtriggered mechanism (ETM)  robotic fish  underwater multiagent system  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:48/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow