CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:97/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:  Song, Ruizhuo;  Wei, Qinglai;  Zhang, Huaguang;  Lewis, Frank L.
收藏  |  浏览/下载:194/0  |  提交时间:2021/08/15
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  discrete-time  nonzero-sum (NZS)  off-policy  reinforcement learning (RL)  
Consensus Control of Leader-Following Multi-Agent Systems in Directed Topology With Heterogeneous Disturbances 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 卷号: 8, 期号: 2, 页码: 423-431
作者:  Wei, Qinglai;  Wang, Xin;  Zhong, Xiangnan;  Wu, Naiqi
Adobe PDF(4423Kb)  |  收藏  |  浏览/下载:278/42  |  提交时间:2021/03/08
Consensus control  directed topology  external disturbance  multi-agent (MA) systems  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:314/75  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Real-Sim-Real Transfer for Real-World Robot Control Policy Learning with Deep Reinforcement Learning 期刊论文
APPLIED SCIENCES-BASEL, 2020, 卷号: 10, 期号: 5, 页码: 16
作者:  Liu, Naijun;  Cai, Yinghao;  Lu, Tao;  Wang, Rui;  Wang, Shuo
浏览  |  Adobe PDF(6287Kb)  |  收藏  |  浏览/下载:247/60  |  提交时间:2020/06/02
robot  policy learning  reality gap  simulated environment  deep reinforcement learning  
Computational modeling of Emotion-motivated Decisions for Continuous Control of Mobile Robots 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2020, 卷号: 13, 期号: 2020, 页码: 1-14
作者:  Huang, Xiao;  Wu, Wei;  Qiao, Hong
浏览  |  Adobe PDF(5970Kb)  |  收藏  |  浏览/下载:239/85  |  提交时间:2020/06/09
Brain-inspired Computing  Emotion-motivated Learning  Emotion-memory Interactions  Decision-making  Reinforcement Learning  
Manipulation Skill Learning on Multi-step Complex Task Based on Explicit and Implicit Curriculum Learning 期刊论文
SCIENCE CHINA Information Sciences, 2020, 卷号: 0, 期号: 0, 页码: 0-0
作者:  Liu, Naijun;  Lu, Tao;  Cai, Yinghao;  Wang, Rui;  Wang, Shuo
浏览  |  Adobe PDF(2456Kb)  |  收藏  |  浏览/下载:160/66  |  提交时间:2020/09/27
robot  manipulation skill learning  multi-step complex task  curriculum learning  
Policy Iteration Algorithm Based Fault Tolerant Tracking Control: An Implementation on Reconfigurable Manipulators 期刊论文
JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 卷号: 13, 期号: 4, 页码: 1739-1750
作者:  Li, Yuanchun;  Xia, Hongbing;  Zhao, Bo
浏览  |  Adobe PDF(708Kb)  |  收藏  |  浏览/下载:336/56  |  提交时间:2018/10/10
Adaptive dynamic programming  Policy iteration  Fault tolerant tracking control  Reconfigurable manipulators  Neural network