CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:117/32  |  提交时间:2023/06/27
Deep Behavioral Cloning for Traffic Control with Virtual Expert Demonstration Under a Parallel Learning Framework 会议论文
, 北京, 2020-12
作者:  Li Xiaoshuang;  Zhu Fenghua;  Wang Fei-Yue
Adobe PDF(770Kb)  |  收藏  |  浏览/下载:181/74  |  提交时间:2022/06/16
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:353/81  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:284/53  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2079Kb)  |  收藏  |  浏览/下载:184/4  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
Parallel Control for Optimal Tracking via Adaptive Dynamic Programming 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 6, 页码: 1662-1674
作者:  Lu, Jingwei;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(7214Kb)  |  收藏  |  浏览/下载:343/60  |  提交时间:2021/01/06
Adaptive dynamic programming (ADP)  nonlinear optimal control  parallel controller  parallel control theory  parallel system  tracking control  neural network (NN)  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:388/118  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
Research progress of parallel control and management 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 2, 页码: 355-367
作者:  Xiong, Gang;  Dong, Xisong;  Lu, Hao;  Shen, Dayong
Adobe PDF(12496Kb)  |  收藏  |  浏览/下载:304/53  |  提交时间:2020/06/02
ACP methodology  artificial systems  computational experiments  parallel control  parallel management  parallel systems  
Real-Sim-Real Transfer for Real-World Robot Control Policy Learning with Deep Reinforcement Learning 期刊论文
APPLIED SCIENCES-BASEL, 2020, 卷号: 10, 期号: 5, 页码: 16
作者:  Liu, Naijun;  Cai, Yinghao;  Lu, Tao;  Wang, Rui;  Wang, Shuo
浏览  |  Adobe PDF(6287Kb)  |  收藏  |  浏览/下载:264/64  |  提交时间:2020/06/02
robot  policy learning  reality gap  simulated environment  deep reinforcement learning  
A Curiosity-Based Learning Method for Spiking Neural Networks 期刊论文
Frontiers in Computational Neuroscience, 2020, 卷号: 14, 期号: 14, 页码: 7
作者:  Shi, Mengting;  Zhang, Tielin;  Zeng, Yi
浏览  |  Adobe PDF(1349Kb)  |  收藏  |  浏览/下载:388/107  |  提交时间:2020/04/27
Curiosity  Spiking Neural Network  Novelty  Stdp  Voltage-driven Plasticity-centric Snn