CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:132/37  |  提交时间:2023/06/27
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:383/86  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:302/56  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2079Kb)  |  收藏  |  浏览/下载:219/18  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:224/74  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes  
Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 6, 页码: 2064-2076
作者:  Li, Haoran;  Zhang, Qichao;  Zhao, Dongbin
浏览  |  Adobe PDF(4274Kb)  |  收藏  |  浏览/下载:412/126  |  提交时间:2020/08/03
Robot sensing systems  Navigation  Entropy  Neural networks  Task analysis  Planning  Automatic exploration  deep reinforcement learning (DRL)  optimal decision  partial observation  
WAGNN: A Weighted Aggregation Graph Neural Network for robot skill learning 期刊论文
ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 卷号: 130, 页码: 9
作者:  Zhang, Fengyi;  Liu, Zhiyong;  Xiong, Fangzhou;  Su, Jianhua;  Qiao, Hong
Adobe PDF(1550Kb)  |  收藏  |  浏览/下载:375/58  |  提交时间:2020/07/20
Skill transfer learning  Serial structures  Robot skill learning  Graph Neural Network  
面向非平稳环境的知识迁移方法研究 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2020
作者:  李怀宇
Adobe PDF(13633Kb)  |  收藏  |  浏览/下载:270/10  |  提交时间:2020/06/11
元学习  持续学习  知识迁移  灾难遗忘  生成式对抗网络  
Computational modeling of Emotion-motivated Decisions for Continuous Control of Mobile Robots 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2020, 卷号: 13, 期号: 2020, 页码: 1-14
作者:  Huang, Xiao;  Wu, Wei;  Qiao, Hong
Adobe PDF(5970Kb)  |  收藏  |  浏览/下载:278/95  |  提交时间:2020/06/09
Brain-inspired Computing  Emotion-motivated Learning  Emotion-memory Interactions  Decision-making  Reinforcement Learning  
Research progress of parallel control and management 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 2, 页码: 355-367
作者:  Xiong, Gang;  Dong, Xisong;  Lu, Hao;  Shen, Dayong
Adobe PDF(12496Kb)  |  收藏  |  浏览/下载:321/58  |  提交时间:2020/06/02
ACP methodology  artificial systems  computational experiments  parallel control  parallel management  parallel systems