CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:114/44  |  提交时间:2023/06/29
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:126/35  |  提交时间:2023/06/27
Deep Behavioral Cloning for Traffic Control with Virtual Expert Demonstration Under a Parallel Learning Framework 会议论文
, 北京, 2020-12
作者:  Li Xiaoshuang;  Zhu Fenghua;  Wang Fei-Yue
Adobe PDF(770Kb)  |  收藏  |  浏览/下载:195/82  |  提交时间:2022/06/16
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:373/85  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2079Kb)  |  收藏  |  浏览/下载:204/14  |  提交时间:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
Manipulation Skill Learning on Multi-step Complex Task Based on Explicit and Implicit Curriculum Learning 期刊论文
SCIENCE CHINA Information Sciences, 2020, 卷号: 0, 期号: 0, 页码: 0-0
作者:  Liu, Naijun;  Lu, Tao;  Cai, Yinghao;  Wang, Rui;  Wang, Shuo
浏览  |  Adobe PDF(2456Kb)  |  收藏  |  浏览/下载:196/81  |  提交时间:2020/09/27
robot  manipulation skill learning  multi-step complex task  curriculum learning  
Computational modeling of Emotion-motivated Decisions for Continuous Control of Mobile Robots 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2020, 卷号: 13, 期号: 2020, 页码: 1-14
作者:  Huang, Xiao;  Wu, Wei;  Qiao, Hong
浏览  |  Adobe PDF(5970Kb)  |  收藏  |  浏览/下载:274/94  |  提交时间:2020/06/09
Brain-inspired Computing  Emotion-motivated Learning  Emotion-memory Interactions  Decision-making  Reinforcement Learning  
Parallel reinforcement learning-based energy efficiency improvement for a cyber-physical system 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 2, 页码: 617-626
作者:  Liu, Teng;  Tian, Bin;  Ai, Yunfeng;  Wang, Fei-Yue
Adobe PDF(5784Kb)  |  收藏  |  浏览/下载:256/3  |  提交时间:2020/06/02
Bidirectional long short-term memory (LSTM) network  cyber-physical system (CPS)  energy management  parallel system  reinforcement learning (RL)  
Real-Sim-Real Transfer for Real-World Robot Control Policy Learning with Deep Reinforcement Learning 期刊论文
APPLIED SCIENCES-BASEL, 2020, 卷号: 10, 期号: 5, 页码: 16
作者:  Liu, Naijun;  Cai, Yinghao;  Lu, Tao;  Wang, Rui;  Wang, Shuo
浏览  |  Adobe PDF(6287Kb)  |  收藏  |  浏览/下载:270/67  |  提交时间:2020/06/02
robot  policy learning  reality gap  simulated environment  deep reinforcement learning