CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Position Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Liuzhou, China, 20-22 November 2020
作者:  Ma, Ruichen;  Wang, Yu;  Gao, Zisen;  Zhao, Tianzi;  Wang, Rui;  Wang, Shuo;  Zhou, Chao
Adobe PDF(927Kb)  |  收藏  |  浏览/下载:69/31  |  提交时间:2023/08/03
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:93/28  |  提交时间:2023/06/27
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:195/39  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Motor-Cortex-Like Recurrent Neural Network and Multi-Tasks Learning for the Control of Musculoskeletal Systems 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2020, 卷号: 暂无, 期号: 暂无, 页码: 暂无
作者:  Jiahao Chen;  Hong Qiao
Adobe PDF(1958Kb)  |  收藏  |  浏览/下载:166/45  |  提交时间:2021/06/01
Biologically inspired  Musculoskeletal system  Neuromuscular control,  Motor cortex  Muscle synergy  Recurrent neural network  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:329/75  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:264/49  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
智能机器人共享控制与操作技能学习方法研究 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2020
作者:  席宝
Adobe PDF(9051Kb)  |  收藏  |  浏览/下载:323/20  |  提交时间:2021/02/01
位姿检测  共享控制  强化学习  策略梯度  示教引导  
Model-Free H-infinity Optimal Tracking Control of Constrained Nonlinear Systems via an Iterative Adaptive Learning Algorithm 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 4097-4108
作者:  Hou, Jiaxu;  Wang, Ding;  Liu, Derong;  Zhang, Yun
收藏  |  浏览/下载:205/0  |  提交时间:2021/01/07
Adaptive dynamic programming (ADP)  control constraints  convergence analysis  H-infinity tracking  neural network (NN)  optimal control  
Parallel Control for Optimal Tracking via Adaptive Dynamic Programming 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 6, 页码: 1662-1674
作者:  Lu, Jingwei;  Wei, Qinglai;  Wang, Fei-Yue
浏览  |  Adobe PDF(7214Kb)  |  收藏  |  浏览/下载:322/56  |  提交时间:2021/01/06
Adaptive dynamic programming (ADP)  nonlinear optimal control  parallel controller  parallel control theory  parallel system  tracking control  neural network (NN)  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:195/66  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes