CASIA OpenIR

浏览/检索结果: 共164条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:8/2  |  提交时间:2024/05/30
Spiking Adaptive Dynamic Programming with Poisson Process 会议论文
, 中国山东省青岛市, 2021-07-18
作者:  Wei QL(魏庆来);  Han LY(韩立元);  Zhang TL(张铁林)
Adobe PDF(2334Kb)  |  收藏  |  浏览/下载:15/3  |  提交时间:2024/05/28
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 卷号: 35, 期号: 3, 页码: 3251 - 3264
作者:  Wei, Qinglai;  Li, Tao
Adobe PDF(8471Kb)  |  收藏  |  浏览/下载:8/2  |  提交时间:2024/05/28
Adaptive dynamic programming  approximate dynamic programming  constrained cost  optimal control  reinforcement learning  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:60/1  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:  Wei, Qinglai;  Zhou, Tianmin;  Lu, Jingwei;  Liu, Yu;  Su, Shuai;  Xiao, Jun
收藏  |  浏览/下载:126/0  |  提交时间:2023/11/17
Adaptive dynamic programming (ADP)  Hamilton-Jacobi-Bellman equation (HJBE)  nonlinear stochastic system  stochastic policy iteration (PI)  
Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文
, Jinghong, China, 05-09 December 2022
作者:  Junhang Wei;  Shaowei Cui;  Peng Hao;  Shuo Wang
Adobe PDF(933Kb)  |  收藏  |  浏览/下载:157/56  |  提交时间:2023/10/25
Sample-Observed Soft Actor-Critic Learning for Path Following of a Biomimetic Underwater Vehicle 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Ming
Adobe PDF(2902Kb)  |  收藏  |  浏览/下载:183/59  |  提交时间:2023/08/03
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:105/31  |  提交时间:2023/06/27
A novel iterative adaptive critic design for smart home energy systems with solar energy 会议论文
, 中国厦门, 2022年11月
作者:  Liao ZH(廖泽华);  Wei, Qinglai;  Li, Hongyang
Adobe PDF(965Kb)  |  收藏  |  浏览/下载:164/72  |  提交时间:2023/06/06
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:63/31  |  提交时间:2023/05/22