CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:74/6  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 18-36
作者:  Ding Wang;  Ning Gao;  Derong Liu;  Jinna Li;  Frank L. Lewis
Adobe PDF(1945Kb)  |  收藏  |  浏览/下载:270/185  |  提交时间:2024/01/02
Adaptive dynamic programming (ADP)  advanced control  complex environment  data-driven control  event-triggered design  intelligent control  neural networks  nonlinear systems  optimal control  reinforcement learning (RL)  
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 9, 页码: 1797-1809
作者:  Ding Wang;  Jiangyu Wang;  Mingming Zhao;  Peng Xin;  Junfei Qiao
Adobe PDF(5140Kb)  |  收藏  |  浏览/下载:150/60  |  提交时间:2023/08/10
Adaptive critic  artificial neural networks  Hamilton-Jacobi-Bellman (HJB) equation  multi-step heuristic dynamic programming  multi-step reinforcement learning  optimal control  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:172/62  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Multitask Policy Adversarial Learning for Human-Level Control With Large State Spaces 期刊论文
IEEE Transactions on Industrial Informatics Information, 2019, 卷号: 15, 期号: 4, 页码: 2395-2404
作者:  Wang JP(王军平);  You Kang Shi;  Wen Sheng Zhang;  Ian Thomas;  Shi Hui Duan
Adobe PDF(2547Kb)  |  收藏  |  浏览/下载:118/40  |  提交时间:2023/05/05
Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 781-791
作者:  Guangyu Zhu;  Xiaolu Li;  Ranran Sun;  Yiyuan Yang;  Peng Zhang
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:212/69  |  提交时间:2023/03/02
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  optimal control  policy iteration  time-varying  
Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 5513-5528
作者:  Zhao, Mengbiao;  Feng, Wei;  Yin, Fei;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(5999Kb)  |  收藏  |  浏览/下载:379/38  |  提交时间:2022/09/19
Costs  Annotations  Training  Labeling  Detectors  Data models  Benchmark testing  Mixed-supervised learning  scene text detection  weak supervision forms  expectation-maximization algorithm  
AHDet: A dynamic coarse-to-fine gaze strategy for active object detection 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 522-532
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Pan, Chunhong
Adobe PDF(2664Kb)  |  收藏  |  浏览/下载:315/61  |  提交时间:2022/09/19
Object detection  Active object detection  Deep reinforcement learning  Convolutional neural networks  
A Brain-Inspired Approach for Probabilistic Estimation and Efficient Planning in Precision Physical Interaction 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 页码: 15
作者:  Xing, Dengpeng;  Yang, Yiming;  Zhang, Tielin;  Xu, Bo
Adobe PDF(2960Kb)  |  收藏  |  浏览/下载:201/2  |  提交时间:2022/06/10
Task analysis  Robots  Force  Planning  Mathematical models  Brain modeling  Biology  Brain-inspired structure  precision physical interaction  spiking neural networks (SNNs)  
Multiagent Reinforcement Learning:Rollout and Policy Iteration 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 2, 页码: 249-272
作者:  Dimitri Bertsekas
Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:144/31  |  提交时间:2021/04/09
Dynamic programming  multiagent problems  neuro-dynamic programming  policy iteration  reinforcement learning, rollout