CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 18-36
作者:  Ding Wang;  Ning Gao;  Derong Liu;  Jinna Li;  Frank L. Lewis
Adobe PDF(1945Kb)  |  收藏  |  浏览/下载:245/180  |  提交时间:2024/01/02
Adaptive dynamic programming (ADP)  advanced control  complex environment  data-driven control  event-triggered design  intelligent control  neural networks  nonlinear systems  optimal control  reinforcement learning (RL)  
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 9, 页码: 1797-1809
作者:  Ding Wang;  Jiangyu Wang;  Mingming Zhao;  Peng Xin;  Junfei Qiao
Adobe PDF(5140Kb)  |  收藏  |  浏览/下载:130/55  |  提交时间:2023/08/10
Adaptive critic  artificial neural networks  Hamilton-Jacobi-Bellman (HJB) equation  multi-step heuristic dynamic programming  multi-step reinforcement learning  optimal control  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:151/58  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Multitask Policy Adversarial Learning for Human-Level Control With Large State Spaces 期刊论文
IEEE Transactions on Industrial Informatics Information, 2019, 卷号: 15, 期号: 4, 页码: 2395-2404
作者:  Wang JP(王军平);  You Kang Shi;  Wen Sheng Zhang;  Ian Thomas;  Shi Hui Duan
Adobe PDF(2547Kb)  |  收藏  |  浏览/下载:98/35  |  提交时间:2023/05/05
Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 3, 页码: 781-791
作者:  Guangyu Zhu;  Xiaolu Li;  Ranran Sun;  Yiyuan Yang;  Peng Zhang
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:169/63  |  提交时间:2023/03/02
Adaptive critic designs  adaptive dynamic programming  approximate dynamic programming  optimal control  policy iteration  time-varying  
平行交通系统中的预测与控制关键技术研究 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022
作者:  戴星原
Adobe PDF(14868Kb)  |  收藏  |  浏览/下载:281/12  |  提交时间:2022/10/09
平行交通系统  交通预测  交通控制  深度学习  强化学习  
Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 5513-5528
作者:  Zhao, Mengbiao;  Feng, Wei;  Yin, Fei;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(5999Kb)  |  收藏  |  浏览/下载:301/35  |  提交时间:2022/09/19
Costs  Annotations  Training  Labeling  Detectors  Data models  Benchmark testing  Mixed-supervised learning  scene text detection  weak supervision forms  expectation-maximization algorithm  
AHDet: A dynamic coarse-to-fine gaze strategy for active object detection 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 522-532
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Pan, Chunhong
Adobe PDF(2664Kb)  |  收藏  |  浏览/下载:294/57  |  提交时间:2022/09/19
Object detection  Active object detection  Deep reinforcement learning  Convolutional neural networks  
Multiagent Reinforcement Learning:Rollout and Policy Iteration 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 2, 页码: 249-272
作者:  Dimitri Bertsekas
Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:128/27  |  提交时间:2021/04/09
Dynamic programming  multiagent problems  neuro-dynamic programming  policy iteration  reinforcement learning, rollout  
Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 卷号: 49, 期号: 10, 页码: 2155-2165
作者:  Luo, Biao;  Liu, Derong;  Huang, Tingwen;  Liu, Jiangjiang
收藏  |  浏览/下载:225/0  |  提交时间:2019/12/16
Adaptive dynamic programming (ADP)  Bellman equation  heuristic dynamic programming  neural networks (NNs)  output tracking control