CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Data-efficient model-based reinforcement learning with trajectory discrimination 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2023, 页码: 10
作者:  Qu, Tuo;  Duan, Fuqing;  Zhang, Junge;  Zhao, Bo;  Huang, Wenzhen
收藏  |  浏览/下载:86/0  |  提交时间:2023/11/16
Reinforcement learning  Deep learning  Continuous control task  World model  
Adaptive Critic Designs for Optimal Event-Driven Control of a CSTR System 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 1, 页码: 484-493
作者:  Yang, Xiong;  Wei, Qinglai
收藏  |  浏览/下载:152/0  |  提交时间:2021/01/06
Chemical reactors  Optimal control  Nonlinear systems  Adaptive systems  Cost function  Informatics  Closed loop systems  Adaptive critic designs (ACDs)  continuous stirred tank reactor (CSTR)  discounted cost  event-driven control  reinforcement learning (RL)  
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
浏览  |  Adobe PDF(976Kb)  |  收藏  |  浏览/下载:389/161  |  提交时间:2016/12/26
Nonlinear Control Systems  Continuous Time Systems  Learning (Artificial Intelligence)  Optimal Control  Dynamic Programming  Lyapunov Methods  Linear Systems  Reinforcement Learning  Continuous-time Problem  Nonlinear Optimal Tracking Problem  Adaptive Dynamic Programming  Model-free Adaptive Optimal Tracking Algorithm  Lyapunov Analysis  Linear System  
Dynamic dual adjustment of daily budgets and bids in sponsored search auctions 期刊论文
DECISION SUPPORT SYSTEMS, 2014, 卷号: 57, 期号: 0, 页码: 105-114
作者:  Zhang, Jie;  Yang, Yanwu;  Li, Xin;  Qin, Rui;  Zeng, Daniel
浏览  |  Adobe PDF(983Kb)  |  收藏  |  浏览/下载:333/92  |  提交时间:2015/08/12
Sponsored Search Auction  Budget Adjustment  Continuous Reinforcement Learning  Dynamic Adjustment  
Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2013, 卷号: 7, 期号: 17, 页码: 2037-2047
作者:  Yang, Xiong;  Liu, Derong;  Huang, Yuzhu
浏览  |  Adobe PDF(493Kb)  |  收藏  |  浏览/下载:322/86  |  提交时间:2015/08/12
Adaptive Control  Approximation Theory  Closed Loop Systems  Continuous Time Systems  Lyapunov Methods  Neurocontrollers  Nonlinear Control Systems  Optimal Control  Robust Control  Uncertain Systems  Neural Network-based Online Adaptive Optimal Control  Uncertain Nonlinear Continuous-time Systems  Control Constraints  Infinite-horizon Optimal Control Problem  Control Policy  Saturation Constraints  Identifier-critic Architecture  Hamilton-jacobi-bellman Equation Approximation  Uncertain System Dynamics  Critic Nn  Action-critic Dual Networks  Reinforcement Learning  Identifier Nn  Policy Iteration  Lyapunovaeuros Direct Method  Closed Loop System Stability