CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

已选(0)清除 条数/页:   排序方式:
Data-efficient model-based reinforcement learning with trajectory discrimination 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2023, 页码: 10
作者:  Qu, Tuo;  Duan, Fuqing;  Zhang, Junge;  Zhao, Bo;  Huang, Wenzhen
收藏  |  浏览/下载:126/0  |  提交时间:2023/11/16
Reinforcement learning  Deep learning  Continuous control task  World model  
Adaptive Critic Designs for Optimal Event-Driven Control of a CSTR System 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 1, 页码: 484-493
作者:  Yang, Xiong;  Wei, Qinglai
收藏  |  浏览/下载:182/0  |  提交时间:2021/01/06
Chemical reactors  Optimal control  Nonlinear systems  Adaptive systems  Cost function  Informatics  Closed loop systems  Adaptive critic designs (ACDs)  continuous stirred tank reactor (CSTR)  discounted cost  event-driven control  reinforcement learning (RL)  
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
Adobe PDF(976Kb)  |  收藏  |  浏览/下载:440/178  |  提交时间:2016/12/26
Nonlinear Control Systems  Continuous Time Systems  Learning (Artificial Intelligence)  Optimal Control  Dynamic Programming  Lyapunov Methods  Linear Systems  Reinforcement Learning  Continuous-time Problem  Nonlinear Optimal Tracking Problem  Adaptive Dynamic Programming  Model-free Adaptive Optimal Tracking Algorithm  Lyapunov Analysis  Linear System  
连续状态空间的强化学习问题 学位论文
, 中国科学院自动化研究所: 中国科学院研究生院, 2007
作者:  何源
Adobe PDF(2826Kb)  |  收藏  |  浏览/下载:436/0  |  提交时间:2015/09/02
强化学习  连续状态空间  核方法  函数逼近  Reinforcement Learning  Continuous State Space  Kernel Method  Function  
连续状态系统的近似最优在线强化学习 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
作者:  朱圆恒
Adobe PDF(2679Kb)  |  收藏  |  浏览/下载:528/0  |  提交时间:2015/09/02
强化学习  最优控制  近似策略迭代  概率近似最优  连续状态系统  收敛性  在线学习  Kd树  Reinforcement Learning  Optimal Control  Approximate Policy Iteration  Probably Approximately Correct  Continuous-state System  Convergence  Online Learning  Kd-tree  
连续状态-动作空间下强化学习方法的研究 学位论文
, 中国科学院自动化研究所: 中国科学院研究生院, 2005
作者:  程玉虎
收藏  |  浏览/下载:540/0  |  提交时间:2015/09/02
强化学习  连续空间  函数逼近  Rbf 网络  模糊推理系统  Reinforcement Learning  Continuous Space  Function Approximation  Rbf Network  Fuzzy Inference System  
Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2013, 卷号: 7, 期号: 17, 页码: 2037-2047
作者:  Yang, Xiong;  Liu, Derong;  Huang, Yuzhu
浏览  |  Adobe PDF(493Kb)  |  收藏  |  浏览/下载:365/93  |  提交时间:2015/08/12
Adaptive Control  Approximation Theory  Closed Loop Systems  Continuous Time Systems  Lyapunov Methods  Neurocontrollers  Nonlinear Control Systems  Optimal Control  Robust Control  Uncertain Systems  Neural Network-based Online Adaptive Optimal Control  Uncertain Nonlinear Continuous-time Systems  Control Constraints  Infinite-horizon Optimal Control Problem  Control Policy  Saturation Constraints  Identifier-critic Architecture  Hamilton-jacobi-bellman Equation Approximation  Uncertain System Dynamics  Critic Nn  Action-critic Dual Networks  Reinforcement Learning  Identifier Nn  Policy Iteration  Lyapunovaeuros Direct Method  Closed Loop System Stability  
Dynamic dual adjustment of daily budgets and bids in sponsored search auctions 期刊论文
DECISION SUPPORT SYSTEMS, 2014, 卷号: 57, 期号: 0, 页码: 105-114
作者:  Zhang, Jie;  Yang, Yanwu;  Li, Xin;  Qin, Rui;  Zeng, Daniel
浏览  |  Adobe PDF(983Kb)  |  收藏  |  浏览/下载:356/100  |  提交时间:2015/08/12
Sponsored Search Auction  Budget Adjustment  Continuous Reinforcement Learning  Dynamic Adjustment