CASIA OpenIR

Browse/Search Results:  1-6 of 6 Help

Selected(0)Clear Items/Page:    Sort:
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  Li, Xiangjun
View  |  Adobe PDF(976Kb)  |  Favorite  |  View/Download:154/69  |  Submit date:2016/12/26
Nonlinear Control Systems  Continuous Time Systems  Learning (Artificial Intelligence)  Optimal Control  Dynamic Programming  Lyapunov Methods  Linear Systems  Reinforcement Learning  Continuous-time Problem  Nonlinear Optimal Tracking Problem  Adaptive Dynamic Programming  Model-free Adaptive Optimal Tracking Algorithm  Lyapunov Analysis  Linear System  
连续状态系统的近似最优在线强化学习 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
Authors:  朱圆恒
Adobe PDF(2679Kb)  |  Favorite  |  View/Download:263/0  |  Submit date:2015/09/02
强化学习  最优控制  近似策略迭代  概率近似最优  连续状态系统  收敛性  在线学习  Kd树  Reinforcement Learning  Optimal Control  Approximate Policy Iteration  Probably Approximately Correct  Continuous-state System  Convergence  Online Learning  Kd-tree  
Dynamic dual adjustment of daily budgets and bids in sponsored search auctions 期刊论文
DECISION SUPPORT SYSTEMS, 2014, 卷号: 57, 期号: 0, 页码: 105-114
Authors:  Zhang, Jie;  Yang, Yanwu;  Li, Xin;  Qin, Rui;  Zeng, Daniel
View  |  Adobe PDF(983Kb)  |  Favorite  |  View/Download:94/27  |  Submit date:2015/08/12
Sponsored Search Auction  Budget Adjustment  Continuous Reinforcement Learning  Dynamic Adjustment  
Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2013, 卷号: 7, 期号: 17, 页码: 2037-2047
Authors:  Yang, Xiong;  Liu, Derong;  Huang, Yuzhu
View  |  Adobe PDF(493Kb)  |  Favorite  |  View/Download:60/24  |  Submit date:2015/08/12
Adaptive Control  Approximation Theory  Closed Loop Systems  Continuous Time Systems  Lyapunov Methods  Neurocontrollers  Nonlinear Control Systems  Optimal Control  Robust Control  Uncertain Systems  Neural Network-based Online Adaptive Optimal Control  Uncertain Nonlinear Continuous-time Systems  Control Constraints  Infinite-horizon Optimal Control Problem  Control Policy  Saturation Constraints  Identifier-critic Architecture  Hamilton-jacobi-bellman Equation Approximation  Uncertain System Dynamics  Critic Nn  Action-critic Dual Networks  Reinforcement Learning  Identifier Nn  Policy Iteration  Lyapunovaeuros Direct Method  Closed Loop System Stability  
连续状态空间的强化学习问题 学位论文
, 中国科学院自动化研究所: 中国科学院研究生院, 2007
Authors:  何源
Adobe PDF(2826Kb)  |  Favorite  |  View/Download:182/0  |  Submit date:2015/09/02
强化学习  连续状态空间  核方法  函数逼近  Reinforcement Learning  Continuous State Space  Kernel Method  Function  
连续状态-动作空间下强化学习方法的研究 学位论文
, 中国科学院自动化研究所: 中国科学院研究生院, 2005
Authors:  程玉虎
Favorite  |  View/Download:234/0  |  Submit date:2015/09/02
强化学习  连续空间  函数逼近  Rbf 网络  模糊推理系统  Reinforcement Learning  Continuous Space  Function Approximation  Rbf Network  Fuzzy Inference System