CASIA OpenIR

Browse/Search Results:  1-7 of 7 Help

Selected(0)Clear Items/Page:    Sort:
Manifold Regularized Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 932-943
Authors:  Li, Hongliang;  Liu, Derong;  Wang, Ding
Favorite  |  View/Download:34/0  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Approximate Dynamic Programming  Approximate Policy Iteration (Api)  Manifold Regularization  Reinforcement Learning (Rl)  
基于计算实验的区域路网交通信号控制研究 学位论文
, 北京: 中国科学院大学, 2016
Authors:  刘裕良
Adobe PDF(5332Kb)  |  Favorite  |  View/Download:65/2  |  Submit date:2016/06/27
计算实验  交通信号控制  区域路网  集成自适应动态规划  
Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems 期刊论文
COGNITIVE COMPUTATION, 2015, 卷号: 7, 期号: 6, 页码: 763-771
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
Adobe PDF(809Kb)  |  Favorite  |  View/Download:70/11  |  Submit date:2016/01/18
Approximate Policy Iteration  Approximation Error  Optimal Control  Fuzzy Approximator  
连续状态系统的近似最优在线强化学习 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
Authors:  朱圆恒
Adobe PDF(2679Kb)  |  Favorite  |  View/Download:240/0  |  Submit date:2015/09/02
强化学习  最优控制  近似策略迭代  概率近似最优  连续状态系统  收敛性  在线学习  Kd树  Reinforcement Learning  Optimal Control  Approximate Policy Iteration  Probably Approximately Correct  Continuous-state System  Convergence  Online Learning  Kd-tree  
基于数据的自适应动态规划最优控制与微分博弈研究 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2015
Authors:  李宏亮
Adobe PDF(2578Kb)  |  Favorite  |  View/Download:663/0  |  Submit date:2015/09/02
智能控制  自适应动态规划  神经网络  最优控制  微分博弈  Intelligent Control  Adaptive Dynamic Programming  Neural Networks  Optimal Controldifferential Games  Differential Games  
Impact of Flavor on Electronic Cigarette Marketing in Social Media 会议论文
, USA, 11.17-11.18
Authors:  Yunji Liang;  Xiaolong Zheng;  Daniel Zeng;  Xingshe Zhou
View  |  Adobe PDF(31064Kb)  |  Favorite  |  View/Download:97/54  |  Submit date:2018/01/08
Data-based approximate policy iteration for affine nonlinear continuous-time optimal control design 期刊论文
AUTOMATICA, 2014, 卷号: 50, 期号: 12, 页码: 3281-3290
Authors:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
View  |  Adobe PDF(668Kb)  |  Favorite  |  View/Download:70/23  |  Submit date:2015/08/12
Nonlinear Optimal Control  Reinforcement Learning  Off-policy  Data-based Approximate Policy Iteration  Neural Network  Hamilton-jacobi-bellman Equation