CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Cyber-Physical-Social System in Intelligent Transportation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 期号: 3, 页码: 320 - 333
作者:  Gang Xiong;  Fenghua Zhu;  Xiwei Liu;  Xisong Dong;  Wuling Huang;  Songhang Chen;  Zhao K(赵恺)
Adobe PDF(4004Kb)  |  收藏  |  浏览/下载:372/160  |  提交时间:2019/10/10
Cyber-physical-social System (Cpss)  Acp Approach  Intelligent Transportation System (Its)  Parallel Control And Management,  Internet Of Vehicles  Social Transportation Network  
Cyber-Physical-Social System in Intelligent Transportation 期刊论文
IEEE/CAA JOURNAL OF AUTOMATICA SINICA,, 2015, 卷号: 2, 期号: 3, 页码: 320-333
作者:  Gang XIONG;  Fenghua Zhu;  Xiwei Liu;  Xisong Dong;  Wuling Huang;  Songhang Chen;  Kai Zhao
浏览  |  Adobe PDF(4001Kb)  |  收藏  |  浏览/下载:388/155  |  提交时间:2017/12/31
Cyber-physical-social System (Cpss)  Acp Approach  Intelligent Transportation System (Its)  Parallel Control And Management  Internet Of Vehicles  
Clique-based cooperative multiagent reinforcement learning using factor graphs 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2015, 卷号: 3, 期号: 1, 页码: 248-256
作者:  Zhang,Zhen;  Zhao DB(赵冬斌)
浏览  |  Adobe PDF(707Kb)  |  收藏  |  浏览/下载:212/87  |  提交时间:2017/12/30
Reinforcement Learning  Factor Graphs  
Data-driven H∞ control for nonlinear distributed parameter systems 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2015, 卷号: 26, 期号: 11, 页码: 2949-2961
作者:  Luo, Biao;  Huang, Tingwen;  Wu, Huai-Ning;  Yang, Xiong
浏览  |  Adobe PDF(1844Kb)  |  收藏  |  浏览/下载:360/139  |  提交时间:2016/10/28
Data Driven  
Off-policy reinforcement learning for H∞ control design 期刊论文
IEEE Transactions on Cybernetics, 2015, 卷号: 45, 期号: 1, 页码: 65-76
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen
浏览  |  Adobe PDF(680Kb)  |  收藏  |  浏览/下载:227/62  |  提交时间:2016/04/08
Off-policy  
Adaptive Optimal Control of Highly Dissipative Nonlinear Spatially Distributed Processes With Neuro-Dynamic Programming 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 684-696
作者:  Luo, Biao;  Wu, Huai-Ning;  Li, Han-Xiong
浏览  |  Adobe PDF(2465Kb)  |  收藏  |  浏览/下载:340/106  |  提交时间:2016/03/30
Adaptive Optimal Control  Empirical Eigenfunction (Eef)  Highly Dissipative Partial Differential Equations (Pdes)  Neuro-dynamic Programming (Ndp)  Spatially Distributed Processes (Sdps)  
Reinforcement learning solution for HJB equation arising in constrained optimal control problem 期刊论文
NEURAL NETWORKS, 2015, 卷号: 71, 期号: 0, 页码: 150-158
作者:  Luo, Biao;  Wu, Huai-Ning;  Huang, Tingwen;  Liu, Derong
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:466/202  |  提交时间:2016/03/30
Constrained Optimal Control  Data-based  Off-policy Reinforcement Learning  Hamilton-jacobi-bellman Equation  The Method Of Weighted Residuals  
A novel policy iteration based deterministic Q-learning for discrete-time nonlinear systems 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2015, 卷号: 58, 期号: 12, 页码: 122203:1–122203:15
作者:  Wei QingLai;  Liu DeRong;  Derong Liu
浏览  |  Adobe PDF(1215Kb)  |  收藏  |  浏览/下载:308/125  |  提交时间:2016/03/19
Adaptive Critic Designs  Adaptive Dynamic Programming  Approximate Dynamic Programming  Q-learning  Policy Iteration  Neural Networks  Nonlinear Systems  Optimal Control  
Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 卷号: 45, 期号: 12, 页码: 1577-1591
作者:  Liu, Derong;  Wei, Qinglai;  Yan, Pengfei
浏览  |  Adobe PDF(1540Kb)  |  收藏  |  浏览/下载:236/70  |  提交时间:2016/03/19
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration  Neural Networks  Neuro-dynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems 期刊论文
COGNITIVE COMPUTATION, 2015, 卷号: 7, 期号: 6, 页码: 763-771
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo;  Ji, Junhong
Adobe PDF(809Kb)  |  收藏  |  浏览/下载:255/38  |  提交时间:2016/01/18
Approximate Policy Iteration  Approximation Error  Optimal Control  Fuzzy Approximator