验证码:

换一张

忘记密码？记住我

取消登录

切换中国科技网通行证登录

切换中国科技网通行证登录

取消

中文版 | English

中国科学院自动化研究所机构知识库

Knowledge Commons of Institute of Automation，CAS

登录注册

图片搜索

粘贴图片网址

首页
研究单元&专题
作者
文献类型
知识图谱
新闻&公告

在结果中检索

研究单元&专题

多模态人工智能系统全... [6]

作者

朱圆恒 [7]

文献类型

期刊论文 [5]

专著章节/文集论文 [1]

学位论文 [1]

发表日期

语种

出处

NEUROCOMPU... [2]

ARTIFICIAL... [1]

Frontiers ... [1]

IEEE Trans... [1]

IET CONTRO... [1]

资助项目

收录类别

SCI [4]

导师

资助机构

CMMI 15268... [1]

知识图谱

CASIA OpenIR

已提交作品

待认领作品

已认领作品

未提交全文

（本次检索基于用户作品认领结果）

浏览/检索结果: 共7条，第1-7条

帮助

限定条件	作者：朱圆恒第一作者

已选(0)清除条数/页：排序方式：
	A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文 IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444 作者: Jiajun Chai; Wenzhang Chen; Yuanheng Zhu; Zong-xin Yao,; Dongbin Zhao Adobe PDF(9249Kb) \| 收藏 \| 浏览/下载：198/107 \| 提交时间：2023/04/26
	Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文 ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547 作者: Zhu, Yuanheng; Zhao, Dongbin Adobe PDF(766Kb) \| 收藏 \| 浏览/下载：406/180 \| 提交时间：2017/09/13 Adaptive Dynamic Programming Policy Iteration Integral Reinforcement Learning Experience Replay Off-policy
	Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems 期刊论文 IET CONTROL THEORY AND APPLICATIONS, 2017, 卷号: 11, 期号: 14, 页码: 2307-2316 作者: Yang, Xiong; He, Haibo; Liu, Derong; Zhu, Yuanheng 浏览 \| Adobe PDF(2123Kb) \| 收藏 \| 浏览/下载：426/141 \| 提交时间：2017/09/13 Dynamic Programming Robust Control Neurocontrollers Continuous Time Systems Control System Synthesis Nonlinear Control Systems Optimal Control Function Approximation Monte Carlo Methods Closed Loop Systems Asymptotic Stability Adaptive Dynamic Programming Robust Neural Control Design Unknown Continuous-time Nonlinear Systems Ct Nonlinear Systems Adp-based Robust Neural Control Scheme Robust Nonlinear Control Problem Nonlinear Optimal Control Problem Nominal System Adp Algorithm Actor-critic Dual Networks Control Policy Approximation Value Function Approximation Actor Neural Network Weights Critic Nn Weights Monte Carlo Integration Method Closed-loop System Asymptotically Stability
	Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文 NEUROCOMPUTING, 2017, 卷号: 238, 期号: , 页码: 377-386 作者: Zhang, Qichao; Zhao, Dongbin; Zhu, Yuanheng* 浏览 \| Adobe PDF(1508Kb) \| 收藏 \| 浏览/下载：608/265 \| 提交时间：2017/05/04 Adaptive Dynamic Programming Optimal Control Neural Network Fully Cooperative Games Data-driven Constrained Input
	连续状态系统的近似最优在线强化学习学位论文 , 中国科学院自动化研究所: 中国科学院大学, 2015 作者: 朱圆恒 Adobe PDF(2679Kb) \| 收藏 \| 浏览/下载：498/0 \| 提交时间：2015/09/02 强化学习最优控制近似策略迭代概率近似最优连续状态系统收敛性在线学习 Kd树 Reinforcement Learning Optimal Control Approximate Policy Iteration Probably Approximately Correct Continuous-state System Convergence Online Learning Kd-tree
	Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems 期刊论文 NEUROCOMPUTING, 2015, 卷号: 149, 页码: 124-131 作者: Zhu, Yuanheng; Zhao, Dongbin; Liu, Derong 浏览 \| Adobe PDF(860Kb) \| 收藏 \| 浏览/下载：266/99 \| 提交时间：2015/10/13 Discrete-time Nonlinear System T-s Fuzzy System Hdp
	Online reinforcement learning for continuous-state systems 专著章节/文集论文出自: Frontiers of Intelligent Control and Information Processing, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore, Singapore:World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, World Scientific, 2014 作者: Yuanheng Zhu; Zhao DB(赵冬斌) Adobe PDF(24150Kb) \| 收藏 \| 浏览/下载：242/27 \| 提交时间：2017/09/13

首页
研究单元产出分布图
收录类型分布图
论文引用排行
作者
文献类型
学科分类
关于网站
使用帮助
联系我们

条目量24537
全文量12367
访问量5125827
下载量806698

版权所有 @2018 - 2024 中国科学院自动化研究所 - Powered by CSpace

地址邮编: 北京市海淀区中关村东路95号（100190）
电话: 010－82544495