CASIA OpenIR

Browse/Search Results:  1-10 of 170 Help

Selected(0)Clear Items/Page:    Sort:
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
Authors:  Zhang, Qichao;  Zhao, Dongbin
View  |  Adobe PDF(1021Kb)  |  Favorite  |  View/Download:36/10  |  Submit date:2019/07/12
Integral reinforcement learning (IRL)  neural network (NN)  nonzero-sum (NZS) games  off-policy  single-critic  unknown drift dynamics  
类脑自主学习与决策神经网络模型 学位论文
, 中国科学院大学: 中国科学院自动化化研究所, 2019
Authors:  赵菲菲
Adobe PDF(16032Kb)  |  Favorite  |  View/Download:114/6  |  Submit date:2019/06/05
类脑自主学习与决策  多脑区协同  脉冲神经网络  发育神经网络  微观可塑性  视觉恐惧反应模型  无人机自主决策  
无人机协同编队控制方法研究 学位论文
工学博士, 北京: 中国科学院大学, 2019
Authors:  熊天漪
Adobe PDF(3911Kb)  |  Favorite  |  View/Download:99/1  |  Submit date:2019/06/09
无人机协同编队控制  时变队形编队跟踪  时变时延  切换通信拓扑  最小参数学习  
A Survey of Cognitive Architectures in the Past 20 Years 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3280-3290
Authors:  Ye, Peijun;  Wang, Tao;  Wang, Fei-Yue
View  |  Adobe PDF(734Kb)  |  Favorite  |  View/Download:124/65  |  Submit date:2019/01/08
Agent  artificial intelligence (Al)  cognitive architecture (CA)  survey  
Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation 期刊论文
NONLINEAR DYNAMICS, 2018, 卷号: 93, 期号: 4, 页码: 2089-2103
Authors:  Zhao, Bo;  Jia, Lihao;  Xia, Hongbing;  Li, Yuanchun
View  |  Adobe PDF(1069Kb)  |  Favorite  |  View/Download:86/37  |  Submit date:2018/10/10
Adaptive Dynamic Programming  Unknown Actuator Saturation  Continuous-time Nonlinear Systems  Stabilizing Control  Neural Networks  
Research on Autonomous Maneuvering Decision of UCAV Based on Deep Reinforcement Learning 会议论文
CCDC2018, Shenyang, China, June 9-11, 2018
Authors:  Zhang, Yesheng;  Zu, Wei;  Gao, Yang;  Chang, Hongxing
View  |  Adobe PDF(1271Kb)  |  Favorite  |  View/Download:243/89  |  Submit date:2018/05/09
Air Combat  Autonomous Maneuvering Decision  Deep Reinforcement Learning  
动态未知环境下多机器人协调围捕研究 学位论文
, 北京: 中国科学院研究生院, 2018
Authors:  吴志勇
Adobe PDF(8589Kb)  |  Favorite  |  View/Download:97/1  |  Submit date:2018/05/30
多机器人系统  围捕  入侵者预测步数优化选取  围捕环境的区域划分  模糊协调围捕  
深度强化学习在多机对战战术决策中的应用研究 学位论文
, 北京: 中国科学院大学, 2018
Authors:  张业胜
Adobe PDF(6414Kb)  |  Favorite  |  View/Download:284/5  |  Submit date:2018/06/19
深度强化学习  机动决策  战术决策  空战仿真  多机协同  
Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 957-969
Authors:  Wei, Qinglai;  Liu, Derong;  Lin, Qiao;  Song, Ruizhuo
Favorite  |  View/Download:73/0  |  Submit date:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Neurodynamic Programming  Optimal Control  Zero-sum Game  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
Authors:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
View  |  Adobe PDF(2475Kb)  |  Favorite  |  View/Download:86/18  |  Submit date:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning