CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

已选(0)清除 条数/页:   排序方式:
An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文
, 线上, 2020-5
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:34/14  |  提交时间:2024/07/04
A Bio-Inspired Integration Model of Basal Ganglia and Cerebellum for Motion Learning of a Musculoskeletal Robot 期刊论文
Journal of Systems Science and Complexity, 2024, 卷号: 37, 页码: 82-113
作者:  Jinhan Zhang;  Jiahao Chen;  Shanlin Zhong;  Hong Qiao
Adobe PDF(1513Kb)  |  收藏  |  浏览/下载:70/20  |  提交时间:2024/06/04
Recent Progress in Reinforcement Learning and Adaptive Dynamic Programming for Advanced Control Applications 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 18-36
作者:  Ding Wang;  Ning Gao;  Derong Liu;  Jinna Li;  Frank L. Lewis
Adobe PDF(1945Kb)  |  收藏  |  浏览/下载:317/203  |  提交时间:2024/01/02
Adaptive dynamic programming (ADP)  advanced control  complex environment  data-driven control  event-triggered design  intelligent control  neural networks  nonlinear systems  optimal control  reinforcement learning (RL)  
A Survey on Reinforcement Learning Methods in Bionic Underwater Robots 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 2, 页码: 29
作者:  Tong, Ru;  Feng, Yukai;  Wang, Jian;  Wu, Zhengxing;  Tan, Min;  Yu, Junzhi
Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:159/22  |  提交时间:2023/11/17
bionic underwater robot  reinforcement learning  robotic fish  intelligent control  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:259/15  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
基于值分解优化的多智能体深度强化学习方法研究 学位论文
工程硕士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2021
作者:  王凌霄
Adobe PDF(13415Kb)  |  收藏  |  浏览/下载:216/9  |  提交时间:2021/06/15
深度强化学习  多智能体系统  价值函数分解算法  图神经网络  
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
作者:  Wei, Qinglai;  Li, Benkai;  Song, Ruizhuo
浏览  |  Adobe PDF(2475Kb)  |  收藏  |  浏览/下载:426/139  |  提交时间:2017/02/23
Adaptive Critic Designs  Adaptive Dynamic Programming (Adp)  Approximate Dynamic Programming  Generalized Policy Iteration (Gpi)  Neural Networks  Neurodynamic Programming  Nonlinear Systems  Optimal Control  Reinforcement Learning  
连续状态空间的强化学习问题 学位论文
, 中国科学院自动化研究所: 中国科学院研究生院, 2007
作者:  何源
Adobe PDF(2826Kb)  |  收藏  |  浏览/下载:446/0  |  提交时间:2015/09/02
强化学习  连续状态空间  核方法  函数逼近  Reinforcement Learning  Continuous State Space  Kernel Method  Function