CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
作者:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
浏览  |  Adobe PDF(4125Kb)  |  收藏  |  浏览/下载:324/128  |  提交时间:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(1508Kb)  |  收藏  |  浏览/下载:591/262  |  提交时间:2017/05/04
Adaptive Dynamic Programming  Optimal Control  Neural Network  Fully Cooperative Games  Data-driven  Constrained Input  
深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero 期刊论文
控 制 理 论 与 应 用, 2017, 卷号: 34, 期号: 12, 页码: 1529-1546
作者:  唐振韬;  邵 坤;  赵冬斌;  朱圆恒
Adobe PDF(8232Kb)  |  收藏  |  浏览/下载:206/33  |  提交时间:2021/07/05
深度强化学习  AlphaGo Zero  深度学习  强化学习  人工智能  
深度强化学习综述:兼论计算机围棋的发展 期刊论文
控制理论与应用, 2016, 卷号: 33, 期号: 6, 页码: 701-717
作者:  赵冬斌;  邵坤;  朱圆恒;  李栋;  陈亚冉;  王海涛;  刘德荣;  周彤;  王成红
浏览  |  Adobe PDF(2816Kb)  |  收藏  |  浏览/下载:1719/634  |  提交时间:2017/09/13
深度强化学习  初弈号  深度学习  强化学习  人工智能  
Neural-network-based decentralized control of continuous-time nonlinear interconnected systems with unknown dynamics 期刊论文
NEUROCOMPUTING, 2015, 期号: 165, 页码: 90-98
作者:  Liu, Derong;  Li, Chao;  Li, Hongliang;  Wang, Ding;  Ma, Hongwen
浏览  |  Adobe PDF(1120Kb)  |  收藏  |  浏览/下载:359/115  |  提交时间:2015/09/17
Adaptive Dynamic Programming  Decentralized Control  Optimal Control  Policy Iteration  Neural Networks  
Finite horizon optimal tracking control of partially unknown linear continuous-time systems using policy iteration 期刊论文
IET CONTROL THEORY AND APPLICATIONS, 2015, 卷号: 9, 期号: 12, 页码: 1791-1801
作者:  Li, Chao;  Liu, Derong;  Li, Hongliang
浏览  |  Adobe PDF(669Kb)  |  收藏  |  浏览/下载:283/89  |  提交时间:2015/09/23
Optimal Tracking Control  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:254/105  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Self-teaching adaptive dynamic programming for Gomoku 期刊论文
NEUROCOMPUTING, 2012, 卷号: 78, 期号: 1, 页码: 23-29
作者:  Zhao, Dongbin;  Zhang, Zhen;  Dai, Yujie
收藏  |  浏览/下载:187/0  |  提交时间:2015/08/12
Gomoku  Reinforcement Learning  Adaptive Dynamic Programming  Temporal Difference Learning  Neural Network