CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共5条,第1-5条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:54/28  |  提交时间:2023/05/22
StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 卷号: 3, 期号: 1, 页码: 73-84
作者:  Kun Shao;  Yuanheng Zhu;  Dongbin Zhao
浏览  |  Adobe PDF(4125Kb)  |  收藏  |  浏览/下载:332/131  |  提交时间:2019/04/22
Reinforcement Learning, Transfer Learning, Curriculum Learning, Neural Network, Game Ai  
Online Reinforcement Learning by Bayesian Inference 会议论文
Proceedings of International Joint Conference on Neural Networks 2015, Ireland, 2015年7月
作者:  Xia ZP(夏中谱);  Dongbin Zhao
浏览  |  Adobe PDF(751Kb)  |  收藏  |  浏览/下载:278/89  |  提交时间:2016/06/15
Reinforcement Learning  Bayesian Inference  Gaussian Processes  
Neural network based online traffic signal controller design with reinforcement training 会议论文
IEEE International Conference on Intelligent Transportation Systems (ITSC), 2011
作者:  Dai, Yujie;  Hu, Jinzong;  Zhao, Dongbin;  Zhu, Fenghua
Adobe PDF(241Kb)  |  收藏  |  浏览/下载:270/86  |  提交时间:2015/08/19
Adaptive dynamic programming for optimal control of unknown nonlinear discrete-time systems 会议论文
Symposium Series on Computational Intelligence (SSC) - IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2011
作者:  Liu, Derong;  Wang, Ding;  Zhao, Dongbin
Adobe PDF(1951Kb)  |  收藏  |  浏览/下载:181/42  |  提交时间:2015/08/19