CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共7条,第1-7条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:247/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
A data-based online reinforcement learning algorithm satisfying probably approximately correct principle 期刊论文
NEURAL COMPUTING & APPLICATIONS, 2015, 卷号: 26, 期号: 4, 页码: 775-787
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1331Kb)  |  收藏  |  浏览/下载:259/60  |  提交时间:2015/09/21
Reinforcement Learning  Probably Approximately Correct  Kd-tree  
MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 2, 页码: 346-356
作者:  Zhao, Dongbin;  Zhu, Yuanheng
浏览  |  Adobe PDF(2156Kb)  |  收藏  |  浏览/下载:271/110  |  提交时间:2015/09/18
Efficient Exploration  Probably Approximately Correct (Pac)  Reinforcement Learning (Rl)  State Aggregation  
Full-range adaptive cruise control based on supervised adaptive dynamic programming 期刊论文
NEUROCOMPUTING, 2014, 卷号: 125, 页码: 57-67
作者:  Zhao, Dongbin;  Hu, Zhaohui;  Xia, Zhongpu;  Alippi, Cesare;  Zhu, Yuanheng;  Wang, Ding
浏览  |  Adobe PDF(2228Kb)  |  收藏  |  浏览/下载:405/119  |  提交时间:2015/08/12
Adaptive Dynamic Programming  Supervised Reinforcement Learning  Neural Networks  Adaptive Cruise Control  Stop And Go  
Trajectory Tracking Control of Omnidirectional Wheeled Mobile Manipulators: Robust Neural Network-Based Sliding Mode Approach 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2009, 卷号: 39, 期号: 3, 页码: 788-799
作者:  Xu, Dong;  Zhao, Dongbin;  Yi, Jianqiang;  Tan, Xiangmin
浏览  |  Adobe PDF(443Kb)  |  收藏  |  浏览/下载:249/90  |  提交时间:2015/08/12
Omnidirectional Mobile Manipulators  Robust Neural Network (Nn)  Sliding Mode Control (Smc)  Trajectory Tracking Control  Uncertainties  
THE APPLICATION OF ADHDP(lambda) METHOD TO COORDINATED MULTIPLE RAMPS METERING 期刊论文
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 卷号: 5, 期号: 10B, 页码: 3471-3481
作者:  Bai, Xuerui;  Zhao, Dongbin;  Yi, Jianqiang
浏览  |  Adobe PDF(474Kb)  |  收藏  |  浏览/下载:226/72  |  提交时间:2015/08/12
Heuristic Dynamic Programming  Eligibility Traces  Multiple Ramps Metering  
DHP Method for Ramp Metering of Freeway Traffic 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2011, 卷号: 12, 期号: 4, 页码: 990-999
作者:  Zhao, Dongbin;  Bai, Xuerui;  Wang, Fei-Yue;  Xu, Jing;  Yu, Wensheng;  Fei-Yue Wang
Adobe PDF(827Kb)  |  收藏  |  浏览/下载:246/77  |  提交时间:2015/08/12
Congestion  Dual Heuristic Programming (Dhp)  Ramp Metering  Traffic Control