CASIA OpenIR

浏览/检索结果: 共37条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:296/75  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Continuous-Time Time-Varying Policy Iteration 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 卷号: 50, 期号: 12, 页码: 4958-4971
作者:  Wei, Qinglai;  Liao, Zehua;  Yang, Zhanyu;  Li, Benkai;  Liu, Derong
Adobe PDF(3149Kb)  |  收藏  |  浏览/下载:237/49  |  提交时间:2021/03/02
Optimal control  Nonlinear systems  Time-varying systems  Mathematical model  Dynamic programming  Approximation algorithms  Iterative algorithms  Adaptive critic designs  adaptive dynamic programming (ADP)  neuro-dynamic programming  nonlinear systems  optimal control  policy iteration  
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:173/33  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-Agent Cooperation 会议论文
, 线上, 2020-11
作者:  Huimu Wang;  Zhen Liu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(916Kb)  |  收藏  |  浏览/下载:90/0  |  提交时间:2021/06/24
Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文
, 线上, 2020-11
作者:  Shiguang, Wu;  Zhiqiang, Pu;  Jianqiang, Yi;  Huimu, Wang
Adobe PDF(1185Kb)  |  收藏  |  浏览/下载:133/1  |  提交时间:2021/06/24
Adaptive Fuzzy Nonsmooth Backstepping Output-Feedback Control for Hypersonic Vehicles With Finite-Time Convergence 期刊论文
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 卷号: 28, 期号: 10, 页码: 2320-2334
作者:  Sun, Jinlin;  Yi, Jianqiang;  Pu, Zhiqiang;  Liu, Zhen
Adobe PDF(3972Kb)  |  收藏  |  浏览/下载:177/0  |  提交时间:2021/01/07
Backstepping  Adaptive fuzzy control  Fuzzy logic  finite-time control  fixed-time state observer  
Decision-Making in Driver-Automation Shared Control: A Review and Perspectives 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2020, 卷号: 7, 期号: 5, 页码: 1289-1307
作者:  Wang, Wenshuo;  Na, Xiaoxiang;  Cao, Dongpu;  Gong, Jianwei;  Xi, Junqiang;  Xing, Yang;  Wang, Fei-Yue
浏览  |  Adobe PDF(14998Kb)  |  收藏  |  浏览/下载:204/26  |  提交时间:2020/09/07
Automated vehicle  decision-making  human driver  human-vehicle interaction  shared control  
A Brain-Inspired Model of Theory of Mind 期刊论文
FRONTIERS IN NEUROROBOTICS, 2020, 卷号: 14, 页码: 17
作者:  Zeng, Yi;  Zhao, Yuxuan;  Zhang, Tielin;  Zhao, Dongcheng;  Zhao, Feifei;  Lu, Enmeng
Adobe PDF(2347Kb)  |  收藏  |  浏览/下载:259/48  |  提交时间:2021/01/07
theory of mind  false-belief task  brain inspired model  self-experience  connection maturation  inhibitory control  
ACDER: Augmented Curiosity-Driven Experience Replay 会议论文
, Paris, France, 2020.05.31-2020.08.31
作者:  Li, Boyao;  Lu, Tao;  Li, Jiayi;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(3303Kb)  |  收藏  |  浏览/下载:239/75  |  提交时间:2020/08/27
A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation 会议论文
, 线上, 2020-8
作者:  Huimu Wang;  Zhiqiang Pu;  Zhen Liu;  Jianqiang Yi;  Tenghai Qiu
Adobe PDF(815Kb)  |  收藏  |  浏览/下载:195/43  |  提交时间:2021/06/24