CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文
, 线上, 2020-11
作者:  Shiguang, Wu;  Zhiqiang, Pu;  Jianqiang, Yi;  Huimu, Wang
Adobe PDF(1185Kb)  |  收藏  |  浏览/下载:158/1  |  提交时间:2021/06/24
STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-Agent Cooperation 会议论文
, 线上, 2020-11
作者:  Huimu Wang;  Zhen Liu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(916Kb)  |  收藏  |  浏览/下载:96/0  |  提交时间:2021/06/24
Approximate Dynamic Programming for Stochastic Resource Allocation Problems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 975-990
作者:  Ali Forootani;  Raffaele Iervolino;  Massimo Tipaldi;  Joshua Neilson
浏览  |  Adobe PDF(3558Kb)  |  收藏  |  浏览/下载:129/38  |  提交时间:2021/03/11
Approximate dynamic programming (ADP)  dynamic programming (DP)  Markov decision processes (MDPs)  resource allocation problem  
Parallel Reinforcement Learning-Based Energy Efficiency Improvement for a Cyber-Physical System 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 2, 页码: 617-626
作者:  Teng Liu;  Bin Tian;  Yunfeng Ai;  Fei-Yue Wang
浏览  |  Adobe PDF(5813Kb)  |  收藏  |  浏览/下载:216/61  |  提交时间:2021/03/11
Bidirectional long short-term memory (LSTM) network  cyber-physical system (CPS)  energy management  parallel system  reinforcement learning (RL)  
Optimal Neuro-Control Strategy for Nonlinear Systems With Asymmetric Input Constraints 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 2, 页码: 575-583
作者:  Xiong Yang;  Bo Zhao
浏览  |  Adobe PDF(1346Kb)  |  收藏  |  浏览/下载:143/47  |  提交时间:2021/03/11
Adaptive critic designs (ACDs)  asymmetric input constraint  critic neural network (CNN)  nonlinear systems  optimal control  reinforcement learning (RL)  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:354/81  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
基于自适应动态规划算法的离散动态系统最优控制 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2020
作者:  梁明明
Adobe PDF(8751Kb)  |  收藏  |  浏览/下载:198/2  |  提交时间:2020/10/23
智能控制  神经网络  自适应动态规划  最优控制  离散随机过程  离散非线性系统  
Neuro-optimal control for discrete stochastic processes via a novel policy iteration algorithm 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2020, 卷号: 50, 期号: 11, 页码: 3972-3985
作者:  Liang, Mingming;  Wang, Ding;  Liu, Derong
浏览  |  Adobe PDF(1604Kb)  |  收藏  |  浏览/下载:208/68  |  提交时间:2020/10/23
Adaptive critic designs  adaptive dynamic programming (ADP)  local policy iteration  neuro-dynamic programming  optimal control  stochastic processes  
Real-world Robot Reaching Skill Learning Based on Deep Reinforcement Learning 会议论文
, Hefei, China, 2020
作者:  Liu, Naijun;  Lu, Tao;  Cai, Yinghao;  Wang, Rui;  Wang, Shuo
浏览  |  Adobe PDF(436Kb)  |  收藏  |  浏览/下载:171/57  |  提交时间:2020/09/27
基于生成式对抗网络的场景文字消除方法研究 学位论文
, 中国科学院自动化研究所: 中国科学院大学, 2020
作者:  边学伟
Adobe PDF(10619Kb)  |  收藏  |  浏览/下载:232/5  |  提交时间:2020/06/11
文字消除  图像修复  图像分割  生成式对抗网络  文字检测