CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:248/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:287/51  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:318/62  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:  Wei, Qinglai;  Liao, Zehua;  Shi, Guang
Adobe PDF(1229Kb)  |  收藏  |  浏览/下载:261/33  |  提交时间:2021/11/02
Optimal control  Process control  Smart homes  Dynamic programming  Numerical models  Iterative methods  Informatics  Actor-critic learning  adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  energy management  optimal control  smart grid  
Learning Control for Air Conditioning Systems via Human Expressions 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 卷号: 68, 期号: 8, 页码: 7662-7671
作者:  Wei, Qinglai;  Li, Tao;  Liu, Derong
Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:222/1  |  提交时间:2021/06/15
Adaptive dynamic programming  air conditioning control  deep learning (DL)  deep Q-network (DQN)  human expressions  optimal control  reinforcement learning (RL)  Q-learning  
Neural network-based model predictive tracking control of an uncertain robotic manipulator with input constraints 期刊论文
ISA TRANSACTIONS, 2021, 卷号: 109, 页码: 89-101
作者:  Kang, Erlong;  Qiao, Hong;  Gao, Jie;  Yang, Wenjing
Adobe PDF(942Kb)  |  收藏  |  浏览/下载:360/69  |  提交时间:2021/03/29
Model predictive control  Neural network  Robotic manipulator  Unknown dynamics  Online learning estimation  Input constraints  
Consensus Control of Leader-Following Multi-Agent Systems in Directed Topology With Heterogeneous Disturbances 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2021, 卷号: 8, 期号: 2, 页码: 423-431
作者:  Wei, Qinglai;  Wang, Xin;  Zhong, Xiangnan;  Wu, Naiqi
Adobe PDF(4423Kb)  |  收藏  |  浏览/下载:299/45  |  提交时间:2021/03/08
Consensus control  directed topology  external disturbance  multi-agent (MA) systems