CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:248/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Learning to Assemble Noncylindrical Parts Using Trajectory Learning and Force Tracking 期刊论文
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2021, 页码: 12
作者:  Su, Jianhua;  Meng, Yan;  Wang, Lili;  Yang, Xu
Adobe PDF(4865Kb)  |  收藏  |  浏览/下载:327/54  |  提交时间:2022/01/27
Force  Trajectory  Robots  Task analysis  Hidden Markov models  Impedance  Training  Assembly skill  impedance control  learning from demonstration  movement primitives (MPs)  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:287/51  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Multi-aspect self-supervised learning for heterogeneous information network 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2021, 卷号: 233, 页码: 14
作者:  Che, Feihu;  Tao, Jianhua;  Yang, Guohua;  Liu, Tong;  Zhang, Dawei
Adobe PDF(2661Kb)  |  收藏  |  浏览/下载:228/45  |  提交时间:2021/12/28
Heterogeneous information network  Self-supervised  Contrastive learning  Graph neural network  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:318/62  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:  Wei, Qinglai;  Liao, Zehua;  Shi, Guang
Adobe PDF(1229Kb)  |  收藏  |  浏览/下载:261/33  |  提交时间:2021/11/02
Optimal control  Process control  Smart homes  Dynamic programming  Numerical models  Iterative methods  Informatics  Actor-critic learning  adaptive critic designs  adaptive dynamic programming (ADP)  approximate dynamic programming  energy management  optimal control  smart grid  
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:242/49  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism  
Learning Control for Air Conditioning Systems via Human Expressions 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 卷号: 68, 期号: 8, 页码: 7662-7671
作者:  Wei, Qinglai;  Li, Tao;  Liu, Derong
Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:222/1  |  提交时间:2021/06/15
Adaptive dynamic programming  air conditioning control  deep learning (DL)  deep Q-network (DQN)  human expressions  optimal control  reinforcement learning (RL)  Q-learning  
Continuous-Time Distributed Policy Iteration for Multicontroller Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 5, 页码: 2372-2383
作者:  Wei, Qinglai;  Li, Hongyang;  Yang, Xiong;  He, Haibo
Adobe PDF(1246Kb)  |  收藏  |  浏览/下载:232/40  |  提交时间:2021/06/07
Optimal control  Nonlinear systems  Decentralized control  Mathematical model  Convergence  Multi-agent systems  Adaptive dynamic programming (ADP)  approximate dynamic programming  distributed policy iteration  nonlinear systems  optimal control  
Neural network-based model predictive tracking control of an uncertain robotic manipulator with input constraints 期刊论文
ISA TRANSACTIONS, 2021, 卷号: 109, 页码: 89-101
作者:  Kang, Erlong;  Qiao, Hong;  Gao, Jie;  Yang, Wenjing
Adobe PDF(942Kb)  |  收藏  |  浏览/下载:360/69  |  提交时间:2021/03/29
Model predictive control  Neural network  Robotic manipulator  Unknown dynamics  Online learning estimation  Input constraints