CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2021, 卷号: 32, 期号: 6, 页码: 2358-2372
作者:  Zezhi Sui;  Zhiqiang Pu;  Jianqiang Yi;  Shiguang Wu
Adobe PDF(5344Kb)  |  收藏  |  浏览/下载:248/81  |  提交时间:2022/04/02
Collision avoidance  deep reinforcement learning (DRL)  formation control  leader–follower  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:233/5  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:301/55  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Dynamic camera configuration learning for high-confidence active object detection 期刊论文
NEUROCOMPUTING, 2021, 卷号: 466, 页码: 113-127
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Cao, Yong;  Meng, Gaofeng;  Pan, Chunhong
Adobe PDF(4412Kb)  |  收藏  |  浏览/下载:310/59  |  提交时间:2021/12/28
Object detection  Active object detection  Deep reinforcement learning  Camera control  
HackRL: Reinforcement learning with hierarchical attention for cross-graph knowledge fusion and collaborative reasoning 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2021, 卷号: 233, 页码: 14
作者:  Yang, Linyao;  Wang, Xiao;  Dai, Yuxin;  Xin, Kejun;  Zheng, Xiaolong;  Ding, Weiping;  Zhang, Jun;  Wang, Fei-Yue
Adobe PDF(1159Kb)  |  收藏  |  浏览/下载:387/135  |  提交时间:2021/12/28
Knowledge fusion  Knowledge reasoning  Decision-making  Hierarchical graph attention  Reinforcement learning  
Underwater Target Tracking Control of an Untethered Robotic Fish With a Camera Stabilizer 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 卷号: 51, 期号: 10, 页码: 6523-6534
作者:  Yu, Junzhi;  Wu, Zhengxing;  Yang, Xiang;  Yang, Yueqi;  Zhang, Pengfei
收藏  |  浏览/下载:224/0  |  提交时间:2021/11/04
Cameras  Robot vision systems  Target tracking  Oscillators  Sports  Active visual tracking  camera stabilizer  reinforcement learning (RL)  robotic fish  underwater target tracking  
Hierarchical Reinforcement Learning With Automatic Sub-Goal Identification 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 10, 页码: 1686-1696
作者:  Chenghao Liu;  Fei Zhu;  Quan Liu;  Yuchen Fu
Adobe PDF(5095Kb)  |  收藏  |  浏览/下载:127/46  |  提交时间:2021/09/03
Hierarchical control  hierarchical reinforcement learning  option  sparse reward  sub-goal  
A Novel Heterogeneous Actor-critic Algorithm with Recent Emphasizing Replay Memory 期刊论文
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 4, 页码: 619-631
作者:  Bao Xi;  Rui Wang;  Ying-Hao Cai;  TaoLu;  Shuo Wang
Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:187/55  |  提交时间:2021/07/20
Reinforcement learning (RL)  actor-critic  experience replay  training efficiency  manipulation skill learning  
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:250/50  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism  
Real-time path planning and following of a gliding robotic dolphin within a hierarchical framework 期刊论文
IEEE Transactions on Vehicular Technology, 2021, 卷号: 70, 期号: 4, 页码: 3243-3255
作者:  Wang, Jian(王健);  Wu, Zhengxing;  Yan, Shuaizheng;  Tan, Min;  Yu, Junzhi
Adobe PDF(3837Kb)  |  收藏  |  浏览/下载:248/51  |  提交时间:2021/06/04
Adaptive backstepping  hierarchical deep q-network  path following  path planning  underwater robot