CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multiexperience-Assisted Efficient Multiagent Reinforcement Learning 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1-15
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Yi JQ(易建强);  Wu SG(吴士广);  Pu ZQ(蒲志强);  Zhao YJ(赵彦杰)
Adobe PDF(2718Kb)  |  收藏  |  浏览/下载:231/85  |  提交时间:2023/06/02
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:141/55  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Attention enhanced reinforcement learning for multi-agent cooperation 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 期号: 2022, 页码: 1-15
作者:  Zhiqiang Pu;  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Shiguang Wu
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:203/37  |  提交时间:2022/04/02
Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2021, 卷号: 32, 期号: 6, 页码: 2358-2372
作者:  Zezhi Sui;  Zhiqiang Pu;  Jianqiang Yi;  Shiguang Wu
Adobe PDF(5344Kb)  |  收藏  |  浏览/下载:216/72  |  提交时间:2022/04/02
Collision avoidance  deep reinforcement learning (DRL)  formation control  leader–follower  
Unsupervised Network Quantization via Fixed-Point Factorization 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2020, 期号: 1, 页码: 1
作者:  Wang, Peisong;  He, Xiangyu;  Chen, Qiang;  Cheng, Anda;  Liu, Qingshan;  Cheng, Jian
Adobe PDF(1998Kb)  |  收藏  |  浏览/下载:195/48  |  提交时间:2020/10/20
Acceleration , compression , deep neural networks (DNNs) , fixed-point quantization , unsupervised quantization.  
Neural network based online simultaneous policy update algorithm for solving the HJI equation in nonlinear H∞ control 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2012, 卷号: 23, 期号: 12, 页码: 1884-1895
作者:  Wu, Huai-Ning;  Luo, Biao
浏览  |  Adobe PDF(530Kb)  |  收藏  |  浏览/下载:167/47  |  提交时间:2016/04/08
Simultaneous Policy Update Algorithm