CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/05/28
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:131/57  |  提交时间:2023/06/12
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:106/43  |  提交时间:2023/06/12
Multi-agent Collaborative Learning with Relational Graph Reasoning in Adversarial Environments 会议论文
, 线上会议, 2021-9
作者:  Wu Shiguang;  Qiu Tenghai;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(1396Kb)  |  收藏  |  浏览/下载:238/69  |  提交时间:2022/06/16
Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2021, 卷号: 32, 期号: 6, 页码: 2358-2372
作者:  Zezhi Sui;  Zhiqiang Pu;  Jianqiang Yi;  Shiguang Wu
Adobe PDF(5344Kb)  |  收藏  |  浏览/下载:234/76  |  提交时间:2022/04/02
Collision avoidance  deep reinforcement learning (DRL)  formation control  leader–follower  
仿生滑翔机器鲸鲨的运动控制与自主对接充电研究 学位论文
, 北京: 中国科学院大学, 2021
作者:  董会杰
Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:289/15  |  提交时间:2021/12/31
仿生滑翔机器鲸鲨  滑翔效率优化  滑翔运动控制  自主对接充电  
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:242/49  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism  
基于演化学习与对手策略的不完美信息博弈算法研究 学位论文
, 中国科学院自动化研究所: 中国科学院自动化研究所, 2021
作者:  张蒙
Adobe PDF(2515Kb)  |  收藏  |  浏览/下载:366/9  |  提交时间:2021/06/20
不完美信息博弈  德州扑克  演化学习  在线对手建模  种群策略集成  
Real-time path planning and following of a gliding robotic dolphin within a hierarchical framework 期刊论文
IEEE Transactions on Vehicular Technology, 2021, 卷号: 70, 期号: 4, 页码: 3243-3255
作者:  Wang, Jian(王健);  Wu, Zhengxing;  Yan, Shuaizheng;  Tan, Min;  Yu, Junzhi
Adobe PDF(3837Kb)  |  收藏  |  浏览/下载:237/51  |  提交时间:2021/06/04
Adaptive backstepping  hierarchical deep q-network  path following  path planning  underwater robot  
Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images 期刊论文
NEURAL PROCESSING LETTERS, 2021, 期号: 53, 页码: 18
作者:  Gao, Zishu;  Li, En;  Wang, Zhe;  Yang, Guodong;  Lu, Jiwu;  Ouyang, Bo;  Xu, Dawei;  Liang, Zize
Adobe PDF(1338Kb)  |  收藏  |  浏览/下载:273/56  |  提交时间:2021/03/01
Object reconstruction  Convolutional LSTM  Visual attention  Robotic application