CASIA OpenIR

浏览/检索结果: 共102条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Hitch-Hiking Motion of Multiple Bionic Robotic Remoras with Enhanced Multimodal Locomotion 期刊论文
IEEE-ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Wu, Zhengxing;  Yu, Lianyi;  Wang, Jian;  Dai, Shijie;  Tan, Min;  Yu, Junzhi
Adobe PDF(4893Kb)  |  收藏  |  浏览/下载:63/32  |  提交时间:2024/06/24
Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 会议论文
, Honolulu, Hawaii, USA, 2019.01.27 - 2019.02.01
作者:  Shizhu HE;  Kang Liu;  Weiting An
Adobe PDF(1562Kb)  |  收藏  |  浏览/下载:47/18  |  提交时间:2024/06/20
Controller Design and Stability Analysis for Spinning Missile Via Tensor Product 期刊论文
Aerospace Science and Technology, 2022, 页码: 107877
作者:  Zhiming Zhou;  Zhen Liu;  Yi Pan;  Jianqiang Yi
Adobe PDF(1047Kb)  |  收藏  |  浏览/下载:46/17  |  提交时间:2024/06/20
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 卷号: 118, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:44/13  |  提交时间:2024/06/06
Multi-agent system  Target allocation  Decision making  Swarm motion control  
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/06/05
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2024/05/29
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:37/7  |  提交时间:2024/05/28
Integrated Tracking Control of an Underwater Bionic Robot Based on Multimodal Motions 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 卷号: 54, 期号: 3, 页码: 1599-1610
作者:  Wang, Jian;  Wu, Zhengxing;  Zhang, Yang;  Kong, Shihan;  Tan, Min;  Yu, Junzhi
Adobe PDF(5090Kb)  |  收藏  |  浏览/下载:97/20  |  提交时间:2024/03/27
Disturbance observer (DOB)  fuzzy system  model predictive control (MPC)  tracking control  underwater bionic robot