CASIA OpenIR

浏览/检索结果: 共56条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
Interpreting Sentiment Composition with Latent Semantic Tree 会议论文
, Toronto, Canada, 2023-7-9
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Cao Liu;  Jiansong Chen;  Jun Zhao;  Kang Liu
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:51/21  |  提交时间:2024/06/06
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:48/14  |  提交时间:2024/06/05
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:69/23  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:66/23  |  提交时间:2024/05/29
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/05/28
Performance Optimization for Bionic Robotic Dolphin with Active Variable Stiffness Control 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 7, 页码: 16
作者:  Chen, Di;  Xiong, Yan;  Wang, Bo;  Tong, Ru;  Meng, Yan;  Yu, Junzhi
收藏  |  浏览/下载:67/0  |  提交时间:2024/02/22
robotic dolphin  torque control  variable stiffness mechanism  performance optimization  
Tight-Space Maneuvering of a Hybrid-Driven Robotic Fish Using Backstepping-Based Adaptive Control 期刊论文
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 页码: 13
作者:  Li, Sijie;  Wu, Zhengxing;  Dai, Shijie;  Wang, Jian;  Tan, Min;  Yu, Junzhi
收藏  |  浏览/下载:109/0  |  提交时间:2024/02/22
Robots  Propellers  Robot kinematics  Robot sensing systems  Propulsion  Mechatronics  Hydrodynamics  3-D path following  backstepping-based adaptive control  depth control  hybrid drive  robotic fish  
3D deployment of UAV-mounted base stations for heterogeneous access requirements 期刊论文
AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 卷号: 143, 页码: 15
作者:  Ai, Xiaolin;  Pu, Zhiqiang;  Chai, Xinghua;  Lei, Jinlin;  Yi, Jianqiang
收藏  |  浏览/下载:73/0  |  提交时间:2024/02/22
Three-dimensional (3D) deployment  Multi-UAV-mounted base stations (BSs)  Non-uniform quality of service (QoS)  requirements  Obstacle avoidance  
Peer Incentive Reinforcement Learning for Cooperative Multiagent Games 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 623-636
作者:  Zhang, Tianle;  Liu, Zhen;  Pu, Zhiqiang;  Yi, Jianqiang
收藏  |  浏览/下载:75/0  |  提交时间:2024/02/22
Cooperative multiagent games  intrinsic reward  multiagent reinforcement learning (MARL)  Starcraft II Micromanagement