CASIA OpenIR

浏览/检索结果: 共246条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊
创刊日期: 2018,
主办者:  Liu BY(刘博寅)
Adobe PDF(5797Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/07/12
Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文
IEEE Transactions on Games, 2022, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/07/12
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:16/2  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25
Hitch-Hiking Motion of Multiple Bionic Robotic Remoras with Enhanced Multimodal Locomotion 期刊论文
IEEE-ASME Transactions on Mechatronics, 2024, 页码: 1-11
作者:  Wu, Zhengxing;  Yu, Lianyi;  Wang, Jian;  Dai, Shijie;  Tan, Min;  Yu, Junzhi
Adobe PDF(4893Kb)  |  收藏  |  浏览/下载:54/30  |  提交时间:2024/06/24
BioDrone: A Bionic Drone-Based Single Object Tracking Benchmark for Robust Vision 期刊论文
International Journal of Computer Vision, 2024, 卷号: 132, 页码: 1659-1684
作者:  Xin Zhao;  Shiyu Hu;  Yipei Wang;  Zhang Jing;  Yimin Hu;  Rongshuai Liu;  Haibin Ling;  Yin Li;  Renshu Li;  Kun Liu;  Jiadong Li
Adobe PDF(9076Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/06/21
A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance 会议论文
, online, 2022-2
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Wu SG(吴士广);  Liu BY(刘博寅);  Yi JQ(易建强);  Geng HJ(耿虎军);  Chai XH(柴兴华)
Adobe PDF(9582Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/06/21