CASIA OpenIR

Browse/Search Results:  1-6 of 6 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
Authors:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  Favorite  |  View/Download:14/5  |  Submit date:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
Authors:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  Favorite  |  View/Download:19/7  |  Submit date:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
Authors:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  Favorite  |  View/Download:13/6  |  Submit date:2024/06/25
表示增强的深度强化学习算法研究 学位论文
, 2024
Authors:  张清扬
Adobe PDF(37765Kb)  |  Favorite  |  View/Download:73/7  |  Submit date:2024/06/04
请输入关键词深度强化学习,表示学习,分层强化学习,多智能体强化学习,大型语言模型  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
Authors:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  Favorite  |  View/Download:30/17  |  Submit date:2024/05/28
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
Authors:  Qingyang Zhang;  Kaishen Wang;  Jingqing Ruan;  Yiming Yang;  Dengpeng Xing;  Bo Xu
Adobe PDF(4997Kb)  |  Favorite  |  View/Download:57/19  |  Submit date:2024/04/23
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency