Centralized Cooperative Exploration Policy for Continuous Control Tasks
Chao Li1; Chen Gong1; Qiang He2; Xinwen Hou1; Yu Liu1
2023-05
会议名称the 2023 International Conference on Autonomous Agents and Multiagent Systems
会议录名称Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems
页码2454–2456
会议日期May 29–June 2, 2023
会议地点London, United Kingdom
出版地Richland, SC
出版者International Foundation for Autonomous Agents and Multiagent Systems
摘要

Despite recent works making great progress in continuous control tasks, exploration in these tasks has remained insufficiently investigated. This paper proposes CCEP (C entralized C ooperative E xploration P olicy), which utilizes estimation biases of value functions to contribute to the exploration capacity. CCEP keeps two value functions initialized with different parameters, and generates diverse policies with multiple exploration styles from a pair of value functions. In addition, a centralized policy framework ensures that CCEP achieves message delivery between multiple policies, furthermore contributing to exploring the environment cooperatively. Extensive experimental results demonstrate that CCEP achieves higher exploration capacity. Empirical analysis shows diverse exploration styles in the learned policies by CCEP, reaping benefits in more exploration regions. Besides, the exploration capabilities of CCEP have been demonstrated to outperform current state-of-the-art methods on multiple continuous control tasks.

关键词continuous control tasks cooperative exploration
DOI10.5555/3545946.3598965
收录类别EI
语种英语
是否为代表性论文
七大方向——子方向分类强化与进化学习
国重实验室规划方向分类认知决策知识体系
是否有论文关联数据集需要存交
引用统计
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/56696
专题多模态人工智能系统全国重点实验室_机器人理论与应用
通讯作者Xinwen Hou; Yu Liu
作者单位1.Institute of Automation, Chinese Academy of Sciences, Beijing, China
2.University of Tubingen, Tubingen, Germany
第一作者单位中国科学院自动化研究所
通讯作者单位中国科学院自动化研究所
推荐引用方式
GB/T 7714
Chao Li,Chen Gong,Qiang He,et al. Centralized Cooperative Exploration Policy for Continuous Control Tasks[C]. Richland, SC:International Foundation for Autonomous Agents and Multiagent Systems,2023:2454–2456.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
p2454.pdf(2175KB)会议论文 开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Chao Li]的文章
[Chen Gong]的文章
[Qiang He]的文章
百度学术
百度学术中相似的文章
[Chao Li]的文章
[Chen Gong]的文章
[Qiang He]的文章
必应学术
必应学术中相似的文章
[Chao Li]的文章
[Chen Gong]的文章
[Qiang He]的文章
相关权益政策
暂无数据
收藏/分享
文件名: p2454.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。