验证码:

换一张

忘记密码？记住我

取消登录

切换中国科技网通行证登录

切换中国科技网通行证登录

取消

中文版 | English

中国科学院自动化研究所机构知识库

Knowledge Commons of Institute of Automation，CAS

登录注册

图片搜索

粘贴图片网址

首页
研究单元&专题
作者
文献类型
知识图谱
新闻&公告

在结果中检索

研究单元&专题

学术期刊 [14]

作者

文献类型

期刊论文 [14]

发表日期

2024 [14]

语种

出处

IEEE/CAA ... [10]

Machine In... [3]

自动化学报 [1]

资助项目

收录类别

导师

资助机构

知识图谱

CASIA OpenIR

已提交作品

待认领作品

已认领作品

未提交全文

浏览/检索结果: 共14条，第1-10条

帮助

限定条件	发表日期：2024 文献类型：期刊论文

已选(0)清除条数/页：排序方式：
	An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570 作者: Yan Song; He Jiang; Zheng Tian; Haifeng Zhang; Yingping Zhang; Jiangcheng Zhu; Zonghong Dai; Weinan Zhang; Jun Wang Adobe PDF(24588Kb) \| 收藏 \| 浏览/下载：8/4 \| 提交时间：2024/05/23 Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory
	Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 411-430 作者: Qiyue Yin; Tongtong Yu; Shengqi Shen; Jun Yang; Meijing Zhao; Wancheng Ni; Kaiqi Huang; Bin Liang; Liang Wang Adobe PDF(2923Kb) \| 收藏 \| 浏览/下载：5/4 \| 提交时间：2024/05/23 Deep reinforcement learning, distributed machine learning, self-play, population-play, toolbox
	Attention Markets of Blockchain-based Decentralized Autonomous Organizations 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 6, 页码: 1370-1380 作者: Juanjuan Li; Rui Qin; Sangtian Guan; Wenwen Ding; Fei Lin; Fei-Yue Wang Adobe PDF(1878Kb) \| 收藏 \| 浏览/下载：5/1 \| 提交时间：2024/05/22 Attention decentralized autonomous organizations Harberger tax Stackelberg game
	Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368 作者: Qingyang Zhang; Kaishen Wang; Jingqing Ruan; Yiming Yang; Dengpeng Xing; Bo Xu Adobe PDF(4997Kb) \| 收藏 \| 浏览/下载：18/7 \| 提交时间：2024/04/23 Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency
	基于优先采样模型的离线强化学习期刊论文自动化学报, 2024, 卷号: 50, 期号: 1, 页码: 143-153 作者: 顾扬; 程玉虎; 王雪松 Adobe PDF(2677Kb) \| 收藏 \| 浏览/下载：66/17 \| 提交时间：2024/04/12 离线强化学习优先采样模型时序差分误差鞅批约束深度Q学习
	Computational Experiments for Complex Social Systems: Experiment Design and Generative Explanation 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 1022-1038 作者: Xiao Xue; Deyu Zhou; Xiangning Yu; Gang Wang; Juanjuan Li; Xia Xie; Lizhen Cui; Fei-Yue Wang Adobe PDF(7239Kb) \| 收藏 \| 浏览/下载：37/8 \| 提交时间：2024/03/18 Agent-based modeling computational experiments cyber-physical-social systems (CPSS) generative deduction generative experiments meta model
	Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 3, 页码: 690-697 作者: Yun Zhang; Lulu Zhang; Yunze Cai Adobe PDF(6850Kb) \| 收藏 \| 浏览/下载：83/34 \| 提交时间：2024/02/19 Adaptive dynamic programming incomplete information multi-player differential game value iteration
	Adaptive Optimal Output Regulation of Interconnected Singularly Perturbed Systems With Application to Power Systems 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 3, 页码: 595-607 作者: Jianguo Zhao; Chunyu Yang; Weinan Gao; Linna Zhou; Xiaomin Liu Adobe PDF(2409Kb) \| 收藏 \| 浏览/下载：48/22 \| 提交时间：2024/02/19 Adaptive optimal control decentralized control output regulation reinforcement learning (RL) singularly perturbed systems (SPSs)
	Advancements in Humanoid Robots: A Comprehensive Review and Future Prospects 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 301-328 作者: Yuchuang Tong; Haotian Liu; Zhengtao Zhang Adobe PDF(7587Kb) \| 收藏 \| 浏览/下载：95/17 \| 提交时间：2024/01/23 Future trends and challenges humanoid robots human-robot interaction key technologies potential applications
	Reinforcement Learning in Process Industries: Review and Perspective 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 283-300 作者: Oguzhan Dogru; Junyao Xie; Om Prakash; Ranjith Chiplunkar; Jansen Soesanto; Hongtian Chen; Kirubakaran Velswamy; Fadi Ibrahim; Biao Huang Adobe PDF(1275Kb) \| 收藏 \| 浏览/下载：44/15 \| 提交时间：2024/01/23 Process control process systems engineering reinforcement learning

首页
研究单元产出分布图
收录类型分布图
论文引用排行
作者
文献类型
学科分类
关于网站
使用帮助
联系我们

条目量24606
全文量12459
访问量5153705
下载量794298

版权所有 @2018 - 2024 中国科学院自动化研究所 - Powered by CSpace

地址邮编: 北京市海淀区中关村东路95号（100190）
电话: 010－82544495