CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570
作者:  Yan Song;  He Jiang;  Zheng Tian;  Haifeng Zhang;  Yingping Zhang;  Jiangcheng Zhu;  Zonghong Dai;  Weinan Zhang;  Jun Wang
Adobe PDF(24588Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/05/23
Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory  
Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 411-430
作者:  Qiyue Yin;  Tongtong Yu;  Shengqi Shen;  Jun Yang;  Meijing Zhao;  Wancheng Ni;  Kaiqi Huang;  Bin Liang;  Liang Wang
Adobe PDF(2923Kb)  |  收藏  |  浏览/下载:5/4  |  提交时间:2024/05/23
Deep reinforcement learning, distributed machine learning, self-play, population-play, toolbox  
Attention Markets of Blockchain-based Decentralized Autonomous Organizations 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 6, 页码: 1370-1380
作者:  Juanjuan Li;  Rui Qin;  Sangtian Guan;  Wenwen Ding;  Fei Lin;  Fei-Yue Wang
Adobe PDF(1878Kb)  |  收藏  |  浏览/下载:5/1  |  提交时间:2024/05/22
Attention  decentralized autonomous organizations  Harberger tax  Stackelberg game  
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
作者:  Qingyang Zhang;  Kaishen Wang;  Jingqing Ruan;  Yiming Yang;  Dengpeng Xing;  Bo Xu
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:18/7  |  提交时间:2024/04/23
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency  
基于优先采样模型的离线强化学习 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 1, 页码: 143-153
作者:  顾扬;  程玉虎;  王雪松
Adobe PDF(2677Kb)  |  收藏  |  浏览/下载:66/17  |  提交时间:2024/04/12
离线强化学习  优先采样模型  时序差分误差    批约束深度Q学习  
Computational Experiments for Complex Social Systems: Experiment Design and Generative Explanation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 1022-1038
作者:  Xiao Xue;  Deyu Zhou;  Xiangning Yu;  Gang Wang;  Juanjuan Li;  Xia Xie;  Lizhen Cui;  Fei-Yue Wang
Adobe PDF(7239Kb)  |  收藏  |  浏览/下载:37/8  |  提交时间:2024/03/18
Agent-based modeling  computational experiments  cyber-physical-social systems (CPSS)  generative deduction  generative experiments  meta model  
Value Iteration-Based Cooperative Adaptive Optimal Control for Multi-Player Differential Games With Incomplete Information 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 3, 页码: 690-697
作者:  Yun Zhang;  Lulu Zhang;  Yunze Cai
Adobe PDF(6850Kb)  |  收藏  |  浏览/下载:83/34  |  提交时间:2024/02/19
Adaptive dynamic programming  incomplete information  multi-player differential game  value iteration  
Adaptive Optimal Output Regulation of Interconnected Singularly Perturbed Systems With Application to Power Systems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 3, 页码: 595-607
作者:  Jianguo Zhao;  Chunyu Yang;  Weinan Gao;  Linna Zhou;  Xiaomin Liu
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:48/22  |  提交时间:2024/02/19
Adaptive optimal control  decentralized control  output regulation  reinforcement learning (RL)  singularly perturbed systems (SPSs)  
Advancements in Humanoid Robots: A Comprehensive Review and Future Prospects 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 301-328
作者:  Yuchuang Tong;  Haotian Liu;  Zhengtao Zhang
Adobe PDF(7587Kb)  |  收藏  |  浏览/下载:95/17  |  提交时间:2024/01/23
Future trends and challenges  humanoid robots  human-robot interaction  key technologies  potential applications  
Reinforcement Learning in Process Industries: Review and Perspective 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 283-300
作者:  Oguzhan Dogru;  Junyao Xie;  Om Prakash;  Ranjith Chiplunkar;  Jansen Soesanto;  Hongtian Chen;  Kirubakaran Velswamy;  Fadi Ibrahim;  Biao Huang
Adobe PDF(1275Kb)  |  收藏  |  浏览/下载:44/15  |  提交时间:2024/01/23
Process control  process systems engineering  reinforcement learning