CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

已选(0)清除 条数/页:   排序方式:
DRL-Based Adaptive Sharding for Blockchain-Based Federated Learning 期刊论文
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 卷号: 71, 期号: 10, 页码: 5992-6004
作者:  Lin, Yijing;  Gao, Zhipeng;  Du, Hongyang;  Kang, Jiawen;  Niyato, Dusit;  Wang, Qian;  Ruan, Jingqing;  Wan, Shaohua
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Blockchain sharding  federated learning  reputation  deep reinforcement learning  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
Learning in bi-level markov games 会议论文
, Padua, Italy, 2022.7.18-2022.7.23
作者:  Meng Linghui;  Ruan Jingqing;  Xing Dengpeng;  Xu Bo
Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:54/23  |  提交时间:2024/06/11
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:42/12  |  提交时间:2024/06/11
An End-to-End Deep Reinforcement Learning Based Modular Task Allocation Framework for Autonomous Mobile Systems 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 页码: 15
作者:  Ma, Song;  Ruan, Jingqing;  Du, Yali;  Bucknall, Richard;  Liu, Yuanchang
收藏  |  浏览/下载:35/0  |  提交时间:2024/05/30
Deep reinforcement learning  task allocation  multi-agent planning  field robotics  
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:64/29  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Learning Causal Dynamics Models in Object-Oriented Environments 会议论文
Proceedings of the 41st International Conference on Machine Learning, 奥地利, 维也纳, 2024-07-21
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(2176Kb)  |  收藏  |  浏览/下载:53/19  |  提交时间:2024/05/28
强化学习  因果模型  
Mixture of personality improved spiking actor network for efficient multi-agent cooperation 期刊论文
FRONTIERS IN NEUROSCIENCE, 2023, 卷号: 17, 页码: 14
作者:  Li, Xiyun;  Ni, Ziyi;  Ruan, Jingqing;  Meng, Linghui;  Shi, Jing;  Zhang, Tielin;  Xu, Bo
收藏  |  浏览/下载:104/0  |  提交时间:2023/11/17
multi-agent cooperation  personality theory  spiking actor networks  multi-agent reinforcement learning  theory of mind