CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

已选(0)清除 条数/页:   排序方式:
Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 782-800
作者:  Jingqing Ruan;   Kaishen Wang;   Qingyang Zhang;   Dengpeng Xing;   Bo Xu
Adobe PDF(4577Kb)  |  收藏  |  浏览/下载:14/6  |  提交时间:2024/07/18
Reinforcement learning  representation learning  subtask planning  task decomposition  pretraining.  
DRL-Based Adaptive Sharding for Blockchain-Based Federated Learning 期刊论文
IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 卷号: 71, 期号: 10, 页码: 5992-6004
作者:  Lin, Yijing;  Gao, Zhipeng;  Du, Hongyang;  Kang, Jiawen;  Niyato, Dusit;  Wang, Qian;  Ruan, Jingqing;  Wan, Shaohua
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03
Blockchain sharding  federated learning  reputation  deep reinforcement learning  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:37/14  |  提交时间:2024/06/25
强化学习,分层强化学习  
Learning in bi-level markov games 会议论文
, Padua, Italy, 2022.7.18-2022.7.23
作者:  Meng Linghui;  Ruan Jingqing;  Xing Dengpeng;  Xu Bo
Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:46/19  |  提交时间:2024/06/11
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
作者:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:37/11  |  提交时间:2024/06/11
An End-to-End Deep Reinforcement Learning Based Modular Task Allocation Framework for Autonomous Mobile Systems 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 页码: 15
作者:  Ma, Song;  Ruan, Jingqing;  Du, Yali;  Bucknall, Richard;  Liu, Yuanchang
收藏  |  浏览/下载:28/0  |  提交时间:2024/05/30
Deep reinforcement learning  task allocation  multi-agent planning  field robotics  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:42/24  |  提交时间:2024/05/28
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
作者:  Qingyang Zhang;  Kaishen Wang;  Jingqing Ruan;  Yiming Yang;  Dengpeng Xing;  Bo Xu
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:77/27  |  提交时间:2024/04/23
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency  
Mixture of personality improved spiking actor network for efficient multi-agent cooperation 期刊论文
FRONTIERS IN NEUROSCIENCE, 2023, 卷号: 17, 页码: 14
作者:  Li, Xiyun;  Ni, Ziyi;  Ruan, Jingqing;  Meng, Linghui;  Shi, Jing;  Zhang, Tielin;  Xu, Bo
收藏  |  浏览/下载:100/0  |  提交时间:2023/11/17
multi-agent cooperation  personality theory  spiking actor networks  multi-agent reinforcement learning  theory of mind