CASIA OpenIR

浏览/检索结果: 共84条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:20/8  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
FeaCo: Reaching Robust Feature-Level Consensus in Noisy Pose Conditions 会议论文
, Ottawa, Canada, 2023.10.27-2023.11.2
作者:  Gu JM(谷佳铭);  Jingyu Zhang;  Zhang MY(张沐阳);  Meng WL(孟维亮);  Xu SB(徐士彪);  Zhang JG(张吉光);  Zhang XP(张晓鹏)
Adobe PDF(5119Kb)  |  收藏  |  浏览/下载:24/6  |  提交时间:2024/06/11
Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文
, Madrid, Spain, 2021-12-7
作者:  Mi,Qirui;  Yang,Ning;  Zhang,Haifeng;  Zhang,Haijun;  Wang,Jun
Adobe PDF(1724Kb)  |  收藏  |  浏览/下载:34/10  |  提交时间:2024/06/05
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:34/12  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:28/7  |  提交时间:2024/05/28
Parallel Learning Based Foundation Model for Networked Traffic Signal Control 会议论文
, Bilbao, Bizkaia, Spain, 2022-9-24
作者:  Zhao, Chen;  Dai, Xingyuan;  Chen, Yuanyuan;  Yilun, Lin;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1112Kb)  |  收藏  |  浏览/下载:25/10  |  提交时间:2024/05/28
Learning Transformer-based Cooperation for Networked Traffic Signal Control 会议论文
, Macau, China, 2022-10
作者:  Zhao, Chen;  Dai, Xingyuan;  Wang, Xiao;  Li, Lingxi;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:31/10  |  提交时间:2024/05/28
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:218/71  |  提交时间:2023/07/06
DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning 会议论文
, Indianapolis, IN, USA, 19-22 September 2021
作者:  Jinhao Xi;  Fenghua Zhu;  Yuanyuan Chen;  Yisheng Lv;  Chang Tan;  Feiyue Wang
Adobe PDF(1652Kb)  |  收藏  |  浏览/下载:132/24  |  提交时间:2023/06/26