CASIA OpenIR

浏览/检索结果: 共39条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/06/21
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:62/21  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:62/27  |  提交时间:2024/05/30
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:44/9  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:45/10  |  提交时间:2024/05/28
UC-OWOD: Unknown-Classified Open World Object Detection 会议论文
, Tel Aviv, Israel, 2022-10
作者:  Zhiheng Wu;  Yue Lu;  Xingyu Chen;  Zhengxing Wu;  Liwen Kang;  Junzhi Yu
Adobe PDF(2702Kb)  |  收藏  |  浏览/下载:140/27  |  提交时间:2023/06/29
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:131/50  |  提交时间:2023/06/29
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:221/82  |  提交时间:2023/06/29
MiaoSuan Wargame: A Multi-Mode Integrated Platform for Imperfect Information Game 会议论文
, Beijing, China, August 21-24, 2022
作者:  Jiale Xu;  Jian Hu;  Shixian Wang;  Xuyang Yang;  Wancheng Ni
Adobe PDF(726Kb)  |  收藏  |  浏览/下载:133/40  |  提交时间:2023/06/28
open platform  human-computer gaming  AI evaluation  Turing test  imperfect information game  wargame