CASIA OpenIR

浏览/检索结果: 共18条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:23/12  |  提交时间:2024/05/28
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:216/81  |  提交时间:2023/06/29
Optimal defense resource allocation and geographically feasible hexagonal topology construction for power grid security 会议论文
Communications in Computer and Information Science, Hangzhou, China, 2021 22-24 October
作者:  Liu, Yifa;  Cheng, Long
Adobe PDF(756Kb)  |  收藏  |  浏览/下载:160/57  |  提交时间:2023/06/28
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:186/66  |  提交时间:2023/06/28
Filter Bank Adversarial Domain Adaptation For Motor Imagery Brain Computer Interface 会议论文
, Online, 18-22 July 2021
作者:  Yukun Zhang;  Shuang Qiu;  Wei Wei;  Xuelin Ma;  Huiguang He
Adobe PDF(602Kb)  |  收藏  |  浏览/下载:150/42  |  提交时间:2023/06/26
brain-computer interface  motor imagery  transfer learning  domain adaptation  filter bank  calibration reduction  
Hierarchical Cooperative Swarm Policy Learning with Role Emergence 会议论文
, Online, 05-07 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Qiu TH(丘腾海);  Yi JQ(易建强)
Adobe PDF(327Kb)  |  收藏  |  浏览/下载:166/70  |  提交时间:2023/06/12
Semantic Perception Swarm Policy with Deep Reinforcement Learning 会议论文
, Online, 05 December 2021
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(523Kb)  |  收藏  |  浏览/下载:142/57  |  提交时间:2023/06/12
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:254/45  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
Tactical intention recognition in Wargame 会议论文
, Chengdu, China, 23-26 April 2021
作者:  Xuan Liu;  Meijing Zhao;  Song Dai;  Qiyue Yin;  Wancheng Ni
Adobe PDF(2771Kb)  |  收藏  |  浏览/下载:198/64  |  提交时间:2022/06/17
wargame  tactical intention recognition  feature fusion  time series prediction model  
A Collaborative Communication-Qmix Approach for Large-scale Networked Traffic Signal Control 会议论文
, Indianapolis, IN, United States, 2021-9-19
作者:  Chen, Xiaoyu;  Xiong, Gang;  Lv, Yisheng;  Chen, yuanyuan;  Song, bing;  Wang, Feiyue
Adobe PDF(1208Kb)  |  收藏  |  浏览/下载:285/75  |  提交时间:2022/06/16