CASIA OpenIR

浏览/检索结果: 共21条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:126/37  |  提交时间:2023/06/28
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:155/57  |  提交时间:2023/06/26
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:174/67  |  提交时间:2023/07/06
知识和数据协同驱动的群体智能决策方法研究综述 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 3, 页码: 1-17
作者:  蒲志强;  易建强;  刘振;  丘腾海;  孙金林;  李非墨
Adobe PDF(1352Kb)  |  收藏  |  浏览/下载:260/63  |  提交时间:2022/04/02
群体智能  知识与数据协同  多智能体  决策智能  
Attention Calibration for Transformer in Neural Machine Translation 会议论文
, 线上, 2021-8
作者:  Yu, Lu;  Jiali Zeng;  Jiajun, Zhang;  Shuangzhi Wu;  Mu, Li
Adobe PDF(749Kb)  |  收藏  |  浏览/下载:91/26  |  提交时间:2023/05/31
神经机器翻译  
Open-book Video Captioning with Retrieve-Copy-Generate Network 会议论文
2021, 线上, 2021.6.19-25
作者:  Zhang,Ziqi;  Qi,Zhongang;  Yuan,Chunfeng;  Shan,Ying;  Li,Bing;  Deng,Ying;  Hu,Weiming
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:213/53  |  提交时间:2022/06/16
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:224/44  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism  
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:101/39  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:122/50  |  提交时间:2023/06/28
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:185/63  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient