CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:128/52  |  提交时间:2023/06/29
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:141/53  |  提交时间:2023/06/28
Attention Calibration for Transformer in Neural Machine Translation 会议论文
, 线上, 2021-8
作者:  Yu, Lu;  Jiali Zeng;  Jiajun, Zhang;  Shuangzhi Wu;  Mu, Li
Adobe PDF(749Kb)  |  收藏  |  浏览/下载:103/29  |  提交时间:2023/05/31
神经机器翻译  
Open-book Video Captioning with Retrieve-Copy-Generate Network 会议论文
2021, 线上, 2021.6.19-25
作者:  Zhang,Ziqi;  Qi,Zhongang;  Yuan,Chunfeng;  Shan,Ying;  Li,Bing;  Deng,Ying;  Hu,Weiming
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:227/56  |  提交时间:2022/06/16
Multi-Agent Hierarchical Cognition Difference Policy for Multi-Agent Cooperation 期刊论文
Algorithms, 2021, 期号: 14, 页码: 98
作者:  Huimu Wang;  Zhen Liu;  Jianqiang Yi;  Zhiqiang Pu
Adobe PDF(1155Kb)  |  收藏  |  浏览/下载:238/48  |  提交时间:2021/06/24
multiagent system  deep reinforcement learning  variational autoencoder  attention mechanism