CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:77/28  |  提交时间:2023/06/29
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:102/30  |  提交时间:2023/06/27
Chinese Named Entity Recognition via Adaptive Multi-pass Memory Network with Hierarchical Tagging Mechanism 会议论文
, Hainan, China, October 30 - Novermber 1, 2020
作者:  Pengfei Cao;  Yubo Chen;  Kang Liu;  Jun Zhao
Adobe PDF(987Kb)  |  收藏  |  浏览/下载:105/46  |  提交时间:2023/06/27
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:201/39  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Deep Imitation Learning for Traffic Signal Control and Operations Based on Graph Convolutional Neural Networks 会议论文
, Rhodes, Greece, 2020-9
作者:  Li Xiaoshuang;  Guo Zhongzheng;  Dai Xingyuan;  Lin Yilun;  Jin Junchen;  Zhu Fenghua;  Wang Fei-Yue
Adobe PDF(314Kb)  |  收藏  |  浏览/下载:211/72  |  提交时间:2022/06/16
Multi-robot cooperative target encirclement through learning distributed transferable policy 会议论文
, Online, July 19-24
作者:  Zhang Tianle;  Liu Zhen;  Wu Shiguang;  Pu Zhiqiang;  Yi Jianqiang
Adobe PDF(949Kb)  |  收藏  |  浏览/下载:182/57  |  提交时间:2022/06/16
Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文
, 线上, 2020-11
作者:  Shiguang, Wu;  Zhiqiang, Pu;  Jianqiang, Yi;  Huimu, Wang
Adobe PDF(1185Kb)  |  收藏  |  浏览/下载:150/1  |  提交时间:2021/06/24
STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-Agent Cooperation 会议论文
, 线上, 2020-11
作者:  Huimu Wang;  Zhen Liu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(916Kb)  |  收藏  |  浏览/下载:94/0  |  提交时间:2021/06/24
Multi-Agent Formation Control with Obstacles Avoidance under Restricted Communication through Graph Reinforcement Learning 会议论文
, 线上, 2020.06
作者:  Huimu, Wang;  Tenghai, Qiu;  Zhen, Liu;  Zhiqiang, Pu;  Jianqiang, Yi
Adobe PDF(1461Kb)  |  收藏  |  浏览/下载:183/37  |  提交时间:2021/06/24
A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation 会议论文
, 线上, 2020-8
作者:  Huimu Wang;  Zhiqiang Pu;  Zhen Liu;  Jianqiang Yi;  Tenghai Qiu
Adobe PDF(815Kb)  |  收藏  |  浏览/下载:221/46  |  提交时间:2021/06/24