CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:126/35  |  提交时间:2023/06/27
Learning Individual Features to Decompose State Space for Robotic Skill Learning 会议论文
, Online, 2020-8
作者:  Fengyi Zhang;  Fangzhou Xiong;  Zhiyong Liu
Adobe PDF(622Kb)  |  收藏  |  浏览/下载:177/64  |  提交时间:2023/01/12
Robotic Skill Learning  Graph Neural Networks  State Decomposition  
A KG-based Enhancement Framework for Fact Checking Using Category Information 会议论文
, 线上, 2020年11月
作者:  Wang S(王帅);  Wang L(王磊);  Mao WJ(毛文吉)
Adobe PDF(1222Kb)  |  收藏  |  浏览/下载:204/50  |  提交时间:2022/07/01
fact checking knowledge graph  
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:229/46  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Multi-Agent Cooperation and Competition with Two-Level Ggraph Attention Network 会议论文
, 线上, 2020-11
作者:  Shiguang, Wu;  Zhiqiang, Pu;  Jianqiang, Yi;  Huimu, Wang
Adobe PDF(1185Kb)  |  收藏  |  浏览/下载:170/1  |  提交时间:2021/06/24
A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation 会议论文
, 线上, 2020-8
作者:  Huimu Wang;  Zhiqiang Pu;  Zhen Liu;  Jianqiang Yi;  Tenghai Qiu
Adobe PDF(815Kb)  |  收藏  |  浏览/下载:247/56  |  提交时间:2021/06/24
Revisiting Parameter Sharing for Automatic Neural Channel Number Search 会议论文
, Online, 2020.12.06-2020.12.12
作者:  Wang JX(王家兴);  Bo HL(柏昊立);  Wu JX(吴家祥);  Shi XP(史旭鹏);  Huang JZ(黄俊洲);  Michael Lyu;  Irwin King;  Cheng J(程健)
Adobe PDF(2004Kb)  |  收藏  |  浏览/下载:207/45  |  提交时间:2021/06/16
Neural Architecture Search  Model Compression  Parameter Sharing  
Page Segmentation Using Convolutional Neural Network and Graphical Model 会议论文
, 视频会议, 2020-7
作者:  Li, Xiao-Hui;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(6979Kb)  |  收藏  |  浏览/下载:205/57  |  提交时间:2021/06/02
Page segmentation  Conditional random field  Feature pyramid network  Graph attention network  
Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 31, 期号: 12, 页码: 5245-5256
作者:  Wei, Qinglai;  Wang, Lingxiao;  Liu, Yu;  Polycarpou, Marios M.
Adobe PDF(4019Kb)  |  收藏  |  浏览/下载:372/85  |  提交时间:2021/03/08
Elevators  Optimal control  Backpropagation  Machine learning  Neural networks  Learning (artificial intelligence)  Actor  –critic  adaptive dynamic programming  deep learning (DL)  elevator group control (EGC)  optimal control  reinforcement learning (RL)  
Real-world Robot Reaching Skill Learning Based on Deep Reinforcement Learning 会议论文
, Hefei, China, 2020
作者:  Liu, Naijun;  Lu, Tao;  Cai, Yinghao;  Wang, Rui;  Wang, Shuo
浏览  |  Adobe PDF(436Kb)  |  收藏  |  浏览/下载:181/63  |  提交时间:2020/09/27