CASIA OpenIR

浏览/检索结果: 共24条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:141/34  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:145/56  |  提交时间:2023/06/26
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:168/67  |  提交时间:2023/07/06
POPO: Pessimistic Offline Policy Optimization 会议论文
, Singapore, Singapore, 23-27 May 2022
作者:  He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:165/34  |  提交时间:2022/06/27
reinforcement learning  offline optimization  out-of-distribution  
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:84/35  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data  
Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning 会议论文
, online, 2021-2
作者:  Huang, Wenzhen;  Yin Qiyue;  Zhang Junge;  Huang, Kaiqi
Adobe PDF(5676Kb)  |  收藏  |  浏览/下载:158/36  |  提交时间:2022/01/11
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:96/37  |  提交时间:2023/06/29
Active Pushing for Better Grasping in Dense Clutter with Deep Reinforcement Learning 会议论文
, Shanghai, China, 6-8 Nov. 2020
作者:  Lu, Ning;  Lu, Tao;  Cai, Yinghao;  Wang, shuo
Adobe PDF(1435Kb)  |  收藏  |  浏览/下载:171/65  |  提交时间:2021/06/01
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:86/28  |  提交时间:2023/06/27
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:66/22  |  提交时间:2023/06/29