已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:141/34  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文 Machine Intelligence Research, 2023, 页码: 1-12 作者: Wang Xin; Wei Qinglai; Li Tao; Zhang Jie Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:145/56  |  提交时间:2023/06/26 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu; Cui Zeyu; Wu Shu; Liu Qiang; Wu Jinlin; Wang Liang; Tan Tieniu Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:168/67  |  提交时间:2023/07/06 |
| POPO: Pessimistic Offline Policy Optimization 会议论文 , Singapore, Singapore, 23-27 May 2022 作者: He Q(何强); Hou XW(侯新文); Liu Y(刘禹) Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:165/34  |  提交时间:2022/06/27 reinforcement learning offline optimization out-of-distribution |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文 , 线上, 2022-07 作者: Zhao TL(赵天理); Zhang X(张希); Zhu WT(朱文涛); Wang JX(王家兴); Yang S(杨森); Liu J(刘季); Cheng J(程健) Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:84/35  |  提交时间:2023/06/21 Deep Neural Networks Network Pruning Structured Pruning Non-structured Pruning Single Instruction Multiple Data |
| Learning to Reweight Imaginary Transitions for Model-Based Reinforcement Learning 会议论文 , online, 2021-2 作者: Huang, Wenzhen; Yin Qiyue; Zhang Junge; Huang, Kaiqi Adobe PDF(5676Kb)  |  收藏  |  浏览/下载:158/36  |  提交时间:2022/01/11 |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯); Xing JL(兴军亮) Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:96/37  |  提交时间:2023/06/29 |
| Active Pushing for Better Grasping in Dense Clutter with Deep Reinforcement Learning 会议论文 , Shanghai, China, 6-8 Nov. 2020 作者: Lu, Ning; Lu, Tao; Cai, Yinghao; Wang, shuo Adobe PDF(1435Kb)  |  收藏  |  浏览/下载:171/65  |  提交时间:2021/06/01 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨); Bai YP(白云鹏); Hou XW(侯新文); Ji XH(季晓慧) Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:86/28  |  提交时间:2023/06/27 |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民); Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣); Li K(李凯); Xing JL(兴军亮) Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:66/22  |  提交时间:2023/06/29 |