已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:172/39  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民); Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣); Li K(李凯); Xing JL(兴军亮) Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:82/30  |  提交时间:2023/06/29 |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯); Xing JL(兴军亮) Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:136/55  |  提交时间:2023/06/29 |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民); Xing JL(兴军亮); Li K(李凯); Kang YX(康永欣); Tao P(陶品) Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:138/41  |  提交时间:2023/06/28 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li K(李凯); Li LJ(李丽娟); Xing JL(兴军亮) Adobe PDF(413Kb)  |  收藏  |  浏览/下载:141/53  |  提交时间:2023/06/28 |
| Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network 会议论文 , Online, June 6–11, 2021 作者: Wu HR(吴浩然); Chen W(陈炜); Xu S(徐爽); Xu B(徐波) Adobe PDF(1394Kb)  |  收藏  |  浏览/下载:164/58  |  提交时间:2023/06/26 |
| Joint Modeling of Document and Label with Clause Interaction Hypergraph for ICD Medical Code Assignment 会议论文 , Padua, Italy, 18-23 July 2022 作者: Wu HR(吴浩然); Meng LH(孟令辉); Xu S(徐爽); Xu B(徐波) Adobe PDF(612Kb)  |  收藏  |  浏览/下载:90/35  |  提交时间:2023/06/26 |
| Disturbance Observer Based Control for an Underwater Biomimetic Vehicle-Manipulator System with Mismatched Disturbances 会议论文 , Suzhou, China, 2021.5.14-2021.5.16 作者: Lv, Jiaqi; Wang, Yu; Wang, Shuo; Cheng, Long; Tan, Min Adobe PDF(1182Kb)  |  收藏  |  浏览/下载:96/33  |  提交时间:2023/06/25 Underwater biomimetic vehicle-manipulator system (UBVMS) disturbance observer arctangent non-singularity sliding mode controller (ANTSMC) mismatched disturbances |
| Stacking More Linear Operations with Orthogonal Regularization to Learn Better 会议论文 , 线上会议, 2022-7 作者: Xu WX(许伟翔); Cheng J(程健) Adobe PDF(1126Kb)  |  收藏  |  浏览/下载:96/32  |  提交时间:2023/06/21 |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文 , 线上, 2022-07 作者: Zhao TL(赵天理); Zhang X(张希); Zhu WT(朱文涛); Wang JX(王家兴); Yang S(杨森); Liu J(刘季); Cheng J(程健) Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:101/46  |  提交时间:2023/06/21 Deep Neural Networks Network Pruning Structured Pruning Non-structured Pruning Single Instruction Multiple Data |