CASIA OpenIR

浏览/检索结果: 共93条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:172/39  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:82/30  |  提交时间:2023/06/29
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:136/55  |  提交时间:2023/06/29
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:138/41  |  提交时间:2023/06/28
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:141/53  |  提交时间:2023/06/28
Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network 会议论文
, Online, June 6–11, 2021
作者:  Wu HR(吴浩然);  Chen W(陈炜);  Xu S(徐爽);  Xu B(徐波)
Adobe PDF(1394Kb)  |  收藏  |  浏览/下载:164/58  |  提交时间:2023/06/26
Joint Modeling of Document and Label with Clause Interaction Hypergraph for ICD Medical Code Assignment 会议论文
, Padua, Italy, 18-23 July 2022
作者:  Wu HR(吴浩然);  Meng LH(孟令辉);  Xu S(徐爽);  Xu B(徐波)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:90/35  |  提交时间:2023/06/26
Disturbance Observer Based Control for an Underwater Biomimetic Vehicle-Manipulator System with Mismatched Disturbances 会议论文
, Suzhou, China, 2021.5.14-2021.5.16
作者:  Lv, Jiaqi;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Tan, Min
Adobe PDF(1182Kb)  |  收藏  |  浏览/下载:96/33  |  提交时间:2023/06/25
Underwater biomimetic vehicle-manipulator system (UBVMS)  disturbance observer  arctangent non-singularity sliding mode controller (ANTSMC)  mismatched disturbances  
Stacking More Linear Operations with Orthogonal Regularization to Learn Better 会议论文
, 线上会议, 2022-7
作者:  Xu WX(许伟翔);  Cheng J(程健)
Adobe PDF(1126Kb)  |  收藏  |  浏览/下载:96/32  |  提交时间:2023/06/21
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:101/46  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data