CASIA OpenIR

浏览/检索结果: 共67条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文
, Austin TX, USA, December 5-9, 2022
作者:  Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh;  Xu BW(徐博文);  Lo David;  Hou XW(侯新文);  Fan GL(范国梁)
Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:110/46  |  提交时间:2023/06/27
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:176/67  |  提交时间:2023/07/06
Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture 会议论文
, 美国路易斯安那州新奥尔良, 2022.06.19
作者:  Zhang, Chenghao;  Tian, Kun;  Fan, Bin;  Meng, Gaofeng;  Zhang, Zhaoxiang;  Pan, Chunhong
Adobe PDF(2647Kb)  |  收藏  |  浏览/下载:157/58  |  提交时间:2023/04/25
L2E: Learning to Exploit Your Opponent 会议论文
, 意大利 帕多瓦, 2022.07.18-2022.07.23
作者:  Wu Zhe;  Li Kai;  Xu Hang;  Zang Yifan;  An Bo;  Xing Junliang
Adobe PDF(5676Kb)  |  收藏  |  浏览/下载:192/38  |  提交时间:2022/06/17
POPO: Pessimistic Offline Policy Optimization 会议论文
, Singapore, Singapore, 23-27 May 2022
作者:  He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:177/36  |  提交时间:2022/06/27
reinforcement learning  offline optimization  out-of-distribution  
Empirical Learning of Decision Parameters for Agent-Based Model 会议论文
, Macau, China, 2022
作者:  Song B(宋冰);  Xiong G(熊刚);  Zhu F(朱凤华);  Wu X(武许可);  Lv Y(吕宜生);  Ye P(叶佩军)
Adobe PDF(1359Kb)  |  收藏  |  浏览/下载:129/46  |  提交时间:2023/06/26
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:88/37  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data  
Traffic Signal Control Using Offline Reinforcement Learning 会议论文
, Beijing, 2021-10
作者:  Dai, Xingyuan;  Zhao, Chen;  Li, Xiaoshuang;  Wang, Xiao;  Wang, Fei-Yue
Adobe PDF(1130Kb)  |  收藏  |  浏览/下载:174/49  |  提交时间:2022/10/11
A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文
, Bratislava, Slovakia, 2021-09
作者:  Haoda Qian;  Qiudan Li;  Zaichuan Tang
Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:323/124  |  提交时间:2022/06/14