已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文 , Austin TX, USA, December 5-9, 2022 作者: Gong C(龚晨); Yang Z(杨洲); Bai YP(白云鹏); Shi JK(史杰克); Sinha Arunesh; Xu BW(徐博文); Lo David; Hou XW(侯新文); Fan GL(范国梁) Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:110/46  |  提交时间:2023/06/27 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu; Cui Zeyu; Wu Shu; Liu Qiang; Wu Jinlin; Wang Liang; Tan Tieniu Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:176/67  |  提交时间:2023/07/06 |
| Continual Stereo Matching of Continuous Driving Scenes with Growing Architecture 会议论文 , 美国路易斯安那州新奥尔良, 2022.06.19 作者: Zhang, Chenghao; Tian, Kun; Fan, Bin; Meng, Gaofeng; Zhang, Zhaoxiang; Pan, Chunhong Adobe PDF(2647Kb)  |  收藏  |  浏览/下载:157/58  |  提交时间:2023/04/25 |
| L2E: Learning to Exploit Your Opponent 会议论文 , 意大利 帕多瓦, 2022.07.18-2022.07.23 作者: Wu Zhe; Li Kai; Xu Hang; Zang Yifan; An Bo; Xing Junliang Adobe PDF(5676Kb)  |  收藏  |  浏览/下载:192/38  |  提交时间:2022/06/17 |
| POPO: Pessimistic Offline Policy Optimization 会议论文 , Singapore, Singapore, 23-27 May 2022 作者: He Q(何强); Hou XW(侯新文); Liu Y(刘禹) Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:177/36  |  提交时间:2022/06/27 reinforcement learning offline optimization out-of-distribution |
| Empirical Learning of Decision Parameters for Agent-Based Model 会议论文 , Macau, China, 2022 作者: Song B(宋冰); Xiong G(熊刚); Zhu F(朱凤华); Wu X(武许可); Lv Y(吕宜生); Ye P(叶佩军) Adobe PDF(1359Kb)  |  收藏  |  浏览/下载:129/46  |  提交时间:2023/06/26 |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文 , 线上, 2022-07 作者: Zhao TL(赵天理); Zhang X(张希); Zhu WT(朱文涛); Wang JX(王家兴); Yang S(杨森); Liu J(刘季); Cheng J(程健) Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:88/37  |  提交时间:2023/06/21 Deep Neural Networks Network Pruning Structured Pruning Non-structured Pruning Single Instruction Multiple Data |
| Traffic Signal Control Using Offline Reinforcement Learning 会议论文 , Beijing, 2021-10 作者: Dai, Xingyuan; Zhao, Chen; Li, Xiaoshuang; Wang, Xiao; Wang, Fei-Yue Adobe PDF(1130Kb)  |  收藏  |  浏览/下载:174/49  |  提交时间:2022/10/11 |
| A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文 , Bratislava, Slovakia, 2021-09 作者: Haoda Qian; Qiudan Li; Zaichuan Tang Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:323/124  |  提交时间:2022/06/14 |