已选(0)清除
条数/页: 排序方式: |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕); Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨); Wang YN(王燕娜); Xu B(徐博) Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文 , Washington DC, USA, 2023-2-7 作者: Pei Xu; Junge Zhang; Qiyue Yin; Chao Yu; Yaodong Yang; Kaiqi Huang Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:196/61  |  提交时间:2023/06/19 deep reinforcement learning sparse reward exploration multi-agent |
| ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation 会议论文 , Queensland, Australia, 2023-6-18 作者: Liu, Jiawei; Wang, Weining; Liu, Wei; He, Qian; Liu, Jing Adobe PDF(4537Kb)  |  收藏  |  浏览/下载:165/36  |  提交时间:2023/05/04 |
| Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文 , Queensland, Australia, June 18-23, 2023 作者: Zhang TL(张天乐); Liu Z(刘振); Pu ZQ(蒲志强); Yi JQ(易建强); Ai XL(艾晓琳); Yuan GM(袁莞迈) Adobe PDF(612Kb)  |  收藏  |  浏览/下载:139/45  |  提交时间:2023/06/12 |
| Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文 , Jinghong, China, 05-09 December 2022 作者: Junhang Wei; Shaowei Cui; Peng Hao; Shuo Wang Adobe PDF(933Kb)  |  收藏  |  浏览/下载:143/51  |  提交时间:2023/10/25 |
| Improving the Data Quality for Credit Card Fraud Detection 会议论文 , Arlington, VA, USA, 2022-11 作者: Rongrong Jing; Hu Tian; Yidi Li; Xingwei Zhang; Xiaolong Zheng; Zhu Zhang; Daniel Dajun Zeng Adobe PDF(472Kb)  |  收藏  |  浏览/下载:342/72  |  提交时间:2022/06/17 |
| LEARN EFFECTIVE REPRESENTATION FOR DEEP REINFORCEMENT LEARNING 会议论文 , Taipei, Taiwan, 26 August 2022 作者: Zhan Yuan; Xu Zhiwei; Fan Guoliang Adobe PDF(2093Kb)  |  收藏  |  浏览/下载:133/43  |  提交时间:2023/06/08 |
| POPO: Pessimistic Offline Policy Optimization 会议论文 , Singapore, Singapore, 23-27 May 2022 作者: He Q(何强); Hou XW(侯新文); Liu Y(刘禹) Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:176/36  |  提交时间:2022/06/27 reinforcement learning offline optimization out-of-distribution |
| A motion based measurement method for monocular vision system 会议论文 , Hefei, Anhui, China, 2022.7 作者: De Xu; Di Zhang Adobe PDF(326Kb)  |  收藏  |  浏览/下载:115/35  |  提交时间:2022/12/20 |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文 , 线上, 2022-07 作者: Zhao TL(赵天理); Zhang X(张希); Zhu WT(朱文涛); Wang JX(王家兴); Yang S(杨森); Liu J(刘季); Cheng J(程健) Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:87/36  |  提交时间:2023/06/21 Deep Neural Networks Network Pruning Structured Pruning Non-structured Pruning Single Instruction Multiple Data |