CASIA OpenIR

浏览/检索结果: 共113条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:196/61  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent  
ED-T2V: An Efficient Training Framework for Diffusion-based Text-to-Video Generation 会议论文
, Queensland, Australia, 2023-6-18
作者:  Liu, Jiawei;  Wang, Weining;  Liu, Wei;  He, Qian;  Liu, Jing
Adobe PDF(4537Kb)  |  收藏  |  浏览/下载:165/36  |  提交时间:2023/05/04
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文
, Queensland, Australia, June 18-23, 2023
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳);  Yuan GM(袁莞迈)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:139/45  |  提交时间:2023/06/12
Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文
, Jinghong, China, 05-09 December 2022
作者:  Junhang Wei;  Shaowei Cui;  Peng Hao;  Shuo Wang
Adobe PDF(933Kb)  |  收藏  |  浏览/下载:143/51  |  提交时间:2023/10/25
Improving the Data Quality for Credit Card Fraud Detection 会议论文
, Arlington, VA, USA, 2022-11
作者:  Rongrong Jing;  Hu Tian;  Yidi Li;  Xingwei Zhang;  Xiaolong Zheng;  Zhu Zhang;  Daniel Dajun Zeng
Adobe PDF(472Kb)  |  收藏  |  浏览/下载:342/72  |  提交时间:2022/06/17
LEARN EFFECTIVE REPRESENTATION FOR DEEP REINFORCEMENT LEARNING 会议论文
, Taipei, Taiwan, 26 August 2022
作者:  Zhan Yuan;  Xu Zhiwei;  Fan Guoliang
Adobe PDF(2093Kb)  |  收藏  |  浏览/下载:133/43  |  提交时间:2023/06/08
POPO: Pessimistic Offline Policy Optimization 会议论文
, Singapore, Singapore, 23-27 May 2022
作者:  He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:176/36  |  提交时间:2022/06/27
reinforcement learning  offline optimization  out-of-distribution  
A motion based measurement method for monocular vision system 会议论文
, Hefei, Anhui, China, 2022.7
作者:  De Xu;  Di Zhang
Adobe PDF(326Kb)  |  收藏  |  浏览/下载:115/35  |  提交时间:2022/12/20
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:87/36  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data