CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:69/24  |  提交时间:2023/06/29
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:104/41  |  提交时间:2023/06/29
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:132/38  |  提交时间:2023/06/28
Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文
, 线上, 2021-02
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li K(李凯);  Li LJ(李丽娟);  Xing JL(兴军亮)
Adobe PDF(413Kb)  |  收藏  |  浏览/下载:128/50  |  提交时间:2023/06/28
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:86/36  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data  
Improving Extreme Low-bit Quantization with Soft Threshold 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 页码: 1549 - 1563
作者:  Xu WX(许伟翔);  Wang PS(王培松);  Cheng J(程健)
Adobe PDF(2414Kb)  |  收藏  |  浏览/下载:74/26  |  提交时间:2023/06/20
DPNAS: Neural Architecture Search for Deep Learning with Differential Privacy 会议论文
, 线上, 2022-2
作者:  Cheng AD(程安达);  Wang JX(王家兴);  Zhang X(张希);  Chen Q(谌强);  Wang PS(王培松);  Cheng J(程健)
Adobe PDF(1135Kb)  |  收藏  |  浏览/下载:88/21  |  提交时间:2023/06/05
Efficient cooperative structured control for a multi-joint biomimetic robotic fish 期刊论文
IEEE/ASME Transactions on Mechatronics, 2020, 卷号: 26, 期号: 5, 页码: 2506-2516
作者:  Yan Shuaizheng;  Wu Zhengxing;  Wang Jian;  Tan Min;  Yu Junzhi
Adobe PDF(2394Kb)  |  收藏  |  浏览/下载:73/23  |  提交时间:2023/05/31
Generative Zero-shot Network Quantization 会议论文
, Virtual Event, 2021-6
作者:  Xiangyu, He;  Jiahao, Lu;  Weixiang, Xu;  Qinghao, Hu;  Peisong, Wang;  Jian, Cheng
Adobe PDF(947Kb)  |  收藏  |  浏览/下载:225/54  |  提交时间:2022/06/29