CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:149/60  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data  
TBERT: Dynamic BERT Inference with Top-k Based Predictors 会议论文
, Antwerp, Belgium, 2023-4-17
作者:  Liu, Zejian;  Zhao, Kun;  Cheng, Jian
Adobe PDF(3426Kb)  |  收藏  |  浏览/下载:122/33  |  提交时间:2023/06/19
Transformer  Dynamic Inference  Pruning  
Towards Binarized MobileNet via Structured Sparsity 会议论文
, Hainan, China, 2021-12-26
作者:  Zhenmeng, Zuo;  Zhexin, Li;  Peisong, Wang;  Weihan, Chen;  Jian, Cheng
Adobe PDF(476Kb)  |  收藏  |  浏览/下载:244/76  |  提交时间:2022/06/15
Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA 会议论文
, Dresden, Germany, 2018
作者:  Li, Gang;  Li, Fanrong;  Zhao, Tianli;  Cheng, Jian
Adobe PDF(244Kb)  |  收藏  |  浏览/下载:150/55  |  提交时间:2022/06/14
EBERT: Efficient BERT Inference with Dynamic Structured Pruning 会议论文
, Online, 2021
作者:  Liu, Zejian;  Li, Fanrong;  Li, Gang;  Cheng, Jian
Adobe PDF(1219Kb)  |  收藏  |  浏览/下载:169/56  |  提交时间:2022/06/14
DPFPS: Dynamic and Progressive Filter Pruning for Compressing Convolutional Neural Networks from Scratch 会议论文
, virtual conference, 2021.2.2-2021.2.9
作者:  Ruan, Xiaofeng;  Liu, Yufan;  Li, Bing;  Yuan, Chunfeng;  Hu, Weiming
Adobe PDF(652Kb)  |  收藏  |  浏览/下载:298/62  |  提交时间:2021/06/17
Compression of Acoustic Model via Knowledge Distillation and Pruning 会议论文
, Beijing, 2018-8
作者:  Li, Chenxing;  Zhu, Lei;  Xu, Shuang;  Gao, Peng;  Xu, Bo
浏览  |  Adobe PDF(237Kb)  |  收藏  |  浏览/下载:233/87  |  提交时间:2020/07/20
Towards Compact and Fast Neural Machine Translation Using a Combined Method 会议论文
, 丹麦哥本哈根, 2017-9
作者:  Xiaowei Zhang;  Wei Chen;  Feng Wang;  Shuang Xu;  Bo Xu
Adobe PDF(257Kb)  |  收藏  |  浏览/下载:429/111  |  提交时间:2018/06/11
Machine Translation  Neural Network  Model Compression  Decoding Speedup