CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共18条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:134/53  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data  
Improving Extreme Low-bit Quantization with Soft Threshold 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 页码: 1549 - 1563
作者:  Xu WX(许伟翔);  Wang PS(王培松);  Cheng J(程健)
Adobe PDF(2414Kb)  |  收藏  |  浏览/下载:81/29  |  提交时间:2023/06/20
TBERT: Dynamic BERT Inference with Top-k Based Predictors 会议论文
, Antwerp, Belgium, 2023-4-17
作者:  Liu, Zejian;  Zhao, Kun;  Cheng, Jian
Adobe PDF(3426Kb)  |  收藏  |  浏览/下载:108/27  |  提交时间:2023/06/19
Transformer  Dynamic Inference  Pruning  
Efficient Accelerator/Network Co-Search with Circular Greedy Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, 2023, 页码: 1-5
作者:  Liu, Zejian;  Li, Gang;  Cheng, Jian
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:132/41  |  提交时间:2023/06/19
Accelerator/Network Co-Search  Reinforcement Learning  Performance Estimation  Multi-objective Optimization  
Hardware Acceleration of Fully Quantized BERT for Efficient Natural Language Processing 会议论文
Proceedings of the 2021 Design, Automation and Test in Europe, DATE 2021, Virtual, Online, 2021-2
作者:  Liu, Zejian;  Li, Gang;  Cheng, Jian
Adobe PDF(593Kb)  |  收藏  |  浏览/下载:63/29  |  提交时间:2023/06/19
Optimization-Based Post-Training Quantization With Bit-Split and Stitching 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 2, 页码: 2119-2135
作者:  Wang, Peisong;  Chen, Weihan;  He, Xiangyu;  Chen, Qiang;  Liu, Qingshan;  Cheng, Jian
Adobe PDF(921Kb)  |  收藏  |  浏览/下载:213/62  |  提交时间:2023/03/20
Deep neural networks  compression  quantization  post-training quantization  
Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA 会议论文
, Dresden, Germany, 2018
作者:  Li, Gang;  Li, Fanrong;  Zhao, Tianli;  Cheng, Jian
Adobe PDF(244Kb)  |  收藏  |  浏览/下载:142/50  |  提交时间:2022/06/14
Dynamic Dual Gating Neural Networks 会议论文
, Online, 2021
作者:  Li, Fanrong;  Li, Gang;  He, Xiangyu;  Cheng, Jian
Adobe PDF(1988Kb)  |  收藏  |  浏览/下载:189/57  |  提交时间:2022/06/14
A System-Level Solution for Low-Power Object Detection 会议论文
, Seoul, Korea, 2019
作者:  Li, Fanrong;  Mo, Zitao;  Wang, Peisong;  Liu, Zejian;  Zhang, Jiayun;  Li, Gang;  Hu, Qinghao;  He, Xiangyu;  Leng, Cong;  Zhang, Yang;  Cheng, Jian
Adobe PDF(869Kb)  |  收藏  |  浏览/下载:204/63  |  提交时间:2022/06/14
Block Convolution: Toward Memory-Efficient Inference of Large-Scale CNNs on FPGA 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 5, 页码: 1436-1447
作者:  Li, Gang;  Liu, Zejian;  Li, Fanrong;  Cheng, Jian
Adobe PDF(4046Kb)  |  收藏  |  浏览/下载:295/32  |  提交时间:2022/06/10
Convolution  Field programmable gate arrays  System-on-chip  Task analysis  Random access memory  Tensors  Memory management  Block convolution  convolutional neural network (CNN) accelerator  field-programmable gate array (FPGA)  memory efficient  off-chip transfer