CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共36条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
TinyNeRF: Towards 100 times Compression of Volume Radiance Fields 会议论文
, 线上, 2023-02
作者:  Zhao TL(赵天理);  Chen JY(陈嘉园);  Leng C(冷聪);  Cheng J(程健)
Adobe PDF(2855Kb)  |  收藏  |  浏览/下载:142/34  |  提交时间:2023/06/21
Neural Radiance Fields  Discrete Cosine Transformation  Frequency Domain  
TBERT: Dynamic BERT Inference with Top-k Based Predictors 会议论文
, Antwerp, Belgium, 2023-4-17
作者:  Liu, Zejian;  Zhao, Kun;  Cheng, Jian
Adobe PDF(3426Kb)  |  收藏  |  浏览/下载:80/21  |  提交时间:2023/06/19
Transformer  Dynamic Inference  Pruning  
Optimization-Based Post-Training Quantization With Bit-Split and Stitching 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 2, 页码: 2119-2135
作者:  Wang, Peisong;  Chen, Weihan;  He, Xiangyu;  Chen, Qiang;  Liu, Qingshan;  Cheng, Jian
Adobe PDF(921Kb)  |  收藏  |  浏览/下载:151/40  |  提交时间:2023/03/20
Deep neural networks  compression  quantization  post-training quantization  
Towards Automatic Model Compression via A Unified Two-Stage Framework 期刊论文
Pattern Recognition (PR), 2023, 卷号: 140, 页码: 109527
作者:  Weihan Chen;  Peisong Wang;  Jian Cheng
Adobe PDF(765Kb)  |  收藏  |  浏览/下载:88/25  |  提交时间:2023/06/20
Deep Neural Networks  Model Compression  Quantization  Pruning  
Block Convolution: Toward Memory-Efficient Inference of Large-Scale CNNs on FPGA 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 5, 页码: 1436-1447
作者:  Li, Gang;  Liu, Zejian;  Li, Fanrong;  Cheng, Jian
Adobe PDF(4046Kb)  |  收藏  |  浏览/下载:227/21  |  提交时间:2022/06/10
Convolution  Field programmable gate arrays  System-on-chip  Task analysis  Random access memory  Tensors  Memory management  Block convolution  convolutional neural network (CNN) accelerator  field-programmable gate array (FPGA)  memory efficient  off-chip transfer  
Fixed-point Quantization for Vision Transformer 会议论文
, Beijing, China, 2021-10-22
作者:  Zhexin, Li;  Peisong Wang;  Zhiyuan Wang;  Jian Cheng
Adobe PDF(940Kb)  |  收藏  |  浏览/下载:217/65  |  提交时间:2022/06/15
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:82/34  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data  
Improving Extreme Low-bit Quantization with Soft Threshold 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 页码: 1549 - 1563
作者:  Xu WX(许伟翔);  Wang PS(王培松);  Cheng J(程健)
Adobe PDF(2414Kb)  |  收藏  |  浏览/下载:68/24  |  提交时间:2023/06/20
Towards Fully Sparse Training: Information Restoration with Spatial Similarity 会议论文
, Vancouver, British Columbia, Canada, 2022-04
作者:  Xu WX(许伟翔);  Wang PS(王培松);  Cheng J(程健)
Adobe PDF(556Kb)  |  收藏  |  浏览/下载:74/24  |  提交时间:2023/06/20
Towards Mixed-Precision Quantization of Neural Networks via Constrained Optimization 会议论文
, 线上举办, 2021-10-11
作者:  Weihan Chen;  Peisong Wang;  Jian Cheng
Adobe PDF(696Kb)  |  收藏  |  浏览/下载:86/30  |  提交时间:2023/06/20