Institutional Repository of Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression | |
Ruan, Xiaofeng1,2![]() ![]() ![]() ![]() | |
Source Publication | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
![]() |
ISSN | 2162-237X |
2020 | |
Volume | 32Issue:0Pages:0 |
Corresponding Author | Yuan, Chunfeng(cfyuan@nlpr.ia.ac.cn) ; Li, Bing(bli@nlpr.ia.ac.cn) |
Abstract | Model compression methods have become popular in recent years, which aim to alleviate the heavy load of deep neural networks (DNNs) in real-world applications. However, most of the existing compression methods have two limitations: 1) they usually adopt a cumbersome process, including pertaining, training with a sparsity constraint, pruning/decomposition, and fine-tuning. Moreover, the last three stages are usually iterated multiple times. 2) The models are pretrained under explicit sparsity or low-rank assumptions, which are difficult to guarantee wide appropriateness. In this article, we propose an efficient decomposition and pruning (EDP) scheme via constructing a compressed-aware block that can automatically minimize the rank of the weight matrix and identify the redundant channels. Specifically, we embed the compressed-aware block by decomposing one network layer into two layers: a new weight matrix layer and a coefficient matrix layer. By imposing regularizers on the coefficient matrix, the new weight matrix learns to become a low-rank basis weight, and its corresponding channels become sparse. In this way, the proposed compressedaware block simultaneously achieves low-rank decomposition and channel pruning by only one single data-driven training stage. Moreover, the network of architecture is further compressed and optimized by a novel Pruning & Merging (PM) module which prunes redundant channels and merges redundant decomposed layers. Experimental results (17 competitors) on different data sets and networks demonstrate that the proposed EDP achieves a high compression ratio with acceptable accuracy degradation and outperforms state-of-the-arts on compression rate, accuracy, inference time, and run-time memory. |
Keyword | Data-driven low-rank decomposition model compression and acceleration structured pruning |
DOI | 10.1109/TNNLS.2020.3018177 |
Indexed By | SCI |
Language | 英语 |
Funding Project | National Key Research and Development Program of China[2018AAA0102802] ; National Key Research and Development Program of China[2018AAA0102803] ; National Key Research and Development Program of China[2018AAA0102800] ; National Key Research and Development Program of China[2018YFC0823003] ; National Key Research and Development Program of China[2017YFB1002801] ; Natural Science Foundation of China[61902401] ; Natural Science Foundation of China[61972071] ; Natural Science Foundation of China[61751212] ; Natural Science Foundation of China[61721004] ; Natural Science Foundation of China[61972397] ; Natural Science Foundation of China[61772225] ; Natural Science Foundation of China[61906052] ; Natural Science Foundation of China[U1803119] ; NSFC-General Technology Collaborative Fund for basic research[U1636218] ; NSFC-General Technology Collaborative Fund for basic research[U1936204] ; NSFC-General Technology Collaborative Fund for basic research[U1736106] ; Beijing Natural Science Foundation[L172051] ; Beijing Natural Science Foundation[JQ18018] ; Beijing Natural Science Foundation[L182058] ; CAS Key Research Program of Frontier Sciences[QYZDJSSW-JSC040] ; CAS External Cooperation Key Project ; NSF of Guangdong[2018B030311046] ; Youth Innovation Promotion Association, CAS |
Funding Organization | National Key Research and Development Program of China ; Natural Science Foundation of China ; NSFC-General Technology Collaborative Fund for basic research ; Beijing Natural Science Foundation ; CAS Key Research Program of Frontier Sciences ; CAS External Cooperation Key Project ; NSF of Guangdong ; Youth Innovation Promotion Association, CAS |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Artificial Intelligence ; Computer Science, Hardware & Architecture ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic |
WOS ID | WOS:000704111000021 |
Publisher | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
Sub direction classification | 机器学习 |
Citation statistics | |
Document Type | 期刊论文 |
Identifier | http://ir.ia.ac.cn/handle/173211/44804 |
Collection | 模式识别国家重点实验室_视频内容安全 |
Corresponding Author | Yuan, Chunfeng; Li, Bing |
Affiliation | 1.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences 2.School of Artificial Intelligence, University of Chinese Academy of Sciences 3.CAS Center for Excellence in Brain Science and Intelligence Technology 4.PeopleAI Inc. 5.National Computer Network Emergency Response Technical Team/Coordination Center of China 6.Department of Computer Science and Information Systems, Birkbeck College, University of London |
First Author Affilication | Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China |
Corresponding Author Affilication | Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China |
Recommended Citation GB/T 7714 | Ruan, Xiaofeng,Liu, Yufan,Yuan, Chunfeng,et al. EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS,2020,32(0):0. |
APA | Ruan, Xiaofeng.,Liu, Yufan.,Yuan, Chunfeng.,Li, Bing.,Hu, Weiming.,...&Maybank, Stephen.(2020).EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression.IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS,32(0),0. |
MLA | Ruan, Xiaofeng,et al."EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression".IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 32.0(2020):0. |
Files in This Item: | Download All | |||||
File Name/Size | DocType | Version | Access | License | ||
EDP_An Efficient Dec(3625KB) | 期刊论文 | 作者接受稿 | 开放获取 | CC BY-NC-SA | View Download |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment