CASIA OpenIR  > 视频内容安全团队
Asymmetric 3D Convolutional Neural Networks for action recognition
Yang H(杨浩)1,3; Yuan CF(原春峰)1; Li B(李兵)1; Du Y(杜杨)1,3; Xing JL(兴军亮)1; Hu WM(胡卫明)1,2; Stephen J. Maybank4
Source PublicationPattern Recognition
2019-01
Volume85Issue:1Pages:1-12
Abstract

 

Convolutional Neural Network based action recognition methods have achieved significant improvements in recent years. The 3D convolution extends the 2D convolution to the spatial-temporal domain for better analysis of human activities in videos. The 3D convolution, however, involves many more parameters than the 2D convolution. Thus, it is much more expensive on computation, costly on storage, and difficult to learn. This work proposes efficient asymmetric one-directional 3D convolutions to approximate the traditional 3D convolution. To improve the feature learning capacity of asymmetric 3D convolutions, a set of local 3D convolutional networks, called MicroNets, are proposed by incorporating multi-scale 3D convolution branches. Then, an asymmetric 3D-CNN deep model is constructed by MicroNets for the action recognition task. Moreover, to avoid training two networks on the RGB and Flow frames separately as most works do, a simple but effective multi-source enhanced input is proposed, which fuses useful information of the RGB and Flow frame at the pre-processing stage.


The asymmetric 3D-CNN model is evaluated on two of the most challenging action recognition benchmarks, UCF-101 and HMDB-51. The asymmetric 3D-CNN model outperforms all the traditional 3D-CNN
models in both effectiveness and efficiency, and its performance is comparable with that of recent stateof-the-art action recognition methods on both benchmarks.

KeywordAsymmetric 3d Convolution Micronets 3d-cnn Action Recognition
Indexed BySCI
Language英语
Document Type期刊论文
Identifierhttp://ir.ia.ac.cn/handle/173211/23553
Collection视频内容安全团队
Corresponding AuthorYuan CF(原春峰)
Affiliation1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
2.Chinese Acad Sci, CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing 100190, Peoples R China
3.Univ Chinese Acad Sci, Beijing 100190, Peoples R China
4.Birkbeck Coll, Dept Comp Sci & Informat Syst, London WC1E 7HX, England
First Author AffilicationChinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
Corresponding Author AffilicationChinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
Recommended Citation
GB/T 7714
Yang H,Yuan CF,Li B,et al. Asymmetric 3D Convolutional Neural Networks for action recognition[J]. Pattern Recognition,2019,85(1):1-12.
APA Yang H.,Yuan CF.,Li B.,Du Y.,Xing JL.,...&Stephen J. Maybank.(2019).Asymmetric 3D Convolutional Neural Networks for action recognition.Pattern Recognition,85(1),1-12.
MLA Yang H,et al."Asymmetric 3D Convolutional Neural Networks for action recognition".Pattern Recognition 85.1(2019):1-12.
Files in This Item: Download All
File Name/Size DocType Version Access License
Asymmetric 3D CNN 发表(2689KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Yang H(杨浩)]'s Articles
[Yuan CF(原春峰)]'s Articles
[Li B(李兵)]'s Articles
Baidu academic
Similar articles in Baidu academic
[Yang H(杨浩)]'s Articles
[Yuan CF(原春峰)]'s Articles
[Li B(李兵)]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Yang H(杨浩)]'s Articles
[Yuan CF(原春峰)]'s Articles
[Li B(李兵)]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: Asymmetric 3D CNN 发表版.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.