A Unified Framework for Multi-Modal Isolated Gesture Recognition
Jiali Duan1; Jun Wan1; Shuai Zhou2; Xiaoyuan Guo3; Stan Z. Li1
2017
发表期刊ACM Transactions on Multimedia Computing, Communications, and Applications
卷号9期号:4页码:39:1-39:17
摘要  ; In this paper, we focus on isolated gesture recognition and explore different modalities by involving RGB stream, depth stream and saliency stream for inspection. Our goal is to push the boundary of this realm even further by proposing a unified framework which exploits the advantages of multi-modality fusion. Specifically, a spatial-temporal network architecture based on consensus-voting has been proposed to explicitly model the long term structure of the video sequence and to reduce estimation variance when confronted with comprehensive inter-class variations. In addition, a 3D depth-saliency convolutional network is aggregated in parallel to capture subtle motion characteristics. Extensive experiments are done to analyze the performance of each component and our proposed approach achieves the best results on two public benchmarks–ChaLearn IsoGD and RGBD-HuDaAct, outperforming the closest competitor by a margin of over 10% and 15% respectively. We will release our codes to facilitate future research.
关键词Multi-modal Consensus-voting 3d Convolution Isolated Gesture Recognition
WOS记录号WOS:000433517100007
引用统计
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/15302
专题模式识别国家重点实验室_生物识别与安全技术研究
通讯作者Jun Wan
作者单位1.Center for Biometrics and Security Research & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
2.Macau University of Science and Technology
3.School of Engineering Science, University of Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Jiali Duan,Jun Wan,Shuai Zhou,et al. A Unified Framework for Multi-Modal Isolated Gesture Recognition[J]. ACM Transactions on Multimedia Computing, Communications, and Applications,2017,9(4):39:1-39:17.
APA Jiali Duan,Jun Wan,Shuai Zhou,Xiaoyuan Guo,&Stan Z. Li.(2017).A Unified Framework for Multi-Modal Isolated Gesture Recognition.ACM Transactions on Multimedia Computing, Communications, and Applications,9(4),39:1-39:17.
MLA Jiali Duan,et al."A Unified Framework for Multi-Modal Isolated Gesture Recognition".ACM Transactions on Multimedia Computing, Communications, and Applications 9.4(2017):39:1-39:17.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
TOMM2017_isogesture.(6349KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Jiali Duan]的文章
[Jun Wan]的文章
[Shuai Zhou]的文章
百度学术
百度学术中相似的文章
[Jiali Duan]的文章
[Jun Wan]的文章
[Shuai Zhou]的文章
必应学术
必应学术中相似的文章
[Jiali Duan]的文章
[Jun Wan]的文章
[Shuai Zhou]的文章
相关权益政策
暂无数据
收藏/分享
文件名: TOMM2017_isogesture.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。