ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition
Wan, Jun1,2; Lin, Chi3; Wen, Longyin4; Li, Yunan5,6; Miao, Qiguang5,6; Escalera, Sergio7; Anbarjafari, Gholamreza8,9,10; Guyon, Isabelle11,12; Guo, Guodong13,14; Li, Stan Z.15,16
发表期刊IEEE TRANSACTIONS ON CYBERNETICS
ISSN2168-2267
2022-05-01
卷号52期号:5页码:3422-3433
通讯作者Wan, Jun(jun.wan@ia.ac.cn)
摘要The ChaLearn large-scale gesture recognition challenge has run twice in two workshops in conjunction with the International Conference on Pattern Recognition (ICPR) 2016 and International Conference on Computer Vision (ICCV) 2017, attracting more than 200 teams around the world. This challenge has two tracks, focusing on isolated and continuous gesture recognition, respectively. It describes the creation of both benchmark datasets and analyzes the advances in large-scale gesture recognition based on these two datasets. In this article, we discuss the challenges of collecting large-scale ground-truth annotations of gesture recognition and provide a detailed analysis of the current methods for large-scale isolated and continuous gesture recognition. In addition to the recognition rate and mean Jaccard index (MJI) as evaluation metrics used in previous challenges, we introduce the corrected segmentation rate (CSR) metric to evaluate the performance of temporal segmentation for continuous gesture recognition. Furthermore, we propose a bidirectional long short-term memory (Bi-LSTM) method, determining video division points based on skeleton points. Experiments show that the proposed Bi-LSTM outperforms state-of-the-art methods with an absolute improvement of 8.1% (from 0.8917 to 0.9639) of CSR.
关键词Gesture recognition Measurement Task analysis Training Conferences Computer vision Bidirectional long short-term memory (Bi-LSTM) gesture recognition RGB-D
DOI10.1109/TCYB.2020.3012092
关键词[WOS]FUSION
收录类别SCI
语种英语
资助项目Chinese National Natural Science Foundation[61961160704] ; Chinese National Natural Science Foundation[61876179] ; Key Project of the General Logistics Department[ASW17C001] ; Science and Technology Development Fund of Macau[0010/2019/AFJ] ; Science and Technology Development Fund of Macau[0025/2019/AKP] ; Science and Technology Development Fund of Macau[PID2019-105093GB-I00] ; (MINECO/FEDER, UE) ; (CERCA Programme/Generalitat de Catalunya) ; ICREA through the ICREA Academia Programme - European Regional Development Fund
项目资助者Chinese National Natural Science Foundation ; Key Project of the General Logistics Department ; Science and Technology Development Fund of Macau ; (MINECO/FEDER, UE) ; (CERCA Programme/Generalitat de Catalunya) ; ICREA through the ICREA Academia Programme - European Regional Development Fund
WOS研究方向Automation & Control Systems ; Computer Science
WOS类目Automation & Control Systems ; Computer Science, Artificial Intelligence ; Computer Science, Cybernetics
WOS记录号WOS:000798227800076
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
引用统计
被引频次:13[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/49483
专题多模态人工智能系统全国重点实验室_生物识别与安全技术
通讯作者Wan, Jun
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100190, Peoples R China
3.Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
4.JD Finance, Mountain View, CA 94043 USA
5.Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Peoples R China
6.Xidian Univ, Xian Key Lab Big Data & Intelligent Vis, Xian 710071, Peoples R China
7.Univ Barcelona, Comp Vis Ctr, Barcelona 08007, Spain
8.Univ Tartu, Inst Technol, iCV Lab, EE-50090 Tartu, Estonia
9.PwC Finland, Helsinki 00180, Finland
10.Hasan Kalyoncu Univ, Fac Engn, TR-27100 Gaziantep, Turkey
11.ChaLearn, San Francisco, CA 94115 USA
12.Univ Paris Saclay, F-91190 St Aubin, France
13.Baidu Res, Inst Deep Learning, Beijing 100193, Peoples R China
14.Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100193, Peoples R China
15.Westlake Univ, Hangzhou 310024, Peoples R China
16.Macau Univ Sci & Technol, Taipa, Macau, Peoples R China
第一作者单位模式识别国家重点实验室
通讯作者单位模式识别国家重点实验室
推荐引用方式
GB/T 7714
Wan, Jun,Lin, Chi,Wen, Longyin,et al. ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition[J]. IEEE TRANSACTIONS ON CYBERNETICS,2022,52(5):3422-3433.
APA Wan, Jun.,Lin, Chi.,Wen, Longyin.,Li, Yunan.,Miao, Qiguang.,...&Li, Stan Z..(2022).ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition.IEEE TRANSACTIONS ON CYBERNETICS,52(5),3422-3433.
MLA Wan, Jun,et al."ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition".IEEE TRANSACTIONS ON CYBERNETICS 52.5(2022):3422-3433.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wan, Jun]的文章
[Lin, Chi]的文章
[Wen, Longyin]的文章
百度学术
百度学术中相似的文章
[Wan, Jun]的文章
[Lin, Chi]的文章
[Wen, Longyin]的文章
必应学术
必应学术中相似的文章
[Wan, Jun]的文章
[Lin, Chi]的文章
[Wen, Longyin]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。