Knowledge Commons of Institute of Automation,CAS
ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition | |
Wan, Jun1,2; Lin, Chi3; Wen, Longyin4; Li, Yunan5,6; Miao, Qiguang5,6; Escalera, Sergio7; Anbarjafari, Gholamreza8,9,10; Guyon, Isabelle11,12; Guo, Guodong13,14; Li, Stan Z.15,16 | |
发表期刊 | IEEE TRANSACTIONS ON CYBERNETICS |
ISSN | 2168-2267 |
2022-05-01 | |
卷号 | 52期号:5页码:3422-3433 |
通讯作者 | Wan, Jun(jun.wan@ia.ac.cn) |
摘要 | The ChaLearn large-scale gesture recognition challenge has run twice in two workshops in conjunction with the International Conference on Pattern Recognition (ICPR) 2016 and International Conference on Computer Vision (ICCV) 2017, attracting more than 200 teams around the world. This challenge has two tracks, focusing on isolated and continuous gesture recognition, respectively. It describes the creation of both benchmark datasets and analyzes the advances in large-scale gesture recognition based on these two datasets. In this article, we discuss the challenges of collecting large-scale ground-truth annotations of gesture recognition and provide a detailed analysis of the current methods for large-scale isolated and continuous gesture recognition. In addition to the recognition rate and mean Jaccard index (MJI) as evaluation metrics used in previous challenges, we introduce the corrected segmentation rate (CSR) metric to evaluate the performance of temporal segmentation for continuous gesture recognition. Furthermore, we propose a bidirectional long short-term memory (Bi-LSTM) method, determining video division points based on skeleton points. Experiments show that the proposed Bi-LSTM outperforms state-of-the-art methods with an absolute improvement of 8.1% (from 0.8917 to 0.9639) of CSR. |
关键词 | Gesture recognition Measurement Task analysis Training Conferences Computer vision Bidirectional long short-term memory (Bi-LSTM) gesture recognition RGB-D |
DOI | 10.1109/TCYB.2020.3012092 |
关键词[WOS] | FUSION |
收录类别 | SCI |
语种 | 英语 |
资助项目 | Chinese National Natural Science Foundation[61961160704] ; Chinese National Natural Science Foundation[61876179] ; Key Project of the General Logistics Department[ASW17C001] ; Science and Technology Development Fund of Macau[0010/2019/AFJ] ; Science and Technology Development Fund of Macau[0025/2019/AKP] ; Science and Technology Development Fund of Macau[PID2019-105093GB-I00] ; (MINECO/FEDER, UE) ; (CERCA Programme/Generalitat de Catalunya) ; ICREA through the ICREA Academia Programme - European Regional Development Fund |
项目资助者 | Chinese National Natural Science Foundation ; Key Project of the General Logistics Department ; Science and Technology Development Fund of Macau ; (MINECO/FEDER, UE) ; (CERCA Programme/Generalitat de Catalunya) ; ICREA through the ICREA Academia Programme - European Regional Development Fund |
WOS研究方向 | Automation & Control Systems ; Computer Science |
WOS类目 | Automation & Control Systems ; Computer Science, Artificial Intelligence ; Computer Science, Cybernetics |
WOS记录号 | WOS:000798227800076 |
出版者 | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/49483 |
专题 | 多模态人工智能系统全国重点实验室_生物识别与安全技术 |
通讯作者 | Wan, Jun |
作者单位 | 1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Beijing 100190, Peoples R China 3.Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA 4.JD Finance, Mountain View, CA 94043 USA 5.Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Peoples R China 6.Xidian Univ, Xian Key Lab Big Data & Intelligent Vis, Xian 710071, Peoples R China 7.Univ Barcelona, Comp Vis Ctr, Barcelona 08007, Spain 8.Univ Tartu, Inst Technol, iCV Lab, EE-50090 Tartu, Estonia 9.PwC Finland, Helsinki 00180, Finland 10.Hasan Kalyoncu Univ, Fac Engn, TR-27100 Gaziantep, Turkey 11.ChaLearn, San Francisco, CA 94115 USA 12.Univ Paris Saclay, F-91190 St Aubin, France 13.Baidu Res, Inst Deep Learning, Beijing 100193, Peoples R China 14.Natl Engn Lab Deep Learning Technol & Applicat, Beijing 100193, Peoples R China 15.Westlake Univ, Hangzhou 310024, Peoples R China 16.Macau Univ Sci & Technol, Taipa, Macau, Peoples R China |
第一作者单位 | 模式识别国家重点实验室 |
通讯作者单位 | 模式识别国家重点实验室 |
推荐引用方式 GB/T 7714 | Wan, Jun,Lin, Chi,Wen, Longyin,et al. ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition[J]. IEEE TRANSACTIONS ON CYBERNETICS,2022,52(5):3422-3433. |
APA | Wan, Jun.,Lin, Chi.,Wen, Longyin.,Li, Yunan.,Miao, Qiguang.,...&Li, Stan Z..(2022).ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition.IEEE TRANSACTIONS ON CYBERNETICS,52(5),3422-3433. |
MLA | Wan, Jun,et al."ChaLearn Looking at People: IsoGD and ConGD Large-Scale RGB-D Gesture Recognition".IEEE TRANSACTIONS ON CYBERNETICS 52.5(2022):3422-3433. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论