Joint spatial-temporal attention for action recognition

doi:10.1016/j.patrec.2018.07.034

CASIA OpenIR > 多模态人工智能系统全国重点实验室 > 先进时空数据分析与学习

	Joint spatial-temporal attention for action recognition
	Yu, Tingzhao1,2 ; Guo, Chaoxu1,2 ; Wang, Lingfeng1 ; Gu, Huxiang1 ; Xiang, Shiming1 ; Pan, Chunhong1
发表期刊	PATTERN RECOGNITION LETTERS
ISSN	0167-8655
	2018-09-01
卷号	112 页码:226-233
文章类型	Article
摘要	In this paper, we propose a novel high-level action representation using joint spatial-temporal attention model, with application to video-based human action recognition. Specifically, to extract robust motion representations of videos, a new spatial attention module based on 3D convolution is proposed, which can pay attention to the salient parts of the spatial areas. For better dealing with long-duration videos, a new bidirectional LSTM based temporal attention module is introduced, which aims to focus on the key video cubes instead of the key video frames of a given video. The spatial-temporal attention network can be jointly trained via a two-stage strategy, which enables us to simultaneously explore the correlation both in spatial and temporal domain. Experimental results on benchmark action recognition datasets demonstrate the effectiveness of our network. (c) 2018 Elsevier B.V. All rights reserved.
关键词	Action Recognition Spatial-temporal Attention Two-stage
WOS标题词	Science & Technology ; Technology
DOI	10.1016/j.patrec.2018.07.034
关键词[WOS]	REPRESENTATION
收录类别	SCI
语种	英语
项目资助者	National Natural Science Foundation of China(61773377 ; 61573352 ; 91646207 ; 91438105)
WOS研究方向	Computer Science
WOS类目	Computer Science, Artificial Intelligence
WOS记录号	WOS:000443950800033
出版者	ELSEVIER SCIENCE BV
引用统计	被引频次：23[WOS] [WOS记录] [WOS相关记录]
文献类型	期刊论文
条目标识符	http://ir.ia.ac.cn/handle/173211/27907
专题	多模态人工智能系统全国重点实验室_先进时空数据分析与学习
通讯作者	Yu, Tingzhao
作者单位	1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 101408, Peoples R China
第一作者单位	模式识别国家重点实验室
通讯作者单位	模式识别国家重点实验室
推荐引用方式 GB/T 7714	Yu, Tingzhao,Guo, Chaoxu,Wang, Lingfeng,et al. Joint spatial-temporal attention for action recognition[J]. PATTERN RECOGNITION LETTERS,2018,112:226-233.
APA	Yu, Tingzhao,Guo, Chaoxu,Wang, Lingfeng,Gu, Huxiang,Xiang, Shiming,&Pan, Chunhong.(2018).Joint spatial-temporal attention for action recognition.PATTERN RECOGNITION LETTERS,112,226-233.
MLA	Yu, Tingzhao,et al."Joint spatial-temporal attention for action recognition".PATTERN RECOGNITION LETTERS 112(2018):226-233.