CASIA OpenIR  > 紫东太初大模型研究中心
Anchor-free temporal action localization via Progressive Boundary-aware Boosting
Tang, Yepeng1; Wang, Weining2; Yang, Yanwu3; Zhang, Chunjie1; Liu, Jing2
发表期刊Information Processing & Management
2022-11
卷号60期号:1页码:103141
摘要

Enormous untrimmed videos from the real world are difficult to analyze and manage. Temporal action localization algorithms can help us to locate and recognize human activity clips in untrimmed videos. Recently, anchor-free temporal action localization methods have gained increasing attention due to small computational costs and no complex hyperparameters of pre-set anchors. Although the performance has been significantly improved, most existing anchor-free temporal action localization methods still suffer from inaccurate action boundary predictions. In this paper, we want to alleviate the above problem through boundary refinement and temporal context aggregation. To this end, a novel Progressive Boundary-aware Boosting Network (PBBNet) is proposed for anchor-free temporal action localization. The PBBNet consists of three main modules: Temporal Context-aware Module (TCM), Instance-wise Boundary-aware Module (IBM), and Frame-wise Progressive Boundary-aware Module (FPBM). The TCM aggregates the temporal context information and provides features for the IBM and the FPBM. The IBM generates multi-scale video features to predict action results coarsely. Compared with IBM, the FPBM focuses on instance features corresponding to action predictions and uses more supervision information for boundary regression. Given action results from IBM, the FPBM uses a progressive boosting strategy to refine the boundary predictions multiple times with supervision from weak to strong. Extensive experiments on three benchmark datasets THUMOS14, ActivityNet-v1.3 and HACS show our PPBNet outperforms all existing anchor-free methods. Further, our PPBNet achieves state-of-the-art performance (72.5% mAP at tIoU = 0.5) on THUMOS14 dataset.

关键词Temporal action localization Anchor-free Video understanding
学科门类工学 ; 工学::计算机科学与技术(可授工学、理学学位)
收录类别SCI
语种英语
七大方向——子方向分类图像视频处理与分析
国重实验室规划方向分类视觉信息处理
是否有论文关联数据集需要存交
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/51598
专题紫东太初大模型研究中心
通讯作者Zhang, Chunjie
作者单位1.北京交通大学
2.中国科学院自动化研究所
3.华中科技大学
推荐引用方式
GB/T 7714
Tang, Yepeng,Wang, Weining,Yang, Yanwu,et al. Anchor-free temporal action localization via Progressive Boundary-aware Boosting[J]. Information Processing & Management,2022,60(1):103141.
APA Tang, Yepeng,Wang, Weining,Yang, Yanwu,Zhang, Chunjie,&Liu, Jing.(2022).Anchor-free temporal action localization via Progressive Boundary-aware Boosting.Information Processing & Management,60(1),103141.
MLA Tang, Yepeng,et al."Anchor-free temporal action localization via Progressive Boundary-aware Boosting".Information Processing & Management 60.1(2022):103141.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Anchor-free temporal(1559KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Tang, Yepeng]的文章
[Wang, Weining]的文章
[Yang, Yanwu]的文章
百度学术
百度学术中相似的文章
[Tang, Yepeng]的文章
[Wang, Weining]的文章
[Yang, Yanwu]的文章
必应学术
必应学术中相似的文章
[Tang, Yepeng]的文章
[Wang, Weining]的文章
[Yang, Yanwu]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Anchor-free temporal action localization via Progressive Boundary-aware Boosting.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。