Rethinking prediction alignment in one-stage object detection
Xiao, Junrui1,2; Jiang, He1,2; Li, Zhikai1,2; Gu, Qingyi1
发表期刊NEUROCOMPUTING
ISSN0925-2312
2022-12-01
卷号514页码:58-69
通讯作者Gu, Qingyi(qingyi.gu@ia.ac.cn)
摘要Owing to their excellent performance and efficiency, one-stage detectors have been widely used in mul-timedia tasks, such as temporal action detection, object tracking, and video detection. However, misalign-ment between classification and regression branches limits the accuracy of the detector. Most existing works add an auxiliary branch or adopt a specific sample assignment strategy to alleviate this problem, but with little effect. In this paper, we attribute this to incomplete branch interactions and propose a comprehensive Predictive Aligned Object Detector (PAOD), which can better correlate two subtasks. Specifically, our proposed PAOD achieves a better trade-off between prediction-interactive and prediction-specific by adopting an Iterative Aggregation Module (IAM) and a Mutual Constraint Module (MCM). We also design an aligned label assignment with an adaptive metric and re-weighting mechanism to further narrow the misalignment between prediction heads. With negligible additional overhead, PAOD achieves 50.4 AP at single-model single-scale testing on the MS-COCO branch, which demonstrates the effectiveness of our proposal. Notably, PAOD consistently outperforms previous sota such as ATSS (47.7 AP), BorderDet (48.0 AP) and GFL (48.2 AP) by a large margin on COCO test-dev data -set, and achieves better performance than various dense detectors on Pascal VOC and CrowdHuman data -sets. Code is available at https://github.com/JunruiXiao/PAOD.(c) 2022 Elsevier B.V. All rights reserved.
关键词Deep learning Object detection Detection head Label assignment
DOI10.1016/j.neucom.2022.09.132
收录类别SCI
语种英语
资助项目Scientific Instrument Developing Project of the Chinese Academy of Sciences ; [YJKYYQ20200045]
项目资助者Scientific Instrument Developing Project of the Chinese Academy of Sciences
WOS研究方向Computer Science
WOS类目Computer Science, Artificial Intelligence
WOS记录号WOS:000869405900004
出版者ELSEVIER
引用统计
被引频次:8[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/50261
专题中国科学院工业视觉智能装备工程实验室_精密感知与控制
通讯作者Gu, Qingyi
作者单位1.Chinese Acad Sci, Inst Automat, East Zhongguancun Rd, Beijing, Peoples R China
2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Jingjia Rd, Beijing, Peoples R China
第一作者单位中国科学院自动化研究所
通讯作者单位中国科学院自动化研究所
推荐引用方式
GB/T 7714
Xiao, Junrui,Jiang, He,Li, Zhikai,et al. Rethinking prediction alignment in one-stage object detection[J]. NEUROCOMPUTING,2022,514:58-69.
APA Xiao, Junrui,Jiang, He,Li, Zhikai,&Gu, Qingyi.(2022).Rethinking prediction alignment in one-stage object detection.NEUROCOMPUTING,514,58-69.
MLA Xiao, Junrui,et al."Rethinking prediction alignment in one-stage object detection".NEUROCOMPUTING 514(2022):58-69.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Xiao, Junrui]的文章
[Jiang, He]的文章
[Li, Zhikai]的文章
百度学术
百度学术中相似的文章
[Xiao, Junrui]的文章
[Jiang, He]的文章
[Li, Zhikai]的文章
必应学术
必应学术中相似的文章
[Xiao, Junrui]的文章
[Jiang, He]的文章
[Li, Zhikai]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。