Knowledge Commons of Institute of Automation,CAS
A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration | |
Yong-Chao Li; Rui-Sheng Jia; Ying-Xiang Hu; Hong-Mei Sun | |
发表期刊 | IEEE/CAA Journal of Automatica Sinica
![]() |
ISSN | 2329-9266 |
2024 | |
卷号 | 11期号:4页码:965-981 |
摘要 | In a crowd density estimation dataset, the annotation of crowd locations is an extremely laborious task, and they are not taken into the evaluation metrics. In this paper, we aim to reduce the annotation cost of crowd datasets, and propose a crowd density estimation method based on weakly-supervised learning, in the absence of crowd position supervision information, which directly reduces the number of crowds by using the number of pedestrians in the image as the supervised information. For this purpose, we design a new training method, which exploits the correlation between global and local image features by incremental learning to train the network. Specifically, we design a parent-child network (PC-Net) focusing on the global and local image respectively, and propose a linear feature calibration structure to train the PC-Net simultaneously, and the child network learns feature transfer factors and feature bias weights, and uses the transfer factors and bias weights to linearly feature calibrate the features extracted from the Parent network, to improve the convergence of the network by using local features hidden in the crowd images. In addition, we use the pyramid vision transformer as the backbone of the PC-Net to extract crowd features at different levels, and design a global-local feature loss function (L2). We combine it with a crowd counting loss (LC) to enhance the sensitivity of the network to crowd features during the training process, which effectively improves the accuracy of crowd density estimation. The experimental results show that the PC-Net significantly reduces the gap between fully-supervised and weakly-supervised crowd density estimation, and outperforms the comparison methods on five datasets of ShanghaiTech Part A, ShanghaiTech Part B, UCF_CC_50, UCF_QNRF and JHU-CROWD++. |
关键词 | Crowd density estimation linear feature calibration vision transformer weakly-supervision learning |
DOI | 10.1109/JAS.2023.123960 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/55370 |
专题 | 学术期刊_IEEE/CAA Journal of Automatica Sinica |
推荐引用方式 GB/T 7714 | Yong-Chao Li,Rui-Sheng Jia,Ying-Xiang Hu,et al. A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration[J]. IEEE/CAA Journal of Automatica Sinica,2024,11(4):965-981. |
APA | Yong-Chao Li,Rui-Sheng Jia,Ying-Xiang Hu,&Hong-Mei Sun.(2024).A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration.IEEE/CAA Journal of Automatica Sinica,11(4),965-981. |
MLA | Yong-Chao Li,et al."A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration".IEEE/CAA Journal of Automatica Sinica 11.4(2024):965-981. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
JAS-2023-0589.pdf(10448KB) | 期刊论文 | 出版稿 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论