复杂场景下的行人再识别方法研究 | |
徐博强 | |
2023-05-19 | |
页数 | 126 |
学位类型 | 博士 |
中文摘要 | 随着智慧城市建设的不断推进,大量的监控设备已经被部署在各种公共场所,形成了庞大的分布式监控网络,从而产生了大量的视频监控数据。行人再识别技术结合了计算机视觉、机器学习和模式识别等领域的方法,通过提取监控画面中行人图像的外观特征并比较特征相似度,实现了关联不同摄像机中同一行人的轨迹图像。在公安刑侦、人物检索和人机交互等场景中,行人再识别技术具有广泛的应用前景。 与手工特征相比,基于深度学习的行人再识别方法利用深度神经网络提取出了更具判别性的行人特征,并在公开的学术数据集上展现了优异的再识别性能。然而,在实际应用过程中,行人再识别模型经常面临跨域、低光照、遮挡等复杂环境,具体而言可能遇到以下问题:(1)常规的行人再识别模型通常基于行人的衣着服饰提取特征,进而对行人身份进行判断。然而,在学校、工厂、银行等场景中,由于人们穿着相似的衣服,导致无法再根据衣着特征对行人身份进行判断,因此常规的行人再识别模型的准确率会大幅度下降。如何在这种情况下进行准确的行人再识别,对行人再识别模型的判别能力提出了挑战。(2)行人再识别模型在实际应用中涵盖学校、商场、车站、机场等多个场景。但是,当行人再识别模型在未经训练的领域上进行部署时,由于不同领域之间的差异较大,使得行人再识别模型在跨场景应用时性能显著下降,甚至无法使用,这对行人再识别模型的泛化性和域适应性提出了挑战。(3)实际应用场景中,人遮人和物遮人的情况经常出现。遮挡物的存在会在特征提取和特征匹配过程中引入噪声,并使得行人的部分特征缺失,从而导致遮挡行人再识别的准确率下降,这对行人再识别模型的鲁棒性提出了挑战。 本文针对上述挑战对行人再识别任务展开了研究,主要工作和创新点包括: 针对人们穿着相似衣服的细粒度行人再识别问题,由于这种情况下衣着特征不再可靠,本文提出了一种同时利用头肩特征和与颜色无关的特征辅助进行行人身份判别的行人再识别算法。该算法会通过一个轻量级的头肩分割层对头肩区域进行定位并提取对应特征。同时利用实例归一化来学习与颜色无关的特征,并且通过注意力机制来挖掘更加丰富的判别线索,以确保提取出的与颜色无关的特征拥有较强的表征性。本文建立了用于研究细粒度行人再识别的数据集FG-reID,并通过与其他方法的对比,验证了该行人再识别算法的有效性。 总而言之,本文针对行人再识别模型在实际复杂场景中可能面临的穿着相似衣服、跨域和遮挡的问题进行了深入的研究,并提出了多种有效的方法,提升了行人再识别模型的判别能力、泛化能力和鲁棒性,增强了模型的实际应用能力,推动了行人再识别课题的发展。 |
英文摘要 | With the continuous promotion of the construction of smart cities, a large number of monitoring devices have been deployed in various public places, forming a massive distributed monitoring network, which has generated a large amount of video surveillance data. Person re-identification technology combines methods from the fields of computer vision, machine learning, and pattern recognition to extract appearance features from pedestrian images in surveillance footage and compare feature similarity to achieve association of trajectory images of the same pedestrian in different cameras. Person re-identification technology has a wide range of application prospects in scenarios such as public security criminal investigation, character retrieval, and human-machine interaction. Compared with handcrafted features, person re-identification methods based on deep learning utilize deep neural networks to extract more discriminative pedestrian features and have demonstrated outstanding re-identification performance on publicly available academic datasets. However, in practical applications, person re-identification models often face complex environments such as cross-domain, low lighting, and occlusion, which may encounter the following issues: (1) Conventional person re-identification models usually extract features based on a pedestrian's clothing and apparel, and then make judgments on the pedestrian's identity. However, in scenarios such as schools, factories, and banks, people may wear similar clothing, resulting in a significant decrease in the accuracy of conventional person re-identification models based on clothing features. This poses a challenge to the discriminative ability of person re-identification models in accurately recognizing pedestrians under such circumstances. (2) person re-identification models cover multiple scenarios such as schools, shopping malls, stations, and airports in practical applications. However, when deploying person re-identification models in untrained domains, significant differences between different domains may cause a significant decline in the model's performance in cross-domain applications, or even make it unusable, which poses a challenge to the generalization and domain adaptation of person re-identification models. (3) In real-world applications, situations where people or objects occlude pedestrians often occur. The presence of occlusions introduces noise in the feature extraction and matching processes and causes some features of the pedestrian to be missing, resulting in a decrease in the accuracy of occluded person re-identification, which poses a challenge to the robustness of person re-identification models. This paper addresses the aforementioned challenges in person re-identification and presents research in this field. The main contributions and innovations of this work include:
In summary, this paper conducts in-depth research on the fine-grained person re-identification problem, cross-domain, and occlusion that person re-identification models may face in practical complex scenarios, and proposes various effective methods to enhance the discriminative ability, generalization ability, and robustness of person re-identification models, enhancing their practical application capabilities and promoting the development of person re-identification. |
关键词 | 行人再识别 细粒度检索 跨域检索 遮挡行人再识别 |
语种 | 中文 |
七大方向——子方向分类 | 目标检测、跟踪与识别 |
国重实验室规划方向分类 | 视觉信息处理 |
是否有论文关联数据集需要存交 | 否 |
文献类型 | 学位论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/52316 |
专题 | 毕业生_博士学位论文 |
推荐引用方式 GB/T 7714 | 徐博强. 复杂场景下的行人再识别方法研究[D],2023. |
条目包含的文件 | ||||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
博士论文.pdf(8990KB) | 学位论文 | 限制开放 | CC BY-NC-SA |
个性服务 |
推荐该条目 |
保存到收藏夹 |
查看访问统计 |
导出为Endnote文件 |
谷歌学术 |
谷歌学术中相似的文章 |
[徐博强]的文章 |
百度学术 |
百度学术中相似的文章 |
[徐博强]的文章 |
必应学术 |
必应学术中相似的文章 |
[徐博强]的文章 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论