基于特征增强的车辆重识别问题研究

CASIA OpenIR > 毕业生 > 博士学位论文

	基于特征增强的车辆重识别问题研究
	钱文
	2023-06-02
页数	140
学位类型	博士
中文摘要	进入21世纪以来，我国人均汽车保有量和汽车总量逐年增长。在这一过程中，城市安防需求也随之增加，其中涉及到了许多与汽车有关的问题。为了解决这些问题，车辆重识别技术应运而生。其目的在于对同一车辆在不同角度和摄像头下的图像进行检索和匹配。借助现代化的视频监控系统，基于车辆重识别的技术被广泛应用于道路拥堵预测、车辆追踪和轨迹还原等领域。然而，由于同一车辆在不同的视角下具有很大的结构和背景差异性，而在相同视角下又存在外观相似的车辆，这给车辆重识别带来了挑战。研究者们将这些问题总结为类间相似性和类内差异性，该问题严重阻碍了车辆重识别技术的发展。近年来，随着深度学习技术的不断发展，基于神经网络进行高效特征提取的车辆重识别方法也得到了长足的进步。然而，这些方法虽然提高了模型的精度，但却各自存在一定局限性，导致重识别模型无法在真实的城市道路中进行有效的应用和部署。经过详细的调研，本文将现有车辆重识别中的相关问题总结为以下四对矛盾：1）借用附加模块进行特征增强和部署时对实时低算力需求之间的矛盾；2）传统方法提取特征表达单一和重识别任务特征多样性需求之间的矛盾；3）低对齐性的特征和基于对齐假设的相似度计算方法之间的矛盾；4）无监督重识别方法和低实验精度之间的矛盾。本文从特征增强的角度提出一系列方法来解决上述矛盾，相关创新点总结如下： 1）基于迁移学习的特征增强方法：对于车辆重识别中难样本特征质量低、分辨困难的问题，研究者们提出了包括多分支网络特征融合、利用先验信息对关键部位或者属性进行特征增强等方法，并取得了一定成果。但是这些方法会在部署过程中导致额外的计算和时间消耗，无法应用于对实时性和低功耗需求高的场景。本文将上述现象总结为“借用附加模块进行特征增强和部署时对实时低算力需求之间的矛盾”，并提出基于迁移学习的特征增强方法来解决上述问题。基于迁移学习的特征增强方法通过多分支信息交互实现车辆特征增强，而且部署过程中只需要主干分支。基于不同的技术路线，本文提出了两种基于知识迁移的方法：“伙伴学习”和“伪图卷积网络”。“伙伴学习”通过基于样本关系的迁移、基于特征的迁移以及基于训练集软标签的迁移来实现全局分支和局部分支的闭环互补式特征增强。伪图卷积模型对由卷积和图卷积组成的异构双分支网络进行端到端的训练，并使用图网络迁移将图卷积的样本联想能力迁移到卷积分支中变成对关键位置或者属性的关注能力。本文提出的两类模型既通过学生和教师分支之间的信息交互进行特征的增强，又兼顾了部署过程中模型对于实时性和低功耗的需求。 2）基于多样性的特征增强方法：主流的车辆重识别框架会较多的关注于全局特征且忽略了那些细微但具有判别能力的特征，导致该类模型的特征表达单一，无法满足重识别任务对于特征多样性表示的需求。而常见的解决方法又都存在一些弊端：基于先验信息的方法无法动态的捕获车辆信息；基于自注意力机制的方法由于间接的监督、特征提取不充分。为了满足车辆特征多样性表达，本文进一步提出基于显著性挖掘的车辆重识别方法，通过模仿人类辨别困难样本的过程来挖掘充分的多阶段特征。该网络从框架和特征空间两个维度进行设计来获得充分的显著性特征。首先，该方法设计了一种显著性引导的金字塔网络，该网络通过特征挖掘-抑制-挖掘的过程获得充分的特征；紧接着，该方法又提出了跨空间的特征约束来保证特征的多样性。最后，本文通过可视化分析证明了来自不同阶段的特征所关注区域的多样性。 3）基于隐式对齐的特征增强方法：在车辆重识别问题的研究中，特征对齐也是一种有效的特征增强手段。传统重识别模型无法保证不同图片提取出来的特征是对齐的，而相似度的计算又是依赖于对齐性假设进行的。本文将上述问题总结为“低对齐性的特征和基于对齐假设的相似度计算方法之间的矛盾”，并提出基于隐式对齐的特征增强方法来解决该矛盾。本文提出一种基于聚类的隐式特征对齐方法，通过非结构化的特征解耦过程无监督的获得粗粒度可对齐的车辆特征。在解耦过程使用自注意力网络进行特征的重组，并通过聚类过程来实现对应位置的解耦特征在语义上的对齐性。在聚类的过程中，额外的设计了多样性约束和对齐性约束，来保证不同图片解耦出来的相同位置特征信息的有序性以及不同位置特征信息的多样性。该方法在主流数据集上获得了有竞争力的实验效果，并通过可视化证明了解耦特征的多样性和对齐性。 4）结合生成范式的无监督特征增强方法：针对车辆重识别“数据多、标注难”的特点，研究者提出利用无监督方法结合大数据来挖掘学习完善的车辆特征。这些方法大多是基于对比学习进行设计，目前的实验精度和有监督模型差距较大。本文进一步总结基于对比学习的重识别方法中两个重要因素：伪标签质量和正负样本的特征表达。近来基于掩码建模的生成式方法在无监督视觉预训练领域发展迅猛，在大量的下游任务获得了媲美对比学习方法的实验效果。本文发现对比范式和生成范式获得的特征各有侧重，且互为补充。本文提出结合生成范式的无监督特征增强方法，该方法构建了一个包括无监督重识别和掩码重建的双分支网络。通过结合生成范式产生低频信号对重识别网络产生的高频信号做特征增强，提升伪标签的质量且获得更好的正负样本表达。最终结合生成范式的无监督特征增强方法在车辆、行人的无监督重识别以及跨域实验上均取得了有竞争力的实验效果。本文提出的系列方法还在车辆重识别主流的公开数据集上以及公安部相关的安防数据上进行了相关的测试，均取得了不错的成效。最后需要指出，由于问题的特异性和互斥性以及受研究期间计算资源的限制，该系列方法很难整合成一个完整的系统，研究者可以依据实际问题去寻找对应的解决方法。
英文摘要	Since the beginning of the 21st century, the per capita automobile ownership and the total number of automobiles in China have been increasing annually. During this process, the demand for urban security has also increased, involving various issues related to automobiles. Vehicle Re-identification (ReID) has been proposed for solving the above problems, which aims to retrieve and identify designated vehicles from a large number of vehicle images across non-overlapping cameras. With the development of the video monitoring system, vehicle ReID has been widely applied in regions like the congestion prediction of roads, vehicle tracking, and track recovery. However, vehicle re-identification faces challenges due to significant structural and background variations of the same vehicle from different viewpoints, as well as the presence of visually similar vehicles from similar viewpoints. The above problems are summarized as inter-class similarity and intra-class difference, which make the vehicle re-identification problem challenging. In recent years, with the continuous development of deep learning technology, the vehicle ReID methods have also made great progress. Although the above methods have improved the performance of Vehicle ReID, there still exist some limitations to them and make it impossible to deploy them in real applications: 1) The contradiction between feature enhancement through auxiliary modules and the real-time low computational power requirements during deployment; 2) The contradiction between the single expression of extracted features and the diversity and sufficient feature requirements of the ReID task; 3) The contradiction between low alignment in re-identification features and similarity calculation methods based on alignment assumptions; 4) The contradiction between unsupervised re-identification methods and low experimental accuracy. This paper aims to solve the above shortcomings from the perspective of feature enhancement. The main works and innovations of the dissertation can be summarized as follows: 1) Feature enhancement scheme based on transfer learning. For improving the low-quality and indistinguishable features of the difficult samples, researchers have proposed several methods including feature fusion based on a multi-branch network and employing the prior information from key parts or attributes. However, these methods will increase additional calculation and time consumption when deployed, which causes the model to fail to meet the requirement of real-time and low power consumption. This paper proposes that vehicle features can be enhanced by knowledge transfer, which not only enhances the features through the information interaction between the branches but also takes into account the real-time requirements of the model during the deployment process. This paper proposes two vehicle re-identification models based on knowledge transfer: partner learning and pseudo-graph convolution network (pseudo-GCN) for vehicle ReID. Partner learning aims to enhance the global feature by a complementary feature enhancement process among branches, which composes of relation-based knowledge transfer, attention-based knowledge transfer, and soft label-based knowledge transfer. The pseudo-GCN network proposes to train a dual-branch network with a CNN branch and a GCN branch under an end-to-end mode, and thus the topological optimization ability of GCN branch can be transferred to the attention of key locations or attributes in the CNN branch by the graph-based distillation. 2) Feature enhancement scheme based on diversity requirement. Mainstream vehicle ReID frameworks pay more attention to global features and ignore those subtle but discriminative features. As a result, the feature representation proposed by this kind of model is single and cannot meet the diversity representation requirement of the ReID task. However, the common solutions have some drawbacks: the method based on prior information cannot dynamically capture vehicle information, and the self-attention methods are usually insufficient due to indirect supervision during the training process. In order to satisfy the diverse expression of vehicle features, this paper tackles the above limitation by proposing a novel Salience-Navigated Vehicle Re-identification Network (SVRN) which explores diverse salient features at multi-scale. For mining sufficient salient features, we design SVRN from two aspects: a) network architecture: we propose a novel salience-navigated vehicle re-identification network, which mines diverse features under a cascaded suppress-and-explore mode. b) feature space: cross-space constraint enables the diversity from feature space, which restrains the cross-space features by vehicle and image identifications (IDs). 3) Feature enhancement scheme based on implicit alignment. Apart from directly using the teacher model or prior information for feature enhancement in vehicle ReID, feature alignment can also be viewed as an effective means of feature enhancement. Traditional vehicle re-identification methods cannot guarantee the alignment of the features extracted from different images, and the unaligned features will lead to inaccurate matching scores calculated by cosine similarity. We summarize the above problem as "the contradiction between the feature with low alignment and the similarity calculation method based on alignment hypothesis", and a feature enhancement scheme based on implicit feature alignment is proposed to solve the contradiction----the unstructured feature decoupling network. The unstructured feature decoupling network is a cluster-based feature alignment method, which aims to achieve more flexible unstructured and aligned decoupled features with diverse discriminative information. In the clustering process, this paper incorporates diversity constraints and alignment constraints to ensure the orderliness of identical positional feature information decoupled from different images, as well as the diversity of feature information from different positions. 4) Unsupervised feature enhancement scheme combined with generative paradigm. In order to further improve the performance of the model, this paper further proposes a data-driven feature enhancement approach for vehicle re-identification. Different from the above three model-driven feature enhancement methods, data-driven feature enhancement aims to use large-scale data to learn complete vehicle features. But as we all know, although massive vehicle data can be obtained from urban transportation systems, the cost of data labeling has always been high. Based on the above observations, we propose that an unsupervised re-identification method can be used to utilize large-scale data for effective feature enhancement. However, most of the existing unsupervised methods have the problem of low accuracy, and this paper points out that is caused by the low quality of the pseudo-labels. How to use large-scale data at a low cost for effective feature enhancement has become the core issue of this paper. This paper proposes that by introducing a generative paradigm into the contrastive-based unsupervised network, the low-frequency features generated by the masked image modeling network (MIM) can complement the high-frequency feature generated by the re-identification network, thereby improving the quality of the pseudo-label and the performance of ReID. The series of methods proposed in this paper are tested on the mainstream vehicle datasets and the security data from the Ministry of Public Security, all of which have achieved good results. Finally, it should be pointed out that due to the specificity and mutual exclusion of the problems and the limitation of computing resources during the study period, it is difficult to integrate the series of methods into a complete system. Researchers can find corresponding solutions according to actual problems.
关键词	车辆重识别特征增强知识迁移隐式特征对齐生成范式
学科领域	计算机感知
学科门类	工学
语种	中文
七大方向——子方向分类	图像视频处理与分析
国重实验室规划方向分类	视觉信息处理
是否有论文关联数据集需要存交	否
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/51901
专题	毕业生_博士学位论文
推荐引用方式 GB/T 7714	钱文. 基于特征增强的车辆重识别问题研究[D],2023.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
基于特征增强的车辆重识别问题研究_钱文.（7526KB）	学位论文		限制开放	CC BY-NC-SA