基于深度学习的遥感影像变化检测研究

CASIA OpenIR > 毕业生 > 硕士学位论文

	基于深度学习的遥感影像变化检测研究
	周圆
	2023-05-25
页数	96
学位类型	硕士
中文摘要	变化检测作为遥感技术中的一个重要研究领域，可以帮助分析决策地表物体的变化情况，广泛应用于国土资源调查、农业生产监测、城市扩张分析和自然灾害评估等方面。由于光照变化、成像条件改变、非物理变化的影响，发展高精度、多场景泛化的高分辨率遥感影像变化检测方法一直是一个难点研究问题。当前，以深度学习为代表的人工智能技术迅速发展，在遥感影像处理领域掀起了新的浪潮。深度学习技术可以从输入数据中高效、自适应的提取层次化的代表性特征，十分适合处理高分辨率遥感影像。但是，目前基于深度学习的方法在变化检测领域仍面临困难和挑战，存在精度低、泛化性差、适用场景单一等问题。本文在充分调研和分析现有方法的基础上，总结缺陷与经验，从多个角度出发，围绕深度学习技术在高分辨率遥感影像中的多类变化检测、单类变化检测中的应用进行了深入的研究。主要研究内容如下： (1) 针对多类别的遥感影像变化检测问题，提出了一种多任务学习的处理框架。多任务学习体现在可以同时完成语义分割和变化检测两个任务，其中，变化检测分支判别发生变化的区域，从而限定了语义分割的作用范围，使得网络对变化类型的识别仅针对发生变化的区域。此外，考虑到多类别的变化检测任务中常常存在某一类占据主导地位，产生的类别不均衡的问题会导致网络倾向于多类而忽视少类。提出的针对非变化区域的语义约束函数可以使网络集中在变化区域的语义对比，将非变化区域语义尽量约束为一致，使得网络的特征提取能力和对变化情况的判别能力得到提升。实验表明，所提出的方法在检测精度上优于其它多类变化检测方法。 (2) 针对单个类别的变化检测问题，本文主要聚焦于建筑物的变化检测，提出了一种基于双交叉注意力的变化检测模型。首先，从变化的本质出发提出交叉注意力，交叉注意力机制受人眼视觉观察变化的启发，通过从左到右的交替比较和从前往后的连续匹配来检测变化。其次，为了充分挖掘图像中的高频和低频信息，提出了结合高频和低频信息的混合器，以取代传统的自注意力混合器。通过频率斜坡结构划分高频与低频分支，耦合基于卷积的和基于 Transformer 的交叉注意力混合器，使模型能够扩展其感知能力并从输入的成对数据中捕获更丰富的特征。此外，在特征提取阶段摒弃了纯粹的孪生结构，而是通过交叉注意力联系起来实现信息的交互，并通过层次化的特征提取与融合来实现细节信息的保护与恢复。大量实验表明，本文提出的方法在四个数据集在上均取得了变化检测精度与效率的平衡，综合性能超过同类方法，具有作为变化检测通用主干的潜力。
英文摘要	As an important research field in remote sensing technology, change detection can help analyze and determine changes in surface objects, and is widely used in land and resource surveys, agricultural production monitoring, urban expansion analysis, and natural disaster assessment. Due to the influence of illumination changes, changes in imaging conditions, and non-physical changes, it has always been a difficult research problem to develop high-resolution remote sensing image change detection methods with high precision and multi-scene generalization. At present, the rapid development of artificial intelligence technology represented by deep learning has set off a new wave in the field of remote sensing image processing. Deep learning technology can efficiently and adaptively extract hierarchical representative features from input data, which is very suitable for processing high-resolution remote sensing images. However, deep learning-based methods in the field of change detection still face difficulties and challenges, including low accuracy, poor generalization, and limited applicability to specific scenes. This thesis takes this opportunity, based on full investigation and analysis of existing methods, summarizes shortcomings and experiences, and conducts in-depth research from multiple perspectives around the application of depth learning technology in multi-class change detection and single-class change detection in high-resolution remote sensing images. The main research contributions are as follows: (1) Aiming at the change detection problem of multi-category remote sensing images, a processing framework of multi-task learning is proposed. Multi-task learning is reflected in the fact that two tasks of semantic segmentation and change detection can be completed at the same time. Among them, the change detection branch identifies the changed area, thus limiting the scope of semantic segmentation, so that the network's identification of the change type is only for the changed area. In addition, considering that in the multi-category change detection task, there is often a certain category that dominates, and the resulting category imbalance will cause the network to tend to multi-category while ignoring few categories. The proposed semantic constraint function for the non-changing area can make the network focus on the semantic comparison of the changing area, constrain the semantics of the non-changing area to be consistent as much as possible, and improve the feature extraction ability of the network and the ability to distinguish changes. Experiments show that the proposed method outperforms other multi-class change detection methods in detection accuracy. (2) For the change detection problem of a single category, this paper mainly focuses on the change detection of buildings, and proposes a change detection model based on double cross attention. First, starting from the essence of change, we propose cross-attention. The cross-attention mechanism is inspired by changes in human visual observation, and detects changes through alternate comparison from left to right and continuous matching from front to back. Second, in order to fully mine the high-frequency and low-frequency information in images, a mixer combining high-frequency and low-frequency information is proposed to replace the traditional self-attention mixer. The high-frequency and low-frequency branches are divided by the frequency ramp structure, and the convolution-based and Transformer-based cross-attention hybrids are coupled, enabling the model to expand its perceptual capabilities and capture more rich features from the input paired data. In addition, the pure twin structure is abandoned in the feature extraction stage, and the interaction of information is achieved through cross-attention connections, and the protection and restoration of detailed information is achieved through hierarchical feature extraction and fusion. A large number of experiments show that the method proposed in this paper has achieved a balance between change detection accuracy and efficiency on the four datasets, and its comprehensive performance exceeds similar methods, and has the potential to be a general backbone for change detection.
关键词	变化检测，遥感，交叉注意力，Transformer，深度学习
语种	中文
七大方向——子方向分类	模式识别基础
国重实验室规划方向分类	其他
是否有论文关联数据集需要存交	否
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/52026
专题	毕业生_硕士学位论文多模态人工智能系统全国重点实验室
推荐引用方式 GB/T 7714	周圆. 基于深度学习的遥感影像变化检测研究[D],2023.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
基于深度学习的遥感影像变化检测研究.pd（24523KB）	学位论文		限制开放	CC BY-NC-SA