图像特征的相似性度量

CASIA OpenIR > 毕业生 > 博士学位论文

	图像特征的相似性度量
其他题名	Similarity between Image Features
	严望
	2009-05-31
学位类型	工学博士
中文摘要	相似性度量是模式识别的基本问题，很多算法的性能都极大地依赖于相似性度量的好坏。例如排序、最近邻分类、聚类和各种基于图的半监督学习算法均以样本间的距离度量为基础。另外，核学习方法的基础——核函数也是一种特殊的相似性度量。在与模式识别紧密联系的计算机视觉中，相似性度量也一直是重要的研究方向。与前者相比，计算机视觉要求相似性度量集成相关的领域知识，并希望其符合人类的感知。如何设计满足上述要求的相似性度量是计算机视觉中的重要问题。本文以上述要求为准则，研究了一般视觉特征向量和直方图的相似性度量，提出了新的距离度量和核函数，并提出了同时利用多种类型特征上相似度的学习算法。本文取得的主要成果包括如下四点。 1. 证明了核主成分分析可以分为两个独立的步骤，并提出了一种非线性降维算法——多相似度主成分分析。核主成分分析可以分为核特征向量的提取和在核特征向量上的主成分分析两步。一个样本的核特征向量由这个样本和所有训练样本的核函数值组成。多相似度主成分分析使用多个核函数分别计算样本在不同类型特征上的相似度，从而将核特征向量扩展为核特征矩阵，并使用已有的二维主成分分析对核特征矩阵降维。由于该算法分别利用了不同类型特征上的相似度信息，因此其性能优于经典的子空间方法。 2. 提出了动态相似度核函数。认知心理学中的相似度理论表明，人类根据两个物体最相似的部分推断两者整体上的相似性。依据该理论，该方法动态地选择特征向量中距离最小的部分维度进行比较，并以此计算核函数。该核函数能显著提高图像分类和人脸识别，特别是有遮挡情况下人脸识别的性能。 3. 提出了动态部分泥土搬运距离。依据同样的心理学理论，该方法动态地选择泥土搬运距离中搬运距离最小的部分泥土，并以此计算直方图之间的距离。相对泥土搬运距离，该方法更好地模拟了人类的感知。 4. 提出了保持拓扑的扩散距离。多数距离将直方图简单地看作多维区间甚至一维向量，因此破坏了直方图分量（bin）的近邻关系，影响了距离的性能。该方法通过测量定义在直方图定义域上的热扩散过程来计算直方图距离，考虑了直方图的拓扑结构，更加符合人类的感知。
英文摘要	Similarity is a basic problem of pattern recognition, and the performances of many algorithms greatly rely on the goodness of the underlying similarity, such as ranking, nearest neighbor classification, clustering and various graph-based semi-supervised learning algorithms. Moreover, the kernel function in the kernel method is also a special similarity measure. Similarity is also an important issue in computer vision, which is closely related to pattern recognition. However, the similarity in computer vision should integrate the domain knowledge, and should be consistent with human perception. How to design the similarity measures which meet the two requirements is an important open question in computer vision. Upholding these two requirements as guiding principles, the thesis investigates the similarity measures for general visual feature vector and histogram, respectively, proposes several new distances and a kernel function, and also proposes a classification scheme which utilizes multiple similarity measures on different kinds of features. The main contributions are as follows. 1. It is proved that Kernel Principal Component Analysis (KPCA) consists of two independent steps, and Multiple Similarity Principal Component Analysis (MSPCA) is proposed as a nonlinear dimensionality reduction algorithm. The two steps of KPCA are kernel feature vector extraction and Principal Component Analysis (PCA). The kernel feature vector consists of the kernel values between the sample and the training set. MSPCA uses multiple kernels to measure the similarities on different kinds of features respectively, and therefore extends the kernel feature vector to the kernel feature matrix. It then utilizes previously proposed Two-Dimensional Principal Component Analysis (2DPCA) to reduce the dimensionality of the kernel feature matrix. MSPCA’s performance is superior to other classic subspace methods since it utilizes the similarities on different kinds of features respectively. 2. The Dynamic Similarity Kernel (DSK) is proposed. The similarity theories in cognitive psychology show that human infers the overall similarity based on the similar aspects among the compared items. Inspired by this evidence, DSK dynamically selects the dimensions with the smallest distances among the compared feature vectors, and computes the kernel value based on them. It improves the accuracies of image classification and face recognition significantly, especially for face recognition with oc...
关键词	相似性度量直方图距离核函数核子空间分析 Similarity Measure Histogram Distance Kernel Function Kernel Subspace Analysis
语种	中文
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/6194
专题	毕业生_博士学位论文
推荐引用方式 GB/T 7714	严望. 图像特征的相似性度量[D]. 中国科学院自动化研究所. 中国科学院研究生院,2009.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
CASIA_20051801462806（5830KB）			暂不开放	CC BY-NC-SA