人脸识别关键问题和算法研究

CASIA OpenIR > 毕业生 > 博士学位论文

	人脸识别关键问题和算法研究
其他题名	Study on Issues in Face Recognition
	雷震
	2010-05-23
学位类型	工学博士
中文摘要	人脸识别是模式识别与计算机视觉学科的热点研究内容，在个人身份识别、安检、智能人机交互等领域有着广泛的应用前景。现有的人脸识别算法深受表情、光照、姿态、低分辨率、异质图像模态等因素的影响，限制了其在实际场合中的推广和应用。本文对人脸识别算法进行了深入研究和分析，针对四个关键问题和难点，提出了相应的解决方法，主要的工作和贡献有： 1. 针对图像光照变化的问题，提出了两种光照鲁棒的特征提取方法。其中一种是基于局部Gabor纹理单元的人脸表征和识别方法。该方法结合了Gabor特征和纹理结构单元分析方法对光照鲁棒的优点，针对人脸这一特定图像结构，提出了局部纹理结构单元字典，能够更加高效地得到对人脸图像的整体描述，并且具有良好的光照适应性。另一种是基于Gabor体局部二元模式描述子的人脸表征和人脸识别方法，该方法充分挖掘了图像的空域和频域信息，使得人脸图像特征更加完备。并进一步地采用了基于互信息的特征选择机制和线性判别分析方法，提取紧致互补的特征表示，降低特征冗余度，提高特征的鉴别能力，从而提升人脸识别性能。 2. 针对低分辨率图像这一问题，提出了局部频域描述方法。该方法通过对图像低频空间的幅度和相位信息进行有效建模，提取具有良好的模糊不变性的图像特征，适用于解决低分辨率的人脸图像识别问题。 3. 针对异质图像匹配问题（如可见光-近红外，视频-数码照片等），提出了对偶谱回归的异质图像匹配框架。与传统判别分析方法不同的是，本文采用不同的投影向量将异质图像数据分别投影到一个共同的判别子空间进行分类，从而能够更好地挖掘异质数据中的共有鉴别信息。利用图嵌入与谱回归的思想，提出了线性和非线性的高效率求解方法，并进一步引入正则化技术，提高了解的泛化能力，取得了比现有方法更好的识别性能。 4. 针对人脸图像姿态变化问题，提出了从单幅二维图像中恢复人脸三维形状信息的方法，从而能够在系统注册时利用恢复得到的人脸三维信息生成多个姿态下的虚拟样本，提高系统对姿态的鲁棒性。在重构算法实现中，提出了张量分析的方法分别对二维和三维图像空间进行建模，尽可能地保留和利用图像的结构信息，应用典型相关分析（CCA）挖掘二维和三维空间中的最相关子分量，建立映射关系，减少噪声因素的影响。实验结果表明，该方法能够快速有效地重建人脸三维模型，降低重构误差，为提高人脸识别系统对姿态变化的鲁棒性打下坚实的基础。总的来说，本文针对人脸识别的几个关键难点：光照，姿态，低分辨率，异质图像匹配等问题进行了有益的探索和尝试，提出了一系列的改进方法，改善了现有的人脸识别算法性能。
英文摘要	Face recognition is an important problem in pattern recognition and computer vision area. Due to various adverse factors such as expression, lighting, pose, low-resolution and heterogeneous modality variations, the performance of most existing face recognition algorithms is far behind satisfactory and hence limits its application in real world. In this thesis, we study on some issues of face recognition and propose several novel descriptors and recognition methods to improve the face recognition performance. The main contributions of this thesis include following issues: 1. To reduce the effect of illumination variation, we propose two feature extraction methods. One is a local Gabor texton based face representation method, which incorporates the advantages of Gabor and texton and therefore is robust to illumination variation. Another one is named Gabor volume based local binary pattern descriptor. Different from the existing method, it exploits the face information in space, frequency and orientation domains simultaneously to make the face representation more sufficient. Further, it utilizes CMI and LDA method to reduce the feature dimension to improve the face recognition performance efficiently and effectively. 2. In order to improve the performance of low resolution face recognition, we propose a local frequency descriptor (LMD) for face representation. The LMD properly formulates the magnitude and phase information in low frequency band of image. Due to its nearly blur-invariant property, it is suitable for low resolution face recognition. 3. To deal with the heterogeneous image matching problem, we propose coupled spectral regression framework which utilizes two projections to project the heterogeneous data respectively onto the common discriminative subspace to be classified. An efficient solution by using graph embedding and spectral regression is presented. Thanks to the proper regularization technique, it demonstrates good generalization performance. 4. To improve the robustness of system to pose variation, we propose a 3D face shape recovery method from a single 2D face image. By utilizing tensor analysis and canonical correlation analysis (CCA) method, we successfully build the 2D and 3D space and learn the relationship between them. Experimental results show the proposed method is effective and efficient for 3D face shape recovery. With the help of this technique, one could produce virtual samples in registration phase so as to improve the...
关键词	人脸识别光照变化低分辨率异质图像匹配三维形状恢复 Face Recognition Illumination Variation Low Resolution Heterogeneous Image Matching 3d Shape Recovery
语种	中文
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/6240
专题	毕业生_博士学位论文
推荐引用方式 GB/T 7714	雷震. 人脸识别关键问题和算法研究[D]. 中国科学院自动化研究所. 中国科学院研究生院,2010.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
CASIA_20071801462804（8275KB）			限制开放	CC BY-NC-SA