复杂场景下目标的跟踪与姿态估计

CASIA OpenIR > 毕业生 > 博士学位论文

	复杂场景下目标的跟踪与姿态估计
其他题名	Object tracking and pose estimation under complex scenes
	张笑钦
	2010-04-22
学位类型	工学博士
中文摘要	目标跟踪在目标运动的视觉分析中占有重要的地位，属于视觉的中层部分。利用对目标的跟踪，可以方便地获得目标的运动参数，为后续的高层的行为理解和识别奠定了基础。目标跟踪虽然是计算机视觉领域一个重要的研究方向和研究热点，但是目前仍然有很多理论与技术问题尚待解决，特别是跟踪过程中噪声干扰、剧烈运动、光照变化、遮挡等复杂问题。目标姿态估计在目标运动的视觉分析中属于高层部分，它是根据图像信息推断出目标物体（主要针对人体，手势）的三维方位指向。姿态估计在智能视觉监控、人机交互和视频运动分析等很多领域都有着重要的应用。本文的工作以复杂场景下鲁棒的目标跟踪与姿态估计为目标，分别对以下三个问题进行了深入的探讨和分析：（1）基于样本分布的表观建模；（2）跟踪过程中样本的退化；（3）姿态估计中高维状态空间的解析。大量的实验表明我们方法的有效性和鲁棒性。论文的主要工作和贡献如下: 1.提出一种基于图嵌入判别学习的框架来学习出正样本的模式空间，以及正负样本之间的判别空间。基于这两个模式空间的学习，我们构建起鲁棒的目标表观模型。与传统基于子空间学习方法相比，该方法有效地嵌入了样本的分布特性，能够得到更符合实际的目标模式子空间，因此对噪声、局部遮挡更为鲁棒。 2.提出两种结合确定性跟踪框架和随机性跟踪框架的方法：（1）核贝叶斯跟踪框架；（2）基于奇异值分解的卡尔曼粒子滤波框架。（1）首先将核方法应用于当前帧，获得对运动方向以及偏移量的估计，再将这些信息作为启发式先验信息加入到贝叶斯滤波框架中，从而有效结合确定性方法与随机性方法的优点，并且该框架是一种通用的跟踪框架，任何形式的表观模型都可以在此框架下应用。与基于核方法的跟踪器（确定性跟踪方法）以及基于贝叶斯方法的跟踪器（随机性跟踪方法）相比，该跟踪框架有效地提高了跟踪的效率和精确度。（2）首先对前一帧跟踪样本的协方差矩阵进行奇异值分解，再利用Unsented 变化（一种确定性的变化）获得预测样本，然后用卡尔曼滤波器对预测样本进行滤波，为最终的粒子滤波提供运动估计分布。与粒子滤波和基于Unsented 的卡尔曼粒子滤波相比，我们的方法更为鲁棒。 3.提出了一种基于序列粒子群优化的跟踪算法框架。该框架首先通过初始化一系列随机粒子样本，然后对这些粒子集进行时序上的传播以及优化迭代，最后对进化的粒子进行收敛性判断。我们从理论上证明了，序列粒子群优化框架实质上是一个基于分层重要性采样的粒子滤波过程，使得采样结果逼近了直接采样于最佳重要性分布的结果。我们还将序列粒子群优化的概念扩展到多种群上，将多目标跟踪中目标物体的交互和遮挡用种群间的竞争和排斥来模拟，实现了有效的多目标跟踪。与其他两种多目标跟踪的算法相比，我们的算法在PETS2004 数据库上获得了最佳的实验结果。 4.提出一种新颖的姿态估计算法，该方法从以下三个方面出发来解决姿态估计所面临的维数灾难问题：（1）采用树形结构来对人体拓扑结构进行建模并进行解析；（2）优化过程采用两种模式：扩散和跳跃；（3）提出一种数据驱动的马尔可夫链蒙特卡...
英文摘要	Object tracking plays an important role in visual analysis and understanding of human motion, it is the intermediate-level vision part. Object tracking is to detect, locate and track moving objects in video sequences captured by cameras. Through object tracking, we obtain the motion parameters and the trajectories of the object, which are the foundation of the high-level activity understanding and recognition. Recent years have witness great advance in tracking literature, it still remains a great challenging task due to clutter background, dynamical motion, appearance variations, and especially the occlusion problems. Pose estimation, a high-level vision task, is the task of determining the location, orientation and scale of each object. It is important for many vision understanding applications, e.g. visual interactive gaming, immersive virtual reality, visual surveillance, content-based image retrieval, etc. This thesis focuses on robust object tracking and pose estimation under complex scenes. Specifically, we mainly discuss the following three sub-topics: (1) graph embedding learning based appearance modeling; (2) sample impoverishment in tracking process; (3) high dimensional state space parsing in pose estimation. The main contributions of our work are summarized as follows: 1.We propose a graph embedding framework to simultaneously learn the subspace of the target and its local discriminative structure against the background. Then a robust appearance model is constructed based on the learnt subspaces. Experimental results demonstrate that, compared with two state-of-art methods, the proposed tracking algorithm is more effective, especially in dynamically changing and clutter scenes. 2.We propose two tracking algorithms, which combines of deterministic and stochastic tracking frameworks: (1) kernel Bayesian tracking framework; (2) singular value decomposition (SVD) based Kalman particle filter framework. (1) First, the kernel method is applied to the current frame to obtain the offset of the object state, and then these information are incorporated in Bayesian filtering framework as a heuristic priori. This algorithm can effectively combine the advantages of deterministic and stochastic tracking frameworks. And this algorithm is general to any form of appearance models. Experimental results demonstrate that, compared with Bayesian and kernel based tracking frameworks, the proposed algorithm is more efficient and effective. (2) Firstly, a s...
关键词	图嵌入学习粒子群优化核贝叶斯卡尔曼滤波粒子滤波目标跟踪姿态估计 Graph Embedding Learning Particle Swarm Optimization Kernel Bayesian Kalman Filter Particle Filter Object Tracking Pose Estimation
语种	中文
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/6233
专题	毕业生_博士学位论文
推荐引用方式 GB/T 7714	张笑钦. 复杂场景下目标的跟踪与姿态估计[D]. 中国科学院自动化研究所. 中国科学院研究生院,2010.

条目包含的文件
文件名称/大小	文献类型	版本类型	开放类型	使用许可
CASIA_20071801462807（3133KB）			限制开放	CC BY-NC-SA