路面车辆运动的语义理解

CASIA OpenIR > 毕业生 > 博士学位论文

	路面车辆运动的语义理解
其他题名	Semantic Interpretation of Vehicle Motion in Traffic Scenes
	楼建光
	2003-05-01
学位类型	工学博士
中文摘要	动态图像序列理解就是要赋予计算机类似于人一样的观察和理解动态场景的视觉能力，通过对图像数据的分析与处理来获取对动态景物中运动物体行为及其相互关系的高层次语义上的解释，实现从图像空间的数值描述到概念空间的语义描述的转换。传统的计算机视觉研究工作主要集中在恢复场景的三维几何结构，计算摄像机的运动和象素的运动信息等，很少涉及到图像的高层语义信息。近年来，在高层语义与行为理解方面的工作受到了越来越多的关注。计算机视觉发展的历史表明，研究领域不限的通用计算机视觉系统是不现实的，本文结合交通视觉监控，对路面交通场景动态图像序列中的车辆运动语义理解进行了深入的研究，涉及到了许多动态图像语义理解的基本问题，包括摄像机标定、运动检测和分割、目标定位、时空推理、场景恢复与表示、行为分析和建模、语义理解等等。本文的主要工作有： ①提出了一种简单方便的适用于路面交通场景的摄像机参数求解方法；改进了运动检测的算法模块，使得其对光照有更好的鲁棒性；还提出了一种车辆定位的算法。 ②提出了利用一种改进的Kalman滤波器的车辆视觉跟踪算法（包括对车辆的运动建立动态模型），并与其他常用跟踪滤波器进行了比较。实验证明，由于引入了合理的正交性约束，使得这种改进的Kalman滤波器能够在复杂运动下具有更好的跟踪能力。 ③发展了一种高层语义理解的框架，引入了概念空间来实现从定量的几何描述到语义概念之间的映射，从而在传统计算机视觉和高层语义理解与推理之间架起一座桥梁。 ④研究了人类运动概念的抽象层次，合理地区分了普遍性程度（Generality）和复杂性程度（Complexity）两种人类语义概念中的不同角度的抽象层次，并在不同的处理过程中对其建立模型。 ⑤提出了采用时间间隔模型来对行为建模和识别的方法。这种方法可以非常方便地为多个运动目标之问的交互行为建模，并利用建立的模型进行行为识别。最后还利用简单的语法规则，自动产生对语义概念的自然语言描述。 ⑥研究并实现了一个基于三维线框模型的路面交通监控与行为理解系统平台，平台已经能够在PIV1.7G的PC机256M内存配置下近乎实时（十几帧每秒）地跟踪单辆车，初步实现对特定场景下的物体行为作出简单的自然语言描述。
英文摘要	Semantic interpretation of dynamic image sequences attempts to automatically interpret the motions and behaviors of the tracked objects by the analysis of the image sequences captured by cameras from wide-area,real-world scenes in natural conditions．Traditional computer vision research mainly focuses on recovering the geometry of the scene(structure from motion),the camera motion (ego-motion),and the motion of the pixels themselves(such as optical flow).In recent years,semantic interpretation of image and video has become an active research topic in computer vision. In this thesis,we study the semantic interpretation of vehicle motion in traffic scenes which involves many basic problems in computer vision,e.g.motion detection,object localization,spatial-temporal reasoning,behavior analysis and semantic interpretation,etc.The main contributions of this thesis include the following: ①We have proposed a simple but convenient method for camera calibration in traffic scenes,an improved motion detection algorithm with lower sensitivity to lighting,and an efficient and robust vehicle localization algorithm. ②We have described a modified extended Kalman filter with 8 novel kinematics model for visual vehicle tracking.By imposing an additional orthogonality condition,the filter is less sensitivity to the temporal variations of the system model.Experiments show that the filter has a good performance when the tracked car is in complex motion. ③A framework for semantic interpretation of vehicles'motion has been proposed in the context of visual traffic surveillance.We introduce a conceptual space to bridge the gap between quantitative low-level processing and qualitative high-level processing. ④From human's mental experiences,there are two aspects of abstraction: "generality"and"complexity".We deal with them in two different computational stages named"conceptual processing"and"symbolic processing" to simplify the modelling and inference of them. ⑤We have presented a new interval-based model of action and a temporal analvzer to model and recognize the objects'behaviors in traffic scenes A single object’sbehaviors and its interactions with other objects can be handled in the same framework.Finally,some of the recognized actions can be selected and translated into natural language descriptions by some simple grammar rules. ⑥We have developed a demo platform for further research which can work at a speed of 17 frames per second on a computer with PIV 1.7G CPU and Windows operating system.The system can give some simple semantic interpretations of vehicle’s behaviors.
关键词	计算机视觉动态图像序列理解视觉跟踪行为分析与识别自然语言描述 Computer Vision Semantic Interpretation Of Dynamic Image Sequences Visual Tracking Behavior Analysis And Recognition Natural Lan
语种	中文
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/5758
专题	毕业生_博士学位论文
推荐引用方式 GB/T 7714	楼建光. 路面车辆运动的语义理解[D]. 中国科学院自动化研究所. 中国科学院研究生院,2003.