Other Abstract | Visual tracking has been a popular problem in computer vision with widespread applications in robotics, human-computer interaction and video surveillance. In recent years, various tracking algorithms have been developed and considerable progress has been made in terms of speed and accuracy thanks to the effort of researchers at home and abroad. Correlation filters perform dense-sampling based on ridge regression with Fast Fourier Transform by exploiting circulant structure. Correlation filters based trackers perform favourably in terms of speed and robustness and have become mainstream branch in the field of visual tracking. To further enhance the performance of correlation filter based trackers, various types of regularization are introduced. However, the introduction of regularization breaks the closed-form solution of correlation filters and reduce the speed, which restricts applications in some computation restricted scenarios such as UAV and mobile robots. This thesis focuses on visual tracking based on correlation filters in computation restricted applications to improve tracker performance with real-time speed and achieve balance between performance and efficiency. Our work mainly includes the following four parts:
(1)Research on correlation filter and scale adaptive methods. Considering the problem of scale variation in correlation filter tracking, we review current scale adaptive methods in correlation filter tracking and analyse their advantages and disadvantages in terms of performance and efficiency. By conducting comparative experiments on OTB-2015, we determine the scale adaptive method of this thesis.
(2)Research on efficient feature representation and aggregation. Discriminative tracking requires to extract efficient and robust feature representation from target and background. Considering the visual tracking demand in computation restricted applications, we borrow the essential idea of weight image in mean-shift based trackers and propose 1-channel color ratio(CR) feature to replace 10-channel color-naming(CN) feature and use 13-channel HOG feature to replace common 31-channel HOG feature to improve efficiency. In addition, during feature concatenation process, we weight between different types of features to control their importance in final feature representation in order to better exert the discriminative power of the features and further improve the tracker performance. Experimental results demonstrate that the proposed method(denoted as CRCF) brings notable improvement in performance and efficiency.
(3)Research on model update scheme. Trackers require to update the model in tracking process to adapt to target appearance variation and reserve historical target appearance. Most correlation filter based trackers update the model with moving average. However, this update scheme usually leads to tracker drift due to model contamination under occlusion and is prone to fail under large target appearance variation due to limited update. Considering latest progress in correlation filter tracking, we propose an update scheme with online Gaussian mixture model and difference hashing distance measurement to maintain a training set of representative target historical appearance. Correlation filter with better discriminative power and generalization ability is trained on the training set. Experimental results demonstrate that the proposed method(denoted as CRCF_ATU) brings performance gain compared with CRCF without reducing efficiency.
(4)Research on long-term tracking. In long-term tracking, the tracker is required to accurately detect tracking failure and target disappearance. When the target reappears, the tracker is required to localize the target. With aforementioned model update scheme, we propose a more accurate tracking status judgement scheme. When tracking failure is detected, background-aware correlation filter(BACF) is trained on the training set to perform global re-detection. Experimental results demonstrate that the proposed method(denoted as CRCF_LCT) can accurately re-detect target after target disappearance and brings notable performance gain in long-term sequences.
CRCF, CRCF_ATU and CRCF_LCT respectively apply to short-term tracking, middle-term tracking and long-term tracking, providing adaptable visual tracking modules for various applications such as detection and tracking fusion, reliable single target tracking and active visual tracking with considerable prospect. |
Edit Comment