英文摘要 | Object detection has important significance and is widely used in the fields such as robot navigation, intelligent transportation, industrial inspection and aerospace. It is also an important research topic in the field of computer vision. Limited by the computing complexity of algorithms and the computing power of processors, the frame rate of object detection based on CPU, GPU or embedded processors is often dozens of frames per second or even only more than ten frames per second, which is near to that of human eyes. However, high-speed-vision object detection system has a strong demand under many situations, especially in advanced manufacturing and military industry. The traditional high-speed-vision object detection algorithms often rely on color and brightness information, which is only suitable for the target detection under simple background in specific scenes. This dissertation focuses on FPGA-based high-speed object detection algorithms to improve the accuracy of high-speed object detection and expand the application scope of the related algorithms. The main work and contributions of this dissertation are as follows:
(1) Aiming at the problem that the existing high-speed vision algorithms are only suitable for target detection under simple backgrounds, a novel high-speed vision object detection algorithm is proposed, which can detect objects in complex background at 10,000 frames per second. The proposed algorithm draws on the idea of classical image descriptor HOG. After gradient histogram calculation and normalization within detection windows, the obtained vectors are regarded as the image features. Through multiplication with parameters after fixed-point quantization, the summarized values are used to judge whether objects exist in the detection windows. After hardware-oriented algorithm optimization and fixed-point parameter quantization, the proposed algorithm is successfully implemented in the high-speed-vision platform with limited on-chip resources in the FPGA. In order to verify the feasibility of the processed algorithm, a system verification platform is built with a high-frame-rate projector and a high-speed camera. Preset patterns are projected with the high-frame-rate projector and the high-speed camera is triggered to shoot synchronously. The effectiveness of the algorithm is verified through rotating single-object and multi-object detection experiments.
(2) Aiming at the problem that large positioning errors are brought by the large stride between detection windows in the former algorithm, a high-speed object detection method combining with projection information is proposed to improve the positioning accuracy. A pixel projection module is added in the former algorithm, to obtain the projection information in horizontal and vertical directions of the detection windows. The projection information is converted into binary vectors through preset thresholds, and the distance between the center of objects and detection windows can be calculated for detection result compensation. The experimental result of specific projection pattern detection and fan speed measurement shows that, through combination of both gradient feature and projection information in the detection windows, the positioning deviation of the detected objects can be reduced to about 30% of that with the former algorithm.
(3) Aiming at the limited accuracy of high-speed object detection system based on handcraft descriptors, a convolutional neural network-based high-speed object detection method is proposed to improve the detection accuracy and scene applicability. A lightweight network structure and a hardware friendly quantization model are proposed to solve the contradiction between the limited hardware resource of FPGA and the huge number of network parameters. Based on the idea of two-stage object detection method R-CNN, bounding boxes are proposed with traditional image features in the proposed method, and convolutional neural network is used for object classification upon the proposed bounding boxes. Different strategies are designed according to the characteristics of weights and intermediate results of the network, and all parameters are stored in on-chip memories in form of fixed points. The simulation results based on software Vivado shows that, 2000 fps object detection can be achieved with the proposed method. Meanwhile, the simulation result of accuracy evaluation shows that the proposed method performs better than the former algorithms.
(4) Facing at the application requirement of vision-based cell screening, a high-speed vision microsphere detection system is proposed based on a high-speed camera, a lab-on-a-chip and a microscope, which achieves real-time detection of microspheres with high-speed and high-throughput and verifies the effectiveness of high-speed object detection algorithm in actual application scenarios. The injection needle is driven by the electric injection pump and moves at a constant speed to ensure that the polystyrene microspheres flow at a constant speed in the microfluidic path. Images are transmitted to the master computer after collection and feature extraction in the high-speed camera. Through analyzing the image feature contained in the additional information line, the location and quantity of microspheres can be obtained. Based on the development framework Qt and open source computer vision library, the software interface is built for relevant information display. The experimental result shows that, the proposed high-speed vision object detection methods can be applied to cell screening and meet the accuracy requirement. |
修改评论