英文摘要 | With the development of Internet, the image information on the web has grown exponentially, and people’s demand for image retrieval is rapidly increasing. Especially, the search for trademark images has great research and application value both in scientific research and in commerce.
In this paper, a series of researches have been carried out on content-based trademark retrieval, and an efficient search algorithm for large-scale trademark image data sets has been proposed. We have conducted the experiments both on public dataset and our lab’s dataset with more than 10 million trademark pictures. What’s more, we implemented a trademark retrieval system in C++ on Windows platform. Specifically, the main content of this paper is as follows:
- For the problem of partial matching of trademark images, this paper uses a series of preprocessing methods to segment the image edge into several components, extract proposals, extract RIDE SIFT features on proposals, and use FV algorithm to get aggregation. The experimental results on the public dataset METU v2 show that this method is state-of-the-art and even performs better than some deep learning methods.
- For the semantic similarity problem and the problem of matching text trademark images, this paper uses deep learning method to extract CNN features. Mainstream CNN networks and traditional SIFT features do not characterize pure text-based trademark maps. This article adopts a HCCR CNN for handwritten Chinese character recognition, which enhances the network's textual representation capability.
- For the problem of the difficulty low efficiency when searching the large-scale datasets, this paper uses the method of IMI index with OPQ feature coding. OPQ coding can reduce the dimension of features and quantize them into Hamming space. IMI index converts traditional KNN search into ANN search, which has greatly improved the retrieval speed while assuring the accuracy.
- A trademark retrieval system was implemented using C++ on the Windows platform, and the search architecture proposed in this paper has been put into commercial applications for trademark infringement detection (see http://www.biaozhanggui.com/), for the time being, it has launched for half a year, and it is working well.
|
修改评论