Knowledge Commons of Institute of Automation,CAS
Image Classification Using Spatial Pyramid Coding and Visual Word Reweighting | |
Chunjie Zhang![]() ![]() ![]() ![]() ![]() ![]() | |
2010 | |
会议名称 | Asian Conference on Computer Vision |
会议录名称 | 无 |
页码 | 239-249 |
会议日期 | November 8-12, 2010 |
会议地点 | Queenstown, New Zealand |
摘要 | The ignorance on spatial information and semantics of visual words becomes main obstacles in the bag-of-visual-words (BoW) method for image classification. To address the obstacles, we present an improved BoW representation using spatial pyramid coding (SPC) and visual word reweighting. In SPC procedure, we adopt the sparse coding technique to encode visual features with the spatial constraint. Visual features from the same spatial sub-region of images are collected to generate the visual vocabulary. Additionally, a relaxed but simple solution for semantic embedding into visual words is proposed. We relax the semantic embedding from ideal semantic correspondence to naive semantic purity of visual words, and reweight each visual word according to its semantic purity. Higher weights are given to semantically distinctive visual words, and lower weights to semantically general ones. Experiments on a public dataset demonstrate the effectiveness of the proposed method. |
关键词 | Bag-of-visual-words (Bow) Image Classification Reweighting Spatialpyramid Coding |
收录类别 | ISTP |
文献类型 | 会议论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/4629 |
专题 | 紫东太初大模型研究中心_图像与视频分析 |
通讯作者 | Jing Liu |
推荐引用方式 GB/T 7714 | Chunjie Zhang,Jing Liu,Jinqiao Wang,et al. Image Classification Using Spatial Pyramid Coding and Visual Word Reweighting[C],2010:239-249. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
paper 正文.pdf(385KB) | 会议论文 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论