Learning representative and discriminative image representation by deep appearance and spatial coding
Bingyuan Liu; Jing Liu; Hanqing Lu
2015
发表期刊Computer Vision and Image Understanding
卷号136期号:1页码:23-31
摘要How to build a suitable image representation remains a critical problem in computer vision. Traditional Bag-of-Feature (BoF) based models build image representation by the pipeline of local feature extraction, feature coding and spatial pooling. However, three major shortcomings hinder the performance, i.e., the limitation of hand-designed features, the discrimination loss in local appearance coding and the lack of spatial information. To overcome the above limitations, in this paper, we propose a generalized BoF-based framework, which is hierarchically learned by exploring recently developed deep learning methods. First, with raw images as input, we densely extract local patches and learn local features by stacked Independent Subspace Analysis network. The learned features are then transformed to appearance codes by sparse Restricted Boltzmann Machines. Second, we perform spatial max-pooling on a set of over-complete spatial regions, which is generated by covering various spatial distributions, to incorporate more flexible spatial information. Third, a structured sparse Auto-encoder is proposed to explore the region representations into the image-level signature. To learn the proposed hierarchy, we layerwise pre-train the network in unsupervised manner, followed by supervised fine-tuning with image labels. Extensive experiments on different benchmarks, i.e., UIUC-Sports, Caltech-101, Caltech-256, Scene-15 and MIT Indoor-67, demonstrate the effectiveness of our proposed model.
关键词Image Classification Deep Learning Structured Sparsity
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/13436
专题模式识别国家重点实验室_图像与视频分析
通讯作者Jing Liu
推荐引用方式
GB/T 7714
Bingyuan Liu,Jing Liu,Hanqing Lu. Learning representative and discriminative image representation by deep appearance and spatial coding[J]. Computer Vision and Image Understanding,2015,136(1):23-31.
APA Bingyuan Liu,Jing Liu,&Hanqing Lu.(2015).Learning representative and discriminative image representation by deep appearance and spatial coding.Computer Vision and Image Understanding,136(1),23-31.
MLA Bingyuan Liu,et al."Learning representative and discriminative image representation by deep appearance and spatial coding".Computer Vision and Image Understanding 136.1(2015):23-31.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Learning representat(1491KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Bingyuan Liu]的文章
[Jing Liu]的文章
[Hanqing Lu]的文章
百度学术
百度学术中相似的文章
[Bingyuan Liu]的文章
[Jing Liu]的文章
[Hanqing Lu]的文章
必应学术
必应学术中相似的文章
[Bingyuan Liu]的文章
[Jing Liu]的文章
[Hanqing Lu]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Learning representative and discriminative image representation by deep appearance and spatial coding.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。