Enhanced 3-D Modeling for Landmark Image Classification
Xiao, Xian1,2; Xu, Changsheng1,2; Wang, Jinqiao1,2; Xu, Min1,2,3
发表期刊IEEE TRANSACTIONS ON MULTIMEDIA
2012-08-01
卷号14期号:4页码:1246-1258
文章类型Article
摘要Landmark image classification is a challenging task due to the various circumstances, e. g., illumination, viewpoint, zoom in/out and occlusion under which landmark images are taken. Most existing approaches utilize features extracted from the whole image including both landmark and non-landmark areas. However, non-landmark areas introduce redundant and noisy information. In this paper, we propose a novel approach to improve landmark image classification consisting of three steps. First, an attention-based 3-D reconstruction method is proposed to reconstruct sparse 3-D landmark models. Second, the sparse 3-D models are projected onto iconic images in order to identify images of the hot regions. For a landmark, hot regions are parts of a landmark which attract photographers' attention and are popularly captured in photos. These hot region images are later used to enhance reconstructed sparse 3-D models. Third, the landmark regions are obtained through mapping the enhanced 3-D models to landmark images. A k-dimensional tree (kd-tree) is then constructed for each landmark based on scale invariant feature transform (SIFT) features [5] extracted from the landmark area to classify unlabeled images into pre-defined landmark categories. The proposed method is evaluated using 291 661 images of 51 landmarks. Experiments of comparison indicate that our method outperforms bag-of-words (BoW) based approach [25] 18.5% and method of spatial-pyramid-matching using sparse-coding (ScSPM) [3] 8.4%.
关键词Attention Analysis Attention-based 3-d Reconstruction Landmark Image Classification 3-d Model Enhancement
WOS标题词Science & Technology ; Technology
关键词[WOS]VISUAL-ATTENTION ; SCENE ; COLLECTIONS ; FEATURES
收录类别SCI
语种英语
WOS研究方向Computer Science ; Telecommunications
WOS类目Computer Science, Information Systems ; Computer Science, Software Engineering ; Telecommunications
WOS记录号WOS:000306599400011
引用统计
被引频次:18[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/3352
专题紫东太初大模型研究中心_图像与视频分析
通讯作者Wang, Jinqiao
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit NLPR, Beijing 100190, Peoples R China
2.China Singapore Inst Digital Media, Singapore, Singapore
3.Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia
第一作者单位模式识别国家重点实验室
通讯作者单位模式识别国家重点实验室
推荐引用方式
GB/T 7714
Xiao, Xian,Xu, Changsheng,Wang, Jinqiao,et al. Enhanced 3-D Modeling for Landmark Image Classification[J]. IEEE TRANSACTIONS ON MULTIMEDIA,2012,14(4):1246-1258.
APA Xiao, Xian,Xu, Changsheng,Wang, Jinqiao,&Xu, Min.(2012).Enhanced 3-D Modeling for Landmark Image Classification.IEEE TRANSACTIONS ON MULTIMEDIA,14(4),1246-1258.
MLA Xiao, Xian,et al."Enhanced 3-D Modeling for Landmark Image Classification".IEEE TRANSACTIONS ON MULTIMEDIA 14.4(2012):1246-1258.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Enhanced 3-D Modelin(2935KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Xiao, Xian]的文章
[Xu, Changsheng]的文章
[Wang, Jinqiao]的文章
百度学术
百度学术中相似的文章
[Xiao, Xian]的文章
[Xu, Changsheng]的文章
[Wang, Jinqiao]的文章
必应学术
必应学术中相似的文章
[Xiao, Xian]的文章
[Xu, Changsheng]的文章
[Wang, Jinqiao]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Enhanced 3-D Modeling for Landmark Image Classification.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。