CASIA OpenIR  > 模式识别实验室
DF2Net: A Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification.
Li Yabei1,2; Zhang Junge1,2; Cheng Yanhua3; Huang Kaiqi1,2,4; Tan Tieniu1,2,4
2018
会议名称The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI)
会议日期2018.2.2-2.8
会议地点New Orleans, USA
产权排序1
摘要

This paper focuses on the task of RGB-D indoor scene classification. It is a very challenging task due to two folds. 1)
Learning robust representation for indoor scene is difficult
because of various objects and layouts. 2) Fusing the complementary cues in RGB and Depth is nontrivial since there
are large semantic gaps between the two modalities. Most existing works learn representation for classification by training
a deep network with softmax loss and fuse the two modalities by simply concatenating the features of them. However,
these pipelines do not explicitly consider intra-class and interclass similarity as well as inter-modal intrinsic relationships.
To address these problems, this paper proposes a Discriminative Feature Learning and Fusion Network (DF2Net) with
two-stage training. In the first stage, to better represent scene
in each modality, a deep multi-task network is constructed to
simultaneously minimize the structured loss and the softmax
loss. In the second stage, we design a novel discriminative
fusion network which is able to learn correlative features of
multiple modalities and distinctive features of each modality.
Extensive analysis and experiments on SUN RGB-D Dataset
and NYU Depth Dataset V2 show the superiority of DF2Net
over other state-of-the-art methods in RGB-D indoor scene
classification task.
 

收录类别EI
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/19711
专题模式识别实验室
作者单位1.中国科学院自动化研究所
2.University of Chinese Academy of Sciences
3.Tencent Wechat AI
4.CAS Center for Excellence in Brain Science and Intelligence Technology
第一作者单位中国科学院自动化研究所
推荐引用方式
GB/T 7714
Li Yabei,Zhang Junge,Cheng Yanhua,et al. DF2Net: A Discriminative Feature Learning and Fusion Network for RGB-D Indoor Scene Classification.[C],2018.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Li-Zhang.pdf(24122KB)会议论文 开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Li Yabei]的文章
[Zhang Junge]的文章
[Cheng Yanhua]的文章
百度学术
百度学术中相似的文章
[Li Yabei]的文章
[Zhang Junge]的文章
[Cheng Yanhua]的文章
必应学术
必应学术中相似的文章
[Li Yabei]的文章
[Zhang Junge]的文章
[Cheng Yanhua]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Li-Zhang.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。