Institutional Repository of Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
Deep neural network based image annotation | |
Zhu, Songhao1; Shi, Zhe1; Sun, Chengjian1; Shen, Shuhan2 | |
发表期刊 | PATTERN RECOGNITION LETTERS |
2015-11-01 | |
卷号 | 65页码:103-108 |
文章类型 | Article |
摘要 | Multilabel image annotation is one of the most important open problems in computer vision field. Unlike existing works that usually use conventional visual features to annotate images, features based on deep learning have shown potential to achieve outstanding performance. In this work, we propose a multimodal deep learning framework, which aims to optimally integrate multiple deep neural networks pretrained with convolutional neural networks. In particular, the proposed framework explores a unified two stage learning scheme that consists of (i) learning to fine-tune the parameters of deep neural network with respect to each individual modality, and (ii) learning to find the optimal combination of diverse modalities simultaneously in a coherent process. Experiments conducted on a variety of public datasets evaluate the performance of the proposed framework for multilabel image annotation, in which the encouraging results validate the effectiveness of the proposed algorithms. (C) 2015 Elsevier B.V. All rights reserved. |
关键词 | Deep Learning Multi-label Multi-modal Image Annotation |
WOS标题词 | Science & Technology ; Technology |
DOI | 10.1016/j.patrec.2015.07.037 |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science |
WOS类目 | Computer Science, Artificial Intelligence |
WOS记录号 | WOS:000362187000015 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/10733 |
专题 | 模式识别国家重点实验室_机器人视觉 |
作者单位 | 1.Nanjing Univ Posts & Telecommun, Sch Automat, Nanjing 210046, Jiangsu, Peoples R China 2.Chinese Acad Sci, Inst Automat, Beijing 110093, Peoples R China |
推荐引用方式 GB/T 7714 | Zhu, Songhao,Shi, Zhe,Sun, Chengjian,et al. Deep neural network based image annotation[J]. PATTERN RECOGNITION LETTERS,2015,65:103-108. |
APA | Zhu, Songhao,Shi, Zhe,Sun, Chengjian,&Shen, Shuhan.(2015).Deep neural network based image annotation.PATTERN RECOGNITION LETTERS,65,103-108. |
MLA | Zhu, Songhao,et al."Deep neural network based image annotation".PATTERN RECOGNITION LETTERS 65(2015):103-108. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论