KM4: Visual reasoning via Knowledge EmbeddingMemoryModel with MutualModulation
Zheng, Wenbo1,2; Yan, Lan2,3; Gou, Chao4; Wang, Fei-Yue2
发表期刊INFORMATION FUSION
ISSN1566-2535
2021-03-01
卷号67页码:14-28
通讯作者Wang, Fei-Yue(feiyue.wang@ia.ac.cn)
摘要Visual reasoning is a special kind of visual question answering, which is essentially multi-step and compositional, and also requires intensive text-visual interaction. The most important and challenging problem of visual reasoning is to design an effective and robust visual reasoning model. To this end, there are two challenges to overcome. The first is that textual and visual information must be jointly considered to make accurate inferences about reasoning. The second is that existing deep learning-based works are often too specific to a particular task. To address these issues, we propose a knowledge memory embedding model with mutual modulation for visual reasoning. This approach learns not only knowledge-based embeddings derived from key-value memory network to make the full and joint of textual and visual information, but also exploits the prior knowledge to improve the performance with knowledge-based representation learning for applying other general reasoning tasks. Experimental results on four benchmarks show that the proposed approach significantly improves performance compared with other state-of-the-art methods, guarantees the robustness with our model. Most importantly, we apply our model to four reasoning tasks, and experimentally show that our model effectively supports relational reasoning and improves performance in several tasks and datasets.
关键词Visual reasoning Knowledge-based representation learning Memory network Knowledge embedding
DOI10.1016/j.inffus.2020.10.007
关键词[WOS]FUSION ; MEMORY
收录类别SCI
语种英语
资助项目National Natural Science Foundation of China[61806198] ; National Natural Science Foundation of China[61533019] ; National Natural Science Foundation of China[U1811463] ; Key Research and Development Program of Guangzhou[202007050002] ; National Key Research and Development Program of China[2018 AAA0101502]
项目资助者National Natural Science Foundation of China ; Key Research and Development Program of Guangzhou ; National Key Research and Development Program of China
WOS研究方向Computer Science
WOS类目Computer Science, Artificial Intelligence ; Computer Science, Theory & Methods
WOS记录号WOS:000598348400003
出版者ELSEVIER
七大方向——子方向分类多模态智能
引用统计
被引频次:14[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/42539
专题多模态人工智能系统全国重点实验室_平行智能技术与系统团队
通讯作者Wang, Fei-Yue
作者单位1.Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China
2.Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
3.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China
4.Sun Yat Sen Univ, Sch Intelligent Syst Engn, Guangzhou 510275, Peoples R China
第一作者单位中国科学院自动化研究所
通讯作者单位中国科学院自动化研究所
推荐引用方式
GB/T 7714
Zheng, Wenbo,Yan, Lan,Gou, Chao,et al. KM4: Visual reasoning via Knowledge EmbeddingMemoryModel with MutualModulation[J]. INFORMATION FUSION,2021,67:14-28.
APA Zheng, Wenbo,Yan, Lan,Gou, Chao,&Wang, Fei-Yue.(2021).KM4: Visual reasoning via Knowledge EmbeddingMemoryModel with MutualModulation.INFORMATION FUSION,67,14-28.
MLA Zheng, Wenbo,et al."KM4: Visual reasoning via Knowledge EmbeddingMemoryModel with MutualModulation".INFORMATION FUSION 67(2021):14-28.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zheng, Wenbo]的文章
[Yan, Lan]的文章
[Gou, Chao]的文章
百度学术
百度学术中相似的文章
[Zheng, Wenbo]的文章
[Yan, Lan]的文章
[Gou, Chao]的文章
必应学术
必应学术中相似的文章
[Zheng, Wenbo]的文章
[Yan, Lan]的文章
[Gou, Chao]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。