Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval
Wang, Wei1,2,3; Gao, Junyu1,2,3; Yang, Xiaoshan1,2,3; Xu, Changsheng1,2,3
发表期刊IEEE TRANSACTIONS ON MULTIMEDIA
ISSN1520-9210
2021
卷号23页码:2386-2397
摘要

We address the problem of video-text retrieval that searches videos via natural language description or vice versa. Most state-of-the-art methods only consider cross-modal learning for two or three data points in isolation, ignoring to get benefit from the structural information of other data points from a global view. In this paper, we propose to exploit the comprehensive relationships among cross-modal samples via Graph Neural Networks (GNN). To improve the discriminative ability for accurately finding the positive sample, a Coarse-to-Fine GNN is constructed, which can progressively optimize the retrieval results via multi-step reasoning. Specifically, we first adopt heuristic edge features to represent relationships. Then we design a scoring module in each layer to rank the edges connected to the query node and drop the edges with lower scores. Finally, to alleviate the class imbalance issue, we propose a random-drop focal loss to optimize the whole framework. Extensive experimental results show that our method consistently outperforms the state-of-the-arts on four benchmarks.

关键词Feature extraction Encoding Task analysis Semantics Data models Cognition Focusing Video-text retrieval graph neural network coarse-to-fine strategy
DOI10.1109/TMM.2020.3011288
关键词[WOS]FEATURES ; IMAGE
收录类别SCI
语种英语
资助项目National Key Research and Development Program of China[2018AAA0102200] ; National Natural Science Foundation of China[61720106006] ; National Natural Science Foundation of China[61721004] ; National Natural Science Foundation of China[61832002] ; National Natural Science Foundation of China[61702511] ; National Natural Science Foundation of China[61751211] ; National Natural Science Foundation of China[61532009] ; National Natural Science Foundation of China[U1836220] ; National Natural Science Foundation of China[U1705262] ; National Natural Science Foundation of China[61872424] ; National Natural Science Foundation of China[61936005] ; Key Research Program of Frontier Sciences of CAS[QYZDJSSWJSC039] ; Research Program of National Laboratory of Pattern Recognition[Z-2018007]
项目资助者National Key Research and Development Program of China ; National Natural Science Foundation of China ; Key Research Program of Frontier Sciences of CAS ; Research Program of National Laboratory of Pattern Recognition
WOS研究方向Computer Science ; Telecommunications
WOS类目Computer Science, Information Systems ; Computer Science, Software Engineering ; Telecommunications
WOS记录号WOS:000679533800018
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
七大方向——子方向分类多模态智能
国重实验室规划方向分类多模态协同认知
是否有论文关联数据集需要存交
引用统计
被引频次:23[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/45591
专题多模态人工智能系统全国重点实验室_多媒体计算
多模态人工智能系统全国重点实验室
通讯作者Xu, Changsheng
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
3.PengCheng Lab, Shenzhen, Peoples R China
第一作者单位模式识别国家重点实验室
通讯作者单位模式识别国家重点实验室
推荐引用方式
GB/T 7714
Wang, Wei,Gao, Junyu,Yang, Xiaoshan,et al. Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval[J]. IEEE TRANSACTIONS ON MULTIMEDIA,2021,23:2386-2397.
APA Wang, Wei,Gao, Junyu,Yang, Xiaoshan,&Xu, Changsheng.(2021).Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval.IEEE TRANSACTIONS ON MULTIMEDIA,23,2386-2397.
MLA Wang, Wei,et al."Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval".IEEE TRANSACTIONS ON MULTIMEDIA 23(2021):2386-2397.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
CF-GNN.pdf(2165KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wang, Wei]的文章
[Gao, Junyu]的文章
[Yang, Xiaoshan]的文章
百度学术
百度学术中相似的文章
[Wang, Wei]的文章
[Gao, Junyu]的文章
[Yang, Xiaoshan]的文章
必应学术
必应学术中相似的文章
[Wang, Wei]的文章
[Gao, Junyu]的文章
[Yang, Xiaoshan]的文章
相关权益政策
暂无数据
收藏/分享
文件名: CF-GNN.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。