CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共8条,第1-8条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning 会议论文
, Chengdu, China, 2021-10
作者:  Zhang X(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(5740Kb)  |  收藏  |  浏览/下载:29/7  |  提交时间:2024/07/08
Multimodal Global Relation Knowledge Distillation for Egocentric Action Anticipation 会议论文
MM '21: Proceedings of the 29th ACM International Conference on Multimedia, Chengdu, China, 2021.10.20—2021.10.24
作者:  Huang Yi;  Yang Xiaoshan;  Xu Changsheng
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:189/74  |  提交时间:2023/06/21
Weakly-Supervised Video Object Grounding via Stable Context Learning 会议论文
, New York, USA, 2021-10-20
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(2062Kb)  |  收藏  |  浏览/下载:55/26  |  提交时间:2023/04/25
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:285/55  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:327/47  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:347/69  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:355/47  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Adversarial Multimodal Network for Movie Story Question Answering 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 1744-1756
作者:  Yuan, Zhaoquan;  Sun, Siyuan;  Duan, Lixin;  Li, Changsheng;  Wu, Xiao;  Xu, Changsheng
收藏  |  浏览/下载:195/0  |  提交时间:2021/08/15
Knowledge discovery  Motion pictures  Visualization  Task analysis  Generators  Gallium nitride  Natural languages  Movie question answering  adversarial network  multimodal understanding