CASIA OpenIR

浏览/检索结果: 共45条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Part-aware Prompt Tuning For Weakly Supervised Referring Expression Grounding 会议论文
, Amsterdam, 2024-1-29
作者:  Chenlin, Zhao;  Jiabo, Ye;  Yaguang, Song;  Ming, Yan;  Xiaoshan, Yang;  Changsheng, Xu
Adobe PDF(6114Kb)  |  收藏  |  浏览/下载:35/11  |  提交时间:2024/06/21
Relative Alignment Network for Source-Free Multimodal Video Domain Adaptation 会议论文
MM '22: Proceedings of the 30th ACM International Conference on Multimedia, Lisboa, Portugal, 2022.10.10—2022.10.14
作者:  Huang Yi;  Yang Xiaoshan;  Zhang Ji;  Xu Changsheng
Adobe PDF(1264Kb)  |  收藏  |  浏览/下载:226/88  |  提交时间:2023/06/21
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:289/56  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:331/47  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Unsupervised Video Summarization via Relation-Aware Assignment Learning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3203-3214
作者:  Gao, Junyu;  Yang, Xiaoshan;  Zhang, Yingying;  Xu, Changsheng
Adobe PDF(3649Kb)  |  收藏  |  浏览/下载:351/69  |  提交时间:2021/11/03
Feature extraction  Training  Optimization  Semantics  Recurrent neural networks  Task analysis  Graph neural network  unsupervised learning  video summarization  
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:356/47  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Multi-modal Knowledge-aware Event Memory Network for Social Media Rumor Detection 会议论文
MM, Nice, France, October 21 - 25, 2019
作者:  Huaiwen Zhang;  Quan Fang;  Shengsheng Qian;  Changsheng Xu
Adobe PDF(2626Kb)  |  收藏  |  浏览/下载:214/70  |  提交时间:2021/06/30
Social Media  Rumor Detection  Multi-Modal  Knowledge Graph  Memory Network  
Knowledge-driven Egocentric Multimodal Activity Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 卷号: 16, 期号: 4, 页码: 21
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyu;  Sang, Jitao;  Xu, Changsheng
Adobe PDF(1875Kb)  |  收藏  |  浏览/下载:430/69  |  提交时间:2021/03/08
Egocentric videos  wearable sensors  graph neural networks  
Self-Supervised Feature Augmentation for Large Image Object Detection 期刊论文
IEEE Transactions on Image Processing, 2020, 卷号: 29, 期号: 0, 页码: 6745-6758
作者:  Pan, Xingjia;  Tang, Fan;  Dong, Weiming;  Gu, Yang;  Song, Zhichao;  Meng, Yiping;  Xu, Pengfei;  Oliver, Deussen;  Xu, Changsheng
浏览  |  Adobe PDF(5411Kb)  |  收藏  |  浏览/下载:331/76  |  提交时间:2020/12/21
object detection  large image  self-supervise  feature augmentation  
Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling 会议论文
, Seoul, Korea, 2018-10
作者:  Gao, Junyu;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(3416Kb)  |  收藏  |  浏览/下载:200/69  |  提交时间:2020/06/11