CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共13条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 4334-4347
作者:  Xiao, Linhui;  Yang, Xiaoshan;  Peng, Fang;  Yan, Ming;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:31/0  |  提交时间:2024/05/30
Grounding  Reliability  Adaptation models  Task analysis  Visualization  Data models  Annotations  Visual grounding  curriculum learning  pseudo-language label  and vision-language models  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:167/26  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2023, 卷号: 19, 期号: 1s, 页码: 1-23
作者:  Song, Yaguang;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(1381Kb)  |  收藏  |  浏览/下载:218/67  |  提交时间:2023/06/12
Food recommendation  recipe calories  heterogeneous graph  selfsupervised learning  
The Model May Fit You: User-Generalized Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 24, 页码: 2998-3012
作者:  Ma, Xinhong;  Yang, Xiaoshan;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(6549Kb)  |  收藏  |  浏览/下载:289/56  |  提交时间:2022/06/17
cross-modal retrieval  domain generalization  meta-learning  
Transformers in computational visual media: A survey 期刊论文
Computational Visual Media, 2021, 卷号: 8, 期号: 1, 页码: 33-62
作者:  Xu,Yifan;  Wei,Huapeng;  Lin,Minxuan;  Deng,Yingying;  Sheng,Kekai;  Zhang,Mengdan;  Tang,Fan;  Dong,Weiming;  Huang,Feiyue;  Xu,Changsheng
Adobe PDF(5366Kb)  |  收藏  |  浏览/下载:330/47  |  提交时间:2021/12/28
visual transformer  computational visual media (CVM)  high-level vision  low-level vision  image generation  multi-modal learning  
Cross-domain personalized image captioning 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 卷号: 79, 期号: 45-46, 页码: 33333-33348
作者:  Long, Cuirong;  Yang, Xiaoshan;  Xu, Changsheng
收藏  |  浏览/下载:201/0  |  提交时间:2021/03/02
Personalization  Image captioning  Domain adaptation  
Deep Structured Event Modeling for User-Generated Photos 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 卷号: 20, 期号: 8, 页码: 2100-2113
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(1164Kb)  |  收藏  |  浏览/下载:406/93  |  提交时间:2018/01/03
Event Analysis  Unusual Event Detection  Deep Learning  
Semantic Feature Mining for Video Event Understanding 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2016, 卷号: 12, 期号: 4, 页码: 55:1-55:22
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(910Kb)  |  收藏  |  浏览/下载:510/166  |  提交时间:2016/12/26
Video Recognition  Event  
Cross-Domain Feature Learning in Multimedia 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 卷号: 17, 期号: 1, 页码: 64-78
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng;  Xu CS(徐常胜)
浏览  |  Adobe PDF(3097Kb)  |  收藏  |  浏览/下载:415/115  |  提交时间:2016/06/27
Cross-domain  Deep Learning  Feature Learning  Multi-modal  
Automatic Visual Concept Learning for Social Event Understanding 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 卷号: 17, 期号: 3, 页码: 346-358
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng;  Hossain, M. Shamim
浏览  |  Adobe PDF(2027Kb)  |  收藏  |  浏览/下载:403/124  |  提交时间:2015/09/21
Event Analysis  Video Recognition