CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共21条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:88/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Weakly-supervised video object grounding via causal intervention 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 卷号: 45, 期号: 3, 页码: 3933 - 3948
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(4558Kb)  |  收藏  |  浏览/下载:121/51  |  提交时间:2023/04/25
Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 1-15
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(3679Kb)  |  收藏  |  浏览/下载:94/17  |  提交时间:2023/04/25
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:301/42  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Learning Multimodal Taxonomy via Variational Deep Graph Embedding and Clustering 会议论文
, Seoul, Republic of Korea, October 22 - 26, 2018
作者:  Huaiwen Zhang;  Quan Fang;  Shengsheng Qian;  Changsheng Xu
浏览  |  Adobe PDF(3204Kb)  |  收藏  |  浏览/下载:281/87  |  提交时间:2019/09/26
Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling 会议论文
, Seoul, Korea, 2018-10
作者:  Gao, Junyu;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(3416Kb)  |  收藏  |  浏览/下载:168/59  |  提交时间:2020/06/11
Multi-modal max-margin supervised topic model for social event analysis 期刊论文
Multimedia Tools and Applications, 2018, 期号: 99, 页码: 1–20
作者:  Feng Xue;  Jianwei Wang;  Shengsheng Qian;  Tianzhu Zhang;  Xueliang Liu;  Changsheng Xu
浏览  |  Adobe PDF(2883Kb)  |  收藏  |  浏览/下载:383/125  |  提交时间:2018/02/07
Social Event Classification  Multi-modal  Max-margin  Social Media  Topic Model  
Multi-Modal Knowledge Representation Learning via Webly-Supervised Relationships Mining 会议论文
, Mountain View, CA, USA, 2017-10
作者:  Fudong Nian (年福东);  Bing-Kun BAO;  Teng Li;  Changsheng Xu
浏览  |  Adobe PDF(7830Kb)  |  收藏  |  浏览/下载:316/110  |  提交时间:2018/01/04
多媒体社会事件分析的研究与展望 期刊论文
南京信息工程大学学报(自然科学版), 2017, 期号: 6, 页码: 1-14
作者:  Shengsheng Qian;  Tianzhu Zhang;  Changsheng Xu
浏览  |  Adobe PDF(1114Kb)  |  收藏  |  浏览/下载:482/203  |  提交时间:2018/02/07
多媒体  社会事件  多模态  跨平台  
Relational User Attribute Inference in Social Media 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 卷号: 17, 期号: 7, 页码: 1031-1044
作者:  Fang, Quan;  Sang, Jitao;  Xu, Changsheng;  Hossain, M. Shamim
Adobe PDF(2825Kb)  |  收藏  |  浏览/下载:364/118  |  提交时间:2015/09/17
Attribute Relation  Latent Svm (lSvm)  User Attribute Inference