CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共23条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:105/15  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 1-15
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(3679Kb)  |  收藏  |  浏览/下载:110/20  |  提交时间:2023/04/25
Weakly-supervised video object grounding via causal intervention 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 卷号: 45, 期号: 3, 页码: 3933 - 3948
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(4558Kb)  |  收藏  |  浏览/下载:126/53  |  提交时间:2023/04/25
Learning Coarse-to-Fine Graph Neural Networks for Video-Text Retrieval 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2386-2397
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(2165Kb)  |  收藏  |  浏览/下载:322/44  |  提交时间:2021/11/02
Feature extraction  Encoding  Task analysis  Semantics  Data models  Cognition  Focusing  Video-text retrieval  graph neural network  coarse-to-fine strategy  
Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling 会议论文
, Seoul, Korea, 2018-10
作者:  Gao, Junyu;  Zhang, Tianzhu;  Xu, Changsheng
浏览  |  Adobe PDF(3416Kb)  |  收藏  |  浏览/下载:170/61  |  提交时间:2020/06/11
Learning Multimodal Taxonomy via Variational Deep Graph Embedding and Clustering 会议论文
, Seoul, Republic of Korea, October 22 - 26, 2018
作者:  Huaiwen Zhang;  Quan Fang;  Shengsheng Qian;  Changsheng Xu
浏览  |  Adobe PDF(3204Kb)  |  收藏  |  浏览/下载:293/92  |  提交时间:2019/09/26
Multi-modal max-margin supervised topic model for social event analysis 期刊论文
Multimedia Tools and Applications, 2018, 期号: 99, 页码: 1–20
作者:  Feng Xue;  Jianwei Wang;  Shengsheng Qian;  Tianzhu Zhang;  Xueliang Liu;  Changsheng Xu
浏览  |  Adobe PDF(2883Kb)  |  收藏  |  浏览/下载:395/128  |  提交时间:2018/02/07
Social Event Classification  Multi-modal  Max-margin  Social Media  Topic Model  
Multi-Modal Knowledge Representation Learning via Webly-Supervised Relationships Mining 会议论文
, Mountain View, CA, USA, 2017-10
作者:  Fudong Nian (年福东);  Bing-Kun BAO;  Teng Li;  Changsheng Xu
浏览  |  Adobe PDF(7830Kb)  |  收藏  |  浏览/下载:326/110  |  提交时间:2018/01/04
Graph-Guided Fusion Penalty Based Sparse Coding for Image Classification 会议论文
PCM, 南京, 2013
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng;  Xu CS(徐常胜)
浏览  |  Adobe PDF(404Kb)  |  收藏  |  浏览/下载:288/77  |  提交时间:2016/06/27
Image Classification  Sparse Coding  Smoothing Proximal Gradient  
Locality Discriminative Coding for Image Classification 会议论文
ICMICS, 安徽黄山, 2013-8
作者:  Yang, Xiaoshan;  Zhang, Tianzhu;  Xu, Changsheng;  Xu CS(徐常胜)
浏览  |  Adobe PDF(636Kb)  |  收藏  |  浏览/下载:303/72  |  提交时间:2016/06/27
Bag-of-words  Feature Coding  Discriminative