CASIA OpenIR

浏览/检索结果: 共8条,第1-8条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:103/15  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Weakly-Supervised Video Object Grounding Via Learning Uni-Modal Associations 期刊论文
IEEE Transactions on Multimedia, 2022, 卷号: 25, 页码: 1-12
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(5406Kb)  |  收藏  |  浏览/下载:99/29  |  提交时间:2023/04/25
Visualization  Grounding  Task analysis  Prototypes  Annotations  Uncertainty  Proposals  Cross-modal retrieval  weakly-supervised learning  video object grounding  uni-modal association  
Holographic Feature Learning of Egocentric-Exocentric Videos for Multi-Domain Action Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2273-2286
作者:  Huang, Yi;  Yang, Xiaoshan;  Gao, Junyun;  Xu, Changsheng
Adobe PDF(2409Kb)  |  收藏  |  浏览/下载:331/64  |  提交时间:2022/07/25
Videos  Feature extraction  Visualization  Task analysis  Computational modeling  Target recognition  Prototypes  Egocentric videos  exocentric videos  holographic feature  multi-domain  action recognition  
Visual Question Answering With Dense Inter- and Intra-Modality Interactions 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 3518-3529
作者:  Liu, Fei;  Liu, Jing;  Fang, Zhiwei;  Hong, Richang;  Lu, Hanqing
Adobe PDF(2891Kb)  |  收藏  |  浏览/下载:285/60  |  提交时间:2021/12/28
Visualization  Knowledge discovery  Connectors  Encoding  Task analysis  Image coding  Stacking  Visual question answering  attention  dense interactions  
EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 卷号: 32, 期号: 0, 页码: 0
作者:  Ruan, Xiaofeng;  Liu, Yufan;  Yuan, Chunfeng;  Li, Bing;  Hu, Weiming;  Li, Yangxi;  Maybank, Stephen
Adobe PDF(3625Kb)  |  收藏  |  浏览/下载:297/43  |  提交时间:2021/06/17
Data-driven  low-rank decomposition  model compression and acceleration  structured pruning  
Show, Tell, and Polish: Ruminant Decoding for Image Captioning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 卷号: 22, 期号: 8, 页码: 2149-2162
作者:  Guo, Longteng;  Liu, Jing;  Lu, Shichen;  Lu, Hanqing
Adobe PDF(4378Kb)  |  收藏  |  浏览/下载:204/31  |  提交时间:2020/08/31
Image captioning  Multi-pass decoding  Rumination  
Deep Self-Evolution Clustering 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 卷号: 42, 期号: 4, 页码: 809-823
作者:  Chang, Jianlong;  Meng, Gaofeng;  Wang, Lingfeng;  Xiang, Shiming;  Pan, Chunhong
浏览  |  Adobe PDF(4817Kb)  |  收藏  |  浏览/下载:408/89  |  提交时间:2020/06/02
Task analysis  Unsupervised learning  Training  Clustering methods  Pattern analysis  Clustering  deep self-evolution clustering  self-evolution clustering training  deep unsupervised learning  
Dense semantic embedding network for image captioning 期刊论文
PATTERN RECOGNITION, 2019, 卷号: 90, 页码: 285-296
作者:  Xiao, Xinyu;  Wang, Lingfeng;  Ding, Kun;  Xiang, Shiming;  Pan, Chunhong
收藏  |  浏览/下载:365/0  |  提交时间:2019/04/23
Image captioning  Retrieval  High-level semantic information  Visual concept  Densely embedding  Long short-term memory