CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Towards Prior Gap and Representation Gap for Long-tailed Recognition, Pattern Recognition 期刊论文
Pattern Recognition, 2023, 卷号: 133, 期号: 109012, 页码: 109012
作者:  Zhang Ming-Liang;  Zhang Xu-Yao;  Wang Chang;  Liu Cheng-Lin
Adobe PDF(2258Kb)  |  收藏  |  浏览/下载:66/13  |  提交时间:2024/04/03
Long-tailed learning  Prior gap  Representation gap  Image recognition  
A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram 会议论文
, 中国 澳门, 2023-7-19
作者:  Zhang Ming-Liang;  Yin Fei;  Liu Cheng-Lin
Adobe PDF(1110Kb)  |  收藏  |  浏览/下载:52/13  |  提交时间:2024/04/03
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:42/13  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:99/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:103/15  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
A Closer Look at Self-Supervised Lightweight Vision Transformers 会议论文
, Honolulu, Hawaii, USA, 2023-7
作者:  Wang, Shaoru;  Gao, Jin;  Li, Zeming;  Zhang, Xiaoqin;  Weiming, Hu
Adobe PDF(3478Kb)  |  收藏  |  浏览/下载:204/64  |  提交时间:2023/09/20
Vision Transformer  Self-supervised Learning  Lightweight Networks  Knowledge Distillation  
Cross-Stream Contrastive Learning for Self-Supervised Skeleton-Based Action Recognition 期刊论文
Image and Vision Computing, 2023, 期号: 135, 页码: 15-26
作者:  Ding Li;  Yongqiang Tang;  Wensheng Zhang
Adobe PDF(2330Kb)  |  收藏  |  浏览/下载:128/38  |  提交时间:2023/06/28
Adaptive Search for Broad Attention based Vision Transformers 期刊论文
IEEE Transactions on Evolutionary Computation, 2023, 页码: 0-0
作者:  Nannan Li;  Yaran Chen;  Dongbin Zhao
Adobe PDF(824Kb)  |  收藏  |  浏览/下载:159/48  |  提交时间:2023/06/28
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:208/59  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:138/28  |  提交时间:2023/06/21