CASIA OpenIR

浏览/检索结果: 共208条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:81/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Medical visual question answering with symmetric interaction attention and cross-modal gating 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 卷号: 85, 页码: 10
作者:  Chen, Zhi;  Zou, Beiji;  Dai, Yulan;  Zhu, Chengzhang;  Kong, Guilan;  Zhang, Wensheng
收藏  |  浏览/下载:63/0  |  提交时间:2023/11/17
Medical visual question answering  Self-attention  Information interaction  Cross-modal gating  
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:172/55  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
Instance-aware Prompt Learning for Language Understanding and Generation 期刊论文
TALLIP, 2023, 页码: 19
作者:  Jin feihu;  Lu jinliang;  Zhang jiajun;  Zong chengqing
Adobe PDF(1091Kb)  |  收藏  |  浏览/下载:148/45  |  提交时间:2023/06/14
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:152/41  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:160/46  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Texts as points: Scene text detection with point supervision 期刊论文
Pattern Recognition Letters, 2023, 卷号: 170, 页码: 1-8
作者:  Mengbiao Zhao;  Wei Feng;  Fei Yin;  Cheng-Lin Liu
Adobe PDF(1670Kb)  |  收藏  |  浏览/下载:119/37  |  提交时间:2023/06/28
Scene text detection  Point supervision  Mixed-supervised learning  
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2023, 卷号: 19, 期号: 1s, 页码: 1-23
作者:  Song, Yaguang;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(1381Kb)  |  收藏  |  浏览/下载:148/53  |  提交时间:2023/06/12
Food recommendation  recipe calories  heterogeneous graph  selfsupervised learning  
XANet: An Efficient Remote Sensing Image Segmentation Model Using Element-Wise Attention Enhancement and Multi-Scale Attention Fusion 期刊论文
REMOTE SENSING, 2023, 卷号: 15, 期号: 1, 页码: 25
作者:  Liang, Chenbin;  Xiao, Baihua;  Cheng, Bo;  Dong, Yunyun
Adobe PDF(63859Kb)  |  收藏  |  浏览/下载:239/25  |  提交时间:2023/02/22
semantic segmentation  attention mechanism  cross-attention  feature fusion  
Graph-Enhanced Emotion Neural Decoding 期刊论文
IEEE Transactions on Medical Imaging, 2023, 页码: 1-1
作者:  Huang ZY(黄中昱);  Du CD(杜长德);  Wang YH;  Fu KC(付铠成);  He HG(何晖光)
Adobe PDF(6049Kb)  |  收藏  |  浏览/下载:228/51  |  提交时间:2023/05/05