CASIA OpenIR

浏览/检索结果: 共77条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:84/7  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Protecting by attacking: A personal information protecting method with cross-modal adversarial examples 期刊论文
NEUROCOMPUTING, 2023, 卷号: 551, 页码: 11
作者:  Zhao, Mengnan;  Wang, Bo;  Guo, Weikuo;  Wang, Wei
收藏  |  浏览/下载:48/0  |  提交时间:2023/11/17
Security  Cross-modal  Image captioning  Adversarial attacks  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:79/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:151/41  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:160/46  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Generating Emotion Descriptions for Fine Art Paintings via Multiple Painting Representations 期刊论文
IEEE Intelligent Systems, 2023, 卷号: 38, 期号: 3, 页码: 31-40
作者:  Lu, Yue;  Guo, Chao;  Dai, Xingyuan;  Wang, Fei-Yue
Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:87/13  |  提交时间:2023/06/25
painting captioning  
Temporal Action Detection with Dynamic Weights Based on Curriculum Learning 期刊论文
Neurocomputing, 2023, 页码: 106-116
作者:  Chen YZ(陈云泽);  He jiang;  Junrui Xiao;  Ding Li;  Qingyi Gu
Adobe PDF(1252Kb)  |  收藏  |  浏览/下载:132/44  |  提交时间:2023/08/25
Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1-17
作者:  Du CD(杜长德);  Fu KC(付铠成);  Li JP(李劲鹏);  He HG(何晖光)
Adobe PDF(4669Kb)  |  收藏  |  浏览/下载:373/64  |  提交时间:2023/05/05
ArtCap: A Dataset for Image Captioning of Fine Art Paintings 期刊论文
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2022, 页码: 12
作者:  Lu, Yue;  Guo, Chao;  Dai, Xingyuan;  Wang, Fei-Yue
Adobe PDF(5137Kb)  |  收藏  |  浏览/下载:209/42  |  提交时间:2023/02/22
Dataset construction  image captioning  painting captioning  
Using Pre-trained Language Model to Enhance Active Learning for Sentence Matching 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 卷号: 21, 期号: 2, 页码: 19
作者:  Bai, Guirong;  He, Shizhu;  Liu, Kang;  Zhao, Jun
Adobe PDF(4097Kb)  |  收藏  |  浏览/下载:235/51  |  提交时间:2022/06/10
Sentence matching  active learning  pre-trained language model