CASIA OpenIR

浏览/检索结果: 共169条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/02/23
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:91/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Contrastive Adversarial Training for Multi-Modal Machine Translation 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2023, 卷号: 22, 期号: 6, 页码: 157:1-18
作者:  Huang X(黄鑫);  Zhang JJ(张家俊);  Zong CQ(宗成庆)
Adobe PDF(2387Kb)  |  收藏  |  浏览/下载:185/56  |  提交时间:2023/06/26
contrastive learning  adversarial training  multi-modal machine translation  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:158/42  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
BViT: Broad Attention-Based Vision Transformer 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: 1 - 12
作者:  Nannan Li;  Yaran Chen;  Weifan Li;  Zixiang Ding;  Dongbin Zhao;  Shuai Nie
Adobe PDF(2171Kb)  |  收藏  |  浏览/下载:173/47  |  提交时间:2023/06/27
Broad attention  broad connection  image classification  parameter-free attention  vision transformer  
Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文
, Dublin, Ireland, 2023-8-20
作者:  Minglun Han;  Feilong Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:135/50  |  提交时间:2023/06/20
Self-supervised Calorie-aware Heterogeneous Graph Networks for Food Recommendation 期刊论文
ACM Transactions on Multimedia Computing, Communications, and Applications, 2023, 卷号: 19, 期号: 1s, 页码: 1-23
作者:  Song, Yaguang;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(1381Kb)  |  收藏  |  浏览/下载:162/53  |  提交时间:2023/06/12
Food recommendation  recipe calories  heterogeneous graph  selfsupervised learning  
Zero-shot language extension for dialogue state tracking via pre-trained models and multi-auxiliary-tasks fine-tuning 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2023, 卷号: 259, 页码: 14
作者:  Xiang, Lu;  Zhao, Yang;  Zhu, Junnan;  Zhou, Yu;  Zong, Chengqing
收藏  |  浏览/下载:122/0  |  提交时间:2023/03/20
Dialogue state tracking  Zero -shot language extension  Multilingual DST  Pre -trained models  Multi -auxiliary -tasks fine-tuning  
RiDDLE: Reversible and Diversified De-identification with Latent Encryptor 会议论文
, Vancouver Convention Center, Jun 18th - 22nd 2023
作者:  Dongze Li;  Wei Wang;  Kang Zhao;  Jing Dong;  Tieniu Tan
Adobe PDF(3690Kb)  |  收藏  |  浏览/下载:175/58  |  提交时间:2023/04/26
Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1-17
作者:  Du CD(杜长德);  Fu KC(付铠成);  Li JP(李劲鹏);  He HG(何晖光)
Adobe PDF(4669Kb)  |  收藏  |  浏览/下载:375/64  |  提交时间:2023/05/05