CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:35/18  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Exploring Explicitly Disentangled Features for Domain Generalization 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6360-6373
作者:  Li, Jingwei;  Li, Yuan;  Wang, Huanjie;  Liu, Chengbao;  Tan, Jie
Adobe PDF(2432Kb)  |  收藏  |  浏览/下载:134/14  |  提交时间:2023/12/21
Domain generalization  feature disentanglement  Fourier transform  data augmentation  
ScoreMix: A Scalable Augmentation Strategy for Training GANs With Limited Data 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8920-8935
作者:  Cao, Jie;  Luo, Mandi;  Yu, Junchi;  Yang, Ming-Hsuan;  He, Ran
Adobe PDF(1823Kb)  |  收藏  |  浏览/下载:132/9  |  提交时间:2023/11/17
Generative adversarial networks  image synthesis  data augmentation  few-shot image-to-image translation  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:204/50  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation