CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Molecular Contrastive Pretraining with Collaborative Featurizations 期刊论文
Journal of Chemical Information and Modeling (JCIM), 2024, 卷号: 64, 期号: 4, 页码: 1112–1122
作者:  Yanqiao Zhu;  Dingshuo Chen;  Yuanqi Du;  Yingze Wang;  Qiang Liu;  Shu Wu
Adobe PDF(1868Kb)  |  收藏  |  浏览/下载:23/9  |  提交时间:2024/06/21
Diff-pcg: diffusion point cloud generation conditioned on continuous normalizing flow 期刊论文
The Visual Computer, 2024, 页码: 1-15
作者:  Yu T(余挺);  Meng WL(孟维亮);  Wu ZQ(吴仲琦);  Guo JW(郭建伟);  Zhang XP(张晓鹏)
Adobe PDF(2471Kb)  |  收藏  |  浏览/下载:35/10  |  提交时间:2024/06/11
3D shape generation  Diffusion model  Continuous normalizing flow  Point cloud  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:60/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 1299-1314
作者:  Jin, Qizhao;  Zhang, Xinbang;  Xiao, Xinyu;  Wang, Ying;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(8766Kb)  |  收藏  |  浏览/下载:83/7  |  提交时间:2024/02/21
Data mining  multimodal knowledge discovery  precipitation nowcasting  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:199/50  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation