CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Stage-Aware Hierarchical Attentive Relational Network for Diagnosis Prediction 期刊论文
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023, 卷号: 36, 期号: 4, 页码: 1773-1784
作者:  Liping Wang;  Qiang Liu;  Mengqi Zhang;  Yaxuan Hu;  Shu Wu;  Liang Wang
Adobe PDF(2088Kb)  |  收藏  |  浏览/下载:22/7  |  提交时间:2024/06/21
Medical diagnostic imaging  Knowledge graphs  Ontologies  Codes  Data models  Predictive models  Graph neural networks  Diagnosis prediction  electronic health record  knowledge graph  relational graph neural network  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:52/17  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Bird's-Eye-View Semantic Segmentation With Two-Stream Compact Depth Transformation and Feature Rectification 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4546-4558
作者:  Liu, Jierui;  Cao, Zhiqiang;  Yang, Jing;  Liu, Xilong;  Yang, Yuequan;  Qu, Zhiyou
Adobe PDF(21890Kb)  |  收藏  |  浏览/下载:76/11  |  提交时间:2024/03/27
Bird's-eye-view  semantic segmentation  two-stream compact depth transformation  feature rectification  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:196/49  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: Early Access, 期号: Early Access, 页码: Early Access
作者:  Shentong Mo;  Xin M(辛淼)
Adobe PDF(2209Kb)  |  收藏  |  浏览/下载:102/18  |  提交时间:2023/04/25
long-term forecasting  spatial-temporal graph transformer  Bayesian transformer  uncertainty estimation