CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:18/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Stage-Aware Hierarchical Attentive Relational Network for Diagnosis Prediction 期刊论文
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2023, 卷号: 36, 期号: 4, 页码: 1773-1784
作者:  Liping Wang;  Qiang Liu;  Mengqi Zhang;  Yaxuan Hu;  Shu Wu;  Liang Wang
Adobe PDF(2088Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/06/21
Medical diagnostic imaging  Knowledge graphs  Ontologies  Codes  Data models  Predictive models  Graph neural networks  Diagnosis prediction  electronic health record  knowledge graph  relational graph neural network  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:49/16  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Bird's-Eye-View Semantic Segmentation With Two-Stream Compact Depth Transformation and Feature Rectification 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 11, 页码: 4546-4558
作者:  Liu, Jierui;  Cao, Zhiqiang;  Yang, Jing;  Liu, Xilong;  Yang, Yuequan;  Qu, Zhiyou
Adobe PDF(21890Kb)  |  收藏  |  浏览/下载:72/10  |  提交时间:2024/03/27
Bird's-eye-view  semantic segmentation  two-stream compact depth transformation  feature rectification  
3D Semantic Segmentation of Aerial Photogrammetry Models Based on Orthographic Projection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 33, 期号: 12, 页码: early-access
作者:  Mengqi Rong;  Shuhan Shen
Adobe PDF(5811Kb)  |  收藏  |  浏览/下载:142/43  |  提交时间:2023/09/25
3D scenes  semantic segmentation  orthographic projection  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:195/48  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Temporal sparse adversarial attack on sequence-based gait recognition 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 133, 页码: 11
作者:  He, Ziwen;  Wang, Wei;  Dong, Jing;  Tan, Tieniu
Adobe PDF(1435Kb)  |  收藏  |  浏览/下载:356/64  |  提交时间:2022/11/21
Adversarial attack  Gait recognition  Temporal sparsity