CASIA OpenIR

浏览/检索结果: 共30条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:18/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
The survey on multi-source data fusion in cyber-physical-social systems: Foundational infrastructure for industrial metaverses and industries 5.0 期刊论文
Information Fusion, 2024, 卷号: 107, 页码: 1-16
作者:  Xiao Wang;  Yutong Wang;  Jing Yang;  Xiaofeng Jia;  Lijun Li;  Weiping Ding;  Fei-Yue Wang
Adobe PDF(4446Kb)  |  收藏  |  浏览/下载:33/3  |  提交时间:2024/06/06
Multi-source data fusion  CPSS  Industrial metaverses  Parallel manufacturing  Social manufacturing  
Online biomedical named entities recognition by data and knowledge-driven model 期刊论文
Artificial Intelligence In Medicine, 2024, 卷号: 150, 页码: 102813
作者:  Lulu Cao;  Chaochen Wu;  Guan Luo;  Chao Guo;  Anni Zheng
Adobe PDF(879Kb)  |  收藏  |  浏览/下载:12/6  |  提交时间:2024/06/05
Biomedical named entity recognition  Neural network  Pre-training  Knowledge representation  Online text  
Tri-relational multi-faceted graph neural networks for automatic question tagging 期刊论文
Neurocomputing, 2024, 卷号: 576, 页码: 127250
作者:  Nuojia Xu;  Jun Hu;  Quan Fang;  Dizhan Xue;  Yongxi Li;  Shengsheng Qian
Adobe PDF(2105Kb)  |  收藏  |  浏览/下载:32/13  |  提交时间:2024/06/04
Graph Neural Networks  Community Question Answering  Question Tagging  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:50/17  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
Coarse Mask Guided Interactive Object Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5808-5822
作者:  Li, Jing;  Fan, Junsong;  Wang, Yuxi;  Yang, Yuran;  Zhang, Zhaoxiang
Adobe PDF(4323Kb)  |  收藏  |  浏览/下载:53/2  |  提交时间:2024/02/22
Segmentation  interactive  transformer  annotation tool  
SpatioTemporal Inference Network for Precipitation Nowcasting With Multimodal Fusion 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 1299-1314
作者:  Jin, Qizhao;  Zhang, Xinbang;  Xiao, Xinyu;  Wang, Ying;  Meng, Gaofeng;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(8766Kb)  |  收藏  |  浏览/下载:70/5  |  提交时间:2024/02/21
Data mining  multimodal knowledge discovery  precipitation nowcasting  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:82/1  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:195/48  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Weakly-Supervised Video Object Grounding Via Learning Uni-Modal Associations 期刊论文
IEEE Transactions on Multimedia, 2022, 卷号: 25, 页码: 1-12
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(5406Kb)  |  收藏  |  浏览/下载:132/39  |  提交时间:2023/04/25
Visualization  Grounding  Task analysis  Prototypes  Annotations  Uncertainty  Proposals  Cross-modal retrieval  weakly-supervised learning  video object grounding  uni-modal association