CASIA OpenIR

浏览/检索结果: 共97条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:39/19  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
The survey on multi-source data fusion in cyber-physical-social systems: Foundational infrastructure for industrial metaverses and industries 5.0 期刊论文
Information Fusion, 2024, 卷号: 107, 页码: 1-16
作者:  Xiao Wang;  Yutong Wang;  Jing Yang;  Xiaofeng Jia;  Lijun Li;  Weiping Ding;  Fei-Yue Wang
Adobe PDF(4446Kb)  |  收藏  |  浏览/下载:52/9  |  提交时间:2024/06/06
Multi-source data fusion  CPSS  Industrial metaverses  Parallel manufacturing  Social manufacturing  
Online biomedical named entities recognition by data and knowledge-driven model 期刊论文
Artificial Intelligence In Medicine, 2024, 卷号: 150, 页码: 102813
作者:  Lulu Cao;  Chaochen Wu;  Guan Luo;  Chao Guo;  Anni Zheng
Adobe PDF(879Kb)  |  收藏  |  浏览/下载:22/12  |  提交时间:2024/06/05
Biomedical named entity recognition  Neural network  Pre-training  Knowledge representation  Online text  
Tri-relational multi-faceted graph neural networks for automatic question tagging 期刊论文
Neurocomputing, 2024, 卷号: 576, 页码: 127250
作者:  Nuojia Xu;  Jun Hu;  Quan Fang;  Dizhan Xue;  Yongxi Li;  Shengsheng Qian
Adobe PDF(2105Kb)  |  收藏  |  浏览/下载:56/24  |  提交时间:2024/06/04
Graph Neural Networks  Community Question Answering  Question Tagging  
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:52/2  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Towards Better Quantity Representations for Solving Math Word Problems 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2024, 页码: -
作者:  Sun, Runxin;  He, Shizhu;  Zhao, Jun;  Liu, Kang
Adobe PDF(417Kb)  |  收藏  |  浏览/下载:54/21  |  提交时间:2024/05/28
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:95/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
PIDray: A Large-Scale X-ray Benchmark for Real-World Prohibited Item Detection (Aug, 10.1007/s11263-023-01855-1, 2023) 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 131, 页码: 3170–3192
作者:  Zhang, Libo;  Jiang, Lutao;  Ji, Ruyi;  Fan, Heng
Adobe PDF(3227Kb)  |  收藏  |  浏览/下载:53/4  |  提交时间:2023/11/17
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:99/9  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Hierarchical Attention Networks for Fact-based Visual Question Answering 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 页码: 18
作者:  Yao, Haibo;  Luo, Yongkang;  Zhang, Zhi;  Yang, Jianhang;  Cai, Chengtao
收藏  |  浏览/下载:99/0  |  提交时间:2023/11/17
Fact-based Visual Question Answering  Hierarchical attention networks  Self-attention  Multiple attention interaction  Positional encoding