CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-teacher Knowledge Distillation for End-to-End Text Image Machine Translation 会议论文
Proceedings of the 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA, August 21-26, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1478Kb)  |  收藏  |  浏览/下载:31/13  |  提交时间:2024/06/26
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:31/16  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation 会议论文
, 加拿大温哥华市, 6.18-6.22
作者:  Jie Qin;  Jie Wu;  Pengxiang Yan;  Ming Li;  Ren Yuxi;  Xuefeng Xiao;  Yitong Wang;  Rui Wang;  Shilei Wen;  Xin Pan;  Xingang Wang
Adobe PDF(5688Kb)  |  收藏  |  浏览/下载:49/13  |  提交时间:2024/06/03
Pre-training in Medical Data: A Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 147-149
作者:  Yixuan Qiu;  Feng Lin;  Weitong Chen;  Miao Xu
Adobe PDF(2262Kb)  |  收藏  |  浏览/下载:54/16  |  提交时间:2024/04/23
Medical data  pre-training  transfer learning  self-supervised learning  medical image data  electrocardiograms (ECG) data  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:55/17  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
SignParser: An End-to-End Framework for Traffic Sign Understanding 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 卷号: 132, 期号: 2, 页码: 805-821
作者:  Guo, Yunfei;  Feng, Wei;  Yin, Fei;  Liu, Cheng-Lin
Adobe PDF(7011Kb)  |  收藏  |  浏览/下载:133/7  |  提交时间:2023/12/21
Traffic sign understanding  Content reasoning  Semantic description generation  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:152/5  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
基于结构信息增强的图神经网络研究 学位论文
, 2023
作者:  呼奋宇
Adobe PDF(4023Kb)  |  收藏  |  浏览/下载:100/4  |  提交时间:2023/07/03
图神经网络  结构信息  邻居交互  注意力机制  多专家融合  多任务学习  
融合图片信息的神经机器翻译方法研究 学位论文
, 2023
作者:  黄鑫
Adobe PDF(10395Kb)  |  收藏  |  浏览/下载:180/12  |  提交时间:2023/06/26
神经机器翻译  跨模态信息融合  多任务学习  对比学习  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:176/34  |  提交时间:2023/06/21