CASIA OpenIR

浏览/检索结果: 共130条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
多尺度视觉语义增强的多模态命名实体识别方法 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1234-1245
作者:  王海荣;  徐玺;  王彤;  陈芳萍
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/07/02
多模态命名实体识别  多任务学习  多模态融合  Transformer  
Born a BabyNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation 会议论文
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia, 20-25 May, 2024
作者:  Ma, Cong;  Zhang, Yaping;  Zhang, Zhiyang;  Liang, Yupu;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(891Kb)  |  收藏  |  浏览/下载:14/7  |  提交时间:2024/06/27
Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation 会议论文
, 新奥尔良, 2023-12-9 至 2023-12-15
作者:  Keji He;  Chenyang Si;  Zhihe Lu;  Yan Huang;  Liang Wang;  Xinchao Wang
Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:25/8  |  提交时间:2024/06/26
VECTOR QUANTIZATION KNOWLEDGE TRANSFER FOR END-TO-END TEXT IMAGE MACHINE TRANSLATION 会议论文
Proceedings of 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, 14-19 April 2024
作者:  Ma, Cong;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1090Kb)  |  收藏  |  浏览/下载:27/11  |  提交时间:2024/06/26
E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation 会议论文
Proceedings of the 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA, August 21-26, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1430Kb)  |  收藏  |  浏览/下载:19/5  |  提交时间:2024/06/26
Multi-teacher Knowledge Distillation for End-to-End Text Image Machine Translation 会议论文
Proceedings of the 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA, August 21-26, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1478Kb)  |  收藏  |  浏览/下载:23/10  |  提交时间:2024/06/26
跨模态信息融合的文本图像翻译方法研究 学位论文
, 2024
作者:  马聪
Adobe PDF(11285Kb)  |  收藏  |  浏览/下载:38/5  |  提交时间:2024/06/26
文本图像翻译  跨模态信息融合  多任务学习  跨模态对比学习  参数高效微调  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Improving End-to-End Text Image Translation From the Auxiliary Text Translation Task 会议论文
Proceedings of the 26th International Conference on Pattern Recognition (ICPR 2022), Montréal, Québec, Canada, August 21-25, 2022
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Han, Xu;  Wu, Linghui;  Zhao, Yang;  Zhou, Yu
Adobe PDF(1891Kb)  |  收藏  |  浏览/下载:23/10  |  提交时间:2024/06/26
CCIM: Cross-modal Cross-lingual Interactive Image Translation 会议论文
Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore, December 6-10, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(373Kb)  |  收藏  |  浏览/下载:19/7  |  提交时间:2024/06/26