CASIA OpenIR

浏览/检索结果: 共40条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition 会议论文
, 中国, 2023.06.08
作者:  Jinzhi Zheng;  Ruyi Ji;  Libo Zhang;  Yanjun Wu;  Chen Zhao
Adobe PDF(1516Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/07/08
E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation 会议论文
Proceedings of the 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA, August 21-26, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1430Kb)  |  收藏  |  浏览/下载:36/10  |  提交时间:2024/06/26
Multi-teacher Knowledge Distillation for End-to-End Text Image Machine Translation 会议论文
Proceedings of the 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA, August 21-26, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1478Kb)  |  收藏  |  浏览/下载:38/16  |  提交时间:2024/06/26
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:43/20  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
CCIM: Cross-modal Cross-lingual Interactive Image Translation 会议论文
Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore, December 6-10, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(373Kb)  |  收藏  |  浏览/下载:37/13  |  提交时间:2024/06/26
Improved Learning for Online Handwritten Chinese Text Recognition with Convolutional Prototype Network 期刊论文
ICDAR2023, 2023, 页码: 1
作者:  Chen Y(陈懿);  Zhang H(张恒);  Liu CL(刘成林)
Adobe PDF(1058Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/24
RC-Net: Row and Column Net with Text Feature for Deep Parsing Floor Plan Images 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 页码: 526-539
作者:  Wang T(王腾);  Meng WL(孟维亮);  Lu ZD(卢政达);  Guo JW(郭建伟);  Xiao J(肖俊);  Zhang XP(张晓鹏)
Adobe PDF(2370Kb)  |  收藏  |  浏览/下载:35/8  |  提交时间:2024/06/11
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation 会议论文
, 加拿大温哥华市, 6.18-6.22
作者:  Jie Qin;  Jie Wu;  Pengxiang Yan;  Ming Li;  Ren Yuxi;  Xuefeng Xiao;  Yitong Wang;  Rui Wang;  Shilei Wen;  Xin Pan;  Xingang Wang
Adobe PDF(5688Kb)  |  收藏  |  浏览/下载:60/17  |  提交时间:2024/06/03
Improved Video Emotion Recognition with Alignment of CNN and Human Brain Representations 期刊论文
IEEE Transactions on Affective Computing, 2023, 页码: 1-15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(3907Kb)  |  收藏  |  浏览/下载:81/27  |  提交时间:2024/05/28
CNN-brain Alignment  Brain-guided Deep Learning  Video Emotion Recognition  Representation Similarity Analysis  
基于单字符注意力的全品类鲁棒车牌识别 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 122-134
作者:  穆世义;  徐树公
Adobe PDF(5048Kb)  |  收藏  |  浏览/下载:95/22  |  提交时间:2024/05/09
车牌识别  注意力机制  字符分割  字符分类