CASIA OpenIR

浏览/检索结果: 共380条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Image captioning: Semantic selection unit with stacked residual attention 期刊论文
IMAGE AND VISION COMPUTING, 2024, 卷号: 144, 页码: 12
作者:  Song, Lifei;  Li, Fei;  Wang, Ying;  Liu, Yu;  Wang, Yuanhua;  Xiang, Shiming
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
Image captioning  Semantic attributes  Semantic selection unit  Transformer  Stacked residual attention  
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 4, 页码: 1913-1931
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
收藏  |  浏览/下载:0/0  |  提交时间:2024/07/03
Visualization  Feature extraction  Benchmark testing  Correlation  Predictive models  Cognition  Training  Benchmark  bias  cross-sample information  debias learning  multiple-choice VQA  
Autonomy Evaluation of Unmanned Systems Based on Task Models 期刊论文
Machine Intelligence Research, 2024, 页码: 1-16
作者:  Yi Zou;  Zehao Ni;  Xun Lei;  Chi Zhang
Adobe PDF(1801Kb)  |  收藏  |  浏览/下载:20/6  |  提交时间:2024/06/27
Towards Automated Ultrasound Scanning Using Vision-Based Navigation: From Physician Skill Learning to Robotic Reproduction 会议论文
, 北京北人亦创国际会展中心, 2023.8.19
作者:  Hao Mingrui;  Zhang Pengcheng;  Chen Chen;  Wang Shuangyi
Adobe PDF(734Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/06/27
Born a BabyNet with Hierarchical Parental Supervision for End-to-End Text Image Machine Translation 会议论文
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia, 20-25 May, 2024
作者:  Ma, Cong;  Zhang, Yaping;  Zhang, Zhiyang;  Liang, Yupu;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(891Kb)  |  收藏  |  浏览/下载:10/6  |  提交时间:2024/06/27
E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation 会议论文
Proceedings of the 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA, August 21-26, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1430Kb)  |  收藏  |  浏览/下载:11/3  |  提交时间:2024/06/26
Multi-teacher Knowledge Distillation for End-to-End Text Image Machine Translation 会议论文
Proceedings of the 17th Document Analysis and Recognition (ICDAR 2023), San José, California, USA, August 21-26, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1478Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/26
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:14/7  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
CCIM: Cross-modal Cross-lingual Interactive Image Translation 会议论文
Findings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore, December 6-10, 2023
作者:  Ma, Cong;  Zhang, Yaping;  Tu, Mei;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(373Kb)  |  收藏  |  浏览/下载:15/6  |  提交时间:2024/06/26
Conditional Diffusion Guided by Part-level Latent for Dental Crown Point Cloud Generation 会议论文
, 昆明, 2024-3
作者:  Ao,Zhang;  Zhen,Shen;  Jian,Yang;  Qihang,Fang;  Gang,Xiong;  Xisong,Dong
Adobe PDF(9444Kb)  |  收藏  |  浏览/下载:18/6  |  提交时间:2024/06/25