CASIA OpenIR

浏览/检索结果: 共109条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:26/6  |  提交时间:2024/07/08
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:31/16  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval? 会议论文
, 美国西雅图, 2024-6
作者:  chen yuxin;  ma zongyang;  zhang ziqi;  qi zhongang;  yuan chunfeng;  li bing;  pu junfu;  shan ying;  qi xiaojuan;  hu weiming
Adobe PDF(1070Kb)  |  收藏  |  浏览/下载:43/11  |  提交时间:2024/06/25
Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video 会议论文
, Vienna Austria, May 7th, 2024 to May 11th, 2024
作者:  Jiang, Yanqin;  Zhang, Li;  Gao, Jin;  Hu, Weiming;  Yao, Yao
Adobe PDF(3186Kb)  |  收藏  |  浏览/下载:32/9  |  提交时间:2024/06/21
Part-aware Prompt Tuning For Weakly Supervised Referring Expression Grounding 会议论文
, Amsterdam, 2024-1-29
作者:  Chenlin, Zhao;  Jiabo, Ye;  Yaguang, Song;  Ming, Yan;  Xiaoshan, Yang;  Changsheng, Xu
Adobe PDF(6114Kb)  |  收藏  |  浏览/下载:35/11  |  提交时间:2024/06/21
Learning to Understand Traffic Signs 会议论文
, 四川成都, 2021年10月20日-24日
作者:  Guo, Yunfei;  Feng, Wei;  Yin, Fei;  Xue, Tao;  Mei, Shuqi;  Liu, Cheng-Lin
Adobe PDF(3271Kb)  |  收藏  |  浏览/下载:50/21  |  提交时间:2024/06/13
traffic sign understanding  semantic description  multi-task learning  
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:43/18  |  提交时间:2024/06/05
UniGen: Unified Generative Pre-training for Multilingual Multimodal Representation 会议论文
, Waseda University, Tokyo, Japan, 2024.03.15-2024.03.18
作者:  Zheyuan, Tian;  Guan, Luo;  Bo, Wang;  Bing, Li;  Weiming, Hu
Adobe PDF(975Kb)  |  收藏  |  浏览/下载:63/16  |  提交时间:2024/05/31
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:45/1  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Cross-Modal Prototype Learning for Zero-Shot Handwritten Character Recognition 期刊论文
Pattern Recognition, 2022, 卷号: 131, 页码: 108859
作者:  Ao, Xiang;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(3111Kb)  |  收藏  |  浏览/下载:58/25  |  提交时间:2024/05/30