CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:43/20  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
UniGen: Unified Generative Pre-training for Multilingual Multimodal Representation 会议论文
, Waseda University, Tokyo, Japan, 2024.03.15-2024.03.18
作者:  Zheyuan, Tian;  Guan, Luo;  Bo, Wang;  Bing, Li;  Weiming, Hu
Adobe PDF(975Kb)  |  收藏  |  浏览/下载:75/19  |  提交时间:2024/05/31
DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 4, 页码: 2041-2055
作者:  Chen, Yuxin;  Zhang, Ziqi;  Qi, Zhongang;  Yuan, Chunfeng;  Wang, Jie;  Shan, Ying;  Li, Bing;  Hu, Weiming;  Qie, Xiaohu;  Wu, Jianping
Adobe PDF(13765Kb)  |  收藏  |  浏览/下载:58/5  |  提交时间:2024/05/30
Chinese video captioning evaluation  dual-reconstruction transformer  
Semantic Policy Network for Zero-Shot Object Goal Visual Navigation 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 卷号: 8, 期号: 11, 页码: 7655-7662
作者:  Zhao, Qianfan;  Zhang, Lu;  He, Bin;  Liu, Zhiyong
Adobe PDF(1888Kb)  |  收藏  |  浏览/下载:159/22  |  提交时间:2023/12/21
Deep learning  path planning  reinforcement learning  vision-based navigation  
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文
, Kigali City, Rwanda, Africa, 2023-5-5
作者:  Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao;  Yuzheng, Zhuang;  Ping, Luo;  Bin, Wang;  Jianye, Hao
Adobe PDF(3492Kb)  |  收藏  |  浏览/下载:183/47  |  提交时间:2023/06/29
标注受限视频人体行为理解模型与算法研究 学位论文
, 2023
作者:  李定
Adobe PDF(8391Kb)  |  收藏  |  浏览/下载:192/8  |  提交时间:2023/06/28
标注受限  人体行为理解  主动学习  视频片段检索  自监督学习  
基于深度强化学习的超车换道决策方法 学位论文
, 2023
作者:  王俊杰
Adobe PDF(17475Kb)  |  收藏  |  浏览/下载:200/3  |  提交时间:2023/06/26
深度强化学习,自动驾驶,换道决策,基于模型值扩展,动力学泛化  
Weakly-Supervised Video Object Grounding via Stable Context Learning 会议论文
, New York, USA, 2021-10-20
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(2062Kb)  |  收藏  |  浏览/下载:61/28  |  提交时间:2023/04/25
Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 1-15
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(3679Kb)  |  收藏  |  浏览/下载:154/35  |  提交时间:2023/04/25
Weakly-supervised video object grounding via causal intervention 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 卷号: 45, 期号: 3, 页码: 3933 - 3948
作者:  Wang, Wei;  Gao, Junyu;  Xu, Changsheng
Adobe PDF(4558Kb)  |  收藏  |  浏览/下载:156/64  |  提交时间:2023/04/25