CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共3条,第1-3条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:163/32  |  提交时间:2023/06/21
Speech-Transformer: A No-Recurrence Sequence-to-Sequence Model for Speech Recognition 会议论文
, Calgary, Canada, 2018-04
作者:  Dong, Linhao;  Xu, Shuang;  Xu, Bo
浏览  |  Adobe PDF(640Kb)  |  收藏  |  浏览/下载:850/489  |  提交时间:2020/06/13
speech recognition  sequence-to-sequence  attention  transformer