CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共3条,第1-3条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:106/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 1340-1351
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Tian, Zhengkun;  Zhang, Shuai
收藏  |  浏览/下载:197/0  |  提交时间:2021/06/07
End-to-End  language modeling  speech recognition  teacher-student learning  transfer learning  
A Public Chinese Dataset for Language Model Adaptation 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 页码: 13
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Fan, Cunhang
收藏  |  浏览/下载:295/0  |  提交时间:2019/12/16
Chinese dataset  Language model adaptation  Speech recognition  N-gram  RNNLM