CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共6条,第1-6条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文
, Singapore, 2022.5
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:221/95  |  提交时间:2023/06/07
Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文
, Lisboa, Portugal, October 10–14, 2022
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:252/149  |  提交时间:2023/06/05
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection 会议论文
, Singapore, Singapore, 2022.05
作者:  Minglun Han;  Linhao Dong;  Zhenlin Liang;  Meng Cai;  Shiyu Zhou;  Zejun Ma;  Bo Xu
Adobe PDF(463Kb)  |  收藏  |  浏览/下载:194/55  |  提交时间:2023/05/29
Automatic Speech Recognition  Context Biasing  Speech Recognition Customization  Continuous Integrate-and-Fire Mechanism  
Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire 期刊论文
Signal Processing Letters, 2022, 页码: 1551-1554
作者:  Fan ZY(范志赟);  Dong LH(董林昊);  Cai M(蔡猛);  Ma ZJ(马泽君);  Xu B(徐波)
Adobe PDF(404Kb)  |  收藏  |  浏览/下载:177/41  |  提交时间:2022/09/17
Compressing Speaker Extraction Model with Ultra-low Precision Quantization and Knowledge Distillation 期刊论文
Neural Networks, 2022, 卷号: 154, 页码: 13-21
作者:  Yating Huang;  Yunzhe Hao;  Jiaming Xu;  Bo Xu
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:216/56  |  提交时间:2022/09/17
Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:  Shi, Jing;  Chang, Xuankai;  Watanabe, Shinji;  Xu, Bo
收藏  |  浏览/下载:228/0  |  提交时间:2022/07/25
Cocktail party problem  Speech separation  Multi-speaker speech recognition  End-to-end  Joint-training