CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共7条,第1-7条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文
, Dublin, Ireland, 2023-8-20
作者:  Minglun Han;  Feilong Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:167/64  |  提交时间:2023/06/20
IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文
, Singapore, 2022.5
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:221/95  |  提交时间:2023/06/07
Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文
, Lisboa, Portugal, October 10–14, 2022
作者:  Feilong Chen;  Duzhen Zhang;  Xiuyi Chen;  Jing Shi;  Shang Xu;  Bo Xu
Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:252/149  |  提交时间:2023/06/05
Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文
COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15
作者:  Shi, Jing;  Chang, Xuankai;  Watanabe, Shinji;  Xu, Bo
收藏  |  浏览/下载:228/0  |  提交时间:2022/07/25
Cocktail party problem  Speech separation  Multi-speaker speech recognition  End-to-end  Joint-training  
Distilled Binary Neural Network for Monaural Speech Separation 会议论文
, Rio de Janeiro, Brazil, 2018-07
作者:  Chen, Xiuyi;  Liu, Guangcan;  Shi, Jing;  Xu, Jiaming;  Xu, Bo
Adobe PDF(1770Kb)  |  收藏  |  浏览/下载:195/50  |  提交时间:2022/06/27
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Yunzhe Hao;  Jiaming Xu;  Jing Shi;  Peng Zhang;  Lei Qin;  Bo Xu
Adobe PDF(399Kb)  |  收藏  |  浏览/下载:234/59  |  提交时间:2022/06/23
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:234/61  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach