CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共22条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文
, Dublin, Ireland, 2023-8-20
作者:  Minglun Han;  Feilong Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:135/50  |  提交时间:2023/06/20
Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire 期刊论文
Signal Processing Letters, 2022, 页码: 1551-1554
作者:  Fan ZY(范志赟);  Dong LH(董林昊);  Cai M(蔡猛);  Ma ZJ(马泽君);  Xu B(徐波)
Adobe PDF(404Kb)  |  收藏  |  浏览/下载:159/38  |  提交时间:2022/09/17
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection 会议论文
, Singapore, Singapore, 2022.05
作者:  Minglun Han;  Linhao Dong;  Zhenlin Liang;  Meng Cai;  Shiyu Zhou;  Zejun Ma;  Bo Xu
Adobe PDF(463Kb)  |  收藏  |  浏览/下载:162/46  |  提交时间:2023/05/29
Automatic Speech Recognition  Context Biasing  Speech Recognition Customization  Continuous Integrate-and-Fire Mechanism  
TWO-STAGE PRE-TRAINING FOR SEQUENCE TO SEQUENCE SPEECH RECOGNITION 会议论文
, 线上会议, 2021-7-18
作者:  Fan ZY(范志赟);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(230Kb)  |  收藏  |  浏览/下载:162/44  |  提交时间:2022/09/17
pre-training  speech recognition  encoder-decoder  sequence-to-sequence  
Exploring wav2vec 2.0 on speaker verification and language identification 会议论文
, 线上会议, 2021-8-30
作者:  Fan ZY(范志赟);  Li M(李蒙);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(2081Kb)  |  收藏  |  浏览/下载:164/31  |  提交时间:2022/09/17
self-supervised  speaker verification  language identification  multi-task learning  wav2vec 2.0  
CIF-Based Collaborative Decoding for End-to-End Contextual Speech Recognition 会议论文
, Toronto, Canada, 2021-06-06
作者:  Minglun Han;  Linhao Dong;  Shiyu Zhou;  Bo Xu
Adobe PDF(469Kb)  |  收藏  |  浏览/下载:123/37  |  提交时间:2023/05/29
Contextual Speech Recognition  Automatic Speech Recognition  Context Biasing  
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文
0, 线上会议, 2021-7-18
作者:  Zhang Peng;  Xu Jiaming;  Shi Jing;  Hao Yunzhe;  Qin Lei;  Xu Bo
Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:204/51  |  提交时间:2021/06/21
audio-visual speech separation  robust  adversarial training method  time-domain approach  
Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文
0, 线上会议, 2021-4-23
作者:  Zhang Peng;  Xu Jiaming;  Hao Yunzhe;  Xu Bo
Adobe PDF(532Kb)  |  收藏  |  浏览/下载:199/47  |  提交时间:2021/06/21
audio-visual speech separation  online processing  generative adversarial training  causal temporal convolutional network  
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition 会议论文
, 在线会议, 2020-05
作者:  Dong, Linhao;  Xu, Bo
浏览  |  Adobe PDF(641Kb)  |  收藏  |  浏览/下载:307/67  |  提交时间:2020/06/13
continuous integrate-and-fire  end-to-end model  soft and monotonic alignment  online speech recognition  acoustic boundary positioning  
SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文
, 新加坡, 2019-12-14
作者:  Fan ZY(范志赟);  Li J(李杰);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(361Kb)  |  收藏  |  浏览/下载:152/51  |  提交时间:2022/09/17
Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector