已选(0)清除
条数/页: 排序方式: |
| Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文 , Dublin, Ireland, 2023-8-20 作者: Minglun Han; Feilong Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(563Kb)  |  收藏  |  浏览/下载:135/50  |  提交时间:2023/06/20 |
| Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire 期刊论文 Signal Processing Letters, 2022, 页码: 1551-1554 作者: Fan ZY(范志赟); Dong LH(董林昊); Cai M(蔡猛); Ma ZJ(马泽君); Xu B(徐波) Adobe PDF(404Kb)  |  收藏  |  浏览/下载:159/38  |  提交时间:2022/09/17 |
| Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection 会议论文 , Singapore, Singapore, 2022.05 作者: Minglun Han; Linhao Dong; Zhenlin Liang; Meng Cai; Shiyu Zhou; Zejun Ma; Bo Xu Adobe PDF(463Kb)  |  收藏  |  浏览/下载:162/46  |  提交时间:2023/05/29 Automatic Speech Recognition Context Biasing Speech Recognition Customization Continuous Integrate-and-Fire Mechanism |
| TWO-STAGE PRE-TRAINING FOR SEQUENCE TO SEQUENCE SPEECH RECOGNITION 会议论文 , 线上会议, 2021-7-18 作者: Fan ZY(范志赟); Zhou SY(周世玉); Xu B(徐波) Adobe PDF(230Kb)  |  收藏  |  浏览/下载:162/44  |  提交时间:2022/09/17 pre-training speech recognition encoder-decoder sequence-to-sequence |
| Exploring wav2vec 2.0 on speaker verification and language identification 会议论文 , 线上会议, 2021-8-30 作者: Fan ZY(范志赟); Li M(李蒙); Zhou SY(周世玉); Xu B(徐波) Adobe PDF(2081Kb)  |  收藏  |  浏览/下载:164/31  |  提交时间:2022/09/17 self-supervised speaker verification language identification multi-task learning wav2vec 2.0 |
| CIF-Based Collaborative Decoding for End-to-End Contextual Speech Recognition 会议论文 , Toronto, Canada, 2021-06-06 作者: Minglun Han; Linhao Dong; Shiyu Zhou; Bo Xu Adobe PDF(469Kb)  |  收藏  |  浏览/下载:123/37  |  提交时间:2023/05/29 Contextual Speech Recognition Automatic Speech Recognition Context Biasing |
| Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文 0, 线上会议, 2021-7-18 作者: Zhang Peng; Xu Jiaming; Shi Jing; Hao Yunzhe; Qin Lei; Xu Bo Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:204/51  |  提交时间:2021/06/21 audio-visual speech separation robust adversarial training method time-domain approach |
| Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文 0, 线上会议, 2021-4-23 作者: Zhang Peng; Xu Jiaming; Hao Yunzhe; Xu Bo Adobe PDF(532Kb)  |  收藏  |  浏览/下载:199/47  |  提交时间:2021/06/21 audio-visual speech separation online processing generative adversarial training causal temporal convolutional network |
| CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition 会议论文 , 在线会议, 2020-05 作者: Dong, Linhao; Xu, Bo 浏览  |  Adobe PDF(641Kb)  |  收藏  |  浏览/下载:307/67  |  提交时间:2020/06/13 continuous integrate-and-fire end-to-end model soft and monotonic alignment online speech recognition acoustic boundary positioning |
| SPEAKER-AWARE SPEECH-TRANSFORMER 会议论文 , 新加坡, 2019-12-14 作者: Fan ZY(范志赟); Li J(李杰); Zhou SY(周世玉); Xu B(徐波) Adobe PDF(361Kb)  |  收藏  |  浏览/下载:152/51  |  提交时间:2022/09/17 Speech-Transformer, speaker adaptation, end-to-end speech recognition, speaker aware training, i-vector |