已选(0)清除
条数/页: 排序方式: |
| CIF-Based Collaborative Decoding for End-to-End Contextual Speech Recognition 会议论文 , Toronto, Canada, 2021-06-06 作者: Minglun Han; Linhao Dong; Shiyu Zhou; Bo Xu Adobe PDF(469Kb)  |  收藏  |  浏览/下载:124/37  |  提交时间:2023/05/29 Contextual Speech Recognition Automatic Speech Recognition Context Biasing |
| Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire 期刊论文 Signal Processing Letters, 2022, 页码: 1551-1554 作者: Fan ZY(范志赟); Dong LH(董林昊); Cai M(蔡猛); Ma ZJ(马泽君); Xu B(徐波) Adobe PDF(404Kb)  |  收藏  |  浏览/下载:160/38  |  提交时间:2022/09/17 |
| A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments 会议论文 , Shanghai, China, October 25–29, 2020 作者: Yunzhe Hao; Jiaming Xu; Jing Shi; Peng Zhang; Lei Qin; Bo Xu Adobe PDF(399Kb)  |  收藏  |  浏览/下载:205/52  |  提交时间:2022/06/23 |
| WASE: LEARNING WHEN TO ATTEND FOR SPEAKER EXTRACTION IN COCKTAIL PARTY ENVIRONMENTS 会议论文 , Toronto, June 6-11, 2021 作者: Yunzhe Hao; Jiaming Xu; Peng Zhang; Bo Xu Adobe PDF(2034Kb)  |  收藏  |  浏览/下载:211/32  |  提交时间:2022/06/23 |
| Consecutive decoding for speech-to-text translation 会议论文 , Virtual, 2021-2 作者: Dong QQ(董倩倩); Mingxuan Wang(王明轩); Hao Zhou(周浩); Shuang Xu(徐爽); Bo Xu(徐波); Lei Li(李磊) Adobe PDF(586Kb)  |  收藏  |  浏览/下载:203/64  |  提交时间:2021/06/24 |
| Listen, understand and translate: triple supervision decouples end-to-endspeech-to-text translation 会议论文 , Virtual, 2021-2 作者: Dong QQ(董倩倩); Rong Ye(叶蓉); Mingxuan Wang(王明轩); Hao Zhou(周浩); Shuang Xu(徐爽); Bo Xu(徐波); Lei Li(李磊) Adobe PDF(991Kb)  |  收藏  |  浏览/下载:184/37  |  提交时间:2021/06/24 |
| Online Audio-Visual Speech Separation with Generative Adversarial Training 会议论文 0, 线上会议, 2021-4-23 作者: Zhang Peng; Xu Jiaming; Hao Yunzhe; Xu Bo Adobe PDF(532Kb)  |  收藏  |  浏览/下载:203/47  |  提交时间:2021/06/21 audio-visual speech separation online processing generative adversarial training causal temporal convolutional network |
| Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training 会议论文 0, 线上会议, 2021-7-18 作者: Zhang Peng; Xu Jiaming; Shi Jing; Hao Yunzhe; Qin Lei; Xu Bo Adobe PDF(1900Kb)  |  收藏  |  浏览/下载:207/52  |  提交时间:2021/06/21 audio-visual speech separation robust adversarial training method time-domain approach |
| CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition 会议论文 , 在线会议, 2020-05 作者: Dong, Linhao; Xu, Bo 浏览  |  Adobe PDF(641Kb)  |  收藏  |  浏览/下载:313/68  |  提交时间:2020/06/13 continuous integrate-and-fire end-to-end model soft and monotonic alignment online speech recognition acoustic boundary positioning |
| Self-Attention Aligner: A Latency-Control End-to-End Model for ASR using Self-attention Network and Chunk-hopping 会议论文 , Brighton, United Kingdom, 2019-05 作者: Dong, Linhao; Wang, Feng; Xu, Bo 浏览  |  Adobe PDF(930Kb)  |  收藏  |  浏览/下载:230/42  |  提交时间:2020/06/13 speech recognition self-attention network encoder-decoder end-to-end latency-control |