已选(0)清除
条数/页: 排序方式: |
| Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments 期刊论文 IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 151-163 作者: Xu, Jiaming; Cui, Jian; Hao, Yunzhe; Xu, Bo 收藏  |  浏览/下载:22/0  |  提交时间:2024/02/22 Cocktail party problem target speaker separation multi-cue guided separation semi-supervised learning |
| Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition 会议论文 , Washington D.C., USA, 2023-2-9 作者: Qingyu Wang; Tielin Zhang; Minglun Han; Yi Wang; Duzhen Zhang; Bo Xu Adobe PDF(1714Kb)  |  收藏  |  浏览/下载:112/40  |  提交时间:2023/06/20 |
| Knowledge Transfer from Pre-Trained Language Models to CIF-Based Speech Recognizers via Hierarchical Distillation 会议论文 , Dublin, Ireland, 2023-8-20 作者: Minglun Han; Feilong Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(563Kb)  |  收藏  |  浏览/下载:117/45  |  提交时间:2023/06/20 |
| VLP: A Survey on Vision-language Pre-training 期刊论文 Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56 作者: Feilong Chen; Duzhen Zhang; Minglun Han; Xiuyi Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(969Kb)  |  收藏  |  浏览/下载:100/25  |  提交时间:2023/06/21 |
| Train from scratch: Single-stage joint training of speech separation and recognition 期刊论文 COMPUTER SPEECH AND LANGUAGE, 2022, 卷号: 76, 页码: 15 作者: Shi, Jing; Chang, Xuankai; Watanabe, Shinji; Xu, Bo 收藏  |  浏览/下载:170/0  |  提交时间:2022/07/25 Cocktail party problem Speech separation Multi-speaker speech recognition End-to-end Joint-training |
| Compressing Speaker Extraction Model with Ultra-low Precision Quantization and Knowledge Distillation 期刊论文 Neural Networks, 2022, 卷号: 154, 页码: 13-21 作者: Yating Huang; Yunzhe Hao; Jiaming Xu; Bo Xu Adobe PDF(801Kb)  |  收藏  |  浏览/下载:167/43  |  提交时间:2022/09/17 |
| Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire 期刊论文 Signal Processing Letters, 2022, 页码: 1551-1554 作者: Fan ZY(范志赟); Dong LH(董林昊); Cai M(蔡猛); Ma ZJ(马泽君); Xu B(徐波) Adobe PDF(404Kb)  |  收藏  |  浏览/下载:146/35  |  提交时间:2022/09/17 |
| Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog 会议论文 , Lisboa, Portugal, October 10–14, 2022 作者: Feilong Chen; Duzhen Zhang; Xiuyi Chen; Jing Shi; Shang Xu; Bo Xu Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:212/138  |  提交时间:2023/06/05 |
| Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection 会议论文 , Singapore, Singapore, 2022.05 作者: Minglun Han; Linhao Dong; Zhenlin Liang; Meng Cai; Shiyu Zhou; Zejun Ma; Bo Xu Adobe PDF(463Kb)  |  收藏  |  浏览/下载:131/41  |  提交时间:2023/05/29 Automatic Speech Recognition Context Biasing Speech Recognition Customization Continuous Integrate-and-Fire Mechanism |
| IMPROVING CROSS-MODAL UNDERSTANDING IN VISUAL DIALOG VIA CONTRASTIVE LEARNING 会议论文 , Singapore, 2022.5 作者: Feilong Chen; Duzhen Zhang; Xiuyi Chen; Jing Shi; Shuang Xu; Bo Xu Adobe PDF(9035Kb)  |  收藏  |  浏览/下载:137/64  |  提交时间:2023/06/07 |