已选(0)清除
条数/页: 排序方式: |
| CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文 IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254 作者: Wang, Tao; Yi, Jiangyan; Fu, Ruibo; Tao, Jianhua; Wen, Zhengqi 收藏  |  浏览/下载:220/0  |  提交时间:2022/09/19 Speech processing Decoding Predictive models Acoustics Transfer learning Training Task analysis Coarse-to-fine decoding mask prediction one-shot learning text-based speech editing text-to-speech |
| Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition 期刊论文 IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766 作者: Zhengkun Tian; Jiangyan Yi; Jianhua Tao; Shuai Zhang; Zhengqi Wen Adobe PDF(934Kb)  |  收藏  |  浏览/下载:279/77  |  提交时间:2022/06/14 |
| FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization 会议论文 , Brno, Czechia, 30 August – 3 September 作者: Zhengkun Tian; Jiangyan Yi; Ye Bai; Jianhua Tao; Shuai Zhang; Zhengqi Wen Adobe PDF(839Kb)  |  收藏  |  浏览/下载:198/45  |  提交时间:2022/06/14 |
| Self-Attention Transducers for End-to-End Speech Recognition 会议论文 , Graz, Austria, September 15–19, 2019 作者: Zhengkun Tian; Jiangyan Yi; Jianhua Tao; Ye Bai; Zhengqi Wen Adobe PDF(278Kb)  |  收藏  |  浏览/下载:105/40  |  提交时间:2022/06/14 |
| Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition 会议论文 , Shanghai, China, October 25–29, 2020 作者: Zhengkun Tian; Jiangyan Yi; Jianhua Tao; Ye Bai; Shuai Zhang; Zhengqi Wen Adobe PDF(629Kb)  |  收藏  |  浏览/下载:167/54  |  提交时间:2022/06/14 |
| One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition 会议论文 , Tokyo, Japan, 14-17 December 2021 作者: Zhengkun Tian; Jiangyan Yi; Ye Bai; Jianhua Tao; Shuai Zhang; Zhengqi Wen Adobe PDF(563Kb)  |  收藏  |  浏览/下载:179/42  |  提交时间:2022/06/14 |
| NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文 IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878 作者: Wang, Tao; Fu, Ruibo; Yi, Jiangyan; Tao, Jianhua; Wen, Zhengqi 收藏  |  浏览/下载:267/0  |  提交时间:2022/06/06 Vocoders Stochastic processes Neural networks Speech processing Signal to noise ratio Acoustics Speech enhancement Vocoder speech synthesis deterministic plus stochastic multiband excitation noise control |
| Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition 会议论文 , Graz, 2019 作者: Ye Bai; Jiangyan Yi; Jianhua Tao; Zhengkun Tian; Zhengqi Wen Adobe PDF(779Kb)  |  收藏  |  浏览/下载:73/9  |  提交时间:2021/06/25 |
| 基于随机时频掩蔽的 DNN-HMM 声学模型数据扩增 会议论文 , 青海西宁, 2019 作者: 白烨; 易江燕; 陶建华; 温正棋 Adobe PDF(444Kb)  |  收藏  |  浏览/下载:167/36  |  提交时间:2021/06/25 |
| A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting 会议论文 , graz, 2019 作者: Ye Bai; Jiangyan Yi; Zhengqi Wen; Zhengkun Tian; Chenghao Zhao; Cunhang Fan Adobe PDF(290Kb)  |  收藏  |  浏览/下载:159/57  |  提交时间:2021/06/25 |