已选(0)清除
条数/页: 排序方式: |
| CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文 IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254 作者: Wang, Tao ; Yi, Jiangyan ; Fu, Ruibo ; Tao, Jianhua ; Wen, Zhengqi![](/image/person.jpg)
![](/themes/default/image/downing1.png) 收藏  |  浏览/下载:229/0  |  提交时间:2022/09/19 Speech processing Decoding Predictive models Acoustics Transfer learning Training Task analysis Coarse-to-fine decoding mask prediction one-shot learning text-based speech editing text-to-speech |
| Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition 期刊论文 IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766 作者: Zhengkun Tian ; Jiangyan Yi ; Jianhua Tao ; Shuai Zhang ; Zhengqi Wen![](/image/person.jpg)
Adobe PDF(934Kb)  |   收藏  |  浏览/下载:284/77  |  提交时间:2022/06/14 |
| FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization 会议论文 , Brno, Czechia, 30 August – 3 September 作者: Zhengkun Tian ; Jiangyan Yi ; Ye Bai ; Jianhua Tao ; Shuai Zhang ; Zhengqi Wen![](/image/person.jpg)
Adobe PDF(839Kb)  |   收藏  |  浏览/下载:206/48  |  提交时间:2022/06/14 |
| Self-Attention Transducers for End-to-End Speech Recognition 会议论文 , Graz, Austria, September 15–19, 2019 作者: Zhengkun Tian ; Jiangyan Yi ; Jianhua Tao ; Ye Bai ; Zhengqi Wen![](/image/person.jpg)
Adobe PDF(278Kb)  |   收藏  |  浏览/下载:114/43  |  提交时间:2022/06/14 |
| Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition 会议论文 , Shanghai, China, October 25–29, 2020 作者: Zhengkun Tian ; Jiangyan Yi ; Jianhua Tao ; Ye Bai ; Shuai Zhang ; Zhengqi Wen![](/image/person.jpg)
Adobe PDF(629Kb)  |   收藏  |  浏览/下载:172/56  |  提交时间:2022/06/14 |
| One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition 会议论文 , Tokyo, Japan, 14-17 December 2021 作者: Zhengkun Tian ; Jiangyan Yi ; Ye Bai ; Jianhua Tao ; Shuai Zhang ; Zhengqi Wen![](/image/person.jpg)
Adobe PDF(563Kb)  |   收藏  |  浏览/下载:182/42  |  提交时间:2022/06/14 |
| NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文 IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878 作者: Wang, Tao ; Fu, Ruibo ; Yi, Jiangyan ; Tao, Jianhua ; Wen, Zhengqi![](/image/person.jpg)
![](/themes/default/image/downing1.png) 收藏  |  浏览/下载:274/0  |  提交时间:2022/06/06 Vocoders Stochastic processes Neural networks Speech processing Signal to noise ratio Acoustics Speech enhancement Vocoder speech synthesis deterministic plus stochastic multiband excitation noise control |
| Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition 会议论文 , Graz, 2019 作者: Ye Bai ; Jiangyan Yi ; Jianhua Tao ; Zhengkun Tian ; Zhengqi Wen![](/image/person.jpg)
Adobe PDF(779Kb)  |   收藏  |  浏览/下载:87/11  |  提交时间:2021/06/25 |
| 基于随机时频掩蔽的 DNN-HMM 声学模型数据扩增 会议论文 , 青海西宁, 2019 作者: 白烨 ; 易江燕 ; 陶建华 ; 温正棋![](/image/person.jpg)
Adobe PDF(444Kb)  |   收藏  |  浏览/下载:173/38  |  提交时间:2021/06/25 |
| voice activity detection based on time-delay neural networks 会议论文 , Gansu, Lanzhou, 2019 作者: Ye Bai ; Jiangyan Yi ; Jianhua Tao ; Zhengqi Wen ; Bin Liu![](/image/person.jpg)
Adobe PDF(2438Kb)  |   收藏  |  浏览/下载:115/37  |  提交时间:2021/06/25 |