CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共30条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:236/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:361/47  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 1340-1351
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Tian, Zhengkun;  Zhang, Shuai
收藏  |  浏览/下载:157/0  |  提交时间:2021/06/07
End-to-End  language modeling  speech recognition  teacher-student learning  transfer learning  
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:177/49  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning 会议论文
, Hong Kong, 24-27 Jan. 2021
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi;  Song, Leichao
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:205/48  |  提交时间:2021/06/01
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(158Kb)  |  收藏  |  浏览/下载:179/53  |  提交时间:2021/06/01
Gated Recurrent Fusion of Spatial and Spectral Features for Multi-channel Speech Separation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(260Kb)  |  收藏  |  浏览/下载:161/44  |  提交时间:2021/06/01
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
浏览  |  Adobe PDF(154Kb)  |  收藏  |  浏览/下载:331/84  |  提交时间:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
Synchronous Transformers for end-to-end Speech Recognition 会议论文
, Barcelona, Spain, 2020.05.04-2020.05.08
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
浏览  |  Adobe PDF(496Kb)  |  收藏  |  浏览/下载:149/47  |  提交时间:2020/10/22
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:269/57  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training