CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共17条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
浏览  |  Adobe PDF(154Kb)  |  收藏  |  浏览/下载:333/84  |  提交时间:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition 会议论文
, shanghai, 2020
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:110/28  |  提交时间:2021/06/25
A Public Chinese Dataset for Language Model Adaptation 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 页码: 13
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Fan, Cunhang
收藏  |  浏览/下载:266/0  |  提交时间:2019/12/16
Chinese dataset  Language model adaptation  Speech recognition  N-gram  RNNLM  
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:213/74  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 3, 页码: 621-630
作者:  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Bai, Ye
浏览  |  Adobe PDF(907Kb)  |  收藏  |  浏览/下载:392/84  |  提交时间:2019/07/12
Adversarial training  transfer learning  cross-lingual  low-resource  speech recognition  
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition 会议论文
, Graz, 2019
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen
Adobe PDF(779Kb)  |  收藏  |  浏览/下载:57/8  |  提交时间:2021/06/25
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(340Kb)  |  收藏  |  浏览/下载:229/46  |  提交时间:2020/06/27
Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 北京, 2018-8
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
浏览  |  Adobe PDF(1188Kb)  |  收藏  |  浏览/下载:197/61  |  提交时间:2020/06/27
speech synthesis  progressive neural networks  unit-selection  target cost  
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1039-1052
作者:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua;  Jianhua Tao
收藏  |  浏览/下载:105/0  |  提交时间:2020/10/27
Speech Synthesis  Excitation Parameters  Deep Neural Network Adaptation  Exclamatory Speech  Interrogative Speech  
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
作者:  Jiangyan Yi;  Zhengqi Wen;  Jianhua Tao;  Hao Ni;  Bin Liu
浏览  |  Adobe PDF(1416Kb)  |  收藏  |  浏览/下载:132/50  |  提交时间:2020/10/22
multi-accent, Mandarin speech recognition,LSTM-RNN-CTC, model adaptation, CTC regularization