CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共17条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:41/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:237/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
浏览  |  Adobe PDF(154Kb)  |  收藏  |  浏览/下载:333/84  |  提交时间:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:213/74  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
Adobe PDF(323Kb)  |  收藏  |  浏览/下载:255/57  |  提交时间:2020/06/27
speech synthesis  unit-selection  target cost  deep metric learning  
Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 北京, 2018-8
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
Adobe PDF(1188Kb)  |  收藏  |  浏览/下载:197/61  |  提交时间:2020/06/27
speech synthesis  progressive neural networks  unit-selection  target cost  
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1039-1052
作者:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua;  Jianhua Tao
收藏  |  浏览/下载:105/0  |  提交时间:2020/10/27
Speech Synthesis  Excitation Parameters  Deep Neural Network Adaptation  Exclamatory Speech  Interrogative Speech  
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1025-1037
作者:  Wen, Zhengqi;  Li, Kehuang;  Huang, Zhen;  Lee, Chin-Hui;  Tao, Jianhua;  Zhengqi Wen
收藏  |  浏览/下载:93/0  |  提交时间:2020/10/27
Dnn-based Speech Synthesis  Vocoder  Speech Parametrization  Blstm  Phoneme Embedded Vector  Multi-task Learning  Pitch-scaled Spectrum  
An Initial Research: Towards Accurate Pitch Extraction for Speech Synthesis Based on BLSTM 会议论文
, Chengdou, China, 6-10, Nov, 2016
作者:  Zheng, Yibin;  Wen, Zhengqi;  Liu, Bin;  Li, Ya;  Tao, Jianhua
收藏  |  浏览/下载:69/0  |  提交时间:2020/10/27
Itch Extraction  Voicing Decision  Blstm  Log-frequency Power Spectrogram  Speech Synthesis  
The Parameterized Phoneme Identity Feature as a Continuous Real-Valued Vector for Neural Network based Speech Synthesis 会议论文
INTERSPEECH, San Francisco,USA, Sep 8-12, 2016
作者:  Wen ZQ(温正棋);  Li Y(李雅);  Tao JH(陶建华);  Wen, Zhengqi
收藏  |  浏览/下载:73/0  |  提交时间:2020/10/27
Phoneme Embedded Vector  Word Embedding  Speech Synthesis  Blstm-rnn