CASIA OpenIR

Browse/Search Results:  1-10 of 14 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
Authors:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
Favorite  |  View/Download:79/0  |  Submit date:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
Authors:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
Favorite  |  View/Download:58/0  |  Submit date:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
个性化语音合成方法研究 学位论文
, 中国科学院大学: 中国科学院大学, 2020
Authors:  傅睿博
Adobe PDF(3985Kb)  |  Favorite  |  View/Download:311/15  |  Submit date:2020/06/21
语音合成  个性化定制  声学建模  说话人特征空间建模  韵律建模  
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
Authors:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
View  |  Adobe PDF(154Kb)  |  Favorite  |  View/Download:234/70  |  Submit date:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
语音伪造与鉴伪的发展与挑战 期刊论文
信息安全学报, 2020, 卷号: 5, 期号: 2, 页码: 28-38
Authors:  陶建华;  傅睿博;  易江燕;  王成龙;  汪涛
View  |  Adobe PDF(432Kb)  |  Favorite  |  View/Download:378/75  |  Submit date:2020/06/27
语音伪造  语音鉴伪  发展与挑战  
多语言语音数据库自动优化方法研究 会议论文
, 青海西宁, 2019-8
Authors:  傅睿博;  陶建华;  温正棋;  易江燕;  王诗明;  强春雨
View  |  Adobe PDF(542Kb)  |  Favorite  |  View/Download:206/72  |  Submit date:2020/06/24
语音数据库优化  语音合成  多语言  数据对匹配度  
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
Authors:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
View  |  Adobe PDF(429Kb)  |  Favorite  |  View/Download:124/45  |  Submit date:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
基于内容和声学特征层级融合的自动韵律边界标注 期刊论文
中国语音学报, 2018, 期号: 10, 页码: 103-110
Authors:  傅睿博;  陶建华;  温正棋
View  |  Adobe PDF(1209Kb)  |  Favorite  |  View/Download:169/51  |  Submit date:2020/06/27
韵律边界标注  特征层级融合  语料库构建  语音合成  
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
Authors:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
View  |  Adobe PDF(340Kb)  |  Favorite  |  View/Download:101/14  |  Submit date:2020/06/27
Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 印度海得拉巴, 2018-9
Authors:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
View  |  Adobe PDF(323Kb)  |  Favorite  |  View/Download:142/21  |  Submit date:2020/06/27
speech synthesis  unit-selection  target cost  deep metric learning