CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共15条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
SceneFake: An initial dataset and benchmarks for scene fake audio detection 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 152, 页码: 12
作者:  Yi, Jiangyan;  Wang, Chenglong;  Tao, Jianhua;  Zhang, Chu Yuan;  Fan, Cunhang;  Tian, Zhengkun;  Ma, Haoxin;  Fu, Ruibo
收藏  |  浏览/下载:20/0  |  提交时间:2024/07/04
Scene manipulation  Fake audio detection  Speech enhancement  SceneFake dateset  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:14/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
Pseudo Labels Regularization for Imbalanced Partial-Label Learning 会议论文
, 韩国首尔, 2024年4月14-19
作者:  Mingyu Xu;  Zheng Lian;  Bin Liu;  Zerui Chen;  Jianhua Tao
Adobe PDF(918Kb)  |  收藏  |  浏览/下载:65/26  |  提交时间:2024/05/31
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:248/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:289/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
The NLPR Speech Synthesis entry for Blizzard Challenge 2017 会议论文
, Stockholm, Sweden, 2017.8.25
作者:  Jianhua Tao;  Ruibo Fu;  Yibin Zheng;  Zhengqi Wen;  Ya Li;  Biu Liu
收藏  |  浏览/下载:93/0  |  提交时间:2020/10/27
基于内容和声学特征层级融合的自动韵律边界标注 期刊论文
中国语音学报, 2018, 期号: 10, 页码: 103-110
作者:  傅睿博;  陶建华;  温正棋
Adobe PDF(1209Kb)  |  收藏  |  浏览/下载:318/103  |  提交时间:2020/06/27
韵律边界标注  特征层级融合  语料库构建  语音合成  
Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 北京, 2018-8
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
Adobe PDF(1188Kb)  |  收藏  |  浏览/下载:245/73  |  提交时间:2020/06/27
speech synthesis  progressive neural networks  unit-selection  target cost  
基于静音时长和文本特征融合的韵律边界自动标注 会议论文
, 江苏连云港, 2017-10
作者:  傅睿博;  李雅;  温正棋;  陶建华
浏览  |  Adobe PDF(877Kb)  |  收藏  |  浏览/下载:268/98  |  提交时间:2020/06/27
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(340Kb)  |  收藏  |  浏览/下载:272/53  |  提交时间:2020/06/27