CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:112/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:228/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:182/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
语音伪造与鉴伪的发展与挑战 期刊论文
信息安全学报, 2020, 卷号: 5, 期号: 2, 页码: 28-38
作者:  陶建华;  傅睿博;  易江燕;  王成龙;  汪涛
浏览  |  Adobe PDF(432Kb)  |  收藏  |  浏览/下载:580/104  |  提交时间:2020/06/27
语音伪造  语音鉴伪  发展与挑战  
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1025-1037
作者:  Wen, Zhengqi;  Li, Kehuang;  Huang, Zhen;  Lee, Chin-Hui;  Tao, Jianhua;  Zhengqi Wen
收藏  |  浏览/下载:87/0  |  提交时间:2020/10/27
Dnn-based Speech Synthesis  Vocoder  Speech Parametrization  Blstm  Phoneme Embedded Vector  Multi-task Learning  Pitch-scaled Spectrum  
Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 卷号: 74, 期号: 3, 页码: 423-435
作者:  Wen, Zhengqi;  Tao, Jianhua;  Pan, Shifeng;  Wang, Yang;  Zhengqi Wen
收藏  |  浏览/下载:16/0  |  提交时间:2020/10/27
Speech Synthesis  Hmm-based Speech Synthesis  Parametric Representation Of Speech  Excitation Model  Pitch-scaled Spectrum