CASIA OpenIR

浏览/检索结果: 共32条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:44/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:242/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
Amplitude spectrum based excitation model for HMM-based speech synthesis 会议论文
Annual Conference of the International Speech Communication Association (INTERSPEECH), 美国, 2012
作者:  Wen, Zhengqi;  Tao, Jianhua;  Zhengqi Wen
收藏  |  浏览/下载:21/0  |  提交时间:2020/10/27
Speech Synthesis  Hmm-based Speech Synthesis  Excitation Model  Amplitude Spectrum  
GATING RECURRENT MIXTURE DENSITY NETWORKS FOR ACOUSTIC MODELING IN STATISTICAL PARAMETRIC SPEECH SYNTHESIS 会议论文
, Shanghai, China, 2016-3-21
作者:  Wang, Wenfu;  Xu, Shuang;  Xu, Bo
收藏  |  浏览/下载:55/0  |  提交时间:2020/10/27
Statistical Parametric Speech Synthesis  Gating Units  Gru  Gating Recurrent Mixture Density Network  
The Parameterized Phoneme Identity Feature as a Continuous Real-Valued Vector for Neural Network based Speech Synthesis 会议论文
INTERSPEECH, San Francisco,USA, Sep 8-12, 2016
作者:  Wen ZQ(温正棋);  Li Y(李雅);  Tao JH(陶建华);  Wen, Zhengqi
收藏  |  浏览/下载:75/0  |  提交时间:2020/10/27
Phoneme Embedded Vector  Word Embedding  Speech Synthesis  Blstm-rnn  
An Initial Research: Towards Accurate Pitch Extraction for Speech Synthesis Based on BLSTM 会议论文
, Chengdou, China, 6-10, Nov, 2016
作者:  Zheng, Yibin;  Wen, Zhengqi;  Liu, Bin;  Li, Ya;  Tao, Jianhua
收藏  |  浏览/下载:70/0  |  提交时间:2020/10/27
Itch Extraction  Voicing Decision  Blstm  Log-frequency Power Spectrogram  Speech Synthesis  
COMBINING UNIDIRECTIONAL LONG SHORT-TERM MEMORY WITH CONVOLUTIONAL OUTPUT LAYER FOR HIGH-PERFORMANCE SPEECH SYNTHESIS 会议论文
, New Orleans, USA, 2017-3-5
作者:  Wang, Wenfu;  Xu, Bo
收藏  |  浏览/下载:55/0  |  提交时间:2020/10/27
Statistical Parametric Speech Synthesis  Lstm  Convolutional Output Layer  High-performance  Trajectory Smoother  
Realistic Visual Speech Synthesis Based on Hybrid Concatenation Method 期刊论文
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 卷号: 17, 期号: 3;3, 页码: 469-477
作者:  Tao, Jianhua;  Xin, Le;  Yin, Panrong
收藏  |  浏览/下载:49/0  |  提交时间:2020/10/27
Fused Hidden Markov Model (Hmm)  Inversion  Speech-driven Facial Animation  Unit Concatenation  Visual Speech Synthesis  
Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 卷号: 74, 期号: 3, 页码: 423-435
作者:  Wen, Zhengqi;  Tao, Jianhua;  Pan, Shifeng;  Wang, Yang;  Zhengqi Wen
收藏  |  浏览/下载:17/0  |  提交时间:2020/10/27
Speech Synthesis  Hmm-based Speech Synthesis  Parametric Representation Of Speech  Excitation Model  Pitch-scaled Spectrum  
Introduction to the Issue on Statistical Parametric Speech Synthesis 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 卷号: 8, 期号: 2, 页码: 170-172
作者:  Tao, Jianhua;  Hirose, Keikichi;  Tokuda, Keiichi;  Black, Alan W.;  King, Simon
收藏  |  浏览/下载:19/0  |  提交时间:2020/10/27
Speech Synthesis