CASIA OpenIR

浏览/检索结果: 共18条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 卷号: 74, 期号: 3, 页码: 423-435
作者:  Wen, Zhengqi;  Tao, Jianhua;  Pan, Shifeng;  Wang, Yang;  Zhengqi Wen
收藏  |  浏览/下载:17/0  |  提交时间:2020/10/27
Speech Synthesis  Hmm-based Speech Synthesis  Parametric Representation Of Speech  Excitation Model  Pitch-scaled Spectrum  
Guest Editorial: Advances in Machine Learning for Speech Processing 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 137-140
作者:  Dong, Minghui;  Tao, Jianhua;  Mak, Man Wai
收藏  |  浏览/下载:12/0  |  提交时间:2020/10/27
Speech Recognition  Speech Classification  
Investigating Effect of Rich Syntactic Features on Mandarin Prosodic Boundaries Prediction 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 263-271
作者:  Che, Hao;  Li, Ya;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:69/0  |  提交时间:2020/10/27
Rich Syntactic Features  Prosodic Phrase Boundaries  Dependency Features  Syntactic Phrase Features  
Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 141-150
作者:  Liu, Bin;  Tao, Jianhua;  Wen, Zhengqi;  Mo, Fuyuan;  Bin Liu
收藏  |  浏览/下载:48/0  |  提交时间:2020/10/27
Analysis-synthesis Framework  Multi-band Summary Correlogram  Denoising Autoencoder  Speech Enhancement  Speech Coding  
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1025-1037
作者:  Wen, Zhengqi;  Li, Kehuang;  Huang, Zhen;  Lee, Chin-Hui;  Tao, Jianhua;  Zhengqi Wen
收藏  |  浏览/下载:95/0  |  提交时间:2020/10/27
Dnn-based Speech Synthesis  Vocoder  Speech Parametrization  Blstm  Phoneme Embedded Vector  Multi-task Learning  Pitch-scaled Spectrum  
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1039-1052
作者:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua;  Jianhua Tao
收藏  |  浏览/下载:108/0  |  提交时间:2020/10/27
Speech Synthesis  Excitation Parameters  Deep Neural Network Adaptation  Exclamatory Speech  Interrogative Speech  
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
作者:  Jiangyan Yi;  Zhengqi Wen;  Jianhua Tao;  Hao Ni;  Bin Liu
浏览  |  Adobe PDF(1416Kb)  |  收藏  |  浏览/下载:133/50  |  提交时间:2020/10/22
multi-accent, Mandarin speech recognition,LSTM-RNN-CTC, model adaptation, CTC regularization  
Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2020, 卷号: 92, 期号: 8, 页码: 831-838
作者:  Li, Yongwei;  Sakakibara, Ken-Ichi;  Akagi, Masato
收藏  |  浏览/下载:170/0  |  提交时间:2020/08/03
Glottal source waveform  Vocal tract shape  ARX-LF model  
CLOSE: Coupled content-semantic embedding 期刊论文
SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 卷号: 13, 期号: 6, 页码: 1087-1095
作者:  Ren, Junhong;  Zhang, Wensheng
收藏  |  浏览/下载:212/0  |  提交时间:2019/12/16
Video captioning  Coupled content-semantic embedding  Multi-content embedding  
A Public Chinese Dataset for Language Model Adaptation 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 页码: 13
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Fan, Cunhang
收藏  |  浏览/下载:271/0  |  提交时间:2019/12/16
Chinese dataset  Language model adaptation  Speech recognition  N-gram  RNNLM