CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Pitch-Scaled Spectrum Based Excitation Model for HMM-based Speech Synthesis 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2014, 卷号: 74, 期号: 3, 页码: 423-435
作者:  Wen, Zhengqi;  Tao, Jianhua;  Pan, Shifeng;  Wang, Yang;  Zhengqi Wen
收藏  |  浏览/下载:18/0  |  提交时间:2020/10/27
Speech Synthesis  Hmm-based Speech Synthesis  Parametric Representation Of Speech  Excitation Model  Pitch-scaled Spectrum  
Guest Editorial: Advances in Machine Learning for Speech Processing 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 137-140
作者:  Dong, Minghui;  Tao, Jianhua;  Mak, Man Wai
收藏  |  浏览/下载:13/0  |  提交时间:2020/10/27
Speech Recognition  Speech Classification  
Investigating Effect of Rich Syntactic Features on Mandarin Prosodic Boundaries Prediction 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 263-271
作者:  Che, Hao;  Li, Ya;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:70/0  |  提交时间:2020/10/27
Rich Syntactic Features  Prosodic Phrase Boundaries  Dependency Features  Syntactic Phrase Features  
Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 141-150
作者:  Liu, Bin;  Tao, Jianhua;  Wen, Zhengqi;  Mo, Fuyuan;  Bin Liu
收藏  |  浏览/下载:50/0  |  提交时间:2020/10/27
Analysis-synthesis Framework  Multi-band Summary Correlogram  Denoising Autoencoder  Speech Enhancement  Speech Coding  
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1025-1037
作者:  Wen, Zhengqi;  Li, Kehuang;  Huang, Zhen;  Lee, Chin-Hui;  Tao, Jianhua;  Zhengqi Wen
收藏  |  浏览/下载:97/0  |  提交时间:2020/10/27
Dnn-based Speech Synthesis  Vocoder  Speech Parametrization  Blstm  Phoneme Embedded Vector  Multi-task Learning  Pitch-scaled Spectrum  
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1039-1052
作者:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua;  Jianhua Tao
收藏  |  浏览/下载:112/0  |  提交时间:2020/10/27
Speech Synthesis  Excitation Parameters  Deep Neural Network Adaptation  Exclamatory Speech  Interrogative Speech  
Simultaneous Estimation of Glottal Source Waveforms and Vocal Tract Shapes from Speech Signals Based on ARX-LF Model 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2020, 卷号: 92, 期号: 8, 页码: 831-838
作者:  Li, Yongwei;  Sakakibara, Ken-Ichi;  Akagi, Masato
收藏  |  浏览/下载:176/0  |  提交时间:2020/08/03
Glottal source waveform  Vocal tract shape  ARX-LF model  
CLOSE: Coupled content-semantic embedding 期刊论文
SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 卷号: 13, 期号: 6, 页码: 1087-1095
作者:  Ren, Junhong;  Zhang, Wensheng
收藏  |  浏览/下载:215/0  |  提交时间:2019/12/16
Video captioning  Coupled content-semantic embedding  Multi-content embedding  
A Public Chinese Dataset for Language Model Adaptation 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 页码: 13
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Fan, Cunhang
收藏  |  浏览/下载:274/0  |  提交时间:2019/12/16
Chinese dataset  Language model adaptation  Speech recognition  N-gram  RNNLM  
Efficient underwater image and video enhancement based on Retinex 期刊论文
SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 卷号: 13, 期号: 5, 页码: 1011-1018
作者:  Tang, Chong;  von Lukas, Uwe Freiherr;  Vahl, Matthias;  Wang, Shuo;  Wang, Yu;  Tan, Min
Adobe PDF(1159Kb)  |  收藏  |  浏览/下载:383/70  |  提交时间:2019/09/30
Underwater video processing  Color correction  Contrast improvement  Retinex  MSRCP