CASIA OpenIR
(Note: the search results are based on claimed items)

Browse/Search Results:  1-10 of 34 Help

Filters        
Selected(0)Clear Items/Page:    Sort:
BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in A Text-to-Speech Front-End 会议论文
, Hyderabad, 2-6 September 2018
Authors:  Yibin Zheng;  Jianhua Tao;  Zhengqi Wen;  Ya Li
View  |  Adobe PDF(642Kb)  |  Favorite  |  View/Download:79/24  |  Submit date:2019/05/02
Prosodic Boundary Prediction  Blstm-crf  Attention  Context Sensitive Embeddings  End-to-end  
On The Application and Compression of Deep Time Delay Neural Network for Embedded Statistical Parametric Speech Synthesis 会议论文
, Hyderabad, 2-6 September 2018
Authors:  Yibin Zheng;  Jianhua Tao;  Zhengqi Wen;  Ruibo Fu
View  |  Adobe PDF(531Kb)  |  Favorite  |  View/Download:34/10  |  Submit date:2019/05/02
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1039-1052
Authors:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua;  Jianhua Tao
View  |  Adobe PDF(1339Kb)  |  Favorite  |  View/Download:134/26  |  Submit date:2018/01/04
Speech Synthesis  Excitation Parameters  Deep Neural Network Adaptation  Exclamatory Speech  Interrogative Speech  
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1025-1037
Authors:  Wen, Zhengqi;  Li, Kehuang;  Huang, Zhen;  Lee, Chin-Hui;  Tao, Jianhua;  Zhengqi Wen
View  |  Adobe PDF(995Kb)  |  Favorite  |  View/Download:103/29  |  Submit date:2018/01/05
Dnn-based Speech Synthesis  Vocoder  Speech Parametrization  Blstm  Phoneme Embedded Vector  Multi-task Learning  Pitch-scaled Spectrum  
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
Authors:  Yi, Jiangyan;  Wen, Zhengqi;  Tao, Jianhua;  Ni, Hao;  Liu, Bin;  Wen ZQ(温正棋)
View  |  Adobe PDF(1416Kb)  |  Favorite  |  View/Download:192/69  |  Submit date:2018/01/04
Multi-accent  Mandarin Speech Recognition  Lstm-rnn-ctc  Model Adaptation  Ctc Regularization  
Reducing Tongue Shape Dimensionality from Hundreds of Available Resources Using Autoencoder 会议论文
, 北京, 2018.08.20-2018.08.24
Authors:  Minghao Yang;  Dawei Zhang;  Jianhua Tao
View  |  Adobe PDF(658Kb)  |  Favorite  |  View/Download:41/3  |  Submit date:2019/10/12
Vocal Tract  Neural Network  Tongue Shape  Pca  
Investigating Efficient Feature Representation Methods and Training Objective for BLSTM-Based Phone Duration Prediction 会议论文
, Stockholm, Sweden, August 20–24, 2017
Authors:  Zheng, Yibin;  Tao, Jianhua;  Wen, Zhengqi;  Li, Ya;  Liu, Bin
View  |  Adobe PDF(251Kb)  |  Favorite  |  View/Download:127/30  |  Submit date:2018/01/04
A Domain Knowledge-Assisted Nonlinear Model for Head-Related Transfer Functions Based on Bottleneck Deep Neural Network 会议论文
, Stockholm, Sweden, August 20–24, 2017
Authors:  Xiaoke Qi;  Jianhua Tao
View  |  Adobe PDF(343Kb)  |  Favorite  |  View/Download:87/17  |  Submit date:2017/09/22
An Initial Research: Towards Accurate Pitch Extraction for Speech Synthesis Based on BLSTM 会议论文
, Chengdou, China, 6-10, Nov, 2016
Authors:  Zheng, Yibin;  Wen, Zhengqi;  Liu, Bin;  Li, Ya;  Tao, Jianhua
View  |  Adobe PDF(823Kb)  |  Favorite  |  View/Download:75/4  |  Submit date:2018/01/04
Itch Extraction  Voicing Decision  Blstm  Log-frequency Power Spectrogram  Speech Synthesis  
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 会议论文
, Tianjin, China, 17-20 Oct. 2016
Authors:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua
View  |  Adobe PDF(754Kb)  |  Favorite  |  View/Download:85/10  |  Submit date:2018/01/04