CASIA OpenIR
(Note: the search results are based on claimed items)

Browse/Search Results:  1-10 of 11 Help

Filters        
Selected(0)Clear Items/Page:    Sort:
Deep imitator: handwriting calligraphy imitation via deep attention networks 期刊论文
Pattern Recogniton, 2019, 期号: 已接收, 页码: 已接收
Authors:  Zhao, Bocheng;  Tao, Jianhua;  Yang, Minghao;  Tian, Zhengkun;  Fan, Cunhang;  Bai, Ye
View  |  Adobe PDF(2498Kb)  |  Favorite  |  View/Download:152/56  |  Submit date:2020/01/05
calligraphy imitation, attention, mata-style matrix, condition Gated Recurrent Unit  
多语言语音数据库自动优化方法研究 会议论文
, 青海西宁, 2019-8
Authors:  傅睿博;  陶建华;  温正棋;  易江燕;  王诗明;  强春雨
View  |  Adobe PDF(542Kb)  |  Favorite  |  View/Download:99/44  |  Submit date:2020/06/24
语音数据库优化  语音合成  多语言  数据对匹配度  
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
Authors:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
View  |  Adobe PDF(429Kb)  |  Favorite  |  View/Download:33/6  |  Submit date:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
Deep Learning Based Speech Separation via NMF-style Reconstructions 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2018, 卷号: 26, 期号: 11, 页码: 2043-2055
Authors:  Shuai Nie;  Shan Liang;  Wenju Liu;  Xueliang Zhang;  Jianhua Tao
View  |  Adobe PDF(2922Kb)  |  Favorite  |  View/Download:26/14  |  Submit date:2020/10/22
Speech separation  deep neural network (DNN)  nonnegative matrix factorization (NMF)  spectro-temporal structures  
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
Authors:  Jiangyan Yi;  Zhengqi Wen;  Jianhua Tao;  Hao Ni;  Bin Liu
View  |  Adobe PDF(1416Kb)  |  Favorite  |  View/Download:6/1  |  Submit date:2020/10/22
multi-accent, Mandarin speech recognition,LSTM-RNN-CTC, model adaptation, CTC regularization  
Reducing Tongue Shape Dimensionality from Hundreds of Available Resources Using Autoencoder 会议论文
, 北京, 2018.08.20-2018.08.24
Authors:  Minghao Yang;  Dawei Zhang;  Jianhua Tao
View  |  Adobe PDF(658Kb)  |  Favorite  |  View/Download:94/15  |  Submit date:2019/10/12
Vocal Tract  Neural Network  Tongue Shape  Pca  
Hierarchical stress generation with Fujisaki model in expressive speech synthesis 会议论文
Proceedings of the International Conference on Speech Prosody, Ireland, 2014
Authors:  Ya Li;  Jianhua Tao;  Keikichi Hirose;  Wei Lai;  Xiaoying Xu
View  |  Adobe PDF(146Kb)  |  Favorite  |  View/Download:105/20  |  Submit date:2018/11/26
Prosody conversion from neutral speech to emotional speech 期刊论文
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 卷号: 14, 期号: 4, 页码: 1145-1154
Authors:  Tao, JH;  Kang, YG;  Li, AJ
View  |  Adobe PDF(557Kb)  |  Favorite  |  View/Download:164/81  |  Submit date:2015/11/07
Emotional Speech  Prosody Analysis  Speech Synthesis  
Affective computing: A review 期刊论文
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 卷号: 3784, 期号: 0, 页码: 981-995
Authors:  Tao, JH;  Tan, TN;  Tao, J;  Picard, RW
View  |  Adobe PDF(214Kb)  |  Favorite  |  View/Download:561/433  |  Submit date:2015/11/06
Acreview  
Dynamic mapping method based speech driven face animation system 期刊论文
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 卷号: 3784, 期号: 0, 页码: 755-763
Authors:  Yin, PR;  Tao, JH;  Tao, J;  Picard, RW
View  |  Adobe PDF(1001Kb)  |  Favorite  |  View/Download:51/6  |  Submit date:2015/11/06
Faceanimation