CASIA OpenIR  > 智能交互
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis
Ruibo Fu1,2; Jianhua Tao1,2,3; Yibin Zheng1,2; Zhengqi Wen1
2018-09
Conference Nameinterspeech2018
Conference Date2018-9
Conference Place印度海得拉巴
PublisherISCA
Abstract

The fundamental frequency and the spectrum of the speech are related thus one of their learned mapping from the linguistic features can be leveraged for another. The conventional methods treat all the acoustic features as one stream for acoustic modeling. And the multi-task learning methods are applied to train the model simultaneously with serval targets and a combined cost. To improve the accuracy of the acoustic model, the progressive deep neural networks (PDNN) is applied for acoustic modeling in statistical parametric speech synthesis (SPSS) in our method. Each type of the acoustic features is modeled in different networks with its own cost function and the knowledge transfers through lateral connections. Each networks in the progressive networks can be trained to reach its own optimal step by step. Experiments are conducted to compare the PDNN based SPSS and the DNN based SPSS. The DNN and PDNN with the multi-task learning (MTL) method for acoustic modeling are also the tested. The computational complexity, prediction sequences and quantity of hierarchies of the PDNN are investigated. Both objective and subjective experimental results demonstrate the effectiveness of the proposed technique.

Indexed ByEI
Language英语
Document Type会议论文
Identifierhttp://ir.ia.ac.cn/handle/173211/39598
Collection智能交互
Corresponding AuthorRuibo Fu
Affiliation1.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
2.CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation, Chinese Academy of Sciences
3.School of Artificial Intelligence, University of Chinese Academy of Sciences
First Author AffilicationChinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;  Institute of Automation, Chinese Academy of Sciences
Corresponding Author AffilicationChinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;  Institute of Automation, Chinese Academy of Sciences
Recommended Citation
GB/T 7714
Ruibo Fu,Jianhua Tao,Yibin Zheng,et al. Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis[C]:ISCA,2018.
Files in This Item: Download All
File Name/Size DocType Version Access License
INTERSPEECH2018_1265(340KB)会议论文 开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Ruibo Fu]'s Articles
[Jianhua Tao]'s Articles
[Yibin Zheng]'s Articles
Baidu academic
Similar articles in Baidu academic
[Ruibo Fu]'s Articles
[Jianhua Tao]'s Articles
[Yibin Zheng]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Ruibo Fu]'s Articles
[Jianhua Tao]'s Articles
[Yibin Zheng]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: INTERSPEECH2018_1265(语音顶会).pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.