CASIA OpenIR

Browse/Search Results:  1-10 of 62 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
Authors:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
View  |  Adobe PDF(154Kb)  |  Favorite  |  View/Download:85/39  |  Submit date:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 页码: 1303-1314
Authors:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Favorite  |  View/Download:15/0  |  Submit date:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
Synchronous Transformers for end-to-end Speech Recognition 会议论文
, Barcelona, Spain, 2020.05.04-2020.05.08
Authors:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
View  |  Adobe PDF(496Kb)  |  Favorite  |  View/Download:8/0  |  Submit date:2020/10/22
Forward-Backward Decoding Sequence for Regularizing End-to-End TTS 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 12, 页码: 2067-2079
Authors:  Zheng, Yibin;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan
Favorite  |  View/Download:44/0  |  Submit date:2020/03/30
Decoding  Training  Speech processing  Linguistics  Acoustics  Speech recognition  Forward-backward  regularization  encoder-decoder with attention  end-to-end  joint-training  TTS  
A Public Chinese Dataset for Language Model Adaptation 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2019, 页码: 13
Authors:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Fan, Cunhang
Favorite  |  View/Download:40/0  |  Submit date:2019/12/16
Chinese dataset  Language model adaptation  Speech recognition  N-gram  RNNLM  
多语言语音数据库自动优化方法研究 会议论文
, 青海西宁, 2019-8
Authors:  傅睿博;  陶建华;  温正棋;  易江燕;  王诗明;  强春雨
View  |  Adobe PDF(542Kb)  |  Favorite  |  View/Download:44/24  |  Submit date:2020/06/24
语音数据库优化  语音合成  多语言  数据对匹配度  
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
Authors:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
View  |  Adobe PDF(429Kb)  |  Favorite  |  View/Download:14/2  |  Submit date:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation  
Language-Adversarial Transfer Learning for Low-Resource Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 卷号: 27, 期号: 3, 页码: 621-630
Authors:  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Bai, Ye
View  |  Adobe PDF(907Kb)  |  Favorite  |  View/Download:61/1  |  Submit date:2019/07/12
Adversarial training  transfer learning  cross-lingual  low-resource  speech recognition  
基于内容和声学特征层级融合的自动韵律边界标注 期刊论文
中国语音学报, 2018, 期号: 10, 页码: 103-110
Authors:  傅睿博;  陶建华;  温正棋
View  |  Adobe PDF(1209Kb)  |  Favorite  |  View/Download:30/4  |  Submit date:2020/06/27
韵律边界标注  特征层级融合  语料库构建  语音合成  
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
Authors:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
View  |  Adobe PDF(340Kb)  |  Favorite  |  View/Download:26/3  |  Submit date:2020/06/27