CASIA OpenIR

Browse/Search Results:  1-10 of 129 Help

Selected(0)Clear Items/Page:    Sort:
图像美学质量评估的方法与应用 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院大学, 2019
Authors:  盛柯恺
Adobe PDF(18854Kb)  |  Favorite  |  View/Download:62/2  |  Submit date:2019/06/18
图像美学质量评估  注意力机制  自监督学习  正则化策略  排序学习  深度学习  
基于深度学习的语音合成方法研究 学位论文
工学博士, 中国科学院自动化研究所: 中国科学院大学, 2019
Authors:  郑艺斌
Adobe PDF(8630Kb)  |  Favorite  |  View/Download:32/1  |  Submit date:2019/06/18
语音合成  深度学习  韵律建模  端到端声学建模  多风格建模  
深度神经网络结构:从人工设计到自动学习 学位论文
工学博士, 中科院自动化研究所: 中国科学院大学, 2019
Authors:  钟钊
Adobe PDF(8590Kb)  |  Favorite  |  View/Download:2770/27  |  Submit date:2019/06/17
深度神经网络  深度学习  网络结构搜索  强化学习  机器学习  
On The Application and Compression of Deep Time Delay Neural Network for Embedded Statistical Parametric Speech Synthesis 会议论文
, Hyderabad, 2-6 September 2018
Authors:  Yibin Zheng;  Jianhua Tao;  Zhengqi Wen;  Ruibo Fu
View  |  Adobe PDF(531Kb)  |  Favorite  |  View/Download:26/9  |  Submit date:2019/05/02
Listen, Think and Listen Again: Capturing Top-down Auditory Attention for Speaker-independent Speech Separation 会议论文
, Stockholm, Sweden, 2018/07/13-2018/07/19
Authors:  Shi, Jing;  Xu, Jiaming;  Liu, Guangcan;  Xu, Bo
View  |  Adobe PDF(1145Kb)  |  Favorite  |  View/Download:103/41  |  Submit date:2018/10/09
Investigating Deep Neural Network Adaptation for Generating Exclamatory and Interrogative Speech in Mandarin 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1039-1052
Authors:  Zheng, Yibin;  Li, Ya;  Wen, Zhengqi;  Liu, Bin;  Tao, Jianhua;  Jianhua Tao
View  |  Adobe PDF(1339Kb)  |  Favorite  |  View/Download:114/25  |  Submit date:2018/01/04
Speech Synthesis  Excitation Parameters  Deep Neural Network Adaptation  Exclamatory Speech  Interrogative Speech  
Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 1025-1037
Authors:  Wen, Zhengqi;  Li, Kehuang;  Huang, Zhen;  Lee, Chin-Hui;  Tao, Jianhua;  Zhengqi Wen
View  |  Adobe PDF(995Kb)  |  Favorite  |  View/Download:81/24  |  Submit date:2018/01/05
Dnn-based Speech Synthesis  Vocoder  Speech Parametrization  Blstm  Phoneme Embedded Vector  Multi-task Learning  Pitch-scaled Spectrum  
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
Authors:  Yi, Jiangyan;  Wen, Zhengqi;  Tao, Jianhua;  Ni, Hao;  Liu, Bin;  Wen ZQ(温正棋)
View  |  Adobe PDF(1416Kb)  |  Favorite  |  View/Download:145/42  |  Submit date:2018/01/04
Multi-accent  Mandarin Speech Recognition  Lstm-rnn-ctc  Model Adaptation  Ctc Regularization  
无权访问的条目 学位论文
Authors:  易江燕
Adobe PDF(2091Kb)  |  Favorite  |  View/Download:13/2  |  Submit date:2018/05/31
语谱特性和噪声声学环境深度感知的语音增强方法研究 学位论文
, 北京: 中国科学院大学, 2018
Authors:  聂帅
Adobe PDF(5903Kb)  |  Favorite  |  View/Download:119/11  |  Submit date:2018/05/31
语音增强  语音分离  深度学习  非负矩阵分解  语谱特性  声学环境