CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:196/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:165/44  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Focal Loss for Punctuation Prediction 会议论文
, 北京,中国, 2020.10.25-2020.10.29
作者:  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Ye Bai;  Cunhang Fan
浏览  |  Adobe PDF(247Kb)  |  收藏  |  浏览/下载:167/54  |  提交时间:2020/10/22
多语言语音数据库自动优化方法研究 会议论文
, 青海西宁, 2019-8
作者:  傅睿博;  陶建华;  温正棋;  易江燕;  王诗明;  强春雨
浏览  |  Adobe PDF(542Kb)  |  收藏  |  浏览/下载:356/113  |  提交时间:2020/06/24
语音数据库优化  语音合成  多语言  数据对匹配度  
Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition 会议论文
, Brighton, UK, 2019.05.12-2019.05.18
作者:  Jiangyan Yi;  Jianhua Tao;  Ye Bai
浏览  |  Adobe PDF(295Kb)  |  收藏  |  浏览/下载:73/34  |  提交时间:2020/10/22
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(340Kb)  |  收藏  |  浏览/下载:229/46  |  提交时间:2020/06/27
基于静音时长和文本特征融合的韵律边界自动标注 会议论文
, 江苏连云港, 2017-10
作者:  傅睿博;  李雅;  温正棋;  陶建华
浏览  |  Adobe PDF(877Kb)  |  收藏  |  浏览/下载:221/80  |  提交时间:2020/06/27
Multimodal Emotion Recognition with Transfer Learning of Deep Neural Network 期刊论文
ZTE Communications, 2017, 卷号: 15, 期号: S2, 页码: 1673-5188
作者:  Huang, Jian;  Li, Ya;  Tao, Jianhua;  Yi, Jianyan
浏览  |  Adobe PDF(828Kb)  |  收藏  |  浏览/下载:170/52  |  提交时间:2020/06/20
deep neutral network  ensemble method  multimodal emotion recognition  transfer learning  
Hierarchical stress generation with Fujisaki model in expressive speech synthesis 会议论文
Proceedings of the International Conference on Speech Prosody, Ireland, 2014
作者:  Ya Li;  Jianhua Tao;  Keikichi Hirose;  Wei Lai;  Xiaoying Xu
浏览  |  Adobe PDF(146Kb)  |  收藏  |  浏览/下载:281/91  |  提交时间:2018/11/26
Hierarchical stress modeling in Mandarin text-to-speech 会议论文
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Italy, 2011.9
作者:  Ya Li;  Jianhua Tao;  Xiaoying Xu
浏览  |  Adobe PDF(238Kb)  |  收藏  |  浏览/下载:248/90  |  提交时间:2018/11/26