已选(0)清除
条数/页: 排序方式: |
| CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文 IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254 作者: Wang, Tao; Yi, Jiangyan; Fu, Ruibo; Tao, Jianhua; Wen, Zhengqi 收藏  |  浏览/下载:196/0  |  提交时间:2022/09/19 Speech processing Decoding Predictive models Acoustics Transfer learning Training Task analysis Coarse-to-fine decoding mask prediction one-shot learning text-based speech editing text-to-speech |
| A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文 Neurocomputing, 2021, 期号: 450, 页码: 208-218 作者: Mingyue Niu; Bin Liu; Jianhua Tao; Qifei Li Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:165/44  |  提交时间:2021/06/01 Sphere embedding normalization DenseNet Transition layer Time-frequency channel attention block Time-frequency vectorization block Depression detection |
| Focal Loss for Punctuation Prediction 会议论文 , 北京,中国, 2020.10.25-2020.10.29 作者: Jiangyan Yi; Jianhua Tao; Zhengkun Tian; Ye Bai; Cunhang Fan 浏览  |  Adobe PDF(247Kb)  |  收藏  |  浏览/下载:167/54  |  提交时间:2020/10/22 |
| 多语言语音数据库自动优化方法研究 会议论文 , 青海西宁, 2019-8 作者: 傅睿博; 陶建华; 温正棋; 易江燕; 王诗明; 强春雨 浏览  |  Adobe PDF(542Kb)  |  收藏  |  浏览/下载:356/113  |  提交时间:2020/06/24 语音数据库优化 语音合成 多语言 数据对匹配度 |
| Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition 会议论文 , Brighton, UK, 2019.05.12-2019.05.18 作者: Jiangyan Yi; Jianhua Tao; Ye Bai 浏览  |  Adobe PDF(295Kb)  |  收藏  |  浏览/下载:73/34  |  提交时间:2020/10/22 |
| Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文 , 印度海得拉巴, 2018-9 作者: Fu, Ruibo; Tao, Jianhua; Zheng, Yibin; Wen, Zhengqi 浏览  |  Adobe PDF(340Kb)  |  收藏  |  浏览/下载:229/46  |  提交时间:2020/06/27 |
| 基于静音时长和文本特征融合的韵律边界自动标注 会议论文 , 江苏连云港, 2017-10 作者: 傅睿博; 李雅; 温正棋; 陶建华 浏览  |  Adobe PDF(877Kb)  |  收藏  |  浏览/下载:221/80  |  提交时间:2020/06/27 |
| Multimodal Emotion Recognition with Transfer Learning of Deep Neural Network 期刊论文 ZTE Communications, 2017, 卷号: 15, 期号: S2, 页码: 1673-5188 作者: Huang, Jian; Li, Ya; Tao, Jianhua; Yi, Jianyan 浏览  |  Adobe PDF(828Kb)  |  收藏  |  浏览/下载:170/52  |  提交时间:2020/06/20 deep neutral network ensemble method multimodal emotion recognition transfer learning |
| Hierarchical stress generation with Fujisaki model in expressive speech synthesis 会议论文 Proceedings of the International Conference on Speech Prosody, Ireland, 2014 作者: Ya Li; Jianhua Tao; Keikichi Hirose; Wei Lai; Xiaoying Xu 浏览  |  Adobe PDF(146Kb)  |  收藏  |  浏览/下载:281/91  |  提交时间:2018/11/26 |
| Hierarchical stress modeling in Mandarin text-to-speech 会议论文 Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Italy, 2011.9 作者: Ya Li; Jianhua Tao; Xiaoying Xu 浏览  |  Adobe PDF(238Kb)  |  收藏  |  浏览/下载:248/90  |  提交时间:2018/11/26 |