CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共10条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:245/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:209/58  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
User behavior fusion in dialog management with multi-modal history cues 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 卷号: 74, 期号: 22, 页码: 10025-10051
作者:  Yang, Minghao;  Tao, Jianhua;  Chao, Linlin;  Li, Hao;  Zhang, Dawei;  Che, Hao;  Gao, Tingli;  Liu, Bin
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:119/9  |  提交时间:2020/10/27
Dialog Management (Dm)  Multi-modal Data Fusion  Human Computer Interaction (Hci)  Emotion Detection  
Focal Loss for Punctuation Prediction 会议论文
, 北京,中国, 2020.10.25-2020.10.29
作者:  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Ye Bai;  Cunhang Fan
Adobe PDF(247Kb)  |  收藏  |  浏览/下载:206/58  |  提交时间:2020/10/22
Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition 会议论文
, Brighton, UK, 2019.05.12-2019.05.18
作者:  Jiangyan Yi;  Jianhua Tao;  Ye Bai
浏览  |  Adobe PDF(295Kb)  |  收藏  |  浏览/下载:117/50  |  提交时间:2020/10/22
Transfer Learning based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(340Kb)  |  收藏  |  浏览/下载:270/53  |  提交时间:2020/06/27
FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS 会议论文
, 网上虚拟会议, 2020-5
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Wang, Tao
浏览  |  Adobe PDF(154Kb)  |  收藏  |  浏览/下载:366/87  |  提交时间:2020/06/27
prosody transfer  optimization strategy  speaker adaptation  attention  speech synthesis  
多语言语音数据库自动优化方法研究 会议论文
, 青海西宁, 2019-8
作者:  傅睿博;  陶建华;  温正棋;  易江燕;  王诗明;  强春雨
浏览  |  Adobe PDF(542Kb)  |  收藏  |  浏览/下载:404/122  |  提交时间:2020/06/24
语音数据库优化  语音合成  多语言  数据对匹配度  
基于静音时长和文本特征融合的韵律边界自动标注 期刊论文
清华大学学报(自然科学版), 2018, 卷号: 58, 期号: 1, 页码: 61-66,74
作者:  傅睿博;  陶建华;  李雅;  温正棋
浏览  |  Adobe PDF(1160Kb)  |  收藏  |  浏览/下载:341/127  |  提交时间:2020/06/21
韵律边界标注  决策融合  静音时长  语料库构建  语音合成  
Multimodal Emotion Recognition with Transfer Learning of Deep Neural Network 期刊论文
ZTE Communications, 2017, 卷号: 15, 期号: S2, 页码: 1673-5188
作者:  Huang, Jian;  Li, Ya;  Tao, Jianhua;  Yi, Jianyan
浏览  |  Adobe PDF(828Kb)  |  收藏  |  浏览/下载:205/57  |  提交时间:2020/06/20
deep neutral network  ensemble method  multimodal emotion recognition  transfer learning