CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:199/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:169/46  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
A multimodal approach of generating 3D human-like talking agent 期刊论文
JOURNAL ON MULTIMODAL USER INTERFACES, 2012, 卷号: 5, 期号: 1-2, 页码: 61-68
作者:  Yang, Minghao;  Tao, Jianhua;  Mu, Kaihui;  Li, Ya;  Che, Jianfeng
收藏  |  浏览/下载:68/0  |  提交时间:2020/10/27
Multimodal 3d Talking Agent  Lip Movement  Head Motion  Mfcc  Facial Expression  Gesture Animation  
Multiple style exploration for story unit segmentation of broadcast news video 期刊论文
MULTIMEDIA SYSTEMS, 2014, 卷号: 20, 期号: 4;4, 页码: 347-361
作者:  Feng, Bailan;  Chen, Zhineng;  Zheng, Rong;  Xu, Bo
收藏  |  浏览/下载:45/0  |  提交时间:2020/10/27
Multiple Style Exploration  Story Unit Pre-location  Story Unit Description  Story Unit Segmentation  
Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech 期刊论文
SPEECH COMMUNICATION, 2015, 卷号: 72, 页码: 59-73
作者:  Li, Ya;  Tao, Jianhua;  Hirose, Keikichi;  Xu, Xiaoying;  Lai, Wei
收藏  |  浏览/下载:64/0  |  提交时间:2020/10/27
Prosody  Stress  Hierarchical Modeling  Fujisaki Model  Speech Synthesis  
Guest Editorial: Advances in Machine Learning for Speech Processing 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 137-140
作者:  Dong, Minghui;  Tao, Jianhua;  Mak, Man Wai
收藏  |  浏览/下载:12/0  |  提交时间:2020/10/27
Speech Recognition  Speech Classification  
Emotional head motion predicting from prosodic and linguistic features 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 卷号: 75, 期号: 9, 页码: 5125-5146
作者:  Yang, Minghao;  Jiang, Jinlin;  Tao, Jianhua;  Mu, Kaihui;  Li, Hao
收藏  |  浏览/下载:53/0  |  提交时间:2020/10/27
Visual Prosody  Head Gesture  Prosody Clustering  
Quantitative intonation modeling of interrogative sentences for Mandarin speech synthesis 期刊论文
SPEECH COMMUNICATION, 2017, 卷号: 89, 期号: 1, 页码: 92-102
作者:  Li, Ya;  Tao, Jianhua;  Lai, Wei;  Xu, Xiaoying
收藏  |  浏览/下载:73/0  |  提交时间:2020/10/27
F0 Declination  Intonation  Interrogative Sentences  Final Lowering  Prosody  
Prosody conversion from neutral speech to emotional speech 期刊论文
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 卷号: 14, 期号: 4, 页码: 1145-1154
作者:  Tao, JH;  Kang, YG;  Li, AJ
浏览  |  Adobe PDF(557Kb)  |  收藏  |  浏览/下载:252/106  |  提交时间:2015/11/07
Emotional Speech  Prosody Analysis  Speech Synthesis  
Features importance analysis for emotional speech classification 期刊论文
AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 卷号: 3784, 期号: 0, 页码: 449-457
作者:  Tao, JH;  Kang, YG;  Tao, J;  Picard, RW
浏览  |  Adobe PDF(188Kb)  |  收藏  |  浏览/下载:202/74  |  提交时间:2015/11/06
Emotionspeech