CASIA OpenIR

浏览/检索结果: 共13条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
SceneFake: An initial dataset and benchmarks for scene fake audio detection 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 152, 页码: 12
作者:  Yi, Jiangyan;  Wang, Chenglong;  Tao, Jianhua;  Zhang, Chu Yuan;  Fan, Cunhang;  Tian, Zhengkun;  Ma, Haoxin;  Fu, Ruibo
收藏  |  浏览/下载:27/0  |  提交时间:2024/07/04
Scene manipulation  Fake audio detection  Speech enhancement  SceneFake dateset  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:254/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:215/63  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
A multimodal approach of generating 3D human-like talking agent 期刊论文
JOURNAL ON MULTIMODAL USER INTERFACES, 2012, 卷号: 5, 期号: 1-2, 页码: 61-68
作者:  Yang, Minghao;  Tao, Jianhua;  Mu, Kaihui;  Li, Ya;  Che, Jianfeng
收藏  |  浏览/下载:87/0  |  提交时间:2020/10/27
Multimodal 3d Talking Agent  Lip Movement  Head Motion  Mfcc  Facial Expression  Gesture Animation  
Multiple style exploration for story unit segmentation of broadcast news video 期刊论文
MULTIMEDIA SYSTEMS, 2014, 卷号: 20, 期号: 4;4, 页码: 347-361
作者:  Feng, Bailan;  Chen, Zhineng;  Zheng, Rong;  Xu, Bo
收藏  |  浏览/下载:66/0  |  提交时间:2020/10/27
Multiple Style Exploration  Story Unit Pre-location  Story Unit Description  Story Unit Segmentation  
Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech 期刊论文
SPEECH COMMUNICATION, 2015, 卷号: 72, 页码: 59-73
作者:  Li, Ya;  Tao, Jianhua;  Hirose, Keikichi;  Xu, Xiaoying;  Lai, Wei
收藏  |  浏览/下载:99/0  |  提交时间:2020/10/27
Prosody  Stress  Hierarchical Modeling  Fujisaki Model  Speech Synthesis  
User behavior fusion in dialog management with multi-modal history cues 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 卷号: 74, 期号: 22, 页码: 10025-10051
作者:  Yang, Minghao;  Tao, Jianhua;  Chao, Linlin;  Li, Hao;  Zhang, Dawei;  Che, Hao;  Gao, Tingli;  Liu, Bin
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:130/13  |  提交时间:2020/10/27
Dialog Management (Dm)  Multi-modal Data Fusion  Human Computer Interaction (Hci)  Emotion Detection  
Guest Editorial: Advances in Machine Learning for Speech Processing 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 卷号: 82, 期号: 2, 页码: 137-140
作者:  Dong, Minghui;  Tao, Jianhua;  Mak, Man Wai
收藏  |  浏览/下载:14/0  |  提交时间:2020/10/27
Speech Recognition  Speech Classification  
Emotional head motion predicting from prosodic and linguistic features 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 卷号: 75, 期号: 9, 页码: 5125-5146
作者:  Yang, Minghao;  Jiang, Jinlin;  Tao, Jianhua;  Mu, Kaihui;  Li, Hao
Adobe PDF(804Kb)  |  收藏  |  浏览/下载:92/10  |  提交时间:2020/10/27
Visual Prosody  Head Gesture  Prosody Clustering  
Quantitative intonation modeling of interrogative sentences for Mandarin speech synthesis 期刊论文
SPEECH COMMUNICATION, 2017, 卷号: 89, 期号: 1, 页码: 92-102
作者:  Li, Ya;  Tao, Jianhua;  Lai, Wei;  Xu, Xiaoying
收藏  |  浏览/下载:118/0  |  提交时间:2020/10/27
F0 Declination  Intonation  Interrogative Sentences  Final Lowering  Prosody