CASIA OpenIR

浏览/检索结果: 共41条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
面向生成语音的模型指纹分析研究 学位论文
, 2024
作者:  ZHANG, CHU YUAN
Adobe PDF(2152Kb)  |  收藏  |  浏览/下载:30/0  |  提交时间:2024/06/25
生成语音  语音生成方法辨别  声学模型  声码器  模型指纹分析  
Multi-Scale Permutation Entropy for Audio Deepfake Detection 会议论文
, 韩国首尔, 2024-4-14
作者:  Chenglong Wang;  He JY(何佳毅);  Jiangyan Yi;  Jianhua Tao;  Chu Yuan Zhang;  Xiaohui Zhang
Adobe PDF(997Kb)  |  收藏  |  浏览/下载:68/22  |  提交时间:2024/06/13
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:67/17  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
CHINESE INTONATION ASSESSMENT USING SEV FEATURES 会议论文
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009
作者:  Ke, DF;  Xu, B
收藏  |  浏览/下载:21/0  |  提交时间:2020/10/27
User behavior fusion in dialog management with multi-modal history cues 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 卷号: 74, 期号: 22, 页码: 10025-10051
作者:  Yang, Minghao;  Tao, Jianhua;  Chao, Linlin;  Li, Hao;  Zhang, Dawei;  Che, Hao;  Gao, Tingli;  Liu, Bin
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:130/13  |  提交时间:2020/10/27
Dialog Management (Dm)  Multi-modal Data Fusion  Human Computer Interaction (Hci)  Emotion Detection  
Emotional head motion predicting from prosodic and linguistic features 期刊论文
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 卷号: 75, 期号: 9, 页码: 5125-5146
作者:  Yang, Minghao;  Jiang, Jinlin;  Tao, Jianhua;  Mu, Kaihui;  Li, Hao
Adobe PDF(804Kb)  |  收藏  |  浏览/下载:92/10  |  提交时间:2020/10/27
Visual Prosody  Head Gesture  Prosody Clustering  
Quantitative intonation modeling of interrogative sentences for Mandarin speech synthesis 期刊论文
SPEECH COMMUNICATION, 2017, 卷号: 89, 期号: 1, 页码: 92-102
作者:  Li, Ya;  Tao, Jianhua;  Lai, Wei;  Xu, Xiaoying
收藏  |  浏览/下载:118/0  |  提交时间:2020/10/27
F0 Declination  Intonation  Interrogative Sentences  Final Lowering  Prosody  
Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition 会议论文
, Brighton, UK, 2019.05.12-2019.05.18
作者:  Jiangyan Yi;  Jianhua Tao;  Ye Bai
浏览  |  Adobe PDF(295Kb)  |  收藏  |  浏览/下载:123/54  |  提交时间:2020/10/22
Progressive Neural Networks based Features Prediction for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 北京, 2018-8
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
浏览  |  Adobe PDF(1188Kb)  |  收藏  |  浏览/下载:252/76  |  提交时间:2020/06/27
speech synthesis  progressive neural networks  unit-selection  target cost  
Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer 会议论文
, 印度海得拉巴, 2018-9
作者:  Fu, Ruibo;  Tao, Jianhua;  Zheng, Yibin;  Wen, Zhengqi
浏览  |  Adobe PDF(323Kb)  |  收藏  |  浏览/下载:304/72  |  提交时间:2020/06/27
speech synthesis  unit-selection  target cost  deep metric learning