CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

已选(0)清除 条数/页:   排序方式:
SceneFake: An initial dataset and benchmarks for scene fake audio detection 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 152, 页码: 12
作者:  Yi, Jiangyan;  Wang, Chenglong;  Tao, Jianhua;  Zhang, Chu Yuan;  Fan, Cunhang;  Tian, Zhengkun;  Ma, Haoxin;  Fu, Ruibo
收藏  |  浏览/下载:27/0  |  提交时间:2024/07/04
Scene manipulation  Fake audio detection  Speech enhancement  SceneFake dateset  
Emotion selectable end-to-end text-based speech editing 期刊论文
ARTIFICIAL INTELLIGENCE, 2024, 卷号: 329, 页码: 16
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zhang, Chu Yuan
收藏  |  浏览/下载:20/0  |  提交时间:2024/07/03
Emotion selectable  Text-based speech editing  Emotion decoupling  Mask prediction  Few-shot learning  Text-to-speech  
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms 会议论文
, Taiyuan, Shanxi, China, 2024-07-27
作者:  Zhang, Chu Yuan;  Yi, Jiangyan;  Tao, Jianhua;  Wang, Chenglong;  Yan, Xinrui
Adobe PDF(2254Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/26
面向生成语音的模型指纹分析研究 学位论文
, 2024
作者:  ZHANG, CHU YUAN
Adobe PDF(2152Kb)  |  收藏  |  浏览/下载:30/0  |  提交时间:2024/06/25
生成语音  语音生成方法辨别  声学模型  声码器  模型指纹分析  
Multi-Scale Permutation Entropy for Audio Deepfake Detection 会议论文
, 韩国首尔, 2024-4-14
作者:  Chenglong Wang;  He JY(何佳毅);  Jiangyan Yi;  Jianhua Tao;  Chu Yuan Zhang;  Xiaohui Zhang
Adobe PDF(997Kb)  |  收藏  |  浏览/下载:68/22  |  提交时间:2024/06/13
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:87/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings