CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共59条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms 会议论文
, Taiyuan, Shanxi, China, 2024-07-27
作者:  Zhang, Chu Yuan;  Yi, Jiangyan;  Tao, Jianhua;  Wang, Chenglong;  Yan, Xinrui
Adobe PDF(2254Kb)  |  收藏  |  浏览/下载:2/2  |  提交时间:2024/06/26
Multi-Scale Permutation Entropy for Audio Deepfake Detection 会议论文
, 韩国首尔, 2024-4-14
作者:  Chenglong Wang;  He JY(何佳毅);  Jiangyan Yi;  Jianhua Tao;  Chu Yuan Zhang;  Xiaohui Zhang
Adobe PDF(997Kb)  |  收藏  |  浏览/下载:20/8  |  提交时间:2024/06/13
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios 会议论文
, 希腊罗得岛, 2023年6月
作者:  Li GJ(李冠君);  Liu WJ(刘文举);  Yi JY(易江燕);  Tao JH(陶建华)
Adobe PDF(3463Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/06
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning 会议论文
, Ottawa, ON, Canada, October 29-November 3, 2023
作者:  Zheng Lian;  Haiyang Sun;  Licai Sun;  Kang Chen;  Mingyu Xu;  Kexin Wang;  Ke Xu;  Yu He;  Ying Li;  Jinming Zhao;  Ye Liu;  Bin Liu;  Jiangyan Yi;  Meng Wang;  Erik Cambria;  Guoying Zhao;  Björn W. Schuller;  Jianhua Tao
Adobe PDF(993Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/05/31
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
作者:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:40/0  |  提交时间:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:66/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
作者:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
收藏  |  浏览/下载:45/0  |  提交时间:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:232/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
SpecMNet: Spectrum mend network for monaural speech enhancement 期刊论文
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
作者:  Fan, Cunhang;  Zhang, Hongmei;  Yi, Jiangyan;  Lv, Zhao;  Tao, Jianhua;  Li, Taihao;  Pei, Guanxiong;  Wu, Xiaopei;  Li, Sheng
收藏  |  浏览/下载:252/0  |  提交时间:2022/07/25
Monaural speech enhancement  Speech distortion  Spectrum mend network  SI-SNR  BLSTM  
普通话语音识别中的神经网络语言模型的比较研究 会议论文
, 青海西宁, 2019-8
作者:  马浩鑫;  白烨;  易江燕;  陶建华
Adobe PDF(621Kb)  |  收藏  |  浏览/下载:226/75  |  提交时间:2022/06/20