CASIA OpenIR

Browse/Search Results:  1-10 of 55 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
Subband fusion of complex spectrogram for fake speech detection 期刊论文
SPEECH COMMUNICATION, 2023, 卷号: 155, 页码: 8
Authors:  Fan, Cunhang;  Xue, Jun;  Dong, Shunbo;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
Favorite  |  View/Download:19/0  |  Submit date:2024/03/26
Automatic speaker verification  Complex spectrogram  Fake speech detection  Phase information  Subband  
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
Authors:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
Favorite  |  View/Download:48/0  |  Submit date:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
Two-stage deep spectrum fusion for noise-robust end-to-end speech recognition 期刊论文
APPLIED ACOUSTICS, 2023, 卷号: 212, 页码: 10
Authors:  Fan, Cunhang;  Ding, Mingming;  Yi, Jiangyan;  Li, Jinpeng;  Lv, Zhao
Favorite  |  View/Download:29/0  |  Submit date:2023/11/16
Robust end-to-end ASR  Speech enhancement  Masking and mapping  Speech distortion  Deep spectrum fusion  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
Authors:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
Favorite  |  View/Download:212/0  |  Submit date:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
SpecMNet: Spectrum mend network for monaural speech enhancement 期刊论文
APPLIED ACOUSTICS, 2022, 卷号: 194, 页码: 9
Authors:  Fan, Cunhang;  Zhang, Hongmei;  Yi, Jiangyan;  Lv, Zhao;  Tao, Jianhua;  Li, Taihao;  Pei, Guanxiong;  Wu, Xiaopei;  Li, Sheng
Favorite  |  View/Download:224/0  |  Submit date:2022/07/25
Monaural speech enhancement  Speech distortion  Spectrum mend network  SI-SNR  BLSTM  
普通话语音识别中的神经网络语言模型的比较研究 会议论文
, 青海西宁, 2019-8
Authors:  马浩鑫;  白烨;  易江燕;  陶建华
Adobe PDF(621Kb)  |  Favorite  |  View/Download:196/64  |  Submit date:2022/06/20
Hypersphere Embedding and Additive Margin for Query-by-example Keyword Spotting 会议论文
, 中国兰州, 2019-11
Authors:  Ma Haoxin;  Bai Ye;  Yi Jiangyan;  Tao Jianhua
Adobe PDF(3993Kb)  |  Favorite  |  View/Download:144/46  |  Submit date:2022/06/20
Continual Learning for Fake Audio Detection 会议论文
, 线上(捷克), 2021-9
Authors:  Ma Haoxin;  Yi Jiangyan;  Tao Jianhua;  Bai Ye;  Tian Zhengkun;  Wang Chenglong
Adobe PDF(2113Kb)  |  Favorite  |  View/Download:241/62  |  Submit date:2022/06/20
fake audio detection  continual learning  detecting fake without forgetting  
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766
Authors:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(934Kb)  |  Favorite  |  View/Download:271/76  |  Submit date:2022/06/14
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization 会议论文
, Brno, Czechia, 30 August – 3 September
Authors:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(839Kb)  |  Favorite  |  View/Download:191/44  |  Submit date:2022/06/14