CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Continual Learning for Fake Audio Detection 会议论文
, 线上(捷克), 2021-9
作者:  Ma Haoxin;  Yi Jiangyan;  Tao Jianhua;  Bai Ye;  Tian Zhengkun;  Wang Chenglong
Adobe PDF(2113Kb)  |  收藏  |  浏览/下载:267/67  |  提交时间:2022/06/20
fake audio detection  continual learning  detecting fake without forgetting  
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization 会议论文
, Brno, Czechia, 30 August – 3 September
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(839Kb)  |  收藏  |  浏览/下载:212/49  |  提交时间:2022/06/14
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition 会议论文
, Tokyo, Japan, 14-17 December 2021
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:194/46  |  提交时间:2022/06/14
F-0-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 3375-3383
作者:  Li, Yongwei;  Tao, Jianhua;  Erickson, Donna;  Liu, Bin;  Akagi, Masato
收藏  |  浏览/下载:149/0  |  提交时间:2021/12/28
Speech recognition  Iterative methods  Production  Estimation  Brain modeling  Shape  Low-frequency noise  Glottal source  vocal tract  source-filter model  ARX-LF model  
Self-supervised graph representation learning via bootstrapping 期刊论文
NEUROCOMPUTING, 2021, 卷号: 456, 页码: 88-96
作者:  Che, Feihu;  Yang, Guohua;  Zhang, Dawei;  Tao, Jianhua;  Liu, Tong
Adobe PDF(1379Kb)  |  收藏  |  浏览/下载:391/67  |  提交时间:2021/11/03
Graph representation learning  Self-supervised  Bootstrapping  Graph neural network  
Integrating Knowledge Into End-to-End Speech Recognition From External Text-Only Data 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 1340-1351
作者:  Bai, Ye;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi;  Tian, Zhengkun;  Zhang, Shuai
收藏  |  浏览/下载:195/0  |  提交时间:2021/06/07
End-to-End  language modeling  speech recognition  teacher-student learning  transfer learning  
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning 会议论文
, Hong Kong, 24-27 Jan. 2021
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi;  Song, Leichao
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:250/60  |  提交时间:2021/06/01
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:391/62  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:428/55  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer