CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共6条,第1-6条 帮助

限定条件                            
已选(0)清除 条数/页:   排序方式:
MULTIMODAL CROSS- AND SELF-ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION 会议论文
, Toronto, Canada, 6-12 June 2021
作者:  Licai Sun;  Bin Liu;  Jianhua Tao;  Zheng Lian
Adobe PDF(1078Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/06/03
Continual Learning for Fake Audio Detection 会议论文
, 线上(捷克), 2021-9
作者:  Ma Haoxin;  Yi Jiangyan;  Tao Jianhua;  Bai Ye;  Tian Zhengkun;  Wang Chenglong
Adobe PDF(2113Kb)  |  收藏  |  浏览/下载:280/72  |  提交时间:2022/06/20
fake audio detection  continual learning  detecting fake without forgetting  
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization 会议论文
, Brno, Czechia, 30 August – 3 September
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(839Kb)  |  收藏  |  浏览/下载:226/55  |  提交时间:2022/06/14
Multi-aspect self-supervised learning for heterogeneous information network 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2021, 卷号: 233, 页码: 14
作者:  Che, Feihu;  Tao, Jianhua;  Yang, Guohua;  Liu, Tong;  Zhang, Dawei
Adobe PDF(2661Kb)  |  收藏  |  浏览/下载:267/54  |  提交时间:2021/12/28
Heterogeneous information network  Self-supervised  Contrastive learning  Graph neural network  
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:216/69  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:408/64  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling