CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共10条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition 会议论文
, Dublin, Ireland, 20-24 August 2023
作者:  Haiyang Sun;  Zheng Lian;  Bin Liu;  Ying Li;  Licai Sun;  Cong Cai;  Jianhua Tao;  Meng Wang;  Yuan Cheng
Adobe PDF(826Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/05/31
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization 会议论文
, Brno, Czechia, 30 August – 3 September
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(839Kb)  |  收藏  |  浏览/下载:210/49  |  提交时间:2022/06/14
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Ye Bai;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(629Kb)  |  收藏  |  浏览/下载:175/58  |  提交时间:2022/06/14
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition 会议论文
, Tokyo, Japan, 14-17 December 2021
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:190/45  |  提交时间:2022/06/14
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:203/62  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:423/55  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
End-to-end Keywords Spotting Based on Connectionist Temporal Classification for Mandarin 会议论文
, 天津, 2016.10
作者:  Bai Y(白烨);  Yi JY(易江燕);  Ni H(倪浩);  Wen ZQ(温正棋);  Liu B(刘斌);  Li Y(李雅);  Tao JH(陶建华)
收藏  |  浏览/下载:51/0  |  提交时间:2020/10/27
Self-attention Based Model for Punctuation Prediction Using Word and Speech Embeddings 会议论文
, Brighton, UK, 2019.05.12-2019.05.15
作者:  Jiangyan Yi;  Jianhua Tao
浏览  |  Adobe PDF(273Kb)  |  收藏  |  浏览/下载:44/17  |  提交时间:2020/10/22
Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition 会议论文
, Brighton, UK, 2019.05.12-2019.05.18
作者:  Jiangyan Yi;  Jianhua Tao;  Ye Bai
浏览  |  Adobe PDF(295Kb)  |  收藏  |  浏览/下载:105/45  |  提交时间:2020/10/22
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
作者:  Jiangyan Yi;  Zhengqi Wen;  Jianhua Tao;  Hao Ni;  Bin Liu
浏览  |  Adobe PDF(1416Kb)  |  收藏  |  浏览/下载:164/62  |  提交时间:2020/10/22
multi-accent, Mandarin speech recognition,LSTM-RNN-CTC, model adaptation, CTC regularization